Roll-Your-Own Parsers

Simon St.Laurent simonstl at
Fri Feb 12 22:17:17 GMT 1999

At 11:58 AM 2/12/99 -0700, roddey at wrote:
>We have taken that approach with our 'Version 2" parsers, Java and C++.
>They are pretty well layered and pluggable. Don't plug in a validation
>handler and you won't do any validation work. Don't plug in an entity
>handler, and you won't get any entity information, etc... Basically we've
>just extended the concept of a SAX-like handler all the way into the core
>of the parser. It allows both for extensibility by rolling your own
>handler, and for the client who is putting together a particular type of
>parser configuration to tell the lowest level of the parser "do the least
>work possible for this group of things, since I'm not even interested".

This sounds (and looks) promising.  I'm not clear exactly _how_ modular it
is, though.  Can I take info from the SAX parser, abuse (or nicely process)
it, and feed it back into the DOM tree builder?  Or am I stuck to choosing
validating/non-validating and DOM/SAX?  Is the 'SAX-like handler' really
SAX with extras, or is it incompatible?

Reading the API is kind of weird.  I'd like to know what this 'scanner'
critter is doing too.

Fun stuff, though!

Simon St.Laurent
XML: A Primer / Building XML Applications (April)
Sharing Bandwidth / Cookies

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at
Archived as: and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at

More information about the Xml-dev mailing list