XML parsing techniques

Peter Seibel peter at weblogic.com
Thu Feb 4 00:54:01 GMT 1999

Is there anyone out there who can characterize the problems/challenges/best
practices when it comes to parsing XML? Looking (briefly) at the source of
a couple parsers (Lark, Microsoft's, and XP) it looks like the parsers are
some flavor of hand written recursive descent. (Well, Lark has that funky
hand-coded DFA thing which I didn't really spend much time trying to grok
-- that's not really recursive descent as I understand things.) Is there a
reason no one seems to be using parser generators (like ANTLR or JavaCC)?
This may be more a question about the limitations of those tools which were
designed for parsing things that look a lot more like Algol than XML does.


P.S. Are there any parsers out there that actually return DOM objects?

Peter Seibel          Perl/Java/English Hacker      peter at weblogic.com

                     Is Windows98 Y2K compliant? 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

More information about the Xml-dev mailing list