XML tools and big documents (was: Re: Is there a size limitation on XML file given to MSXSL as input?)

David Megginson david at megginson.com
Tue Sep 1 19:07:11 BST 1998

Ingo Macherius writes:

 > My afterall impression is that most available tools do well with
 > toy examples, but any input being in the MB range easily blasts
 > them. At least that's true for what came from MS so far.

I don't think that that's true in general.  Most of the Java-based XML
parsers I've tried seem to be able to handle Jon Bosak's XML Old
Testament (nearly 4MB) just fine, if somewhat slowly -- I used ot.xml
for routine testing and profiling while developing AElfred, and
AElfred barely kicked up a sweat.

The problem comes if the parser tries to build a tree rather than
simply reporting an event stream.  Depending on the implementation,
document trees tend to be very large.  With a naive tree
implementation, a 10MB document might use 100MB or more of virtual
memory for the document tree -- that'll bring most current desktop
systems to a screeching halt.

All the best,


David Megginson                 david at megginson.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

More information about the Xml-dev mailing list