XML tools and big documents (was: Re: Is there a size limitation on XML file given to MSXSL as input?)

Tyler Baker tyler at infinet.com
Wed Sep 2 23:52:09 BST 1998


Don Park wrote:

> >I would suspect that the DOM package provided by Don Park has similiar
> >performance and memory consumption.  Your best bet would probably be to
> look at
> >an XSL package which takes a DOM tree of your XML data, and a DOM tree of
> an XSL
> >stylesheet and spits out the content.  That way you are not stuck with an
> MS,
> >IBM, Oracle, or whatever implementation that you are not happy with.
>
> About 10 seconds and 10 meg of memory to convert each meg of XML into DOM
> with JIT enabled.

For my implementation, for ot.xml (a 4 meg document) only about 1-2 megs of RAM
is used to store the 4 meg file in RAM due to all Names being cached at the
parser level.  It also takes only 10-12 seconds with a P-120 running Symantec's
JIT for JDK 1.2 b4 to build the entire DOM tree.  For spitting out the DOM tree
(and normalizing all the Text nodes) it takes about 15-20 seconds of which 5
seconds is spent normalizing text nodes and most of the rest of this time is
actually spent in a brute force search and replace method that scans all
character data and attribute values and replaces any occurrences of entity values
with entity names.  This can be very expensive but I know no other way around it.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list