XML tools and big documents (was: Re: Is there a size limitation on XML file given to MSXSL as input?)

Michael Kay M.H.Kay at eng.icl.co.uk
Wed Sep 2 11:27:59 BST 1998

>... You could thereafter deal directly with the object
store or
>even just the view (indexed into the store) and not
repeatedly reparse
>the document.

The sirens have lured you!

I have a lot of experience with storing parsed document
trees in an object database and I have experimented with
storing the Java serialization of DOM-like models on disk,
and for what it's worth, in both cases retrieving the
document takes a lot longer than reparsing original XML. The
main reason is simply that there are more bytes to read.

The only technique that I find really effective for handling
large documents is to split them up into lots of small ones.
That way you only parse the bits the user actually wants to


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

More information about the Xml-dev mailing list