XML tools and big documents (was: Re: Is there a size limitation on XML file given to MSXSL as input?)

Michael Kay M.H.Kay at eng.icl.co.uk
Wed Sep 2 11:27:59 BST 1998


>... You could thereafter deal directly with the object
store or
>even just the view (indexed into the store) and not
repeatedly reparse
>the document.

The sirens have lured you!

I have a lot of experience with storing parsed document
trees in an object database and I have experimented with
storing the Java serialization of DOM-like models on disk,
and for what it's worth, in both cases retrieving the
document takes a lot longer than reparsing original XML. The
main reason is simply that there are more bytes to read.

The only technique that I find really effective for handling
large documents is to split them up into lots of small ones.
That way you only parse the bits the user actually wants to
see.

MK


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list