half-baked parsers vs binary XML

Gabe Beged-Dov begeddov at jfinity.com
Mon Mar 29 20:02:32 BST 1999

David Megginson wrote:

> In other words, it's not the XML *input* that you need to optimize,
> but the *output* -- for example, if you have a Perl script that
> renders XML in HTML, the best speed optimization is to cache the
> result and reserve it for any request with the same parameters.

Assume that caching isn't an option. I.e. you have to make all your processing reasonably
fast. Its not acceptable to make 80% of your processing really fast.

> The XML/SGML processing model is generally to walk through a document
> (as a collection of events or as a tree) and fire off handlers for
> different types of things.  Even a short to medium-length XML document
> can cause the handlers to be fired off many thousands of times, and if
> you're trying to handle hundreds of requests per second, that's going
> to cause problems with or without XML.

Are we talking about throughput or responsiveness?  It would be useful  to bring up some
use-cases where XML processing can't be employed using the default handler firing model and
try to understand what the alternatives are.

Matt Sergeant has brought up one that he might be able to flesh out involving large scale
usage. I'm sure there are others.

Gabe Beged-Dov

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

More information about the Xml-dev mailing list