Streams, protocols, documents and fragments

Mark Birbeck Mark.Birbeck at iedigital.net
Wed Feb 24 22:27:46 GMT 1999


David Megginson wrote:
> Tom Harding writes:
> 
>  > Mark Birbeck wrote:
>  > 
>  > > You know when you've reached the end by the closing tag. That's
>  > > it!
>  > 
>  > There's the pesky issue of the document "epilog" which is Misc
>  > markup that can follow the closing tag.  If the XML Rec were
>  > changed to disallow this then I would be in complete agreement.

But you do know that the *document* has ended. You may receive PIs, but
aren't they directed at applications? If so, then if you are not
expecting any why not ignore them? (And comments.) Then all you need to
wait for is the next '<?xml ... ?>' or <!DOCTYPE or element tag - that
is, the next document.

> More importantly, you don't want to have to parse an entire document
> just to find out where it ends because that forces your system into
> linear processing -- on a busy server, it is absolutely necessary to
> be to isolate the documents/packets quickly and pass them off to
> separate threads (or even separate boxes) for parsing and processing.

You may be able to *parse* parts of the document before the entire
document has completely arrived, but it is surely wrong to *process* the
document because you don't yet know if it's well-formed or valid. Some
of the parsing you did in one process might be invalidated as the result
of another process.

Regards,

Mark

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list