SAX and whitespace (was Re: Problems with whitespace and msxml)

David Megginson ak117 at freenet.carleton.ca
Thu Jan 1 15:01:23 GMT 1998


 > >   An XML processor must always pass all characters in a document
 > >   that are not markup through to the application. A validating
 > >   XML processor must distinguish white space in element content
 > >   from other non-markup

What the PR means to say here is that a DTD-driven XML parser has to
treat whitespace in element content differently than whitespace in
mixed content -- this, of course, has nothing to do with xml:space.
If there is no DTD, then all element types are assumed to allow mixed
content, so a DTD-driven XML parser ("validating XML processor") would
report all whitespace as significant.

What should SAX do with ignorable whitespace?

1) Report it as a distinct event, like Ælfred does?
2) Treat it as regular character data?
3) Ignore it (as in regular SGML)?

(1) seems to be what the PR requires.  Either (2) or (3) could cause
strange results.


All the best,


David

-- 
David Megginson                 ak117 at freenet.carleton.ca
Microstar Software Ltd.         dmeggins at microstar.com
      http://home.sprynet.com/sprynet/dmeggins/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list