SAX and whitespace (was Re: Problems with whitespace and msxml)
David Megginson
ak117 at freenet.carleton.ca
Thu Jan 1 15:01:23 GMT 1998
> > An XML processor must always pass all characters in a document
> > that are not markup through to the application. A validating
> > XML processor must distinguish white space in element content
> > from other non-markup
What the PR means to say here is that a DTD-driven XML parser has to
treat whitespace in element content differently than whitespace in
mixed content -- this, of course, has nothing to do with xml:space.
If there is no DTD, then all element types are assumed to allow mixed
content, so a DTD-driven XML parser ("validating XML processor") would
report all whitespace as significant.
What should SAX do with ignorable whitespace?
1) Report it as a distinct event, like Ælfred does?
2) Treat it as regular character data?
3) Ignore it (as in regular SGML)?
(1) seems to be what the PR requires. Either (2) or (3) could cause
strange results.
All the best,
David
--
David Megginson ak117 at freenet.carleton.ca
Microstar Software Ltd. dmeggins at microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
More information about the Xml-dev
mailing list