CDATA by any other name... (was The raw and the cooked)

David Brownell db at Eng.Sun.COM
Sun Nov 1 03:55:12 GMT 1998


As another data point -- Sun's validating parser accepts Henry's
original example, no problems.  (And it does so very quickly,
but you knew that!  ;-)

A pragmatic answer "why":  it uses the data model implied by
SAX, which treats characters "quoted" by "<![CDATA[" ... "]]>"
like any other characters (but without using '&' and '<' as
markup delimiters).

I think that's the right model.  It's clear from 2.7 that the
text inside a CDATA section is character data, not markup; the
example is clear, if the text could be misunderstood.

Since 2.4 makes clear (sentence 1!) that the _only_ two sorts
of stuff in XML are "character data" and "markup", so there's
no way I could justify treating space inside a CDATA section
differently from other characters (in terms of data model).
Hence it's not possible to distinguish whitespace characters
that are the content of a CDATA section from the same text
that's outside of a CDATA section.

- Dave

p.s. Yes, if there's confusion, the spec probably needs to
	be clarified.  Not a crime in any 1.0 spec.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list