Canonical Encoding for XML Elements
David Megginson
ak117 at freenet.carleton.ca
Fri Jan 9 11:53:17 GMT 1998
Chris Smith writes:
> In particular, do parsers keep CDATA sections distinct from
> character data?
CDATA sections are part of the document's physical representation
rather than of its logical structure, so they would likely be reported
only by a specialised parser designed for authoring tools or
repositories. For other purposes, it doesn't matter; the following
two are exactly equivalent:
<example><![CDATA[
<sample>text</sample>
]]></example>
<example>
<sample>text</sample>
</example>
Switching between the two should produce exactly the same rendered
output from a formatting engine, exactly the same entries in a
database, etc. etc. SAX would report both as
start element: example
characters: "<sample>text</sample>"
end element: example
(some might break the second event into several smaller ones).
All the best,
David
--
David Megginson ak117 at freenet.carleton.ca
Microstar Software Ltd. dmeggins at microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
More information about the Xml-dev
mailing list