Canonical Encoding for XML Elements

David Megginson ak117 at
Fri Jan 9 11:53:17 GMT 1998

Chris Smith writes:

 > In particular, do parsers keep CDATA sections distinct from
 > character data?

CDATA sections are part of the document's physical representation
rather than of its logical structure, so they would likely be reported
only by a specialised parser designed for authoring tools or
repositories.  For other purposes, it doesn't matter; the following
two are exactly equivalent:



Switching between the two should produce exactly the same rendered
output from a formatting engine, exactly the same entries in a
database, etc. etc.  SAX would report both as

  start element: example
  characters: "<sample>text</sample>"
  end element: example

(some might break the second event into several smaller ones).

All the best,


David Megginson                 ak117 at
Microstar Software Ltd.         dmeggins at

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at
Archived as:
To (un)subscribe, mailto:majordomo at the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at

More information about the Xml-dev mailing list