Normalization of attribute values

Toby Speight tms at
Thu Aug 6 14:22:53 BST 1998

Peter> Peter Jones <URL:mailto:peterj at>

0> In article
0> <c=GB%a=_%p=Wrox_Press%l=WROX2-980806114656Z-578 at>,
0> Peter wrote:

Peter> Can you tell me whether it is still the case that XML parsers
Peter> normalize the data passed to a surrounding application?  Or are
Peter> there smarter parsers out there now?

What do you mean by "normalise"?  Do you mean that whitespace is
collapsed/removed from IDREFS, NMTOKENS, and ENTITIES values?  Do you
mean that character and entity references are replaced?  Or something

In the first case, it's obviously not possible unless the parser reads
the DTD (if not, then all attributes are treated as CDATA).

[I'm assuming that by "data" you mean attribute values, since you
mention that in your Subject.  "data" might be a bad choice of term in
this case, as to many - particularly to those who use DSSSL -it means
all the character content of an element.]


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at
Archived as:
To (un)subscribe, mailto:majordomo at the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at

More information about the Xml-dev mailing list