Normalization of attribute values

Toby Speight tms at ansa.co.uk
Thu Aug 6 14:22:53 BST 1998


Peter> Peter Jones <URL:mailto:peterj at wrox.com>

0> In article
0> <c=GB%a=_%p=Wrox_Press%l=WROX2-980806114656Z-578 at mail.wrox.co.uk>,
0> Peter wrote:

Peter> Can you tell me whether it is still the case that XML parsers
Peter> normalize the data passed to a surrounding application?  Or are
Peter> there smarter parsers out there now?

What do you mean by "normalise"?  Do you mean that whitespace is
collapsed/removed from IDREFS, NMTOKENS, and ENTITIES values?  Do you
mean that character and entity references are replaced?  Or something
different?

In the first case, it's obviously not possible unless the parser reads
the DTD (if not, then all attributes are treated as CDATA).

[I'm assuming that by "data" you mean attribute values, since you
mention that in your Subject.  "data" might be a bad choice of term in
this case, as to many - particularly to those who use DSSSL -it means
all the character content of an element.]

-- 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list