Control Characters

MURATA Makoto murata at apsdc.ksp.fujixerox.co.jp
Thu Feb 4 08:05:19 GMT 1999


>Why are the control characters x80-x9F allowed in XML character data, while
>x0-x8,xB,xC,xE-x1F are illegal? Is it that the illegals have meanings that
>XML does not support? Just wondering.


I am afraid that you are right.  XML should have disallowed C0 control codes 
and C1 control codes except CR, LF, and HT, since the Unicode standard does 
not define semantics of these control codes.  U+007F should have been 
disallowed as well.  However, I do not think that this causes practical 
problems.

Cheers,

Makoto
 
Fuji Xerox Information Systems
 
Tel: +81-44-812-7230   Fax: +81-44-812-7231
E-mail: murata at apsdc.ksp.fujixerox.co.jp

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list