Control Characters

MURATA Makoto murata at
Thu Feb 4 08:05:19 GMT 1999

>Why are the control characters x80-x9F allowed in XML character data, while
>x0-x8,xB,xC,xE-x1F are illegal? Is it that the illegals have meanings that
>XML does not support? Just wondering.

I am afraid that you are right.  XML should have disallowed C0 control codes 
and C1 control codes except CR, LF, and HT, since the Unicode standard does 
not define semantics of these control codes.  U+007F should have been 
disallowed as well.  However, I do not think that this causes practical 


Fuji Xerox Information Systems
Tel: +81-44-812-7230   Fax: +81-44-812-7231
E-mail: murata at

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at
Archived as: and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at

More information about the Xml-dev mailing list