Character Encoding and the XML PR (was Re: PR.xml)

David Megginson ak117 at freenet.carleton.ca
Sat Jan 17 12:08:19 GMT 1998


James Clark writes:

 > Are you saying that Java's 16-bit characters prevent complete support
 > for some of those encodings in an XML parser?  If so, I don't see why,
 > since XML doesn't allow characters >= 0x110000, all legal XML characters
 > are representable in UTF-16 and hence in Java.

Quite right, I wasn't connecting the two -- Java supports UCS-4 only
to the extent allowed by surrogates in UTF-16, but that's the limit in
XML as well, so there should be no problem (at least, not until
Unicode starts assigning codes >= 0x110000, in which case the problem
will be both Java's and XML's).


All the best,


David

-- 
David Megginson                 ak117 at freenet.carleton.ca
Microstar Software Ltd.         dmeggins at microstar.com
      http://home.sprynet.com/sprynet/dmeggins/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list