Character Encoding and the XML PR (was Re: PR.xml)

James Clark jjc at jclark.com
Sat Jan 17 04:36:07 GMT 1998


David Megginson wrote:

> AElfred accepts the following encodings, and to my
> knowledge, supports them completely and correctly to the extent
> allowed by Java's 16-bit characters and by surrogates:
> 
> - UTF-8
> - ISO-10646-UCS-2 (both byte orders)
> - ISO-10646-UCS-4 (four byte orders)
> - UTF-16
> - ISO-8859-1

Are you saying that Java's 16-bit characters prevent complete support
for some of those encodings in an XML parser?  If so, I don't see why,
since XML doesn't allow characters >= 0x110000, all legal XML characters
are representable in UTF-16 and hence in Java.

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list