UTF-8 vs UTF-16...? (Was: Feeling good about SML)
jtauber at jtauber.com
Wed Nov 17 11:04:42 GMT 1999
> Er... AFAIK you can represent the exactly the same characters in UTF-8
> and UTF-16, since both are transformation formats of the full 32 bit
> Unicode. Maybe I'm missing something here...?
I don't think this is true.
UTF-8 and UTF-16 are transformations formats of ISO10646, not Unicode.
UTF-8 gives access to all 2^31 characters from the UCS repertoire.
UTF-16, however, only gives access to the equivalent of Unicode with the
surrogate extension mechanism, ie the first 17 planes of the UCS.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To unsubscribe, mailto:majordomo at ic.ac.uk the following message;
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
More information about the Xml-dev