UTF-8 vs UTF-16...? (Was: Feeling good about SML)

MURATA Makoto murata.makoto at fujixerox.co.jp
Mon Nov 22 10:19:30 GMT 1999

Tim Bray writes:
>In terms of actual usability, there's effectively no difference between
>UTF-16 and UTF-8.  UTF-16 seems to be an easier sell in Japan for reasons
>that I've not fully understood.

Many Japanese prefer UTF-8 to UTF-16, but I don't.

I prefer UTF-16, since XML documents in legacy encodings never parse 
as UTF-16 and those in UTF-16 never parse as legacy encodings.  On the 
other hand, they may parse as UTF-8 and lead to corrupted outputs.

As Tim knows very well, UTF-16 has a number of problems about byte 
ordering.  On the other hand, UTF-8 it not free from such problems.
UTF-8 from Microsoft appears to begin with the zero-width non-breaking 
space always ;-(

Fuji Xerox Information Systems
Tel: +81-44-812-7230   Fax: +81-44-812-7231
E-mail: murata.makoto at fujixerox.co.jp

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To unsubscribe, mailto:majordomo at ic.ac.uk the following message;
unsubscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

More information about the Xml-dev mailing list