UTF-8 vs UTF-16...? (Was: Feeling good about SML)

Eric Bohlman ebohlman at netcom.com
Wed Nov 17 17:43:17 GMT 1999

On 17 Nov 1999, Steinar Bang wrote:

> >>>>> "Don Park" <donpark at docuverse.com>:
> > I was born in Korea so I appreciate your position.  However, your
> > opposition of SML is based on the assumption that SML will only
> > support UTF-8.  The list of SML features is preliminary and,
> > personally, I do not mind seeing SML support both UTF-8 and UTF-16
> > although not necessarily at the same time since the application
> > might require either encoding but not both.
> Er... AFAIK you can represent the exactly the same characters in UTF-8
> and UTF-16, since both are transformation formats of the full 32 bit
> Unicode.  Maybe I'm missing something here...?

I think his point was that UTF-8 imposes a 50% size penalty over UTF-16
for typical CJKV text.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To unsubscribe, mailto:majordomo at ic.ac.uk the following message;
unsubscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

More information about the Xml-dev mailing list