Mix encodings in a document?

Michael Kay M.H.Kay at eng.icl.co.uk
Wed Sep 23 18:04:25 BST 1998


Jerome McDonough wrote:
>
>ISO-10646-UCS-2 (the 2-octet Basic Multilingual Plane) is
the
>same as Unicode (which is a 16-bit chararacter encoding),
so
>that would be your "UTF-16." (I don't think that,
technically,
>the 16-bit encoding gets referred to as a UCS Transmission
Format).
>
No. UTF-16 is an encoding of ISO 10646 that uses 16 bits to
represent the characters in the Basic MultiLingual Plane
(BMP, equivalent to Unicode) and longer sequences to
represent characters outside the BMP. It is thus a pure
superset of UCS-2 or Unicode. See
http://osiris.dkuug.dk/jtc1/sc2/wg2/docs/N1334.html

Mike Kay




xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list