Mix encodings in a document?
John Cowan
cowan at locke.ccil.org
Wed Sep 23 22:23:20 BST 1998
Michael Kay wrote:
> No. UTF-16 is an encoding of ISO 10646 that uses 16 bits to
> represent the characters in the Basic MultiLingual Plane
> (BMP, equivalent to Unicode) and longer sequences to
> represent characters outside the BMP. It is thus a pure
> superset of UCS-2 or Unicode. See
> http://osiris.dkuug.dk/jtc1/sc2/wg2/docs/N1334.html
Almost. Unicode = UTF-16; Unicode applications are not
allowed to support only the BMP, although there are no
characters on the Astral Planes yet.
--
John Cowan http://www.ccil.org/~cowan cowan at ccil.org
You tollerday donsk? N. You tolkatiff scowegian? Nn.
You spigotty anglease? Nnn. You phonio saxo? Nnnn.
Clear all so! 'Tis a Jute.... (Finnegans Wake 16.5)
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
More information about the Xml-dev
mailing list