Character encodings
John Cowan
cowan at locke.ccil.org
Mon Dec 7 21:06:40 GMT 1998
Chris von See scripsit:
> In section 4.3.3, the XML spec implies
> that support of ISO 10646 UCS-2 encoding (i.e. Unicode)
Unicode = UTF-16, not UCS-2. In practice there is no difference
at the moment, because no 10646 planes other than the BMP (plane 0)
contain any characters. UTF-16 = Unicode uses surrogate
characters, whereas UCS-2 simply cannot represent the Astral Planes.
Therefore, UTF-16 encoding should be used instead of UCS-2.
--
John Cowan http://www.ccil.org/~cowan cowan at ccil.org
You tollerday donsk? N. You tolkatiff scowegian? Nn.
You spigotty anglease? Nnn. You phonio saxo? Nnnn.
Clear all so! 'Tis a Jute.... (Finnegans Wake 16.5)
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
More information about the Xml-dev
mailing list