Mix encodings in a document?

Chris Olds colds at nwlink.com
Thu Sep 24 00:57:24 BST 1998

Tim Bray said:
>At 04:23 PM 9/23/98 -0400, John Cowan wrote:
>>Almost.  Unicode = UTF-16; Unicode applications are not
>>allowed to support only the BMP, although there are no
>>characters on the Astral Planes yet.
>I've been told that the geniuses in charge have blessed a whole
>bunch of language tagging characters on plane 14.  Anyone have
>a confirmation of this? -Tim

Yes, this is true.  It is not (yet) part of the full standard, but it is
"provided as information and guidance to implementers".  Details at

Additionally, there is a document that shows what characters and scripts are
"in the pipeline", which includes several scripts (Linear B, Etruscan,
Gothic, Western Musical Notation, etc.) that have or are expected to be
allocated space in Plane 1.  UCS-2 is history.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

More information about the Xml-dev mailing list