XML and special Characters : unicode v3.0 ?

John Cowan cowan at locke.ccil.org
Wed Mar 3 15:17:20 GMT 1999

Baden Hughes wrote:

> Uh, that's gonna be a problem. How would you put in a PUA character in an
> XML doc ?  Still by the U+... ? (we have around 800 of them for the languages
> we work with !!)

Well, first of all there are 6400 private-use characters on the BMP,
so that gives you plenty of room to play with.  You cannot use
any kind of private-use character in element or attribute names,
which is good for interoperability; to incorporate them in
character data or attribute values, use a character reference
like .

What will be more serious is that *normative* characters from the
Astral Planes aren't usable in XML names either.  Presumably,
when they actually show up, XML will be modified, so that we can
have element names in Egyptian hieroglyphics with attributes in

John Cowan	http://www.ccil.org/~cowan		cowan at ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

More information about the Xml-dev mailing list