SDATA or UNICODE

Tim Bray tbray at textuality.com
Wed Jan 28 22:11:46 GMT 1998


At 04:10 PM 28/01/98 -0500, Gavin McKenzie wrote:
>On this issue of accessing characters that aren't in Unicode...
>
>XML provides a way for specifying the encoding of an entity with the
>?XML pi encoding declaration.  Why wouldn't this be sufficient.  If the
>euro or florin symbol is available in some non-Unicode character
>encoding scheme

Good idea, but it doesn't quite work.  XML is very rigid in saying
that all the characters have to be Unicode characters (which the Euro
is quickly becoming).  So let's take for an example the current 
identifier of The Artist Formerly Known As Prince.  Even if I have
an encoding in which this is available, say at code point 12352, that
doesn't make it into a Unicode character, or usable in XML.

Non-Unicode *encodings* are OK (e.g. ASCII).  Non-Unicode *characters*
aren't. -T.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list