SDATA or UNICODE

Chris Maden crism at ora.com
Tue Jan 27 15:42:46 GMT 1998


[Martin Bryan]
> The official entity name is € - its definition will have the
> form:
> 
> <!ENTITY euro "&#128;">
> 
> if running on a Microsoft system as Microsoft have assigned Hex 80
> as the codepoint.

NO NO NO NO NO NO NO NO!  Martin, you should know better!

XML DECLARATIONS DO NOT DEPEND ON THE PLATFORM.
XML DECLARATIONS DO NOT DEPEND ON THE PLATFORM.
XML DECLARATIONS DO NOT DEPEND ON THE PLATFORM.
XML DECLARATIONS DO NOT DEPEND ON THE PLATFORM.
XML DECLARATIONS DO NOT DEPEND ON THE PLATFORM.
XML DECLARATIONS DO NOT DEPEND ON THE PLATFORM.

This is *why* ISO/IEC 10646 is the document character set.  The number
in a numeric character reference is ALWAYS ALWAYS ALWAYS to 10646.
NEVER TO THE PLATFORM.

This has caused me enough headaches with bad HTML browser
implementations, and I have *no* care to repeat it.

> TC304 have assigned it to Hexadecimal B1 in ISO 8859, and its ISO
> 10646 code point is 20AC, so the following should be the formal XML
> reference to it:
> 
> <!ENTITY euro &#X20ac; >

This is (almost) correct (needs quotes).  Also correct would be

<!ENTITY euro "&#8364;">

(if I converted correctly).

-Chris
-- 
<!NOTATION SGML.Geek PUBLIC "-//Anonymous//NOTATION SGML Geek//EN">
<!ENTITY crism PUBLIC "-//O'Reilly//NONSGML Christopher R. Maden//EN"
"<URL>http://www.oreilly.com/people/staff/crism/ <TEL>+1.617.499.7487
<USMAIL>90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list