Combining characters/Extenders, etc. in Unicode

Tim Bray tbray at
Mon Aug 9 19:59:13 BST 1999

At 12:56 PM 8/9/99 -0400, Paul W. Abrahams wrote:
>The XML spec defines certain character classes (in Appendix B) that are
>derived from the Unicode standard.   Could someone explain what is meant
>by Ideographics, Combining Characters, and Extenders?  I'd like to get
>this information without having to buy a whole book on Unicode, but so
>far I haven't been able to find it.

Advice: buy a whole book on Unicode - the Unicode standard.  It's an
excellent piece of work and really essential if you're going to be
doing any work with these things.   If it's not in your local
bookstore you can buy it from their website at

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at
Archived as: and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at

More information about the Xml-dev mailing list