XML and special Characters : unicode v3.0 ?

John Cowan cowan at locke.ccil.org
Mon Mar 1 19:43:07 GMT 1999

Tim Bray wrote:

> So... it seems that in UTF8,
> a ZWNBSP as first character in the file isn't a data character.

Can you quote chapter and verse for this, either Unicode or 10646?
The latter spec tells you that the sequence EF BB BF may be used as
a *signature* at the beginning of UTF-8 data (since it is unlikely
to occur in any other kind), but does not IMHO imply that the
sequence is removable or doesn't represent a real ZWNBSP.

John Cowan	http://www.ccil.org/~cowan		cowan at ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

More information about the Xml-dev mailing list