ANNOUNCE: xml encoding detector in C

John Cowan cowan at
Fri Apr 23 16:12:13 BST 1999

Paul Langer wrote:

> One remark on the EBCDIC handling:
> Your program returns "EBCDIC-CP-US" if it detects EBCDIC
> without an explicit encoding declaration (see comment:
>  /* better than nothing */).
> I do not think that this behaviour is "better than nothing".

In Java I could throw an error, but C doesn't have exception
handling, and I figure a server would rather return something
than nothing.  The routine is not meant to handle ill-formed
XML, and will return one of the other defaults ("UTF-8",
"UTF-16-BE", "UTF-16-LE") depending on just what bytes it sees.

> And if you want to define a default, what makes "EBCDIC-CP-US"
> more desirable than e.g. "ebcdic-cp-is"?


Thanks for taking the trouble to look at it.

John Cowan		cowan at
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at
Archived as: and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at

More information about the Xml-dev mailing list