ANNOUNCE: xml encoding detector in C

John Cowan cowan at locke.ccil.org
Fri Apr 23 16:12:13 BST 1999


Paul Langer wrote:

> One remark on the EBCDIC handling:
> 
> Your program returns "EBCDIC-CP-US" if it detects EBCDIC
> without an explicit encoding declaration (see comment:
>  /* better than nothing */).
> 
> I do not think that this behaviour is "better than nothing".

In Java I could throw an error, but C doesn't have exception
handling, and I figure a server would rather return something
than nothing.  The routine is not meant to handle ill-formed
XML, and will return one of the other defaults ("UTF-8",
"UTF-16-BE", "UTF-16-LE") depending on just what bytes it sees.

> And if you want to define a default, what makes "EBCDIC-CP-US"
> more desirable than e.g. "ebcdic-cp-is"?

Nothing.

Thanks for taking the trouble to look at it.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan at ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list