ANNOUNCE: xml encoding detector in C

Matthew Sergeant (EML) Matthew.Sergeant at
Fri Apr 23 16:42:13 BST 1999

> -----Original Message-----
> From:	John Cowan [SMTP:cowan at]
> Paul Langer wrote:
> > One remark on the EBCDIC handling:
> > 
> > Your program returns "EBCDIC-CP-US" if it detects EBCDIC
> > without an explicit encoding declaration (see comment:
> >  /* better than nothing */).
> > 
> > I do not think that this behaviour is "better than nothing".
> In Java I could throw an error, but C doesn't have exception
> handling, and I figure a server would rather return something
> than nothing.  The routine is not meant to handle ill-formed
> XML, and will return one of the other defaults ("UTF-8",
> "UTF-16-BE", "UTF-16-LE") depending on just what bytes it sees.
	If this gets turned into an Apache module it would be better to
return DECLINED and let the next mime sniffer module handle it - that's what
I do in my Apache::MimeXML. I don't think it's a good idea to make
assumptions for invalid XML - just return an error code.

	Just my 2p


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at
Archived as: and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at

More information about the Xml-dev mailing list