ANNOUNCE: xml encoding detector in C

Matthew Sergeant (EML) Matthew.Sergeant at eml.ericsson.se
Fri Apr 23 16:42:13 BST 1999


> -----Original Message-----
> From:	John Cowan [SMTP:cowan at locke.ccil.org]
> 
> Paul Langer wrote:
> 
> > One remark on the EBCDIC handling:
> > 
> > Your program returns "EBCDIC-CP-US" if it detects EBCDIC
> > without an explicit encoding declaration (see comment:
> >  /* better than nothing */).
> > 
> > I do not think that this behaviour is "better than nothing".
> 
> In Java I could throw an error, but C doesn't have exception
> handling, and I figure a server would rather return something
> than nothing.  The routine is not meant to handle ill-formed
> XML, and will return one of the other defaults ("UTF-8",
> "UTF-16-BE", "UTF-16-LE") depending on just what bytes it sees.
> 
	If this gets turned into an Apache module it would be better to
return DECLINED and let the next mime sniffer module handle it - that's what
I do in my Apache::MimeXML. I don't think it's a good idea to make
assumptions for invalid XML - just return an error code.

	Just my 2p

	Matt.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list