ANNOUNCE: xml encoding detector in C

John Cowan cowan at
Fri Apr 23 00:10:59 BST 1999

I have written an XML encoding detector function in C.
It would be easy to translate it to Java, but I thought that
C would be the most useful in different contexts.  It uses
only Standard C facilities.

There is a subroutine called "xmlenc" which accepts a FILE*
argument and returns a (static) string representing the encoding.
I believe it handles all the cases in Appendix F correctly,
including the EBCDIC one.

There is also a test-harness main program that can generate some
sample files in EBCDIC and 16-bit Unicode (8-bit ASCII-compatible
files are easy to find).  This part can be stripped out, as indicated
by the comments, in order to use the routine in some server program.

No copyright, no warranty; I assert the moral right to be
known as the author.

Download from .

John Cowan		cowan at
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at
Archived as: and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at

More information about the Xml-dev mailing list