ANNOUNCE: xml encoding detector in C

John Cowan cowan at locke.ccil.org
Fri Apr 23 00:10:59 BST 1999


I have written an XML encoding detector function in C.
It would be easy to translate it to Java, but I thought that
C would be the most useful in different contexts.  It uses
only Standard C facilities.

There is a subroutine called "xmlenc" which accepts a FILE*
argument and returns a (static) string representing the encoding.
I believe it handles all the cases in Appendix F correctly,
including the EBCDIC one.

There is also a test-harness main program that can generate some
sample files in EBCDIC and 16-bit Unicode (8-bit ASCII-compatible
files are easy to find).  This part can be stripped out, as indicated
by the comments, in order to use the routine in some server program.

No copyright, no warranty; I assert the moral right to be
known as the author.

Download from http://www.ccil.org/~cowan/XML/xmlenc.c .

-- 
John Cowan	http://www.ccil.org/~cowan		cowan at ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list