ANNOUNCE: xml encoding detector in C

Paul Langer Paul.Langer at softwareag.com
Fri Apr 23 12:09:43 BST 1999


At Friday, April 23, 1999 12:11 AM John Cowan wrote:

> I have written an XML encoding detector function in C.
> [snip]
> I believe it handles all the cases in Appendix F correctly,
> including the EBCDIC one.

One remark on the EBCDIC handling:

Your program returns "EBCDIC-CP-US" if it detects EBCDIC
without an explicit encoding declaration (see comment:
 /* better than nothing */).

I do not think that this behaviour is "better than nothing".
The XML spec says "Parsed entities which are stored in an encoding
other than UTF-8 or UTF-16 must begin with a text declaration containing
an encoding declaration"  (Chapter 4.3.3 Character Encoding in Entities,
see http://www.w3.org/TR/REC-xml#charencoding).

And if you want to define a default, what makes "EBCDIC-CP-US"
more desirable than e.g. "ebcdic-cp-is"?

All the best,
Paul

-----------------------------------------------------------
Paul Langer                           PL at softwareag.com
Software AG                           Tel. +49-6151-92-1912
Uhlandstr. 12                         Fax  +49-6151-92-1613
D-64297 Darmstadt 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list