multiple encoding specs (Re: IE5.0 does not conform to RFC2376)

John Cowan cowan at locke.ccil.org
Fri Apr 9 18:05:21 BST 1999


Rick Jelliffe wrote:

> Given that an XML processor may transcode the document without knowing
> the meanings of the elements (i.e., that the meta tag means something),
> the XML encoding has to have priority over the HTML meta tag value. And
> given that a proxies can transcode text/* files without knowing what
> kind of text it is (i.e., that it is XML, and so has a label), the MIME
> header has to have priority over the XML header PI. I think that is the
> logical order: generic operations must be allowed.

All extremely sound.

> However, it is all spoiled if there are systems which corrupt the
> labels: for example by rewriting the charset parameter incorrectly. It
> is far better to send the XML file without a charset parameter than to
> send it with a wrong one.

But there's the snag: in text/xml documents, a missing charset parameter
does not mean "Charset unspecified"; it means "Charset specified
as US-ASCII".  There is no way to fail to specify a charset in
text/* documents, and rightly so, because text without a charset
is uninterpretable.

In SGML terms, omitting the charset in text/* documents is a mere
minimization, whereas in application/* documents it is a true #IMPLIED.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan at ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list