Mix encodings in a document?

James Tauber jtauber at jtauber.com
Mon Sep 21 17:10:50 BST 1998

-----Original Message-----
From: Deke Smith <deke at tallent.com>

>I think I know the answer I am going to get, but I'll ask anyway.
>Within a single XML document, is it possible to have the text encoding
>change from element to element?

The way you've phrased the question, the answer is yes, but given your
examples, I suspect you are really asking whether is it possible to have the
text encoding change WITHIN A SINGLE ENTITY, in which case the answer is no.

There is nothing to stop you having

<?xml version="1.0"?>
<!ENTITY phrase2 SYSTEM "phrase2.xml">
<PHRASE encoding="ISO-8859-1" xml:lang="en">Hello!</PHRASE>

where phrase2.xml is

<?xml encoding="X-EUC-TW"?>
<PHRASE xml:lang="zh-TW"><!--chinese language text

This is within the one document (but two entities).

Mind you, your example:

<?xml version="1.0"?>
<PHRASE encoding="ISO-8859-1" xml:lang="en">Hello!</PHRASE>
<PHRASE encoding="X-EUC-TW" xml:lang="zh-TW"><!--chinese language text

won't cause any problems, simply because there is nothing special about the
attribute "encoding". Note that it's not valid but that isn't because of the
encoding attribute, it's simply because there isn't a DTD declaring the
elements and attributes used.

James Tauber / jtauber at jtauber.com      http://www.jtauber.com/
Lecturer and Associate Researcher
Electronic Commerce Network             ( http://www.xmlinfo.com/
Curtin Business School                  ( http://www.xmlsoftware.com/
Perth, Western Australia                ( http://www.schema.net/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

More information about the Xml-dev mailing list