Peter Murray-Rust peter at
Fri Jan 16 16:12:04 GMT 1998

At 10:43 16/01/98 -0500, David Megginson wrote:
>Peter Murray-Rust writes:
>The XML source for the PR is encoded in ISO-8859-1 but has no encoding
>declaration (so AElfred assumes UTF-8, and reports an encoding error,
>though not very helpfully, when it finds an invalid UTF-8 sequence).
>The WG is aware of the problem.

Thanks. I am also aware of it now :-).  Can I make the assumption that:

	- ISO-8859-1 and UTF-8 look identical to not-very-experienced humans.
	- in principle I should be able to sort this by adding something like

<?xml version="1.0" encoding="ISO-8859-1"?>
	to the top of the document

	- in practice this fails because by the time it gets to the encoding
declaration it has already assumed the encoding is UTF-8 and has crashed :-)

I am not quite clear why we need this problem. Do different tools emit
different encodings? If so, what should I work with?. Can I convert this

	I know there has been lots of important discussions about encodings (which
I have not always read very carefully), so an authoritative statement from
a WG member would help at least one human :-)


Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS, Virtual Hyperglossary

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at
Archived as:
To (un)subscribe, mailto:majordomo at the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at

More information about the Xml-dev mailing list