XHTML and character entities

Rick Jelliffe ricko at allette.com.au
Tue Mar 30 04:45:54 BST 1999

From: Gabe Beged-Dov <begeddov at jfinity.com>

>I mention tidy below but am asking about html->xhtml conversion in
>I use tidy to to convert html to xhtml using the -asxml switch. The
>result of many conversions is still not accepted as well-formed because
>entities like agrave and friends aren't defined unless you process the
>Wouldn't it be reasonable to convert these to character entities as
>of the html->xhtml process?

With tidy, you have to be a little creative with the switches. For
example, to process Big5 text, we have to use "-latin1".

Certainly it is the expectation of some people that the entities for
special characters will disappear with XML, that people will use NCRs.
I am not sure about it.

Rick Jelliffe

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

More information about the Xml-dev mailing list