Converting HTML to well formed XML

Don Park donpark at
Wed Nov 11 02:43:26 GMT 1998

>Unfortunately, no one yet (so far as I know) has created
>a friendly one-step legacy HTML->well-formed XML
>syntax HTML converter.  That's a nice opportunity there for
>some publicity, if not necessarily $$$...

Docuverse HTML SDK can be used to build such a converter easily.  What it
contains is a SAX parser interface to Swing's HTML parser which means you
can use your XML tools on HTML documents.

However, Swing's HTML parser has mishandles unknown tags so it is not a
perfect solution.

You can find HTML SDK at


Don Park

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at
Archived as:
To (un)subscribe, mailto:majordomo at the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at

More information about the Xml-dev mailing list