Converting HTML to well formed XML

Don Park donpark at quake.net
Wed Nov 11 02:43:26 GMT 1998


>Unfortunately, no one yet (so far as I know) has created
>a friendly one-step legacy HTML->well-formed XML
>syntax HTML converter.  That's a nice opportunity there for
>some publicity, if not necessarily $$$...


Docuverse HTML SDK can be used to build such a converter easily.  What it
contains is a SAX parser interface to Swing's HTML parser which means you
can use your XML tools on HTML documents.

However, Swing's HTML parser has mishandles unknown tags so it is not a
perfect solution.

You can find HTML SDK at http://www.docuverse.com/htmlsdk

Best,

Don Park
Docuverse



xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list