HTML to XML

Michael Kay M.H.Kay at eng.icl.co.uk
Thu Jul 16 16:29:50 BST 1998


-----Original Message-----
From: Peter Murray-Rust <peter at ursus.demon.co.uk>
To: xml-dev at ic.ac.uk <xml-dev at ic.ac.uk>
Date: 16 July 1998 08:50
Subject: Re: HTML to XML


>At 20:30 15/07/98 -0400, David Megginson wrote:
>>
>>We desperately need a SAX parser that parses HTML instead
of XML --
>>any takers?
>
>The SwingSet (from com.sun.java) has HTML functionality.
I'm not sure
>exactly what, but it can read in HTML and render it.

Good thinking. I've had a look at the Swing source. It
includes a parser (html32.java) generated using the java
compiler-compiler JavaCC. This calls a callback interface
HTMMLParserCallback.java, similar in concept to SAX, though
it seems to include both generic (start/end element) and
element-specific (e.g. startUL) callbacks. Of course the
main difference from a SAX application will be that the
elements are not properly nested.

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list