Wish lists for the Holidays

Simon St.Laurent simonstl at simonstl.com
Tue Dec 28 03:39:25 GMT 1999

At 10:17 PM 12/27/99 -0500, John Cowan wrote:
>Simon St.Laurent scripsit:
>> While I like Tidy a lot, I'd love to have a parser that tidies up the HTML
>> structure and then spits it out as SAX events or a DOM tree, rather than
>> the kind of document-to-document work that Tidy does.  Seems like that
>> shouldn't be much more difficult than the work Tidy does.
>It isn't, and in fact the Java version of Tidy (linked from Dave Raggett's
>page) provides a mini-DOM.  WIth my DOMParser, you can generate SAX events
>from the mini-DOM as well.

Looks promising!  For those who want to enjoy it, see:


Simon St.Laurent
XML: A Primer, 2nd Ed.
Building XML Applications
Inside XML DTDs: Scientific and Technical
Sharing Bandwidth / Cookies

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To unsubscribe, mailto:majordomo at ic.ac.uk the following message;
unsubscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

More information about the Xml-dev mailing list