xml parser

Tim Bray tbray at textuality.com
Wed Nov 4 16:23:10 GMT 1998


At 10:55 AM 11/4/98 -0000, Michael Kay wrote:
>My immediate answer to this is yes, all the information you need for a
>search engine is available via the SAX or DOM interface offered by many
>parsers.

I disagree.  Few parsers track byte offsets or other locational info in
the file, and I think you need that to do basic things like proximity
and phrase search.

>Of course you don't need to build your own search engine either, all you
>need to do is write an XML filter for an existing search engine. I'm
>surprised no-one seems to have done this yet.

I think you do need to build your own engine.  Reason is, most existing
search engines have an atomic-document view of the world, and break
down completely when asked to model a general recursive hierarchical
structure like XML. -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list