xml parser

Michael Kay M.H.Kay at eng.icl.co.uk
Wed Nov 4 11:00:01 GMT 1998

>1.To develop a search engine for xml documents, can we use the xml parser
>already developed by microsoft and others or do we need to build our own

My immediate answer to this is yes, all the information you need for a
search engine is available via the SAX or DOM interface offered by many

This is certainly true for the indexing phase; for displaying hit documents
I can think of some requirements that a standard parser might not meet, such
as displaying the text around a search term without parsing the whole
document. So it depends on your detailed design. But in any case many XML
parsers are available with source code so you shouldn't need to write a new
one from scratch.

Of course you don't need to build your own search engine either, all you
need to do is write an XML filter for an existing search engine. I'm
surprised no-one seems to have done this yet.

Mike Kay

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

More information about the Xml-dev mailing list