Indexing XML documents

Jacek Ambroziak - Sun Microsystems Labs BOS Jacek.Ambroziak at East.Sun.COM
Thu Mar 12 18:34:51 GMT 1998


Rakesh,

I've done some work in this area adapting our Conceptual Indexing
framework to index XML documents. I have a paper on this work
almost ready. If you'll be in Seattle (XML conf) we can talk about it.

The interesting fact about Conceptual Indexing is the expressive
richness of its index database, which lets you store both offsets
of various meaningful chunks of the indexed documents, and relations
(structural and semantic) between these chunks.

I use a Java XML parser and a set of small Java objects
(one per element type) which know how to render each encountered
element into structures appropriate for the index, consistent
w/ the designed conceptualization of a given document type.

This has been a cool work, and as always, more remains to be done :-)

--Jacek


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list