"Multiple" Namespaces? (but NOT for HTML)

Walter Underwood wunder at infoseek.com
Thu Oct 28 18:42:44 BST 1999


At 12:08 PM 10/22/99 -0400, John Robert Gardner wrote:
>Greetings:
>
>One of the goals of our project--so kindly written up by Robin at
>http://www.oasis-open.org/cover/atlas.html--is to allow the searching of
>our records (which will be linked to XML instances of the articles in gif,
>then later in keyed/tagged text) from a variety of existing search
>interfaces, as transparently as possible.  In other words, someone using
>DC can search for articles where Creator is Tillich, while someone using
>GILS can search for Originator as Tillich, and so on.  

It may be that markup is not the right hammer for this problem.
Our search engine handles multiple DTDs by mapping the elements
into common search meta data elements.

   DC:Creator      -> author
   GILS:Originator -> author
   TEI:docAuthor   -> author

and so on. So the documents can remain legal and "pure" with
respect to TEI or GILS, but users can search them with a 
common model.

In a sense, this is the same as having separate style sheets for
each format, but making them all look the same. You could even
consider making a search-specific style sheet (XSLT-only) which
transformed each DTD into the common search form. Whether it
is implemented with a literal style sheet or not, that is a
useful model for thinking about it. If there are separate
sets of meta data elements (Dublin Core, GILS, US-MARC), then
you'd need a separate transformation for each.

Ultraseek Server doesn't implement it that way, partly because
XSLT didn't yet exist at the time, and mostly because the
indexer has no other use for a DOM, and SAX-style stream/event
parsing is more efficient for our purposes.

At first glance, it sounds like Ultraseek Server might already
implement what you want, so it is at least worth checking it
out. If you want more details about our XML support, I wrote 
it, so you can ask me.

And I like the idea of "Creator is Tillich". Shouldn't that
be "Ground of Creator is Tillich"?

wunder
--
Walter R. Underwood
Staff Engineer, Infoseek Corp.
wunder at infoseek.com
http://software.infoseek.com/cce/ (my product)
http://www.best.com/~wunder/
1-408-543-6946

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To unsubscribe, mailto:majordomo at ic.ac.uk the following message;
unsubscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)





More information about the Xml-dev mailing list