Software for markup? (was Re: XML Search Engine)
Peter Murray-Rust
peter at ursus.demon.co.uk
Tue Nov 10 19:14:20 GMT 1998
I have what (I hope) is an easier problem and would be grateful for
pointers to existing software that can be used to mark up XML documents
[primarily in English].
We have developed an XML representation (VHG) for controlled vocabularies
which are widely used for "encoding" reports and other information. These
vocabularies have about 10K terms each, consisting of single words or
phrases (e.g. "acne", "Asian cholera", etc.. [Our current examples are
dictionaries of disease terms from world authorities.] The [pharmaceutical]
industry/regulators spend much time in "encoding" (== markup).
In general the dictionaries do not have indexHeadings, stemming,
pronunciations, US/Eur variants etc. (though we are hoping to promote the
communal capture of such knowledge through WWW-based collaboration). We
wish to be able to markup such terms as automatically as possible, e.g.
<P>The patient developed acne</P>
could be transformed to
<P>The patient developed <simpleLink href="icd10_L70.0">acne</simpleLink></P>
The current encoding procedures are not complex (often manual) and I'm not
looking for rocket science (e.g. no automatic extraction of terms, concept
analysis, etc.) I want to avoid re-inventing the wheel because I assume
lots have people have already done this.
TIA
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
More information about the Xml-dev
mailing list