possibility of an RTF, LaTex XML conversion process (fwd)
Don Park
donpark at quake.net
Wed Apr 15 00:37:45 BST 1998
>Amen! I am up-converting a technical book in LaTeX that has literally
>thousands of format directives, each of which must be replaced by
>a descriptor showing the author's intent. I used Perl to do some
>automatically, but about half needed decisions by a content expert.
My recommendation would be to do a dumb translation of LaTeX into XML. By
doing so, you are deferring all the critical decisions which, if made
prematurely, could cause information loss and taint.
Once you have the XML-lized LaTeX document you have a core document to
create more application-oriented XML documents from. For example, if you
are interested in duplicating the layout of the original LaTeX document, you
could extract the layout information and create a PGML document. If you are
interested in an indexable XML document, you can extract the contents and
structural elements and massage them into an easily indexable format.
At later point, you can inject elements representing the author's intent as
well as some other content expert's interpretation (such element should have
an attribute indicating the point of view).
Regards,
Don Park
http://www.docuverse.com/personal/index.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
More information about the Xml-dev
mailing list