Paul R. Brown
prb at uic.edu
Sun Dec 26 07:00:56 GMT 1999
> I am currently looking out for converting Word Perfect, MS
> Word and ASCII files into XML.
As Robert DuCharme suggested, you're better off dealing with RTF, but even
that is a moving target. (Microsoft says that the spec is subject to
change...) Writing an RTF-to-TXT conversion in Perl is a good exercise (all
you really need are regular expressions), and then it's up to you how much
of the RTF markup you want to preserve or not.
For marking ASCII into XML, you're going to deal with a host of problems.
(Capturing tables into markup is an interesting exercise, for instance.)
What is it that you want to accomplish?
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To unsubscribe, mailto:majordomo at ic.ac.uk the following message;
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
More information about the Xml-dev