Ps --> xml (or html)

Eve L. Maler elm at arbortext.com
Fri Sep 10 14:05:33 BST 1999


Ghostscript claims to be able to "extract text" (so that you could do a 
PostScript-to-text-to-XML process), but at a quick glance I can't see how 
you do it.

         Eve

At 04:53 PM 9/10/99 +1000, James Robertson wrote:
>At 16:27 10/09/1999 , David LeBlanc wrote:
>
>>Hi;
>>
>>Anyone know of a PS to xml (or html) converter?
>>
>>TIA
>>
>>Dave LeBlanc
>
>If "PS" means Postscript, then that's a big
>ask ...
>
>Postscript can contain absolutely anything
>that can be output to a printer: test, graphics,
>equations, line art, anything ...
>
>Even worse, a lot of the text is broken up
>and specified according to exact placement
>on the page, etc, so it's not trivial to
>try and reconstruct a text flow ...
>
>Also, if XML is your target, how would
>you determine what tags to use for the
>text?
>
>That being said, who know's what some
>brilliant mind has created out there, but
>I wouldn't hold your breath.
>
>J
>
>
>-------------------------
>James Robertson
>Step Two Designs Pty Ltd
>SGML, XML & HTML Consultancy
>http://www.steptwo.com.au/
>jamesr at steptwo.com.au
>
>"Beyond the Idea"
>  ACN 081 019 623
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on 
>CD-ROM/ISBN 981-02-3594-1
>To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
>


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)





More information about the Xml-dev mailing list