Ps --> xml (or html)

Lisa Rein lisarein at finetuning.com
Fri Sep 10 15:45:25 BST 1999


Actually, Ghostscript does it pretty easily.  You just go to the "Edit"
menu nand then "Text Extract", but you have to make sure that 
the "PS to Text" is checked on "Normal". ("PS to Text" is on the
"Options" menu)

lisa

Eve L. Maler wrote:
> 
> Ghostscript claims to be able to "extract text" (so that you could do a
> PostScript-to-text-to-XML process), but at a quick glance I can't see how
> you do it.
> 
>          Eve
> 
> At 04:53 PM 9/10/99 +1000, James Robertson wrote:
> >At 16:27 10/09/1999 , David LeBlanc wrote:
> >
> >>Hi;
> >>
> >>Anyone know of a PS to xml (or html) converter?
> >>
> >>TIA
> >>
> >>Dave LeBlanc
> >
> >If "PS" means Postscript, then that's a big
> >ask ...
> >
> >Postscript can contain absolutely anything
> >that can be output to a printer: test, graphics,
> >equations, line art, anything ...
> >
> >Even worse, a lot of the text is broken up
> >and specified according to exact placement
> >on the page, etc, so it's not trivial to
> >try and reconstruct a text flow ...
> >
> >Also, if XML is your target, how would
> >you determine what tags to use for the
> >text?
> >
> >That being said, who know's what some
> >brilliant mind has created out there, but
> >I wouldn't hold your breath.
> >
> >J
> >
> >
> >-------------------------
> >James Robertson
> >Step Two Designs Pty Ltd
> >SGML, XML & HTML Consultancy
> >http://www.steptwo.com.au/
> >jamesr at steptwo.com.au
> >
> >"Beyond the Idea"
> >  ACN 081 019 623
> >
> >xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
> >Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on
> >CD-ROM/ISBN 981-02-3594-1
> >To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
> >(un)subscribe xml-dev
> >To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
> >subscribe xml-dev-digest
> >List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
> >
> 
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
> To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)





More information about the Xml-dev mailing list