SAX-J and the DPH (DJH?)

Sean Mc Grath digitome at iol.ie
Tue Dec 30 16:31:03 GMT 1997


[Sean Mc Grath]
>> He is the guy staring at 10GB of corporate XML docs and just wants
>> to change
>> 
>> <telephone>555-1234</telephone>
>> 
>> into
>> 
>> <telephone>555-4321</telepone>
>> 
>> throughout the whole lot. If he uses SAX-J he blows all the
>> aide-memoires stuffed into his comments? He also blows his CDATA
>> marked sections. Is the DPH destined to always use a fully blown
>> parser to output XML?
>
[Chris Maden]
>Hell, no!  He'd be a fool to use Java for this when
>s/555-1234/555-4321/g will do the trick.  (I suspect that in any
>real-world situation like this, *all* numbers would want to be
>changed, but if not, just include the markup in the sed command.)

My example was a bit simplistic. This transformation is
no more difficult in fully blown SGML that it is in XML.

The fun starts for the D&lang;H when including the markup in the
SED command is not an option due to the hierarchical sensitivity of
the task. e.g. just telephone numbers occuring within 
the appendix elements and skipping those where the client attribute
has the value = "Jones". That sort of thing.

Maybe nothing short of a fully blown XML parser will do for these
situations?


Sean Mc Grath
sean at digitome dot com




xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list