DOM & Entities

Leigh Dodds ldodds at ingenta.com
Mon Apr 12 11:50:42 BST 1999


Hi,

I'd like some clarification for how DOM handles entities. The 
spec suggests that entities are resolved into their text 
equivalent ("...are replaced by the single character that 
makes up the entities equivalent..."). 

I'm curious as to how this is handled with entities such as those 
used in mathematical equations, or accented characters, or 
other special characters that aren't strictly 'plain text'?

I'm writing an XML processing application which reads in an 
XML document, performs some processing (based on another 
XML 'rules' document) and then produces a final XML document.
Ideally I'd like the entities retained from start to finish, so 
that I can be sure that they survive the transformation unchanged.
But I'm unclear how I can ensure this? Will I have to wrap all 
entity references in CDATA sections before parsing?

Incidentally I plan on using the IBM xml4j parser and its DOM 
implementation

Tips, comments, clarifications welcomed.

L.

==================================================================
    "Never Do With More, What Can Be Achieved With Less"
				---William of Occam
==================================================================
Leigh Dodds                             Eml:  ldodds at ingenta.com
ingenta ltd                             Tel:  +44 1225 826619
BUCS Building, University of Bath       Fax:  +44 1225 826283
==================================================================


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list