XML::Parser and numeric entities

Eric Bohlman ebohlman at netcom.com
Sat Aug 14 06:19:22 BST 1999


On Fri, 13 Aug 1999 mookie at unagi.undef.com wrote:

> Anybody know how I can get XML::Parser pass along entities like &#233? 
> I'm just translating my XML into HTML, so it'd be convenient.
> 
> 
> use strict;
> use XML::Parser;
> 
> my $p = new XML::Parser(Style => 'Debug');
> $p->parse('<element>s&#97;ut&#233;</element>');
> 
>  \\ ()
> element || s
> element || a
> element || ut
> element || #xC3;#xA9;
>  //
> 
> that's supposed to be "saute" with an acute accent over the e.
> what the heck is C3A9?  

It's the UTF-8 representation of the character whose ISO-8859-1 codepoint
is 233.  Remember that regardless of the input encoding, XML::Parser
always returns its results in UTF-8.  If you need to output in another
encoding, you'll have to translate it.



xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)





More information about the Xml-dev mailing list