XML::Parser and numeric entities
Eric Bohlman
ebohlman at netcom.com
Sat Aug 14 06:19:22 BST 1999
On Fri, 13 Aug 1999 mookie at unagi.undef.com wrote:
> Anybody know how I can get XML::Parser pass along entities like é?
> I'm just translating my XML into HTML, so it'd be convenient.
>
>
> use strict;
> use XML::Parser;
>
> my $p = new XML::Parser(Style => 'Debug');
> $p->parse('<element>sauté</element>');
>
> \\ ()
> element || s
> element || a
> element || ut
> element || #xC3;#xA9;
> //
>
> that's supposed to be "saute" with an acute accent over the e.
> what the heck is C3A9?
It's the UTF-8 representation of the character whose ISO-8859-1 codepoint
is 233. Remember that regardless of the input encoding, XML::Parser
always returns its results in UTF-8. If you need to output in another
encoding, you'll have to translate it.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
More information about the Xml-dev
mailing list