Attribute normalisation and character entities

Richard Tobin richard at
Wed Jan 26 13:02:38 GMT 2000

In article <003901bf6776$5d426f20$0201a8c0 at tomshp>,
Thomas B. Passin <tpassin at> wrote:
>The XML rec says

For character references, the table in 4.4 specifies "Included" rather
then "Included in literal", but in any case, the first point in
section 3.3.3 ("a character reference is processed...") must be
explaining the same thing (rather than something that happens
afterwards), otherwise there would be no character references present.

The algorithm in 3.3.3 applied to a non-CDATA attribute

   "foo &#x9; bar"

will first replace the character reference with a tab character, so we
will have

  space tab space

Then the "further processing" will do nothing, because there are no
sequences of more than one space character.

The only way that the tab could become a space is if both the first
and third points of section 3.3.3 were applied to it, but the natural
reading is that those points are alternatives.

-- Richard
Spam filter: to mail me from a .com/.net site, put my surname in the headers.

"The Internet is really just a series of bottlenecks joined by high
speed networks." - Sam Wilson

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at
Archived as: or CD-ROM/ISBN 981-02-3594-1
Unsubscribe by posting to majordom at the message
unsubscribe xml-dev  (or)
unsubscribe xml-dev your-subscribed-email at your-subscribed-address

Please note: New list subscriptions now closed in preparation for transfer to OASIS.

More information about the Xml-dev mailing list