SAX2 RFD: LexicalHandler draft v.1.1
David Megginson
david at megginson.com
Thu Mar 25 12:01:09 GMT 1999
Lars Marius Garshol writes:
> http://xml.org/sax/features/normalize-text
> Ensure that all consecutive text is returned in a single callback to
> DocumentHandler.characters or DocumentHandler.ignorableWhitespace
> (true) or explicitly do not require it (false).
>
>
> This is potentially problematic, since it's unspecified what the
> parser should do about CDATA sections in this case. (I suspect we will
> see more problems of this kind when we start using really using and
> stacking filters.) Should they be normalized, or should they be
> reported separately? (Ie: what is consecutive text, exactly?) The same
> problem appears with entity boundaries and character references.
Thanks, Lars -- this is an excellent point. I think that the
specification belongs, not with the normalize-text feature, but with
the LexicalHandler (since people may define other types of handlers
that we cannot predict).
> Possible solutions:
>
> - reject normalize-text true if a LexicalHandler has been registered,
> and reject LexicalHandler registration if normalize-text has been set
> to true
> - make normalize-text have a logical interpretation by default, and
> switch to lexical if a LexicalHandler has been registered
> - make normalize-text always have a lexical interpretation
> - have separate normalize-text-logical and normalize-text-lexical
> events, with reject-behaviour for the first
The DOM's text-normalisation feature does *not* normalise CDATA
sections, but I think that SAX's should.
All the best,
David
--
David Megginson david at megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
More information about the Xml-dev
mailing list