White Space
Simon St.Laurent
simonstl at simonstl.com
Thu Aug 12 00:00:00 BST 1999
At 04:42 PM 8/11/99 -0400, jadams at touchpointsw.com wrote:
>Am using IBM xml4c2_2_0 SAXPrint, and
>It appears that leading and trailing spaces (whitespace) surrounding an
>element, e.g.
> <name>mumble</name> ,
> <name>mumble </name>,
> and lastly
> <name>mumble
> </name>,
>offer up via the characters handler mumble with no space, a trailing space,
>and lastly a trailing NL (0x0A) respectively.
>My difficulty lies in comparing the many forms of mumble with the string
>"mumble" because of the white space. Simon's "Building ..." Pp 87 suggests
>that maybe (hopefully) the parser is removing white space.
>Should the underlying SAX parser be removing the troublesome white space or
>should i be removing this problem white space in the characters handler???
>Thanks much
It's that wacky distinction between what the parser does (which is most of
what the XML spec discusses) and what the application does (which is
xml:space). The _parser_ should return all whitespace to the application,
apart from the rules about end of line in section 2.11. That means you'll
need to have your application, or a filter (as David Megginson suggested)
eliminate whitespace you don't consider significant.
Whitespace seems to be an issue that just never dies...
Simon St.Laurent
XML: A Primer (2nd Ed - September)
Building XML Applications
Inside XML DTDs: Scientific and Technical
Sharing Bandwidth / Cookies
http://www.simonstl.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
More information about the Xml-dev
mailing list