Preserving white space and entity references in DataChannel XJP
Erik James Freed
ejfreed at infocanvas.com
Wed Jul 28 05:24:26 BST 1999
Vance,
Yes indeed this is not a pure DOM/XML issue, however the DC extension does
purport to
adhere to standard XML concepts.
The following is from the datachannel documentation on the
PreserveWhiteSpace parameter:
"As per the XML Language Specification, this specifies the white space
handling for the application; that is, the default white space handling to
apply when xml:space="default". If preserveWhiteSpace is true, all white
space will be preserved regardless of the setting of any xml:space
attributes in the document. The white space will be preserved by additional
text nodes being present in the tree. If preserveWhiteSpace is false, then
the values of the xml:space attribute specified in the document will
determine whether white space is preserved or not. "
So with that clarification is this a bug or a feature?
erik
-----Original Message-----
From: Vance Christiaanse [mailto:vance at textwise.com]
Sent: Tuesday, July 27, 1999 11:49 AM
To: Erik James Freed
Cc: Xml-Dev; Keith Swenson
Subject: Re: Preserving white space and entity references in DataChannel
XJP
> Erik James Freed wrote:
>
> I am experiencing some strange behavior with the datachannel XML
> parser package (the most recent one).
> In my reading of the DOM spec, this is not appropriate behavior, but
> perhaps I am missing something.
>
> The behavior is that when I do a 'setPreserveWhiteSpace(true)' before
> parsing a document, and the document
> contains strings with entity references such as:
>
> <element1> < fubar > </element1>
>
> when I then do a getText() on element1, what is returned is a
> java.lang.String that contains a null (char 0) for each entity
> reference.
>
> These nulls of course confound the rest of the code I am writing. In
> side the DOM tree the entity reference objects are happily holding the
> appropriate text representation i.e. '<' and '>'.
>
> Turning off white space preservation makes the getText() place
> appropriate decoded entity references in the resulting string.
>
> Bug or feature?
>
> erik
>
>
> Erik James Freed
> InfoCanvas Software
I don't see a setPreserveWhiteSpace(...) method or preserveWhiteSpace
class or instance variable in the DOM spec and I don't see getText() or
a text variable either. The answer to "bug or feature" would be up to
the
> datachannel XML parser package
Vance
Vance Christiaanse
TextWise, LLC
vance at textwise.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
More information about the Xml-dev
mailing list