Preserving white space and entity references in DataChannel XJP

Vance Christiaanse vance at textwise.com
Fri Jul 30 21:25:41 BST 1999


Step 1:
> > Erik James Freed wrote:
> >
> > I am experiencing some strange behavior with the datachannel XML
> > parser package (the most recent one).
> > In my reading of the DOM spec, this is not appropriate behavior, but
> > perhaps I am missing something.
> >
> > The behavior is that when I do a 'setPreserveWhiteSpace(true)' before
> > parsing a document, and the document
> > contains strings with entity references such as:
> >
> >     <element1> &lt; fubar &gt; </element1>
> >
> > when I then do a getText() on element1, what is returned is a
> > java.lang.String that contains a null (char 0) for each entity
> > reference.
> >
> > These nulls of course confound the rest of the code I am writing. In
> > side the DOM tree the entity reference objects are happily holding the
> > appropriate text representation  i.e. '<' and  '>'.
> >
> > Turning off white space preservation makes the getText() place
> > appropriate decoded entity references in the resulting string.
> >
> > Bug or feature?

Step 2:
I wrote:
> I don't see a setPreserveWhiteSpace(...) method or preserveWhiteSpace
> class or instance variable in the DOM spec and I don't see getText() or
> a text variable either. The answer to "bug or feature" would be up to
> the
> 
> > datachannel XML parser package

Step 3:
Eric wrote
> Vance,
> 
> Yes indeed this is not a pure DOM/XML issue, however the DC extension does
> purport to
> adhere to standard XML concepts.
> 
> The following is from the datachannel documentation on the
> PreserveWhiteSpace parameter:
> 
>         "As per the XML Language Specification, this specifies the white space
> handling for the        application; that is, the default white space handling to
> apply when      xml:space="default". If preserveWhiteSpace is true, all white
> space will be preserved         regardless of the setting of any xml:space
> attributes in the document. The white space     will be preserved by additional
> text nodes being present in the tree. If        preserveWhiteSpace is false, then
> the values of the xml:space attribute specified in      the document will
> determine whether white space is preserved or not. "
> 
> So with that clarification is this a bug or a feature?

Step 4:
I don't know, unfortunately. I've been studying the DOM and I just
wanted to clarify its boundaries. Hopefully someone familiar with the
DataChannel XML parser package will answer!

Vance

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)





More information about the Xml-dev mailing list