Whitespace rules (v2)

Andrew Greene agreene at bitstream.com
Mon Aug 11 14:40:38 BST 1997


I'm troubled by one aspect of that suggestion:

> RULE 4.  A remaining line-end code is converted into a space, except
> when it is preceded by a normal (hard) hyphen, or by a soft hyphen
> ('°'), in which case it is removed (a soft hyphen is also then
> removed).                                ^^^^^^^^^^^^^^^^^^^^^^^^^^
  ^^^^^^^

That could alter the semantics of the data stream. The incoming data
stream may have been broken at that point, but we don't want to lose
the fact that such a break is legal -- it may be required again down-
stream.

So, using your example, I think that

> A[CR]
> line-[CR]
> end code sep°[CR]
> arates lines.

should become

 A line-end code sep°arates lines.

and not, as you suggest,

> A line-end code seperates lines.

An individual application may choose to ignore soft hyphens when it
displays (or otherwise handles) the data. 

Does that make sense?

- Andrew Greene



xml-dev: A list for W3C XML Developers
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To unsubscribe, send to majordomo at ic.ac.uk the following message;
unsubscribe xml-dev
List coordinator, Henry Rzepa (rzepa at ic.ac.uk)




More information about the Xml-dev mailing list