Whitespace rules (v2)

Rick Jelliffe ricko at allette.com.au
Sun Aug 24 17:10:37 BST 1997


 
> From: Liam Quin <liamquin at interlog.com>
  
> The hyphen is 0255 octal (173 decimal).  It is a hyphen, not a soft hyphen.
> There is no soft hyphen in Latin 1.
> I don't have the necessary copy of Unicode in front of me, 

In both Unicode 1.0 and Unicode 2.0   &#x00AD;  is called "soft hyphen"
or "discretionary hyphen", so it is available, but perhaps not reliably 
supported by 8859-1 applications.

Also available is the zero-width
space  &#x200B;  which can be used to provide non-hyphenating line-break
points inside long technical terms (this might be useful in chemical names,
where a dash of any kind might be misleading) and in languages in which 
words are not delimited by spaces.

For example,  supercali&#x200B;fragalistic&x200B;expialladocious.



Rick Jelliffe


xml-dev: A list for W3C XML Developers
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To unsubscribe, send to majordomo at ic.ac.uk the following message;
unsubscribe xml-dev
List coordinator, Henry Rzepa (rzepa at ic.ac.uk)




More information about the Xml-dev mailing list