character encoding questions
Alice.Portillo at PSS.boeing.com
Tue Jul 15 17:18:03 BST 1997
I am new to the list and have some basic questions on the use of
ISO10646 that may have already been answered.
Has there been any discussion on how the mapping of the current
character sets in use will be mapped to the ISO10646?
In the current SGML ATA documents being generated by Boeing there is
reference to the ISO TECH, ISO PUB, ISO NUM, ISO GRK1, ISO BOX, ISO
GRK3, ISO AMSO, ISO AMSC, and ISO LAT1 in order to access all the
required characters referenced in the manuals being produced in SGML.
My questions are:
1. How are the software vendors (browser, parser, authoring) planning on
supporting documents which utilize the UNICODE character set?
2. a) Can all the characters referenced in ISO LAT,1 positions 0-256, be
referenced in the document without benefit of escape codes?
2. b) What about positions 0-125?
2. c) Must the characters above 126 be escaped?
3. At what point in the ISO10646 character set must escaping be
instituted in order to reference a character within the set?
4. Has anyone mapped the ISO TECH, ISO PUB, ISO NUM, ISO GRK1, ISO BOX,
ISO GRK3, ISO AMSO, and ISO AMSC to the UNICODE equivalent escape codes?
5. How does SHUNCHAR set to NONE in the XML SGML DECLARATION interplay
with Char, Letters, Ignorable and other character class definitions? Has
the character class IGNORABLE taken care of this problem?
Product Definition and Image
The Boeing Company Phone: 425.237.3351
PO Box 3707 M/S 6H-AF Fax: 425.237.3428
Seattle, WA 98124-2207 christina.portillo at boeing.com
xml-dev: A list for W3C XML Developers
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To unsubscribe, send to majordomo at ic.ac.uk the following message;
List coordinator, Henry Rzepa (rzepa at ic.ac.uk)
More information about the Xml-dev