Non-Unicode Character Sets
paul at prescod.net
Fri Jan 28 22:05:49 GMT 2000
I am told that conversion of some character sets through Unicode is
lossy and cannot be round-tripped. But it occurs ot me that as long as
one has the private use area, "unknown" characters can always be
preserved. If a particular mapping loses information, isn't that more a
weakness in the mapping then in Unicode itself? Are there some
standardized national character sets with so many non-Unicode characters
that they cannot fit into the PUA? Even with planes 15 and 16?
Is there any character set in the world that cannot be considered a
"subset of Unicode"?
Paul Prescod - ISOGEN Consulting Engineer speaking for himself
The new revolutionaries believe the time has come for an aggressive
move against our oppressors. We have established a solid beachhead
on Friday. We now intend to fight vigorously for 'casual Thursdays.'
-- who says America's revolutionary spirit is dead?
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ or CD-ROM/ISBN 981-02-3594-1
Unsubscribe by posting to majordom at ic.ac.uk the message
unsubscribe xml-dev (or)
unsubscribe xml-dev your-subscribed-email at your-subscribed-address
Please note: New list subscriptions now closed in preparation for transfer to OASIS.
More information about the Xml-dev