How to process Japanese Code with XMLDSO(MS-XML)

MURATA Makoto murata at apsdc.ksp.fujixerox.co.jp
Thu Aug 6 05:49:01 BST 1998


TAKAHASHI Masayoshi wrote:
> Can we define standard (or recommended) conversion (mapping table)
> between Unicode(UCS-2) and such encodings used in XML?

JIS X 0221, Java, and Microsoft appear to have their own.  Terrible 
confusion will happen in the near future.  This is not at all a 
fault of the Unicode consortium or ISO.  If somebody should be 
criticized, Japanese should be criticized.

An interesting document about this issue is available at:

http://hp.vector.co.jp/authors/VA001240/article/ucsnote.html

> I know that "Japanese profile" in KAISETSU of TR X 0008-1998
> (http://www.y-adagio.com/public/standards/xml/tutr.htm)
> define encodings used in japanese XML document, but it doesn't
> define conversions between encodings. I think it's not enough
> to guarantee exchange.

Quite.

TAKAHASHI Masayoshi wrote:
> 
> # ...masaka "UTF-{8|16} igai no encoings ha buji ni koukan dekiru
> # hoshou ga nai kara jissai niha tsukatte ha ikenai" to an ni
> # niowaseteiru wake deha nai desuyone? :-) > japanese profile

So, do you plan to contribute?  I can give you a SJIS XML file 
containing a number of problematic characters.  If you convert 
them to UTF-16 and then back to SJIS by a number of software 
tools and report the result to the public, that would be a great 
contribution.



Makoto
 
Fuji Xerox Information Systems
 
Tel: +81-44-812-7230   Fax: +81-44-812-7231
E-mail: murata at apsdc.ksp.fujixerox.co.jp

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list