Why XML data typing is hard

David Brownell db at Eng.Sun.COM
Mon Nov 30 22:46:22 GMT 1998


Toby Speight wrote:
> 
> Henry> Henry S. Thompson <URL:mailto:ht at cogsci.ed.ac.uk>
>  
> Henry> In other words, if there are (natural) language/culture dependent
> Henry> aspects to our documents, then if we are good citizens we should
> Henry> use the xml:lang attribute to signal this.

A good "if" ... related:  "if" the document is directly created or
consumed by humans, we should use some locale tagging.  ("xml:lang"
identifies language, not locale!)


> It looks attractive at first sight, but think of the burden you're
> placing on processors ... readers now
> need to know (enough about) *all* the locales from which they might
> receive data.

I'd contend that the "float" (or "r4") data _type_ doesn't have any
such localization issues.  IEEE floating point is a binary spec, and
nobody's proposed not assuming IEEE floating point.

Rather, this issue is an encoding issue.  By and large, having just
one (canonical) form is a lot easier for programs to deal with:  smaller
code, easier to debug, faster in normal cases, and so on.

But:  is this data being generated by/for programs, or people?  People
have different priorities than programs.

- Dave

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list