Is there anyone working on a binary version of XML?

David Megginson david at megginson.com
Sat Mar 27 15:56:18 GMT 1999


Rick Jelliffe writes:

 > One trivial way to minimise file sizes for transmission is to
 > collapse white-space inside markup (e.g. [\ \t \n\ r]+ becomes
 > [\n]),

Yes, that might be helpful (but only minimally in most cases).

 > sure that newlines are not CR LF pairs, 

Yes, that will make a small difference.  You might get a bigger bang
by doing some quick analysis to determine which character encoding
will provide the smallest object size: UTF-8, ISO-8859-1, UTF-16,
etc. (mileage will vary depending on the languages used in the text).

 > and to minimize whitespace in data: (removing trailing spaces, [\
 > \t]+\n) becomes [\n], is a safe transformation, for example.)

No.  It might be a safe transformation for specific XML formats, but
not for XML in general, because you don't know what people might be
using that whitespace for.

In general, though, what we need is a transport layer that takes care
of things like this for us.  Document type designers should optimise
for readability and usability, and let protocol designers worry about
the optimisations.


All the best,


David

-- 
David Megginson                 david at megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)




More information about the Xml-dev mailing list