From David.Brownell at Eng.Sun.COM Fri May 1 00:23:17 1998
From: David.Brownell at Eng.Sun.COM (David Brownell)
Date: Mon Jun 7 17:00:59 2004
Subject: SAX 1.0beta: Three bugs so far
Message-ID: <199804302221.PAA24043@argon.eng.sun.com>
> > BUG #2: Parser.setLocale takes only one String argument
> >
> > As will quickly become apparent, I am not an expert in
> > localisation. I have discovered that localisation requires both
> > a language code _and_ a country code, so I have changed the
> > interface prototype to
> >
> > public abstract void setLocale (String language, String country)
> > throws SAXException;
> >
> > Does this look correct? Would people prefer that I use the
> > java.util.Locale class?
Locale ...
> I think a single string, or unspecified parts, would be better. XML
> allows RFC 1766 language identifiers, which can include i-cherokee and
> x-klingon. The language-country form is only one class of valid
> language identifier.
But this is instructions to the parser, which will be using the
standard Java localisation facilities which center around Locale
objects in order to format any messages. It's not related to the
text of the document.
- Dave
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Fri May 1 00:28:30 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:00:59 2004
Subject: Separation of formatting...
References: <199804301555.QAA00020@GPO.iol.ie>
Message-ID: <3548B8B6.10DFFC2E@technologist.com>
Sean Mc Grath wrote:
>
...
> Here is a pieces of data marked up three ways:-
>
> Version 1 : Purely Formatting Mindset (RTF)
> "{\i Customer} Joe Bloggs \par"
>
> Version 2 : SGML - Generic Markup
>
CustomerJoe Bloggs
>
> Version 3 : SGML - Data Modelling
> Joe Bloggs
>
> Somewhere along the line, people started thinking
> as in version 3 above. I have no idea when this
> started to happen. Anyone out there know?
>...
These live on a continuum. I don't think that there is a shift from one to
another. SGML allows you to decide what is formatting and what is
abstraction. It is decided on an element by element basis and most DTDs
will have different parts at different points along the continuum. For
instance bibliographies typically tend towards data modelling, predictable
body text towards what the middle, and unpredictable body text (e.g. web
pages) towards the formatting side.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"Perpetually obsolescing and thus losing all data and programs every 10
years (the current pattern) is no way to run an information economy or
a civilization." - Stewart Brand, founder of the Whole Earth Catalog
http://www.wired.com/news/news/culture/story/10124.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From srn at techno.com Fri May 1 01:14:46 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun 7 17:00:59 2004
Subject: Separation of formatting...
In-Reply-To: <199804301555.QAA00020@GPO.iol.ie> (message from Sean Mc Grath on
Thu, 30 Apr 1998 16:55:17 +0100)
References: <199804301555.QAA00020@GPO.iol.ie>
Message-ID: <199804302313.SAA03170@bruno.techno.com>
[Sean Mc Grath :]
> Version 2 : SGML - Generic Markup
>
CustomerJoe Bloggs
Maybe the
is generic markup, but the
is procedural markup, not generic markup.
> Version 3 : SGML - Data Modelling
> Joe Bloggs
The above is generic markup, for sure.
"Generic" means "according to kind". "Generic markup" means markup
that indicates *what kind of thing* is being delimited, rather than
*what to do with the thing* that is being delimited. The latter is
usually called "procedural markup".
> I think for some poople, SGML is all about
> Version 2 above. Entire books have been written that
> use SGML to abstract the concepts of "paragraph",
> "artwork" etc. from the typographic codes required to
> achieve the result.
Ain't it the truth. Well, evidently it's a useful attitude.
> Somewhere along the line, people started thinking
> as in version 3 above. I have no idea when this
> started to happen. Anyone out there know?
I think it was long before there was SGML. The radical implications
of the modeling power of DTDs still escape most people, even though
they were surprisingly well understood right at the outset.
I think SGML's roots are firmly in generic markup, way back to
the mid-1960s and the precursorial beginnings of the GenCode
committee. I don't know the details, but the history is long,
has many details, and is only slightly contentious. Norm Scharpf,
President of the GCA, is one authority on the early history.
-Steve
--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com http://www.techno.com ftp.techno.com
voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax +1 972 994 0087 (at ISOGEN: +1 214 953 3152)
3615 Tanner Lane
Richardson, Texas 75082-2618 USA
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From mrc at allette.com.au Fri May 1 01:53:22 1998
From: mrc at allette.com.au (Marcus Carr)
Date: Mon Jun 7 17:00:59 2004
Subject: Off track
References:
Message-ID: <35490EB4.A58F6437@allette.com.au>
Simon St.Laurent wrote:
> Apart from the number of people working on XML who are from assorted empires
> and settler states, I'm really not sure what on earth you're talking about. I
> know there are several issues with Unicode (65000 characters isn't enough),
> but what in XML is so 'colonial'?
Nothing in the sense that you are thinking of - what I meant was that a relatively
small albeit it open, etc. group of people are making
decisions that are going to have a profound impact on many millions of people for
some time to come. I believe we are preparing to unleash a new culture on
everyone; I wasn't referring to any particular cultural or ethnic group or issues
therewith.
--
Regards
Marcus Carr email: mrc@allette.com.au
_______________________________________________________________
Allette Systems (Australia) email: info@allette.com.au
Level 10, 91 York Street www: http://www.allette.com.au
Sydney 2000 NSW Australia phone: +61 2 9262 4777
fax: +61 2 9262 4774
_______________________________________________________________
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From eliot at isogen.com Fri May 1 02:24:48 1998
From: eliot at isogen.com (W. Eliot Kimber)
Date: Mon Jun 7 17:01:00 2004
Subject: Off track
Message-ID: <3.0.32.19980430192254.006ae490@postoffice.swbell.net>
At 09:52 AM 5/1/98 +1000, Marcus Carr wrote:
>Simon St.Laurent wrote:
>
>> Apart from the number of people working on XML who are from assorted
empires
>> and settler states, I'm really not sure what on earth you're talking
about. I
>> know there are several issues with Unicode (65000 characters isn't enough),
>> but what in XML is so 'colonial'?
>
>Nothing in the sense that you are thinking of - what I meant was that a
relatively
>small albeit it open, etc. group of people are
making
>decisions that are going to have a profound impact on many millions of
people for
>some time to come. I believe we are preparing to unleash a new culture on
>everyone; I wasn't referring to any particular cultural or ethnic group or
issues
>therewith.
But isn't this almost always how these things happen? There's maybe 20
people who have direct and continuous input into SGML, DSSSL, etc. It must
be the same for things like HTML, internet standards, products, laws, etc.
Certainly a large number of people will influence the decisions of these
small groups, but ultimately it comes down to a few people making critical
decisions. No matter how open a process is, it always comes down to what
the people who do the work do.
The US constitution was framed by no more than about 50 people, of whom
maybe 10 were involved in the bulk of the decisions made, with the core
ideas coming from just two or three people (Jefferson, Madison, etc.). The
decisions of that small group of people working largely in secret in 1789
have profoundly affected world history.
I would like to think that, as a body and as individuals, that the members
of the XML WG take their resposibility very seriously--my experience is
that we do--it's one of the reasons that I'm willing and proud to
participate in the process. It doesn't mean we'll always make the best
technical decision--we're only human, but we certainly don't make any
decisions lightly. And of course, we have to play the hand we're dealt,
whether we like it or not.
Cheers,
E.
--
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004
www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From greyno at mcs.com Fri May 1 03:12:13 1998
From: greyno at mcs.com (Gregg Reynolds)
Date: Mon Jun 7 17:01:00 2004
Subject: Separation of formatting...
References: <199804301555.QAA00020@GPO.iol.ie> <199804302313.SAA03170@bruno.techno.com>
Message-ID: <354913CC.28B1@mcs.com>
Steven R. Newcomb wrote:
>
> [Sean Mc Grath :]
>
> > Version 2 : SGML - Generic Markup
> >
CustomerJoe Bloggs
>
> Maybe the
is generic markup, but the
> is procedural markup, not generic markup.
>
> > Version 3 : SGML - Data Modelling
> > Joe Bloggs
>
> The above is generic markup, for sure.
>
> "Generic" means "according to kind". "Generic markup" means markup
> that indicates *what kind of thing* is being delimited, rather than
> *what to do with the thing* that is being delimited. The latter is
> usually called "procedural markup".
In light of the ongoing and eternal debate over just what markup
is/means/does, the following passages may interest you. I think the
line between, on the one hand, what you call generic and what Paul (if I
understand him properly) calls abstract markup, and procedural or
formatting instructions is pretty thin when you look closely. From "Ad
Herennium", usually attributed to Cicero:
"Colon or Clause is the name given to a sentence member, brief and
complete, which does not express the entire thought, but is in turn
supplemented by another colon, as follows: 'On the one hand you were
helping your enemy.' That is one so-called colon; it ought then to be
supplemented by a second: 'And on the other you were hurting your
friend." This figure can consist of two cola, but it is neatest and
most complete when composed of three, as follows: 'You were helping
your enemy, you were hurting your friend, and you were not consulting
your own best interests.'
(Was the modern editor just being a smarty-pants by omitting colons from
the quoted examples?) Tully's not talking about written punctuation
marks here (markup), but about rhetorical techniques of the public
orator - phrasing, rhythm, delivery. In other words, processing
instructions. The text goes on to discuss "Comma or Phrase" and
"Period" among other things, all in terms of the orator's art. Viewed
in this light, the common punctuation marks (including paragraph and
other layout conventions) which most of us probably think of as abstract
or generic demarcations of thought-units (or something like that), may
be viewed as notes on how delivery: distincly how-to. Still works that
way, as anybody who's ever heard a recitation by a bad reader can
attest.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Fri May 1 11:16:36 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:00 2004
Subject: Off track
In-Reply-To: <35490EB4.A58F6437@allette.com.au>
References:
Message-ID: <3.0.1.16.19980501090437.62bf2ec4@pop3.demon.co.uk>
At 09:52 01/05/98 +1000, Marcus Carr wrote:
[... in response to others ...]
[...]
>Nothing in the sense that you are thinking of - what I meant was that a
relatively
>small albeit it open, etc. group of people are
making
>decisions that are going to have a profound impact on many millions of
people for
>some time to come. I believe we are preparing to unleash a new culture on
>everyone; I wasn't referring to any particular cultural or ethnic group or
issues
>therewith.
We are undoubtedly part of a very radical process in human history. When
Faraday was asked about the use of one of his discoveries (I think it was
electromagnetic induction) he is reputed to have said:
"Madam, what is the use of a new born baby?". [BTW without electromagnetic
induction there would be no XML...]
I am very grateful to be part of a virtual community which is international
and includes many disciplines, especially those related to linguistics. For
example the curators of the TEI hold much of the world's heritage in their
care; the quotation from Cicero is an example of the thread that binds us
over space and time. [BTW I favour return to Roman monumental inscriptions
- no case-sensitivity and no whitespace, and the *practice* has remained
future-proof for 2 millennia.] Although it is confidential, the discussion
about i18n and especially case-sensitivity on XML-SIG relied heavily on the
members' knowledge of non-ISOLatin languages and cultures, and the SIG/WG
possibly suffers through that not being widely known.
I am very conscious of the separation of the world - I spent two years in
W.Africa and am very grateful for the insight I gained into
neo-colonialism. The information revolution has the power to bring great
benefit to the world, especially in making education more easily available.
I am sure that XML forms a major platform for the educational technology of
the future - we have a clear responsibility to use it wisely.
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tony at ems.uq.edu.au Fri May 1 11:20:16 1998
From: tony at ems.uq.edu.au (Anthony B. Coates)
Date: Mon Jun 7 17:01:00 2004
Subject: Announce: xml-litprog-l
Message-ID: <35499355.5D10F3B0@ems.uq.edu.au>
I have created a mailing list, "xml-litprog-l", for discussions about
how best to use XML/XSL/XLink/XPtr for constructing literate programming
frameworks and tools. I'm interested in being able to share ideas with
(i) people who know more about XML and/or SGML than I do;
(ii) people who know more about literate programming than I do;
(iii) anyone else who is interested.
If you think that you fit into one of these groups, then please consider
joining (I'm not expecting it to be a high traffic list, but I hope it
will be a profitable list, in non-monetary terms). The Web page with
all the details is at
Some people have already expressed to me an interest in being on this
list. However, for reasons of "netiquette", I won't subscribe anyone
automatically. If you are interested, please visit the Web page (or
send the message "subscribe xml-litprog-l" to "majordomo@ems.uq.edu.au".
Cheers,
Tony.
--
Educational Multimedia Services
= reduced workloads for lecturers, teachers, and tutors
= better results for students.
--
Another 100% Pure Java e-mail. Is yours?
--
Anthony B. Coates.
Multimedia Developer (Software Design)
Educational Multimedia Services
TEDI, The University of Queensland.
AJUG-QLD Steering Committee Member
(Australian Java Users' Group ).
QMUG Committee Member
(Queensland Macromedia Users' Group ).
E-mail: tony@ems.uq.edu.au
WWW: http://www.ems.uq.edu.au/People/Tony/
UIN: 5191015
--
All opinions are mine, and may not represent
those of The University of Queensland.
--
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tony at ems.uq.edu.au Fri May 1 11:22:44 1998
From: tony at ems.uq.edu.au (Anthony B. Coates)
Date: Mon Jun 7 17:01:00 2004
Subject: Announce: xml-litprog-l
Message-ID: <35499355.5D10F3B0@ems.uq.edu.au>
I have created a mailing list, "xml-litprog-l", for discussions about
how best to use XML/XSL/XLink/XPtr for constructing literate programming
frameworks and tools. I'm interested in being able to share ideas with
(i) people who know more about XML and/or SGML than I do;
(ii) people who know more about literate programming than I do;
(iii) anyone else who is interested.
If you think that you fit into one of these groups, then please consider
joining (I'm not expecting it to be a high traffic list, but I hope it
will be a profitable list, in non-monetary terms). The Web page with
all the details is at
Some people have already expressed to me an interest in being on this
list. However, for reasons of "netiquette", I won't subscribe anyone
automatically. If you are interested, please visit the Web page (or
send the message "subscribe xml-litprog-l" to "majordomo@ems.uq.edu.au".
Cheers,
Tony.
--
Educational Multimedia Services
= reduced workloads for lecturers, teachers, and tutors
= better results for students.
--
Another 100% Pure Java e-mail. Is yours?
--
Anthony B. Coates.
Multimedia Developer (Software Design)
Educational Multimedia Services
TEDI, The University of Queensland.
AJUG-QLD Steering Committee Member
(Australian Java Users' Group ).
QMUG Committee Member
(Queensland Macromedia Users' Group ).
E-mail: tony@ems.uq.edu.au
WWW: http://www.ems.uq.edu.au/People/Tony/
UIN: 5191015
--
All opinions are mine, and may not represent
those of The University of Queensland.
--
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tfj at apusapus.demon.co.uk Fri May 1 12:42:47 1998
From: tfj at apusapus.demon.co.uk (Trevor Jenkins)
Date: Mon Jun 7 17:01:00 2004
Subject: Announce: xml-litprog-l
In-Reply-To: <35499355.5D10F3B0@ems.uq.edu.au>
Message-ID: <199805010948.tfj.21089@apusapus.demon.co.uk>
> I have created a mailing list, "xml-litprog-l", for discussions
> about how best to use XML/XSL/XLink/XPtr for constructing literate
> programming frameworks and tools. I'm interested in being able to
> share ideas with
> (i) people who know more about XML and/or SGML than I do;
> (ii) people who know more about literate programming than I do;
> (iii) anyone else who is interested.
What a good idea. Back in 1985/6 I proposed that the ISO Technical
Report "Techniques for Using SGML" include an example of Literate
Programming! Some of the other committee members prefered a Z
example. I gave way to their "pressure" and forgot the literate
programming example; no one provided the Z example either.
There are some things in SGML that would mean existing web files
could have been processed whereas they could not be processed with
XML.
Regards, Trevor.
--
<>< Re: deemed
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jjc at jclark.com Fri May 1 13:32:44 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun 7 17:01:00 2004
Subject: SAX 1.0beta: Three bugs so far
References: <199804300045.UAA00461@unready.microstar.com>
Message-ID: <3549A239.C0199123@jclark.com>
David Megginson wrote:
>
> BUG #2: Parser.setLocale takes only one String argument
>
> As will quickly become apparent, I am not an expert in
> localisation. I have discovered that localisation requires both a
> language code _and_ a country code, so I have changed the interface
> prototype to
>
> public abstract void setLocale (String language, String country)
> throws SAXException;
>
> Does this look correct? Would people prefer that I use the
> java.util.Locale class?
I would prefer java.util.Locale:
setLocale(java.util.Locale locale)
is the standard Java pattern. Since Java also provides a 3-argument
variant of the Locale constructor, I don't think setLocale(String
language, String country) is sufficient.
James
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From M.H.Kay at eng.icl.co.uk Fri May 1 14:38:01 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun 7 17:01:00 2004
Subject: SAX 1.0: problem with InputSource
Message-ID: <003101bd74fe$120d0400$1e09e391@mhklaptop.bra01.icl.co.uk>
I am trying to subclass InputSource to allow a File to be
supplied as an alternative to a SystemId, CharacterStream,
etc. (The implementation
of this goes through the magic incantations to turn a file
name into a URL).
This is proving clumsy because the InputSource class has no
default constructor.
Since there is a full range of setX() methods, and given the
general usefulness of having a default constructor e.g. for
JavaBeans / ActiveX,
should this omission be rectified?
Mike Kay
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Fri May 1 14:53:45 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:00 2004
Subject: SAX 1.0: problem with InputSource
In-Reply-To: <003101bd74fe$120d0400$1e09e391@mhklaptop.bra01.icl.co.uk>
References: <003101bd74fe$120d0400$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <199805011252.IAA00387@unready.microstar.com>
Michael Kay writes:
> Since there is a full range of setX() methods, and given the
> general usefulness of having a default constructor e.g. for
> JavaBeans / ActiveX,
> should this omission be rectified?
Yes. Thanks for pointing this out.
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Fri May 1 16:02:15 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:00 2004
Subject: Open standards processes
Message-ID: <3549D5D2.CE54C193@technologist.com>
Here are some more thoughts about open standards processes:
The XML SIG was pretty wide open, which was admirable, but XSL the DOM,
etc. seem to have no equivalent. My hypothesis is that the XML SIG became
large before anyone "important" realized it was going to be a crucial
technology.
On the other hand, since the W3C groups do much of their work online,
there is more of a "bit-trail" (available only to members, but available
nevertheless) that I cannot see in ISO standards processes. I am involved
in the ISO process, but do not go to the meetings, and proposals seem to
drop down from heaven for us to vote on or dismiss. Surely we could
achieve a more global, technology-mediated discussion in this day and age!
Clearly no process is perfect, but I think that whining about them on
XML-DEV is an important part of determining the imperfections. Standards
bodies derive all of their legitimacy from public acquiescence (else, I
would start one myself!). If the public were more aware of their goings
on, they could be forced open and the vendors would still have no choice
but to play along. They also derive their legitimacy from the public.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"Perpetually obsolescing and thus losing all data and programs every 10
years (the current pattern) is no way to run an information economy or
a civilization." - Stewart Brand, founder of the Whole Earth Catalog
http://www.wired.com/news/news/culture/story/10124.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From M.H.Kay at eng.icl.co.uk Fri May 1 17:15:59 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun 7 17:01:00 2004
Subject: Open standards processes
Message-ID: <005201bd7514$1ed4cc20$1e09e391@mhklaptop.bra01.icl.co.uk>
>Here are some more thoughts about open standards processes:
>
To add my tuppenceworth, I've been involved in the past in
both de jure and consortium standards-making processes,
though all before the days of the web.
To get a successful standard you need a core team who work
hard, who are technically highly competent, and who
understand the needs of the users as well as (a more common
reason for failure) the needs of potential vendors. You need
a consensus on the general principles and objectives, an
aversion to introducing unproven innovations, and an absence
of people with an interest in obstructing the process. You
don't need consultation or democracy or legal authority;
these can sometimes help to achieve the necessary consensus
but can also slow things down or send things off in the
wrong direction.
But I don't think it serves any purpose to be secretive. I
have certainly always believed that the more people knew
what was going on, the greater the chance of success.
Publishing work in progress will enable the user and vendor
community to respond more rapidly when the thing is finally
published, and will harness the resources of a wider group
of people to spot the errors. I find it a little
disappointing, now that there is no cost argument to prevent
open dissemination, that W3C should (apparently) have a
policy of secrecy which goes beyond anything I ever
encountered in ISO or ANSI or X/Open or OMG committees.
Perhaps the problem is that they would be deluged by
feedback, but I doubt it.
Michael Kay
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From paul at arbortext.com Fri May 1 17:32:05 1998
From: paul at arbortext.com (Paul Grosso)
Date: Mon Jun 7 17:01:00 2004
Subject: Open standards processes
Message-ID: <98May1.112853edt.26884@thicket.arbortext.com>
At 10:01 1998 05 01 -0400, Paul Prescod wrote:
>Here are some more thoughts about open standards processes:
>
>The XML SIG was pretty wide open, which was admirable, but XSL the DOM,
>etc. seem to have no equivalent.
I don't quite understand this statement.
The open, public www-dom@w3.org mailing list has been in existence "forever,"
and the DOM work has been receiving some very useful feedback from people on
that list.
The open, public xsl-list@mulberrytech.com has been in existence for only
two weeks less than the xsl-wg itself, and it has been quite active since
its inception.
The "problem" with the XML SIG was that everyone on that list had to be
an "invited expert" and they were all bound by W3C-member confidentiality
regulations. This was a logistic nightmare, and it actually meant that
some important discussion was pre-empted from truly public lists like xml-dev.
The XSL WG decided instead not to have this "intermediate" (and confusing
and problematic) level mailing group, but instead to have just the public
xsl-list (kindly hosted by Mulberry Technologies). We felt that was the
more open, democratic thing to do.
As far as XSL, if you think you're not seeing something that W3C members
are seeing, you're wrong. We do not have anything yet to show (other than
the requirements document that we are in the process of releasing to the
public as we speak). We plan to have a first draft of the XSL 1.0 spec
released to the public in July--the same time it is released to W3C members,
as I understand it [mega-apologies if I'm wrong about this, but this is my
current understanding].
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From bckman at ix.netcom.com Fri May 1 17:47:55 1998
From: bckman at ix.netcom.com (Frank Boumphrey)
Date: Mon Jun 7 17:01:01 2004
Subject: Open standards processes
Message-ID: <01bd7532$54288c40$efaddccf@uspppBckman>
>>But I don't think it serves any purpose to be secretive. I
have certainly always believed that the more people knew
what was going on, the greater the chance of success.
Publishing work in progress will enable the user and vendor
community to respond more rapidly when the thing is finally
published, and will harness the resources of a wider group
of people to spot the errors. I find it a little
disappointing, now that there is no cost argument to prevent
open dissemination,<<
I think thats very well said. Even if there were a deluge of feed back,
the compilers of the standard would be free to ignore it.
I can think of no legitimate reason for secrecy.
Even if the member-developers did want an edge on non-member developers,
(assuming that that's a legimitate reason, which is debateable) if they need
that kind of edge they have real problems and are not going to last very
long!!
Frank
-----Original Message-----
From: Michael Kay
To: xml-dev
Date: Friday, May 01, 1998 8:21 AM
Subject: Re: Open standards processes
>>Here are some more thoughts about open standards processes:
>>
>To add my tuppenceworth, I've been involved in the past in
>both de jure and consortium standards-making processes,
>though all before the days of the web.
>
>To get a successful standard you need a core team who work
>hard, who are technically highly competent, and who
>understand the needs of the users as well as (a more common
>reason for failure) the needs of potential vendors. You need
>a consensus on the general principles and objectives, an
>aversion to introducing unproven innovations, and an absence
>of people with an interest in obstructing the process. You
>don't need consultation or democracy or legal authority;
>these can sometimes help to achieve the necessary consensus
>but can also slow things down or send things off in the
>wrong direction.
>
>But I don't think it serves any purpose to be secretive. I
>have certainly always believed that the more people knew
>what was going on, the greater the chance of success.
>Publishing work in progress will enable the user and vendor
>community to respond more rapidly when the thing is finally
>published, and will harness the resources of a wider group
>of people to spot the errors. I find it a little
>disappointing, now that there is no cost argument to prevent
>open dissemination, that W3C should (apparently) have a
>policy of secrecy which goes beyond anything I ever
>encountered in ISO or ANSI or X/Open or OMG committees.
>Perhaps the problem is that they would be deluged by
>feedback, but I doubt it.
>
>Michael Kay
>
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jcupp at essc.psu.edu Fri May 1 17:54:36 1998
From: jcupp at essc.psu.edu (Jason R. Cupp)
Date: Mon Jun 7 17:01:01 2004
Subject: Namespaces & Behavior
Message-ID: <3549EFB3.6A24E89E@essc.psu.edu>
Hello,
If I were writing an XML parser (in PERL) and I wanted to consider
adding functionallity to recognize tags in particular namespaces, then
would it make sense that the parser could import locally, dynamically
link, include, request from the namespace URI (broker
fashion?)--whatever you want to call it--behaviors for that tag and its
content such as simple tag replacement, special formatting, re-ordering
of elements, basically full XSL I think?
Jason R. Cupp (jcupp@essc.psu.edu)
Deasy GeoGraphics
The Pennsylvania State University
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Patrice.Bonhomme at loria.fr Fri May 1 19:32:57 1998
From: Patrice.Bonhomme at loria.fr (Patrice Bonhomme)
Date: Mon Jun 7 17:01:01 2004
Subject: XML 1.0 :
Message-ID: <199805011731.TAA14818@chimay.loria.fr>
I am very surprise to see that the XML spec 1.0 cannot authorize the following
element declaration (because of the +):
I am wrong ?
Pat.
--
==============================================================
bonhomme@loria.fr | Office : B.228
http://www.loria.fr/~bonhomme | Phone : 03 83 59 30 52
--------------------------------------------------------------
* Serveur Silfide : http://www.loria.fr/projets/Silfide
* Projet Aquarelle : http://aqua.inria.fr
==============================================================
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Fri May 1 19:47:19 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:01 2004
Subject: XML 1.0 :
In-Reply-To: <199805011731.TAA14818@chimay.loria.fr>
References: <199805011731.TAA14818@chimay.loria.fr>
Message-ID: <199805011746.NAA05101@unready.microstar.com>
Patrice Bonhomme writes:
> I am very surprise to see that the XML spec 1.0 cannot authorize
> the following element declaration (because of the +):
>
>
>
> I am wrong ?
Entirely correct; however, there is no XML content matched by
(#PCDATA | s+)*
that is not also matched by
(#PCDATA | s)*
More generally,
(a+)* = (a*)+ = (a*)* = (a?)+ = (a?)* = (a)*
XML forces you to reduce mixed content models to the simplest
representation.
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Patrice.Bonhomme at loria.fr Fri May 1 19:53:19 1998
From: Patrice.Bonhomme at loria.fr (Patrice Bonhomme)
Date: Mon Jun 7 17:01:01 2004
Subject: ELEMENT P (#PCDATA | S+)*
In-Reply-To: Your message of "Fri, 01 May 1998 12:52:59 CDT."
<199805011752.MAA07891@ACADCOMP.SIL.ORG>
Message-ID: <199805011752.TAA15485@chimay.loria.fr>
robin@ACADCOMP.SIL.ORG said:
] Hmmm... (#PCDATA | (s)+ )* should be OK.
Yes it should be, but :
/* Mixed-content Declaration */
[51] Mixed ::= '(' S? '#PCDATA' (S? '|' S? Name)* S? ')*' | '(' S? '#PCDATA'
S? ')'
Pat.
--
==============================================================
bonhomme@loria.fr | Office : B.228
http://www.loria.fr/~bonhomme | Phone : 03 83 59 30 52
--------------------------------------------------------------
* Serveur Silfide : http://www.loria.fr/projets/Silfide
* Projet Aquarelle : http://aqua.inria.fr
==============================================================
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tbray at textuality.com Fri May 1 19:57:09 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun 7 17:01:01 2004
Subject: XML 1.0 :
Message-ID: <3.0.32.19980501105343.00b662e0@pop.intergate.bc.ca>
At 07:31 PM 5/1/98 +0200, Patrice Bonhomme wrote:
>I am very surprise to see that the XML spec 1.0 cannot authorize the following
>element declaration (because of the +):
>
>
That's right. -Tim
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Fri May 1 20:31:11 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:01 2004
Subject: Open standards processes
References: <98May1.112853edt.26884@thicket.arbortext.com>
Message-ID: <354A151D.B718CB2C@technologist.com>
Paul Grosso wrote:
>
> At 10:01 1998 05 01 -0400, Paul Prescod wrote:
> >Here are some more thoughts about open standards processes:
> >
> >The XML SIG was pretty wide open, which was admirable, but XSL the DOM,
> >etc. seem to have no equivalent.
>
> I don't quite understand this statement.
>
> The open, public www-dom@w3.org mailing list has been in existence "forever,"
> and the DOM work has been receiving some very useful feedback from people on
> that list.
www-dom@w3.org and xsl-list (which is not even a W3C mailing list) are NOT
comparable to w3c-xml-sig@w3.org. The latter has published minutes,
focused and moderated discussions that were carefully watched by
essentially ALL members of the working group. The former is essentially a
"feedback list" with little influence and hardly enough information to be
influential even if it had more. I know, because I have the same role in
both.
> The "problem" with the XML SIG was that everyone on that list had to be
> an "invited expert" and they were all bound by W3C-member confidentiality
> regulations. This was a logistic nightmare, and it actually meant that
> some important discussion was pre-empted from truly public lists like xml-dev.
So the XML effort had three levels: "Voting members", "Influential
non-voting experts" and "XML-DEV" and the DOM effort has essentially two
levels "voting members" and "non-influential, non-voting experts" and you
see that as more open? How so? Before there were some discussions I could
only have "inside" (based on inside knowledge) and some I could only have
"outside" (based on open knowledge) and now I can only have the "outside"
discussions because nobody except the 20 people on the WG have inside
knowledge. Wiping out a level of quasi-openness does not make the process
more open, it makes it less.
It's like abolishing the congress so that they don't "get in the way" of
the president speaking directly to the people. It's good to give people
more power (in this case information and voice). It doesn't have to be at
the expense of someone else's power, however.
> As far as XSL, if you think you're not seeing something that W3C members
> are seeing, you're wrong. We do not have anything yet to show (other than
> the requirements document that we are in the process of releasing to the
> public as we speak).
That the W3C members are as in the dark about what you are doing as we are
is not comforting. I think that anyone should have access to the minutes
of meetings as soon as they are released, as well as mailing list
archives. In other words, the "audit trail" from the first idea to the
last page-break in the rendered version of the spec. should be as
transparent as possible.
No, we do not achieve that in the XML WG. Yes, we come closer. This is why
so many people in this mailing list who participated in that effort have
warm, fuzzy feelings about the W3C right now. My point is that this warm
and fuzzy cooperative effort was an aberration. As someone wrote in the
Atlantic Monthly not too long ago: "Democracy's moment (such as it was)
has passed."
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"Perpetually obsolescing and thus losing all data and programs every 10
years (the current pattern) is no way to run an information economy or
a civilization." - Stewart Brand, founder of the Whole Earth Catalog
http://www.wired.com/news/news/culture/story/10124.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From digitome at iol.ie Fri May 1 21:29:43 1998
From: digitome at iol.ie (Sean Mc grath)
Date: Mon Jun 7 17:01:01 2004
Subject: Open standards processes
Message-ID: <1.5.4.32.19980501192208.00922c0c@gpo.iol.ie>
On Wed 31 Dec 1997 I asked some basic questions about XSL on
the DSSSL List. This was before the formation of the XSL list and
after Jon Bosak saying that the DSSSL list was a suitable forum
to discuss XSL. I sort of assumed that the XML-SIG style banter
between the core experts and dabblers would be the order of the
day.
Paul Grosso replied to my enquiries with this comment
>As a submission, it [XSL] is only a partial starting point,
>and detailed discussion of it is of very questionable
>utility (and I suspect most of the authors of the submission
>will not find it well worth their time to reply).
Some weeks later Arbortext announced an XSL product.
My point? Any suggestion that the XSL etc. lists are
run along the same "open" lines as XML-SIG is/was seems
rather dubious to me.
Sean Mc Grath
http://www.digitome.com
"there are two types of people in the world. Those who
think the world consists of two types of people, and
those who don't"
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From bckman at ix.netcom.com Sat May 2 00:26:38 1998
From: bckman at ix.netcom.com (Frank Boumphrey)
Date: Mon Jun 7 17:01:01 2004
Subject: Imbed XML in HTML
Message-ID: <01bd7569$ca762c40$e8acdccf@uspppBckman>
Some of you might be interested in a demonstration of embedding XML in an
HTML file.
Its the third tip at.
http://www.hypermedic.com/style/tips/tipindex.htm
When you click on the link a msgbox with the generated HTML code appears
followed by the Display of the XML document.
Look at the source code to see how its done.
This will only work on IE4 because the script uses the IE4 document object
model.
Frank Boumphrey
XML and style sheet info at Http://www.hypermedic.com/style/index.htm
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Sat May 2 01:07:01 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:02 2004
Subject: Open standards processes
In-Reply-To: <354A151D.B718CB2C@technologist.com>
References: <98May1.112853edt.26884@thicket.arbortext.com>
Message-ID: <3.0.1.16.19980501230607.77873dc8@pop3.demon.co.uk>
At 14:31 01/05/98 -0400, Paul Prescod wrote:
[...]
>
>www-dom@w3.org and xsl-list (which is not even a W3C mailing list) are NOT
>comparable to w3c-xml-sig@w3.org. The latter has published minutes,
>focused and moderated discussions that were carefully watched by
>essentially ALL members of the working group. The former is essentially a
>"feedback list" with little influence and hardly enough information to be
>influential even if it had more. I know, because I have the same role in
>both.
>
>> The "problem" with the XML SIG was that everyone on that list had to be
>> an "invited expert" and they were all bound by W3C-member confidentiality
>> regulations. This was a logistic nightmare, and it actually meant that
>> some important discussion was pre-empted from truly public lists like
xml-dev.
>
>So the XML effort had three levels: "Voting members", "Influential
>non-voting experts" and "XML-DEV" and the DOM effort has essentially two
>levels "voting members" and "non-influential, non-voting experts" and you
>see that as more open? How so? Before there were some discussions I could
Just to clear up any confusion - XML-DEV is not a W3C mailing list and has
no formal part in the XML process - it's offered by the goodwill of Henry
and myself in case people find it useful. Also -as far as XML-DEV is
concerned - there is no reason why it can't play the same role for DOM as
XML if people want (I at least am going to start hacking the DOM at some
stage and will no doubt be asking the same sort of hairy questions as I ask
here).
I am reading the current discussion with interest.
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From dan at intervista.com Sat May 2 01:45:49 1998
From: dan at intervista.com (Dan Ancona)
Date: Mon Jun 7 17:01:02 2004
Subject: Separation of formatting...
Message-ID: <3.0.32.19980501165640.00afa1e0@mail.intervista.com>
At 08:14 PM 4/30/98 -0400, Gregg Reynolds wrote:
> [ ... ]
> about rhetorical techniques of the public
> orator - phrasing, rhythm, delivery. In
> other words, processing instructions.
> [ ... ]
IOW, the writing is to some extent output device independent, and oration
is just one such device? That is SO cool.
Anyway, the gist I'm getting from this thread is that a rigorous separation
of formatting from any purely generic or abstract markup in a given XML
file format is not strictly necessary. In fact, it could even be
beneficial in some circumstances and for some formats. Correct?
If I'm right, this is a good thing, since it always seemed to me that
(despite how horribly broken in so many other ways) the unholy combination
of a partial procedural markup and the complete lack of generic markup
built into HTML was part of why it caught on. At least you could easily
include the elements that you wanted, even if you couldn't actually lay
them out at first (and could only do so by using kloogey stuff like tables
and later).
Dan
__________________________________________________________
Dan Ancona "engage!"
Tech Support, Evangelism and Cookery Intervista Software
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From lauren at sqwest.bc.ca Sat May 2 01:54:12 1998
From: lauren at sqwest.bc.ca (Lauren Wood)
Date: Mon Jun 7 17:01:02 2004
Subject: Open standards processes
In-Reply-To: <3.0.1.16.19980501230607.77873dc8@pop3.demon.co.uk>
References: <354A151D.B718CB2C@technologist.com>
<98May1.112853edt.26884@thicket.arbortext.com>
Message-ID: <199805012354.QAA17523@sqwest.bc.ca>
At 14:31 01/05/98 -0400, Paul Prescod wrote:
>So the XML effort had three levels: "Voting members", "Influential
>non-voting experts" and "XML-DEV" and the DOM effort has essentially two
>levels "voting members" and "non-influential, non-voting experts" and you
>see that as more open? How so?
I don't think equating lack of voting rights and lack of influence is fair
in terms
of the DOM work.
Just as in the XML SIG, where what was said was read and discussed by members
of the XML WG, so are all postings to www-dom read and discussed by members
of the DOM WG. Not all postings get direct replies, but the influence on the
specs has been large. The people who take the time to read the specs and
post to the www-dom list are not being ignored.
Those who are interested in the current DOM discussions will find the
archives
for www-dom at http://lists.w3.org/Archives/Public/www-dom/
Lauren
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Sat May 2 02:47:35 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:02 2004
Subject: Announcement: SAX 1.0gamma
Message-ID: <199805020046.UAA02818@unready.microstar.com>
Implementors have requested more time to test SAX 1.0 before it
becomes final.
There will be no additional changes other than bug fixes; however,
since bugs have required several minor changes already, and since the
final, official, righteous SAX 1.0 version might still be a week or
more away, in the interest of openness I have uploaded SAX 1.0gamma (I
don't like "beta2") to the following (new) URL:
http://www.megginson.com/SAX/
(The final SAX 1.0 will be mirrored at
http://www.microstar.com/XML/SAX/ as well).
The latest JavaDoc is also available online. SAX is changing very
little now, and it is probably safe to start basing implementations on
it, as long as you're ready for a little tweaking. Here are the (very
minor) changes since 1.0beta:
Interface Changes
-----------------
- bug fix: changed parameter to Parser.setLocale() to java.util.Locale
(a single string is insufficient)
- bug fix: added default constructor to InputSource, to simplify
subclassing
- bug fix: removed "extends IOException" from InputSource (it was
retained accidentally from an earlier revision)
- bug fix: in the ParserFactory helper class, used
"org.xml.sax.parser" instead of "sax.parser" for the system property
name
Documentation Changes
---------------------
- doc fix: clarify that attributes in AttributeList are zero-indexed
- doc fix: clarify that Parser objects are non-re-entrant but reusable
- doc fix: clarify that the parser should never modify the InputSource
provided by the application
- doc fix: clarify that the parser should not deliver the endDocument
event until EOF or a non-recoverable error
- moved sample drivers to com.megginson.sax package
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From greyno at mcs.com Sat May 2 04:22:43 1998
From: greyno at mcs.com (Gregg Reynolds)
Date: Mon Jun 7 17:01:02 2004
Subject: Separation of formatting...
References: <3.0.32.19980501165640.00afa1e0@mail.intervista.com>
Message-ID: <354A75D1.4348@mcs.com>
Dan Ancona wrote:
>
> Anyway, the gist I'm getting from this thread is that a rigorous separation
> of formatting from any purely generic or abstract markup in a given XML
> file format is not strictly necessary. In fact, it could even be
> beneficial in some circumstances and for some formats. Correct?
>
Personally I've come around to a pragmatist position on this, after some
time as a radical Free The Text purist. It's just not possible to
encode information without also encoding something about what we're
supposed to do with it, any more than it's possible to draw a "real"
line segment. But we can certainly do some very useful things by
trying.
An interesting paper on a similar topic is at
http://www.sil.org/sgml/ohco1.html, "Refining Our Notion of What Text
Really Is: The Problem of Overlapping Hierarchies."
--
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ricko at allette.com.au Sat May 2 05:19:13 1998
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun 7 17:01:02 2004
Subject: Separation of formatting...
Message-ID: <004501bd7579$505bfba0$890b4ccb@NT.JELLIFFE.COM.AU>
From: Gregg Reynolds
>Dan Ancona wrote:
>>
>> Anyway, the gist I'm getting from this thread is that a rigorous
separation
>> of formatting from any purely generic or abstract markup in a given XML
>> file format is not strictly necessary. In fact, it could even be
>> beneficial in some circumstances and for some formats. Correct?
>
>Personally I've come around to a pragmatist position on this, after some
>time as a radical Free The Text purist
If we subdivide the logical/presentation split we can come up with
the following 6 subdivisions:
topical structures (e.g. )
editorial structures (e.g.
)
characters
glyphs
page objects (e.g. a line)
layout structures (e.g. a column)
If there is a straightforward flow of dependence and synchronization
between these different structures, then is possible to start at the
topic structure and format all the way down to the layout (i.e. starting
from a higher and piggybacking the settings of the lower based on the
higher ones.) This is the underlying model of, e.g. CSS & DSSSL
to a great extent: in DSSSL there is no feedback from the layout
program to the stylesheet language.
However, there are many kinds of documents where the flow of dependence
is not one-way or straightforward. For example, in a magazine the page
layout
determines how many characters a heading sould have, and in some designs
it might determine what kinds of editorial structures are possible
I came across an extreme example in a loose leaf system: the writers of
a prose section knew they had a fixed space to work in, but this could not
be determined at authoring time. So they wrote knowing that the last
paagraph might be deleted by the pagination system if it could not
be fitted. This is a case where the layout structure determined the
page-objects which determined which editorial structures made it into
the final publication.
Developers should be aware that if their publication requires more than
just a straight-forward flow of dependence between the various structures
(which are concurrently present in their publication) they may have to
spend a lot more development effort. A simple flow of dependence with
no reverse feeding back is easier to implement.
Rick Jelliffe
"The XML & SGML Cookbook" Prentice Hall, out in May 1998
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Sat May 2 06:39:14 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:02 2004
Subject: Open standards processes
References: <354A151D.B718CB2C@technologist.com>
<98May1.112853edt.26884@thicket.arbortext.com> <199805012354.QAA17523@sqwest.bc.ca>
Message-ID: <354AA396.E5F1608C@technologist.com>
Lauren Wood wrote:
>
> At 14:31 01/05/98 -0400, Paul Prescod wrote:
> >So the XML effort had three levels: "Voting members", "Influential
> >non-voting experts" and "XML-DEV" and the DOM effort has essentially two
> >levels "voting members" and "non-influential, non-voting experts" and you
> >see that as more open? How so?
>
> I don't think equating lack of voting rights and lack of influence is fair
> in terms
> of the DOM work.
I didn't mean to equate lack of voting rights and lack of influence. In
the XML effort there were people (like me) that had no voting rights but
could still influence things sometimes. It isn't clear to me the extent
that the DOM is the same, because I haven't tried to influence the DOM and
haven't been watching the conversations clearly.
I will say that in several months of watching the list I haven't seen any
heated debates similar to those we had in the XML group. My *guess* is
that these heated (and important) debates are taking place elsewhere. That
impression is further supported by the fact that I often see
questions/answers on the DOM list of the form:
"Feature X seems strange to me."
"Yes, we thought about that quite a bit and argued about it. Do you think
we did the right thing?"
which strikes me as nearly the opposite of the XML-SIG version:
"We've got this hard problem to solve. We've got this group of smart
people assembled. WHAT DO YOU THINK? We'll make a decision after hearing
all of your opinions."
Casting an opinion on an issue that was debated two months ago is *not*
the same as participating in the debate.
Paul Grosso said that the XML-SIG way was a procedural nightmare. In my
experience, all forms of democracy, even limited ones, are procedural
nightmares. But as Len Bullard once told me: "Chaos is the engine." I
can't prove that the XML spec. is better because I could participate in
ways that I cannot with the DOM and XSL (so far). But I know that it is
the fact that some of my suggestions were taken seriously and even adopted
in times of heated discussions that would never have come about at all
under the DOM/XSL processes. Even given read-only access to the main
conversational mailing lists (and meeting minutes), outsiders could
influence the process through private email in support of or rebuttal to
positions.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"Perpetually obsolescing and thus losing all data and programs every 10
years (the current pattern) is no way to run an information economy or
a civilization." - Stewart Brand, founder of the Whole Earth Catalog
http://www.wired.com/news/news/culture/story/10124.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cmatei at agora.ro Sat May 2 14:38:23 1998
From: cmatei at agora.ro (Crystian)
Date: Mon Jun 7 17:01:02 2004
Subject: Loops in DTD?
Message-ID: <01BD75E0.75CF83E0@jackson.agora.ro>
Hi,
I make an vadidating XML processor. Until now, the processor can make a complex tree from data from Document Type Definition and validat the declaration from it. I am not sure if this tree can contain loops.
I mean something like this:
...
...
]>
Thank you,
Cristian Matei
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Sat May 2 15:34:50 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:02 2004
Subject: Announcement: SAX 1.0gamma
In-Reply-To: <199805020046.UAA02818@unready.microstar.com>
Message-ID: <3.0.1.16.19980502133350.60cfa996@pop3.demon.co.uk>
Many thanks David,
At 20:46 01/05/98 -0400, David Megginson wrote:
>Implementors have requested more time to test SAX 1.0 before it
>becomes final.
FWIW this suits my timetable as well :-)
My plan is to assemble a manageable number (2-5) of Java-based parsers
under JUMBO2 on the CDROM. The present parsers I have been looking at (with
comments and queries) follow. I have listed these in public rather than
approach individual authors - I hope that's OK. The selection is pragmatic
- I can only deal with a smallish number at this stage (the CLASSPATHs,
version differences, etc. can make my operation unstable at this stage). I
also plan to use JUMBO to extract DTD information from at least one parser
so that people can browse DTD-provided information such as attribute types
and content specs. This is obviously a non-standard process at present and
will either be through subclassing the parsers to implement a further
(JUMBO-hacked) interface or simply if ... else if's.
I'd be very grateful for the authors to confirm that I have got things
right. If I have made any errors, please forgive me. The latest revision
info is taken from RobinC's http://www.sil.org/sgml/xml.html#xmlSoftware.
If there are later versions, I'd be very grateful to know what the official
position is.
Some parsers have their latest update pre-XML-V1.0. Unless I hear to the
contrary I have to assume that these are not currently XML1.0-compliant
(unless the author had a (paranormal) inside track to the spec).
AElfred:
Version: V1.1
Date in distribution 980309
XML1.0-compliant Yes
SAX1.0pre-beta compliant Yes
SAX1.0beta compliant after SAX completion + a little bit ?
DTD-related information Yes?
Validating No
DXP:
Version: V1.0beta1c
Date in distribution 19980410
XML1.0-compliant Yes
SAX1.0pre-beta compliant Yes
SAX1.0beta compliant Presumably (timescale?)
DTD-related information Yes?
Validating Yes
Lark:
Version: V1.0beta (final)
Date in distribution 19980105
XML1.0-compliant No?
SAX1.0pre-beta compliant No
SAX1.0beta compliant Megginsonian SAX wrapper(?)
DTD-related information Yes?
Validating No (but Larval, codistributed, is)
XP:
Version: V0.2 (from Version.java)
Date in distribution 19980307
XML1.0-compliant Yes
SAX1.0pre-beta compliant Yes
SAX1.0beta compliant RSN
DTD-related information Yes?
Validating No
These are the parsers I have worked with. They are all 100% Java and I plan
to run them under JUMBO as both applications and applets. JUMBO will
distribute a JRE for W95 and Solaris and for applets will assume a
JDK1.1-compliant browser such as Netscape 4.05 (I haven't actually tested
this - that's this afternoon's fun).
***AUTHORS***. Please correct any info here - I may well have missed later
versions. I hope that the authors' timescales and JUMBO converge and that
all 4 parsers will be available. I shall certainly wait for AElfred :-).
The other *Java* parsers on Robin's list are:
MSXML:
Version: ?
Date in distribution 19971204 (RobinC)
XML1.0-compliant No??
SAX1.0pre-beta compliant No
SAX1.0beta compliant Megginsonian SAX wrapper(?)
DTD-related information ?
Validating No?
Pure Java ??
[PMR: I tried to install this some time ago and couldn't, but didn't give
it lots of time. Clearly lots of people *can* run it. If it's trivial to
run under (a) java (b) in an applet and there is a SAX wrapper, I might
find the time to try it out.]
IBM:
Version: ?
Date in distribution 19980416 (RobinC)
XML1.0-compliant Yes?
SAX1.0pre-beta compliant ?
SAX1.0beta compliant ?
DTD-related information ?
Validating Yes
Pure Java ?
[PMR: I have no experience of this and all the information above comes from
RobinC's page. If there are plans to make it SAX-compliant and it's easy to
install I'd be interested in having a look.]
XML-by-hand (Bert Bos). Non-validating. I haven't looked further than
Robin's page.
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Sat May 2 15:34:53 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:02 2004
Subject: Loops in DTD?
In-Reply-To: <01BD75E0.75CF83E0@jackson.agora.ro>
Message-ID: <3.0.1.16.19980502125757.7787e944@pop3.demon.co.uk>
At 15:39 02/05/98 +-300, Crystian wrote:
>Hi,
>
>I make an vadidating XML processor. Until now, the processor can make a
complex tree from data from Document Type Definition and validat the
declaration from it. I am not sure if this tree can contain loops.
>
>I mean something like this:
>
> ...
>
> ...
>
> ...
>]>
This is legal XML. Anyone who is in doubt about whether their XML is legal
should download one of the free parsers and see if it throws errors. For
some sorts of people (e.g. me) it's easy to rely on the computer first of
all rather than reading the spec. Apart from a very few minor bugs (and
pre-XML V1.0 implementations) the parsers should be the touchstone of your
XML. I shall be releasing several parsers on the JUMBO-CDROM - see separate
posting today.
An XML document instance never contains loops. If the DTD is expanded so
that element content is recursively displayed may contain loops. This can
make it quite difficult to display. For example Earl Hood's very nice
dtd2html tool showed the content of the first occurrence of an element and
added an ellipsis for others. An alternative way is not to expand the
children recursively but to add hyperlinks. This works fine on screen but
not on cellulose.
HTH
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jtauber at jtauber.com Sat May 2 15:37:53 1998
From: jtauber at jtauber.com (James K. Tauber)
Date: Mon Jun 7 17:01:02 2004
Subject: RFD: comp.text.xml
Message-ID: <000001bd75ce$fdf449c0$d86118cb@caleb>
REQUEST FOR DISCUSSION (RFD)
unmoderated group comp.text.xml
This is a formal Request For Discussion (RFD) for the creation of a
comp.text.xml newsgroup. This is not a Call for Votes (CFV);
you cannot vote at this time. Procedural details are below.
Newsgroups line:
comp.text.xml The Extensible Markup Language (XML)
RATIONALE: comp.text.xml
Abstract
A newsgroup for the discussion of the Extensible Markup
Language (XML); including, but not limited to the specifications
and syntax, document creation and editing, interchange, software,
processing and database integration. This applies not only to XML
itself but also the Extensible Linking Language (XLL), the Extensible
Style Language (XSL), Cascading Style Sheets (CSS) as applied to XML
documents, and to document types and applications of XML.
Justification
XML is a new language, and it appears that it will be extremely
popular.
Over the past few months, traffic on the XML-DEV and XML-L mailing lists has
grown rapidly. With the release of the W3C Recommendation for XML 1.0,
developer interest is growing rapidly and an increasing amount of software
is
being released. A newsgroup would make it much easier for a wider audience
to participate in and benefit from these discussions.
While it may appear at first that comp.text.sgml is appropriate, there will
be those interested in XML who do not care about the arcana of SGML, and
there will be those interested in SGML who do not care to see many basic
questions about XML.
CHARTER: comp.text.xml
comp.text.xml shall be an unmoderated newsgroup for
the general discussion of the Extensible Markup Language (XML) and its
application.
This includes, but is not limited to the specifications
and syntax, document creation and editing, interchange, software,
processing and database integration. This applies not only to XML
itself but also the Extensible Linking Language (XLL), the Extensible
Style Language (XSL), Cascading Style Sheets (CSS) as applied to XML
documents, and to document types and applications of XML.
Policy on Advertising
We encourage discussion of the merits and shortcomings of
commercial products. A certain amount of advertising (both objective
and advocative) is to be expected and this is to be encouraged as
long as the products and services relate directly to XML and the
announcements are brief. Company representatives are expected to
participate in discussions of their product that they themselves did
not initiate.
Please refer to the widely available Internet literature on the
subject of common net abuses (indiscriminate newsgroup spamming,
multiple advertisement posting, and chain letters being among the
most frequent).
END CHARTER.
PROCEDURE:
This is a request for discussion, not a call for votes. In this phase
of the process, any potential problems with the proposed newsgroups
should be raised and resolved. The discussion period will continue for
a minimum of 21 days (starting from when the first RFD for this
proposal is posted to news.announce.newgroups), after which a Call For
Votes (CFV) may be posted by a neutral vote taker if the discussion
warrants it. Please do not attempt to vote until this happens.
All discussion of this proposal should be posted to news.groups.
This RFD attempts to comply fully with the Usenet newsgroup creation
guidelines outlined in "How to Create a New Usenet Newsgroup" and
"How to Format and Submit a New Group Proposal". Please refer to
these documents (available in news.announce.newgroups) if you have
any questions about the process.
END PROCEDURE.
DISTRIBUTION:
This RFD shall be posted to the w3c-xml-sig, XML-DEV and XML-L mailing lists
and to the following newsgroups:
comp.text
comp.text.sgml
alt.etext
alt.hypertext
comp.infosystems.www.announce
comp.infosystems.www.authoring.html
comp.infosystems.www.authoring.misc
comp.infosystems.www.authoring.stylesheets
Please see the newsgroup news.groups for discussions about this
RFD, as mentioned above. Readers of the mailing lists and additional
newsgroups listed in this section shall be referred to
news.announce.newgroups for the eventual Call For Votes (CFV).
END DISTRIBUTION.
Proponent: James Tauber (jtauber@jtauber.com)
With assistance from Chris Maden, Simon St Laurent, BK DeLong and Susan
Lesch
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Sat May 2 17:26:19 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:02 2004
Subject: Java-based XML Parsers
In-Reply-To: <3.0.1.16.19980502133350.60cfa996@pop3.demon.co.uk>
References: <199805020046.UAA02818@unready.microstar.com>
<3.0.1.16.19980502133350.60cfa996@pop3.demon.co.uk>
Message-ID: <199805021525.LAA00247@unready.microstar.com>
Peter Murray-Rust writes:
> The other *Java* parsers on Robin's list are:
>
> MSXML:
> Version: ?
> Date in distribution 19971204 (RobinC)
> XML1.0-compliant No??
> SAX1.0pre-beta compliant No
> SAX1.0beta compliant Megginsonian SAX wrapper(?)
> DTD-related information ?
> Validating No?
> Pure Java ??
> [PMR: I tried to install this some time ago and couldn't, but didn't give
> it lots of time. Clearly lots of people *can* run it. If it's trivial to
> run under (a) java (b) in an applet and there is a SAX wrapper, I might
> find the time to try it out.]
I have managed to use 1.8 (I think) out of the box on my notebook, and
have taken the SAX driver as far as I can. I don't know if it's "pure
Java", but it certainly didn't break under Linux.
There are several serious and well-known bugs in the current version
of MSXML that make it unreliable for production use (such as bogus
validation errors, the failure to report defaulted attribute values
and badly broken entity management); however, since future versions of
MSXML will probably ship with MSIE, MSXML has a very strong
distribution channel and deserves attention. It might be be
worthwhile to plan to support it when Microsoft has a chance to go
back and finish development.
> IBM:
> Version: ?
> Date in distribution 19980416 (RobinC)
> XML1.0-compliant Yes?
> SAX1.0pre-beta compliant ?
> SAX1.0beta compliant ?
> DTD-related information ?
> Validating Yes
> Pure Java ?
>
> [PMR: I have no experience of this and all the information above comes from
> RobinC's page. If there are plans to make it SAX-compliant and it's easy to
> install I'd be interested in having a look.]
IBM's XML for Java shipped with a SAX driver for the January draft
(thanks, guys), and they included SAX 1.0 compliance on the time line
during their presentation at the XML Developers' Day in Seattle. You
should probably watch this one closely.
XML for Java does provide DTD information and even guided authoring
(though I haven't tested those features), and again, it runs under
Linux JDK 1.1.5.
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cbullard at hiwaay.net Sat May 2 17:34:11 1998
From: cbullard at hiwaay.net (len bullard)
Date: Mon Jun 7 17:01:02 2004
Subject: RFD: comp.text.xml
References: <000001bd75ce$fdf449c0$d86118cb@caleb>
Message-ID: <354B3C86.6C4@hiwaay.net>
James K. Tauber wrote:
>
> REQUEST FOR DISCUSSION (RFD)
> unmoderated group comp.text.xml
>
> This is a formal Request For Discussion (RFD) for the creation of a
> comp.text.xml newsgroup. This is not a Call for Votes (CFV);
> you cannot vote at this time. Procedural details are below.
>
> Newsgroups line:
> comp.text.xml The Extensible Markup Language (XML)
>
> RATIONALE: comp.text.xml
>
> Abstract
>
> A newsgroup for the discussion of the Extensible Markup
> Language (XML); including, but not limited to the specifications
> and syntax, document creation and editing, interchange, software,
> processing and database integration. This applies not only to XML
> itself but also the Extensible Linking Language (XLL), the Extensible
> Style Language (XSL), Cascading Style Sheets (CSS) as applied to XML
> documents, and to document types and applications of XML.
>
> Justification
>
> XML is a new language, and it appears that it will be extremely
> popular.
XML is a subset of SGML as specified by ISO. It is a
consortium-owned subset without normative reference to
the parent standard.
> Over the past few months, traffic on the XML-DEV and XML-L mailing lists has
> grown rapidly. With the release of the W3C Recommendation for XML 1.0,
> developer interest is growing rapidly and an increasing amount of software
> is
> being released. A newsgroup would make it much easier for a wider audience
> to participate in and benefit from these discussions.
This is true. XML-DEV supports the public discussions. Invitation-only
special interest group maillists support the XML core. The XML
applications (eg, XSL, XLL) are supported as well.
> While it may appear at first that comp.text.sgml is appropriate, there will
> be those interested in XML who do not care about the arcana of SGML, and
> there will be those interested in SGML who do not care to see many basic
> questions about XML.
This is speculative not substantive. The membership of the XML and XML
application communities overlaps the SGML community. The extent of
overlap should be examined, but it is probably greater than a majority.
> CHARTER: comp.text.xml
>
> comp.text.xml shall be an unmoderated newsgroup for
> the general discussion of the Extensible Markup Language (XML) and its
> application.
At this time, the comp-text-sgml newsgroup can support the discussion
of SGML subsets such as the W3C consortium subset, XML and its
applications.
There is no need for a separate group which will weaken the tenuous
and often politically difficult to maintain spirit of cooperation
between
international standards groups and consortium product working groups.
This
is not in the best interests of that alliance.
Len Bullard
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Sat May 2 18:32:05 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:03 2004
Subject: Java-based XML Parsers
In-Reply-To: <199805021525.LAA00247@unready.microstar.com>
References: <3.0.1.16.19980502133350.60cfa996@pop3.demon.co.uk>
<199805020046.UAA02818@unready.microstar.com>
<3.0.1.16.19980502133350.60cfa996@pop3.demon.co.uk>
Message-ID: <3.0.1.16.19980502162740.5977dfea@pop3.demon.co.uk>
Thanks very much David,
At 11:25 02/05/98 -0400, David Megginson wrote:
>Peter Murray-Rust writes:
[...MSXML...]
>
>I have managed to use 1.8 (I think) out of the box on my notebook, and
>have taken the SAX driver as far as I can. I don't know if it's "pure
>Java", but it certainly didn't break under Linux.
Thanks - glad to hear this - there was discussion at one stage about
MS-specific classes.
>
>There are several serious and well-known bugs in the current version
>of MSXML that make it unreliable for production use (such as bogus
>validation errors, the failure to report defaulted attribute values
>and badly broken entity management); however, since future versions of
>MSXML will probably ship with MSIE, MSXML has a very strong
>distribution channel and deserves attention. It might be be
>worthwhile to plan to support it when Microsoft has a chance to go
>back and finish development.
Thanks. If this is the case (and I trust David's accuracy) I would not feel
it appropriate to offer it as part of the currently planned JUMBO-CDROM.
(I shall try to make sure that all parsers give 'the same results' - at
least for my examples). However I would certainly wish to support MSXML in
the future (assuming the distribution policy allows me to.) Any further
clarification from MS would be welcome - I certainly don't want to
undervalue the role that it plays.
>
> > IBM:
>
>IBM's XML for Java shipped with a SAX driver for the January draft
>(thanks, guys), and they included SAX 1.0 compliance on the time line
>during their presentation at the XML Developers' Day in Seattle. You
>should probably watch this one closely.
I downloaded it just before I got your mail :-) and shall hack this evening.
[I haven't looked at the distribution conditions yet.]
>
>XML for Java does provide DTD information and even guided authoring
>(though I haven't tested those features), and again, it runs under
>Linux JDK 1.1.5.
Excellent. Any info from IBM about timescales for SAX1.0 would be most
welcome.
>
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Sat May 2 18:34:25 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:03 2004
Subject: RFD: comp.text.xml
In-Reply-To: <000001bd75ce$fdf449c0$d86118cb@caleb>
Message-ID: <3.0.1.16.19980502161715.69af8892@pop3.demon.co.uk>
[reply to xml-dev only - I'm sure JamesT will read this].
At 21:34 02/05/98 +0800, James K. Tauber wrote:
> REQUEST FOR DISCUSSION (RFD)
> unmoderated group comp.text.xml
>
>This is a formal Request For Discussion (RFD) for the creation of a
>comp.text.xml newsgroup. This is not a Call for Votes (CFV);
>you cannot vote at this time. Procedural details are below.
I am deliberately neutral about this - this message is simply to state how
I see XML-DEV's position w.r.t. the propose comp.text.xml
[...]
>A newsgroup for the discussion of the Extensible Markup
>Language (XML); including, but not limited to the specifications
>and syntax, document creation and editing, interchange, software,
>processing and database integration. This applies not only to XML
>itself but also the Extensible Linking Language (XLL), the Extensible
>Style Language (XSL), Cascading Style Sheets (CSS) as applied to XML
>documents, and to document types and applications of XML.
XML-DEV has had and continues to have robust and IMO valuable discussion in
these areas.
>
>Justification
>
>XML is a new language, and it appears that it will be extremely
>popular.
>
>Over the past few months, traffic on the XML-DEV and XML-L mailing lists has
>grown rapidly. With the release of the W3C Recommendation for XML 1.0,
>developer interest is growing rapidly and an increasing amount of software
>is
>being released. A newsgroup would make it much easier for a wider audience
>to participate in and benefit from these discussions.
I will leave Henry to answer whether there are any current problems of
scale. [I would hate to promise his services without checking :-)]
>
[...]
>
>
>Policy on Advertising
>
>We encourage discussion of the merits and shortcomings of
>commercial products. A certain amount of advertising (both objective
>and advocative) is to be expected and this is to be encouraged as
>long as the products and services relate directly to XML and the
>announcements are brief. Company representatives are expected to
>participate in discussions of their product that they themselves did
>not initiate.
There has never been the slightest hint of abuse of this on XML-DEV and for
this I thank the XML community. We aren't yet at the stage where there are
a large number of XML products in common use, so haven't got to the 'I
bought XML-FOO last week and when I try to run DUCKBOOK under it it ...'.
>
So far there have also been very few 'What is a DDT?' 'Where do I find the
< and > on my keyboard?'. Peter Flynn and Robin Cover (and many others)
must take great credit for providing answers to every conceivable question.
Since I have asked these sort of Qs on comp.text.sgml 3 years ago, I expect
that there needs to be a similar place for XML. XML-DEV isn't appropriate
for large volumes of such Qs. I'm neutral whether it should be on
comp.text.xml, comp.text.sgml or XML-L.
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From SimonStL at classic.msn.com Sat May 2 19:20:26 1998
From: SimonStL at classic.msn.com (Simon St.Laurent)
Date: Mon Jun 7 17:01:03 2004
Subject: RFD: comp.text.xml
Message-ID:
Len made some comments about the RFD which I heartily disagree with.
>> XML is a new language, and it appears that it will be extremely
>> popular.
>
>XML is a subset of SGML as specified by ISO. It is a
>consortium-owned subset without normative reference to
>the parent standard.
XML is more than a subset of SGML. It has its own limitations, its own
practices, and its own audience. You can debate whether or not it is a 'new
language' just as the XML-DEV list has argued whether the XML spec contains
just syntax or syntax and semantics, but I think you're splitting hairs, at
best. New acronym, new approach, fine - new language.
Personally, I'd like to see XML make a clean parent-child break with SGML.
XML should acknowledge SGML as the parent, the teacher, the source of
inspiration, and move on to live its own life. I'd like to see XML have its
own place to grow. If that means that it acquires features from SGML (like
architectural forms, which David Megginson pointed out on XML-L), then fine -
but it shouldn't be forever limited to being a 'mere' subset.
>> While it may appear at first that comp.text.sgml is appropriate, there will
>> be those interested in XML who do not care about the arcana of SGML, and
>> there will be those interested in SGML who do not care to see many basic
>> questions about XML.
>
>This is speculative not substantive. The membership of the XML and XML
>application communities overlaps the SGML community. The extent of
>overlap should be examined, but it is probably greater than a majority.
The extent of overlap may be examined, but I doubt it will be a majority for
more than few months. XML is arousing interest in many computing sectors
(HTML developers, database developers, OOP programmers) who in the past had
nothing (or as little as possible) to do with SGML.
I can go through my email archive and find at least fifty people who have come
to XML but had no previous experience in SGML and show no interest in learning
it. There may be bias because my book deliberately targeted a non-SGML
audience, but that audience certainly exists and is growing rapidly.
>At this time, the comp-text-sgml newsgroup can support the discussion
>of SGML subsets such as the W3C consortium subset, XML and its
>applications.
I've been perusing comp.text.sgml for the last few months. While I'm
impressed with the quality of discussion (as I have been on XML-DEV and
XML-L), I don't think it's a great place to learn about XML. It's not always
easy to determine when SGML-only features are being discussed, and the
question overlaps can lead to some fairly significant confusion, especially
for beginners.
>There is no need for a separate group which will weaken the tenuous
>and often politically difficult to maintain spirit of cooperation
>between international standards groups and consortium product working groups.
>This is not in the best interests of that alliance.
There is a lot of need for a separate group which will address the needs of
large numbers of people moving into a new technology. 'Maintaining the spirit
of cooperation between international standards groups and consortium product
working groups' is _not_ the purpose of this newsgroups. Those groups, for
better or worse, need to learn to work with each other while the rest of get
on with learning the technologies that are appropriate to the tasks _we_ want
to accomplish.
I see comp.text.xml as a place for people, both new and experts, to
communicate about the new things opened up by XML, not as a place for a
relatively small club of experts to form and maintain alliances.
XML-L is good for this, but newsgroups are easy places for the whole world to
find and participate. I strongly hope comp.text.xml arrives soon.
Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer / Cookies
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From digitome at iol.ie Sat May 2 20:06:26 1998
From: digitome at iol.ie (Sean Mc grath)
Date: Mon Jun 7 17:01:03 2004
Subject: Separation of formatting...
Message-ID: <1.5.4.32.19980502175845.0090f500@gpo.iol.ie>
[Rick Jelliffe]
>
>Developers should be aware that if their publication requires more than
>just a straight-forward flow of dependence between the various structures
>(which are concurrently present in their publication) they may have to
>spend a lot more development effort. A simple flow of dependence with
>no reverse feeding back is easier to implement.
>
Even reverse feeding has its limits. I suspect that no amount
of algorithm driven formatting will ever produce the same result
as a skilled human. IOW, there are limits to how good a pure
SGML/XML driven formatting system can be. Thusly we need processing
instructions and format oriented markup.
I recently grappled with a large piece of legislation on paper
which proved easy enough to model but I gave up trying to re-produce
the "look" of the paper version. Individual sections exhibited
clear signs of aesthetic tweaking. Infuriating because the darn
thing looks so much nicer than mine yet I cannot determine why:-(
To see this aesthetic effect in its extremes I heartily recommend
"Le Ton beau de Marot" by Douglas Hofstadter. For example, according
to the introduction, the section layout of chapter 2 was
heavily influenced by the avoidence of descender characters in
the first word!
Sean Mc Grath
http://www.digitome.com
"there are two types of people in the world. Those who
think the world consists of two types of people, and
those who don't"
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From h.rzepa at ic.ac.uk Sat May 2 20:38:22 1998
From: h.rzepa at ic.ac.uk (Rzepa, Henry)
Date: Mon Jun 7 17:01:03 2004
Subject: RFD: comp.text.xml
In-Reply-To: <3.0.1.16.19980502161715.69af8892@pop3.demon.co.uk>
References: <000001bd75ce$fdf449c0$d86118cb@caleb>
Message-ID:
>I will leave Henry to answer whether there are any current problems of
>scale. [I would hate to promise his services without checking :-)]
It scales more or less linearly, with about 10 individual addresses
"decaying" each week and about ten new ones needing attention,
out of a total of 1000 subscribers. I dont see a problem up to say
2000 subscribers. The spam/junk seems to have stabilised.
Henry Rzepa. +44 171 594 5774 (Office) +44 171 594 5804 (Fax)
http://www.ch.ic.ac.uk/rzepa/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Sat May 2 21:14:53 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:03 2004
Subject: Announcement: AElfred 1.2, with SAX 1.0gamma support
Message-ID: <199805021914.PAA01433@unready.microstar.com>
A new version of Microstar's AElfred XML parser is available at the
following URL:
http://www.microstar.com/XML/
There are several user-visible changes since version 1.1:
1. the XmlParser.parse method for parsing from a URI now has a third
argument (for an encoding, if known);
2. The XmlHandler.resolveEntity method is more powerful: you may
return a String (for a URI), an InputStream, _or_ a Reader. If you
return null, the parser will take the default action.
3. The SAX driver has been updated to SAX 1.0gamma (released 1 May
1998). SAX is available from http://www.megginson.com/SAX/
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Sun May 3 00:23:08 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:03 2004
Subject: Announcement: AElfred 1.2, with SAX 1.0gamma support
In-Reply-To: <199805021914.PAA01433@unready.microstar.com>
Message-ID: <3.0.1.16.19980502221849.2f776e8a@pop3.demon.co.uk>
At 15:14 02/05/98 -0400, David Megginson wrote:
[... marvellous announcements about SAX and AElfred ...]
Thanks very much David. I shall assume that the application-side of the
SAX API will remain constant and therefore hope to release an alpha version
of JUMBO2.0 quite shortly.
P.
I was just going to bed. [I thought I had a few weeks to relax in :-)]
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jtauber at jtauber.com Sun May 3 08:30:30 1998
From: jtauber at jtauber.com (James K. Tauber)
Date: Mon Jun 7 17:01:03 2004
Subject: RFD: comp.text.xml
In-Reply-To: <3.0.1.16.19980502161715.69af8892@pop3.demon.co.uk>
Message-ID: <000801bd765c$dcc66420$e66118cb@caleb>
A couple of points:
1. The proposal of comp.text.xml in no way is to be taken as a negative
comment on xml-dev or any other mailing list. I love xml-dev.
2. Discussion on the pros and cons of comp.text.xml should be held on the
news.groups newsgroup as stated in the RFD.
James
--
James Tauber / jtauber@jtauber.com
Perth, Western Australia
XML Pages: http://www.jtauber.com/xml/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Sun May 3 12:46:28 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:03 2004
Subject: RFD: comp.text.xml
In-Reply-To: <000801bd765c$dcc66420$e66118cb@caleb>
References: <3.0.1.16.19980502161715.69af8892@pop3.demon.co.uk>
Message-ID: <3.0.1.16.19980503075802.3617e848@pop3.demon.co.uk>
At 14:28 03/05/98 +0800, James K. Tauber wrote:
>
>A couple of points:
>
>1. The proposal of comp.text.xml in no way is to be taken as a negative
>comment on xml-dev or any other mailing list. I love xml-dev.
Thanks :-). I didn't take it as negative. I posted primarily because one
posting could be read to give the wrong impression of the role of XML-DEV.
Our intention (H+P) is that XML-DEV continue and if there are topics which
are better suited to be on other fora we'll gently point that out.
XML-DEV's role is clearly evolving and so our goals are deliberately simple
'a list for XML developers'.
>
>2. Discussion on the pros and cons of comp.text.xml should be held on the
>news.groups newsgroup as stated in the RFD.
Fully agreed. Any burgeoning discussion of c.t.x. will be gently pointed
towards n.g
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jjc at jclark.com Sun May 3 13:09:46 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun 7 17:01:03 2004
Subject: Announcement: SAX 1.0gamma
References: <199805020046.UAA02818@unready.microstar.com>
Message-ID: <354C4EF0.81ADF185@jclark.com>
I did a first pass at implementing a SAX 1.0gamma driver for XP today.
Some nits:
It should be specified whether a byte order mark at the beginning of a
XML byte stream is included as part of the character stream. I don't
think it should be since the byte-order mark isn't included the XML
document production, and the XML spec says explicitly that the byte
order mark "is an encoding signature, not part of either the markup or
the character data of the XML document".
How are relative system identifiers supposed to be handled in
DTDHandler? Suppose I have a DTD with a system id of dir/foo.dtd, which
declares an unparsed entity with a system id of foo.eps (which refers to
dir/foo.eps). If the systemId argument to DTDHandler.unparsedEntityDecl
is foo.eps, then the application is going to have problems. There's a
similar issue with EntityResolver.resolveEntity.
Parser.parse should be allowed to throw IOException in addition to
SAXException. Since InputSource includes a Reader and InputStream, and
methods on Reader and InputStream throw IOException, parse needs to
throw IOException. It's ridiculous to require the parser to wrap an
IOException in a SAXException when you know that the parser needs to
throw IOException.
There's nothing in the XML spec that says parsers have to make attribute
types available. So I think the doc for AttributeList.getType should say
that CDATA may be returned not only if the parser has not read the
declaration, but also if the parser does not make this information
available (alternatively it could return null in this case).
Supporting changing of Locale in the middle of a parse would require me
to redesign my native interface in a way I consider very undesirable,
and I don't see any need for this, so I don't plan to support this. The
basic issue is that my counterpart of Parser is reentrant unlike SAX's
and I want to keep it that way. I think changing of Locale mid-parse is
going to be difficult for anyone with a similar style of interface.
It's probably too late for this, but I'm having problems seeing the
logic in the exception handling design. The design seems to make things
inconvenient for both users and implementors: implementors have to wrap
SAXException in order to pass it up through their parsers, and in
handler methods users have to wrap their exceptions in SAXException.
James
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From alain at cabinfo.com Sun May 3 14:55:17 1998
From: alain at cabinfo.com (Alain DESEINE)
Date: Mon Jun 7 17:01:03 2004
Subject: announce : innovation Partners and CEI announce IRIS XML DTD GENERATOR beta 1
Message-ID: <354C68CF.2B7A@cabinfo.com>
innovation Partners and CEI announce IRIS XML DTD GENERATOR beta 1
The first release (beta 1) of IRIS XML DTD GENERATOR is now available
for download. This first
beta version can open or create DTD for XML applications. The user can
edit the DTD with a text
view (like if you edit it in the Windows Notepad), or in a logical view.
Both the text and
logical view provide tools to insert ELEMENTS, ATTRIBUTES, COMMENTS, OR
ENTITIES. In logical view
you can also modify ***very easily*** ELEMENTS, ATTRIBUTES, ENTITIES,
and COMMENTS. This software
is design for user that don't know (and don't want to know) XML
standards details.
You can download the IRIS XML DTD GENERATOR from the following URL.
http://www.cabinfo.com/download.htm
All comments, bugs reports, functionnalities wishes are welcome at
alain@cabinfo.com
Well known in FRANCE for his HTML EDITOR ODYSSEE ( see
http://www.mondial-telecom.com/odyssee ),
Innovation Partners has start to think about XML many months ago. This
thought tell us to develop
3 XML software. The first one, IRIS XML DTD GENERATOR, is a tool for
building XML DTDs. The second
is IRIS XML EDITOR for editing XML files. And the third one is a XSL
stylesheet EDITOR for
presenting and processing XML files. These 3 softwares can naturally
work together, building a complete
sofware suite for XML editing. For limitations, supported
functionnalities, and bugs of beta
versions, please see the readme file of the downloaded software. Now
lets have a look to the
DTD functionnalities.
IRIS XML DTD GENERATOR
- Editing existing DTDs
- Creating DTDs
- Logical and Text view mode for editing DTDs
- Wizzard for creating ELEMENTS both in text an logical view
- Wizzard for creating ATTRIBUTES both in text an logical view
- Wizzard for creating ENTITIES both in text an logical view
- Wizzard for creating COMMENTS both in text an logical view
- Support of additionnals informations for IRIS XML EDITOR (saved as
comment in the DTD)
- Generation of XML DATA schemas
- Find an replace both in logical and text view
- Wizzard for creating xml standard attributes (like xml:lang for
example)
- support of XLL linking attributes
- Glossaries
- Printing a DTD
- multi language software (currently French and English only - You can
change the language in the file menu)
- Automatic save capabilities
- backup on save capabilities
- MDI environment
NOTE : All of these functionnalities are not implemented in the beta 1
release. This list describe
what the software can be for the version 1.00 release. Some of these
functionnalities will perharps
disappear for the final release.
IRIS XML EDITOR
- Creating XML files according to a DTD generated by IRIS XML DTD
GENERATOR, or generated by something else.
- Project notion for managing XML file collections (like in the ODYSSEE
HTML EDITOR).
- glossaries
- Dynamic toolbars with generated icons that can insert ELEMENTS,
ATTRIBUTES, or ENTITIES you define in your DTD
- Dynamic Wizzard generated automaticly for ELEMENTS with attributes,
that ask user for informations.
- Text view for editing XML files
- Hierachical view for for editing the XML file (the tree object flow
view)
- Visualisation view for seeing the rendering of the XML file :
- If an XSL file is specified in your XML document these one is use to
generate the
visualisation view.
- if none is provide, a minimal built in style sheet is use to render
your document.
- Automatic save capabilities
- backup on save capabilities
- XML document printing
- XML Visualisation view printing
- FTP publication of XML files
- Specific Wizzard adding capabilities through an Standardize DLL API
(like in ODYSSEE HTML EDITOR)
- Friend application capabilities
- multi language software (currently French and English only - You can
change the language in the file menu)
- Find an replace both in tree and text view
- Insertion of XML Standard Attributes (like xml:lang)
NOTE : All of these functionnalities are not implemented in the beta 1
release. This list describe
what the software can be for the version 1.00 release. Some of these
functionnalities will perharps
disappear for the final release.
IRIS XSL STYLER
- Creating XSL stylesheet
- full support of XSL proposal and future ones.
- glossaries
- Text view for editing XSL files
- Hierachical view for for editing the XSL rules
- Import a DTD to re-use its elements, attributes, etc. in the style
sheet
- Visualisation view for seeing the rendering produce with the XSL file
:
- You can provide your own XML file to see the rendering.
- if none is provide, a minimal built in XML file based on the
target-elements is used.
- Automatic save capabilities
- backup on save capabilities
- XSL document printing
- XSL Visualisation view printing
- Specific Wizzard adding capabilities through an Standardize DLL API
(like in ODYSSEE HTML EDITOR)
- Friend application capabilities
- multi language software (currently French and English only - You can
change the language in the file menu)
- Find an replace both in tree and text view
NOTE : All of these functionnalities are not implemented in the beta 1
release. This list describe
what the software can be for the version 1.00 release. Some of these
functionnalities will perharps
disappear for the final release.
Currently, only the IRIS XML DTD GENERATOR beta 1 is available. IRIS XML
EDITOR beta 1 will be available
in May, and IRIS XSL STYLER beta 1 will be available in June. All theses
software are commercial, so
they are copyright. The beta version of these software are freely usable
for evaluation purpose,
bugs report, and functionnality discussions. Once the final release
(Version 1.00) launch up, you
***MUST*** register the software if you want to use it.
For Bugs reports, Functionnalities discussion, technical questions, XML
and XSL discussion, you
can use the DTD GENERATOR mailling list, for details about the mailling
list, please see the readme
file include in the beta 1 distribution. You can download the Beta 1
distribution of IRIS XML
DTD GENERATOR at :
http://www.cabinfo.com/download.htm
Paris the 20/04/1998
Alain DESEINE, Technical contact.
********************************************************************
Innovation Partners
Commercial contact : M. Emmanuel CHAMBON innovationpartners@hol.fr
Technical contact : M. Alain DESEINE alain@cabinfo.com
http://www.mondial-telecom.com/odyssee
http://www.cabinfo.com
Postal Adress :
9, bis rue des Besnards
92260 FONTENAY AUX ROSES
FRANCE
Tel : 33 01 41 41 00 89 or 33 01 41 41 91 91 or 33 06 09 80
10 16 (GSM)
Fax : 33 01 41 41 02 34
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Sun May 3 15:13:49 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:03 2004
Subject: Announcement: SAX 1.0gamma
In-Reply-To: <354C4EF0.81ADF185@jclark.com>
References: <199805020046.UAA02818@unready.microstar.com>
<354C4EF0.81ADF185@jclark.com>
Message-ID: <199805031312.JAA00291@unready.microstar.com>
James Clark writes:
> I did a first pass at implementing a SAX 1.0gamma driver for XP today.
>
> Some nits:
>
> It should be specified whether a byte order mark at the beginning
> of a XML byte stream is included as part of the character stream.
> I don't think it should be since the byte-order mark isn't included
> the XML document production, and the XML spec says explicitly that
> the byte order mark "is an encoding signature, not part of either
> the markup or the character data of the XML document".
My first hunch is the opposite: the XML productions deal with
characters, not bytes. When I provide a raw byte stream
(java.io.InputStream), I'm requiring the XML parser to take on two
logical tasks:
1) convert the bytes to characters
2) apply the XML productions to the characters
You have already mentioned that, unlike many XML parsers (including
AElfred), XP does not perform these as independent, serial steps;
conceptually, however, the tasks are still distinct. The BOM is part
of the raw byte stream, but not part of the character stream.
I think that it also simplifies Java implementation if the parser can
behave the same way with an InputStream from a URLConnection and an
InputStream supplied explicitly by an application.
> How are relative system identifiers supposed to be handled in
> DTDHandler? Suppose I have a DTD with a system id of dir/foo.dtd,
> which declares an unparsed entity with a system id of foo.eps
> (which refers to dir/foo.eps). If the systemId argument to
> DTDHandler.unparsedEntityDecl is foo.eps, then the application is
> going to have problems. There's a similar issue with
> EntityResolver.resolveEntity.
This does seem to be a serious problem. One solution is to require
the parser to fully resolve system identifiers before reporting them
(as AElfred already does). This approach will work well with URLs,
but may break for other URI schemes.
Any other solutions?
> Parser.parse should be allowed to throw IOException in addition to
> SAXException. Since InputSource includes a Reader and InputStream, and
> methods on Reader and InputStream throw IOException, parse needs to
> throw IOException. It's ridiculous to require the parser to wrap an
> IOException in a SAXException when you know that the parser needs to
> throw IOException.
This suggestion sounds quite reasonable. Any objections?
> There's nothing in the XML spec that says parsers have to make
> attribute types available. So I think the doc for
> AttributeList.getType should say that CDATA may be returned not
> only if the parser has not read the declaration, but also if the
> parser does not make this information available (alternatively it
> could return null in this case).
I don't recall anything in the spec that requires parser to report the
start and end of elements, or even character data other than
whitespace -- this area needs attention from the XML WG.
That said, I think that this documentation change would be useful.
> Supporting changing of Locale in the middle of a parse would
> require me to redesign my native interface in a way I consider very
> undesirable, and I don't see any need for this, so I don't plan to
> support this. The basic issue is that my counterpart of Parser is
> reentrant unlike SAX's and I want to keep it that way. I think
> changing of Locale mid-parse is going to be difficult for anyone
> with a similar style of interface.
I doubt that anyone else is supporting Locale at all right now, so
this change should not cause any trouble. I have no objection to
making it.
> It's probably too late for this, but I'm having problems seeing the
> logic in the exception handling design. The design seems to make
> things inconvenient for both users and implementors: implementors
> have to wrap SAXException in order to pass it up through their
> parsers, and in handler methods users have to wrap their exceptions
> in SAXException.
The former problem exists only when SAX support comes through a
separate driver (as, admittedly, is usually be the case). A new
parser, written from scratch, could include SAXException in its throw
clauses without using a wrapper.
The latter problem is very annoying, but there seems to be no
obviously correct solution. I received a very, very large number of
objections to my use of java.lang.Exception. I don't want to
vacillate any more, and have settled on a SAXException wrapper simply
as the (slightly) lesser of two evils.
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Gil_Zehavi at ibcomverse.com Sun May 3 15:18:18 1998
From: Gil_Zehavi at ibcomverse.com (Zehavi, Gil)
Date: Mon Jun 7 17:01:04 2004
Subject: Using schemas with XML parser
Message-ID:
I wrote xml-data schema and instances that are based on that schema, but
I don't know how to combine them in order that the parser (i'm using the
Microsoft parser which is supposed to know how to deal with schemas)
will recognize that a schema is being used. Putting both the schema and
the instances in one files is not good since the parser recognizes only
one root. I don't know how to link the two parts if they are defined in
different files (my application is not a web application). I can use DTD
but the use of schemas is very useful in my case.
I would appreciate any information I can get.
Gil Zehavi
Comverse Network System
email: gil_zehavi@ibcomverse.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tbray at textuality.com Sun May 3 15:43:36 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun 7 17:01:04 2004
Subject: Announcement: SAX 1.0gamma
Message-ID: <3.0.32.19980503064103.00b5bec0@pop.intergate.bc.ca>
At 09:12 AM 5/3/98 -0400, David Megginson wrote:
>James Clark writes:
> > It should be specified whether a byte order mark at the beginning
> > of a XML byte stream is included as part of the character stream.
> > I don't think it should be
>
>My first hunch is the opposite: the XML productions deal with
>characters, not bytes.
I think James has the spec on his side here. The BOM should *definitely*
be hidden from SAX customers.
-Tim
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From elharo at sunsite.unc.edu Sun May 3 16:25:02 1998
From: elharo at sunsite.unc.edu (Elliotte Rusty Harold)
Date: Mon Jun 7 17:01:04 2004
Subject: Cafe con Leche
In-Reply-To: <199805021914.PAA01433@unready.microstar.com>
Message-ID:
I've begun building a new XML news and resources site called Cafe con
Leche, modeled after my Java site Cafe au Lait. You'll find it at
http://sunsite.unc.edu/xml/
It's pretty nascent at this point. Comments, suggestions, and critiques are
apreciated.
+-----------------------+------------------------+-------------------+
| Elliotte Rusty Harold | elharo@sunsite.unc.edu | Writer/Programmer |
+-----------------------+------------------------+-------------------+
| Java Secrets (IDG Books 1997) |
| http://www.amazon.com/exec/obidos/ISBN=0764580078/cafeaulaitA/ |
+----------------------------------+---------------------------------+
| Cafe au Lait |
| http://sunsite.unc.edu/javafaq/ |
+----------------------------------+---------------------------------+
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From boumhome at agents-tech.com Sun May 3 17:05:40 1998
From: boumhome at agents-tech.com (Martin Bouchard Home)
Date: Mon Jun 7 17:01:04 2004
Subject: unsubsribe
Message-ID: <000d01bd76a5$168f1880$c002aa98@PETER.ATC>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980503/9b89f881/attachment.htm
From cbullard at hiwaay.net Sun May 3 17:57:43 1998
From: cbullard at hiwaay.net (len bullard)
Date: Mon Jun 7 17:01:04 2004
Subject: RFD: comp.text.xml
References:
Message-ID: <354C9391.60FA@hiwaay.net>
Simon St.Laurent wrote:
>
> XML is more than a subset of SGML. It has its own limitations, its own
> practices, and its own audience.
So far, it has a new name.
>You can debate whether or not it is a 'new
> language' just as the XML-DEV list has argued whether the XML spec contains
> just syntax or syntax and semantics, but I think you're splitting hairs, at
> best. New acronym, new approach, fine - new language.
As I said, so far, a new name. This is not splitting hairs. It is
maintaining a correct and accurate history.
> Personally, I'd like to see XML make a clean parent-child break with SGML.
> XML should acknowledge SGML as the parent, the teacher, the source of
> inspiration, and move on to live its own life. I'd like to see XML have its
> own place to grow. If that means that it acquires features from SGML (like
> architectural forms, which David Megginson pointed out on XML-L), then fine -
> but it shouldn't be forever limited to being a 'mere' subset.
It isn't limited. There is no normative reference and it is a product
of
the W3C Consortium whose rules and processes govern its growth. At any
time the Director chooses, he can cast away that parentage and the
alliance
with ISO.
> >This is speculative not substantive. The membership of the XML and XML
> >application communities overlaps the SGML community. The extent of
> >overlap should be examined, but it is probably greater than a majority.
>
> The extent of overlap may be examined, but I doubt it will be a majority for
> more than few months. XML is arousing interest in many computing sectors
> (HTML developers, database developers, OOP programmers) who in the past had
> nothing (or as little as possible) to do with SGML.
Nice. Take the ideas, abuse the parent in public, claim to have created
a
new technology.
o HTML acknowledges it is an SGML application
o XML acknowledges it is an SGML subset.
o Both database and OOP programmers have in the past
worked with SGML and developed applications of SGML.
> I can go through my email archive and find at least fifty people who have come
> to XML but had no previous experience in SGML and show no interest in learning
> it. There may be bias because my book deliberately targeted a non-SGML
> audience, but that audience certainly exists and is growing rapidly.
Bias is bias but that is a different issue. Those who learn XML do not
need to learn SGML. That has been a clear goal of the development of
SGML On The Web. So?
> >At this time, the comp-text-sgml newsgroup can support the discussion
> >of SGML subsets such as the W3C consortium subset, XML and its
> >applications.
>
> I've been perusing comp.text.sgml for the last few months. While I'm
> impressed with the quality of discussion (as I have been on XML-DEV and
> XML-L), I don't think it's a great place to learn about XML. It's not always
> easy to determine when SGML-only features are being discussed, and the
> question overlaps can lead to some fairly significant confusion, especially
> for beginners.
This is speculative but so far, the one issue that can be taken
seriously.
> There is a lot of need for a separate group which will address the needs of
> large numbers of people moving into a new technology. 'Maintaining the spirit
> of cooperation between international standards groups and consortium product
> working groups' is _not_ the purpose of this newsgroups. Those groups, for
> better or worse, need to learn to work with each other while the rest of get
> on with learning the technologies that are appropriate to the tasks _we_ want
> to accomplish.
It is that spirit of cooperation which has enabled the standard (SGML)
and
the consortium-owned product (XML) to remain aligned. If you want to
use
SGML ideas (rename them if you must) and keep this alignment, maintain
that
spririt. It is in your own best interest. As for the royal "we", that
can
be dismissed. Technologies and standards are separate entities.
> I see comp.text.xml as a place for people, both new and experts, to
> communicate about the new things opened up by XML, not as a place for a
> relatively small club of experts to form and maintain alliances.
That relatively small club created everything that you
are now claiming is a new language. You are being somewhat less than
ethical
in that comment.
> XML-L is good for this, but newsgroups are easy places for the whole world to
> find and participate. I strongly hope comp.text.xml arrives soon.
The creation of comp.text.xml is inevitable. The reasons given should
be
truthful, legitimate, and compelling, not based on the politics of
claiming the work of others. If you plan to be a successful author,
do your homework and keep to the facts.
len
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jjc at jclark.com Sun May 3 18:47:28 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun 7 17:01:04 2004
Subject: Announcement: SAX 1.0gamma
References: <199805020046.UAA02818@unready.microstar.com>
<354C4EF0.81ADF185@jclark.com> <199805031312.JAA00291@unready.microstar.com>
Message-ID: <354C9E40.E13ED372@jclark.com>
David Megginson wrote:
>
> James Clark writes:
> > It should be specified whether a byte order mark at the beginning
> > of a XML byte stream is included as part of the character stream.
> > I don't think it should be since the byte-order mark isn't included
> > the XML document production, and the XML spec says explicitly that
> > the byte order mark "is an encoding signature, not part of either
> > the markup or the character data of the XML document".
>
> My first hunch is the opposite: the XML productions deal with
> characters, not bytes. When I provide a raw byte stream
> (java.io.InputStream), I'm requiring the XML parser to take on two
> logical tasks:
>
> 1) convert the bytes to characters
>
> 2) apply the XML productions to the characters
>
> You have already mentioned that, unlike many XML parsers (including
> AElfred), XP does not perform these as independent, serial steps;
> conceptually, however, the tasks are still distinct. The BOM is part
> of the raw byte stream, but not part of the character stream.
>
> I think that it also simplifies Java implementation if the parser can
> behave the same way with an InputStream from a URLConnection and an
> InputStream supplied explicitly by an application.
I'm a bit confused by your reply. You say you're disagreeing with me,
but the points you make don't seem to contradict my suggestion. I agree
there are conceptually two stages. My point is that the BOM bytes are
removed as part of the first stage because they are part of the encoding
signature not part of the sequence of characters that matches the
document production. Thus the InputStream should include the BOM bytes,
but the Reader shouldn't include the 0xFEFF character.
> > How are relative system identifiers supposed to be handled in
> > DTDHandler? Suppose I have a DTD with a system id of dir/foo.dtd,
> > which declares an unparsed entity with a system id of foo.eps
> > (which refers to dir/foo.eps). If the systemId argument to
> > DTDHandler.unparsedEntityDecl is foo.eps, then the application is
> > going to have problems. There's a similar issue with
> > EntityResolver.resolveEntity.
>
> This does seem to be a serious problem. One solution is to require
> the parser to fully resolve system identifiers before reporting them
> (as AElfred already does). This approach will work well with URLs,
> but may break for other URI schemes.
>
> Any other solutions?
In XP, my analog of InputSource has both an InputStream and optionally a
URL to use a base URL for system identifiers in that InputStream. In
each case where the application is passed a system identifier (whether
for parsed or unparsed entity), the parser passes both the specified
system identifier and the base URL from the InputSource analog. This
gives the application complete control over resolving relative URLs,
although at the cost of some complexity.
In implementing the SAX driver for XP I try to make an absolute URL from
the specified system identifier and the base; if that succeeds I pass
the result (after conversion to a String); if it fails (for example
because it is parsing from an InputStream with no specified system
identifier) I pass the specified system identifier. That is the
approach I would suggest for SAX.
James
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From SimonStL at classic.msn.com Sun May 3 19:01:23 1998
From: SimonStL at classic.msn.com (Simon St.Laurent)
Date: Mon Jun 7 17:01:04 2004
Subject: Parents and children (was RFD: comp.text.xml)
Message-ID:
>> XML is more than a subset of SGML. It has its own limitations, its own
>> practices, and its own audience.
>
>So far, it has a new name.
So far it has its own limitations, and 20-odd announced books (3 of which have
SGML in the title) coming out, as well as a new name. It also has press
attention like SGML hasn't _ever_ seen. It has ringing endorsements from
major corporations, and promising new products. None of it's here yet,
really, but the standard only arrived two months ago.
Not a bad start. Hopefully, it will grow.
>> The extent of overlap may be examined, but I doubt it will be a majority
for
>> more than few months. XML is arousing interest in many computing sectors
>> (HTML developers, database developers, OOP programmers) who in the past had
>> nothing (or as little as possible) to do with SGML.
>
>Nice. Take the ideas, abuse the parent in public, claim to have created
>a
>new technology.
Um... where did I invent a new technology? I was discussing additional
sectors of users who are taking advantage of XML and who didn't use SGML much
previously.
As for abusing the parent, I think you may be in denial about SGML's
less-than-friendly reputation. There were good reasons that the W3C created
XML, I understand, as well as good reasons for naming it XML rather than
SGML-Lite. This doesn't mean that SGML is evil incarnate - it just means that
it involved a learning curve that many people found less than exciting.
I also said "XML should acknowledge SGML as the parent, the teacher, the
source of inspiration, and move on to live its own life." This doesn't sound
like I'm denying SGML's beneficent influence.
>> I see comp.text.xml as a place for people, both new and experts, to
>> communicate about the new things opened up by XML, not as a place for a
>> relatively small club of experts to form and maintain alliances.
>
>That relatively small club created everything that you
>are now claiming is a new language. You are being somewhat less than
>ethical in that comment.
I'm very thankful to that small club, as I've said repeatedly. I just don't
think that comp.text.xml is meant as a forum for diplomacy. XML-DEV or other
forums are probably more appropriate. I don't understand why you seem so
dismissive of the needs of people who are coming to XML from non-SGML (and
non-standards body) backgrounds.
And, to end on a positive note:
>> I've been perusing comp.text.sgml for the last few months. While I'm
>> impressed with the quality of discussion (as I have been on XML-DEV and
>> XML-L), I don't think it's a great place to learn about XML. It's not
always
>> easy to determine when SGML-only features are being discussed, and the
>> question overlaps can lead to some fairly significant confusion, especially
>> for beginners.
>This is speculative but so far, the one issue that can be taken
>seriously.
Well, at least we seem to agree on the pedagogical value of a separate
comp.text.xml newsgroup. That's a start.
Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer / Cookies
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Sun May 3 19:35:27 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:04 2004
Subject: Using schemas with XML parser
In-Reply-To:
References:
Message-ID: <199805031734.NAA00245@unready.microstar.com>
Zehavi, Gil writes:
> I wrote xml-data schema and instances that are based on that
> schema, but I don't know how to combine them in order that the
> parser (i'm using the Microsoft parser which is supposed to know
> how to deal with schemas) will recognize that a schema is being
> used. Putting both the schema and the instances in one files is not
> good since the parser recognizes only one root. I don't know how to
> link the two parts if they are defined in different files (my
> application is not a web application). I can use DTD but the use of
> schemas is very useful in my case. I would appreciate any
> information I can get.
You can have the best of both worlds: define your document type using
any high-level schema language (it doesn't have to be XML-Data), then
compile the schema into a standard XML 1.0 DTD for parsing or
distribution. When you make any changes, make them to the high-level
original rather than to the DTD, then recompile.
This way, you can enjoy any advantages that the higher-level schema
language offers, but you are still interoperable with conformant XML
tools. You also have the advantage of being able to choose (or
invent) any schema language.
I can imagine specialised fields inventing their own abstract (and
much simpler) schema languages that deal with only a few high-level,
domain-specific units (like tasks or forms or financial tables) rather
than low-level elements and attributes. XML-Data will undoubtedly
satisfy some needs, there is probably not a one-size-fits-all solution
to schemas.
Has anyone written a compiler for XML-Data yet?
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Sun May 3 19:38:26 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:04 2004
Subject: Announcement: SAX 1.0gamma
In-Reply-To: <3.0.32.19980503064103.00b5bec0@pop.intergate.bc.ca>
References: <3.0.32.19980503064103.00b5bec0@pop.intergate.bc.ca>
Message-ID: <199805031736.NAA00257@unready.microstar.com>
Tim Bray writes:
> At 09:12 AM 5/3/98 -0400, David Megginson wrote:
> >James Clark writes:
> > > It should be specified whether a byte order mark at the beginning
> > > of a XML byte stream is included as part of the character stream.
> > > I don't think it should be
> >
> >My first hunch is the opposite: the XML productions deal with
> >characters, not bytes.
>
> I think James has the spec on his side here. The BOM should *definitely*
> be hidden from SAX customers.
My fault -- I am very tired (for non-SAX reasons), and misread James's
comment: I somehow assumed that he was suggesting that the BOM be
removed from _byte_ streams. Tim and James are entirely correct: the
BOM should not appear in character streams.
Sorry for any confusion,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From schampeo at hesketh.com Sun May 3 21:47:54 1998
From: schampeo at hesketh.com (Steven Champeon)
Date: Mon Jun 7 17:01:04 2004
Subject: RFD: comp.text.xml
In-Reply-To: <354C9391.60FA@hiwaay.net>
Message-ID:
On Sun, 3 May 1998, len bullard wrote:
> Simon St.Laurent wrote:
> >
> > XML is more than a subset of SGML. It has its own limitations, its own
> > practices, and its own audience.
>
> So far, it has a new name.
Len, you're sounding a bit defensive. Do you think C would have gotten as
far as it did if every time someone brought it up, someone else popped up
to remind everyone that it was inspired by BCPL? Need we say that Java was
once named Oak in order to reap its benefits?
I started out working with SGML, waited for the CALS table model, hung
out wondering when DSSSL would be done, learned Author/Editor's internal
styles language as a desperation measure, bemoaned the ridiculous complexity
of FOSI, and when the Web hit I never looked back. I'm thrilled to death
that the idea of separating presentation from markup has become a reality
for those of us without multi-million dollar consulting budgets, and I'm
thrilled that XML will be part of the next generation of browsers. I'm
afraid that Microsoft has cornered the market on Windows-based XML parsing,
by virtue of its being built into the "Internet Explorer OS Upgrade" blob
they seem to be pushing as a pre-req for any new application installs,
but the fact that there is a standard to appeal to - a simple standard,
which doesn't give MS much cover should they decide to slightly alter
their implementation. I think it's amazing that there are fully-functional
parsers out there today. AFAIK, there isn't a single application out there
which fully supports all of the SGML arcana.
Len, I've known you since my first days reading comp.text.sgml, and I
respect your enlightened approach to subjects both technical and humane.
Please don't expect XML to be a bait-and-switch move to bring SGML into
everyone's home. And please recognize that XML is a powerful, simple,
iteration on some of the ideas espoused by SGML. Where XML will not suffice,
perhaps SGML will thrive. But where XML will suffice, its users need not
know thing one about SGML. In my mind, this is a good thing.
--
Steven Champeon | "While we're all very dependent on
http://hesketh.com/schampeo/ | technology, it doesn't always work."
http://a.jaundicedeye.com | - Bill Gates
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Sun May 3 21:54:20 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:04 2004
Subject: Using schemas with XML parser
References: <199805031734.NAA00245@unready.microstar.com>
Message-ID: <354CC7F8.9D3B78B5@technologist.com>
David Megginson wrote:
>
> Has anyone written a compiler for XML-Data yet?
I haven't tried it, but my *guess* is that if someone tried to implement
XML-Data they would find it is more a repository for interesting ideas
than a finished, implementable spec. When last I looked at it there were
some things tht I considered to be holes. Some things seemed specified by
example rather than by specification, for example.
Of course none of that is a criticism. It's a NOTE, not a REC.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"Perpetually obsolescing and thus losing all data and programs every 10
years (the current pattern) is no way to run an information economy or
a civilization." - Stewart Brand, founder of the Whole Earth Catalog
http://www.wired.com/news/news/culture/story/10124.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Sun May 3 21:54:32 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:05 2004
Subject: Parents and children (was RFD: comp.text.xml)
References:
Message-ID: <354CC975.ACB97DFC@technologist.com>
I think that we could end this flamewar quite simply. XML is an SGML
subset. That's a verifable, mathematical fact. As far as I can tell, Simon
St. Laurent has no concrete proposal for how to change that fact and I
can't think of any reasonable ones off of the top of my head. We could
just throw in a feature to break compliance and if ISO adopts it, we could
throw in another, and keep doing so until they gave up.
NYAH NYAH! So there! We win!
Since there is no proposal, there is nothing to argue about and
argumentation is wasted breath.
Nevertheless, XML practice is often quite different from SGML practice and
there are many questions that apply only to one or the other (especially
tools questions). That seems justification enough for separate newsgroups.
Where concerns overlap, there is cross posting.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"Perpetually obsolescing and thus losing all data and programs every 10
years (the current pattern) is no way to run an information economy or
a civilization." - Stewart Brand, founder of the Whole Earth Catalog
http://www.wired.com/news/news/culture/story/10124.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From SimonStL at classic.msn.com Sun May 3 23:08:54 1998
From: SimonStL at classic.msn.com (Simon St.Laurent)
Date: Mon Jun 7 17:01:05 2004
Subject: Parents and children (was RFD: comp.text.xml)
Message-ID:
>XML is an SGML
>subset. That's a verifable, mathematical fact. As far as I can tell, Simon
>St. Laurent has no concrete proposal for how to change that fact and I
>can't think of any reasonable ones off of the top of my head.
Great. I haven't actually suggested that anyone start throwing spare parts
into XML for the sake of breaking SGML compliance. Nor have I suggested that
the W3C should declare its independence of the ISO, like there's actually a
tie there. What I have suggested is that the SGML community stop mixing SGML
and XML like they are identical practices. They aren't.
Note that XML-L isn't a subset of HyTime, XSL isn't a subset of DSSL, and
XPointers don't fit TEI precisely either. This could change, but I think a
lot of people would be disappointed.
XML practice and the XML spec itself appear to be two different things; the
child is in fact diverging from the ways of the parent. If the parent chooses
not to acknowledge that, fine. It's early in the child's development, after
all.
As Paul went on to say,
>Nevertheless, XML practice is often quite different from SGML practice and
>there are many questions that apply only to one or the other (especially
>tools questions). That seems justification enough for separate newsgroups.
>Where concerns overlap, there is cross posting.
Here we seem to be saying the same thing. That, to me, suggests the end of an
argument.
Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer / Cookies
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Sun May 3 23:16:37 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:05 2004
Subject: Software specifications (was Re: announce : innovation
Partners and CEI announce IRIS XML DTD GENERATOR beta 1)
In-Reply-To: <354C68CF.2B7A@cabinfo.com>
Message-ID: <3.0.1.16.19980503161019.2fcf7a6e@pop3.demon.co.uk>
Thanks for the posting,
At 14:53 03/05/98 +0200, Alain DESEINE wrote:
>innovation Partners and CEI announce IRIS XML DTD GENERATOR beta 1
[... impressive specification of XML tools ...]
I couldn't immediately see a statement of what platforms and/or language
this system uses (Please forgive me if I've missed it). I suggest that
those posting software announcements on XML-DEV include a very short
summary of attributes such as:
- platform
- or language if it can run, or be compiled on multiple platforms
- source availability
- any usage or distribution restrictions
- free or costs-money (no need to give prices).
- any other software required (e.g. uses FOO-XML as a parser, BAR-DSSSL
for stylesheet support, etc.)
This helps some classes of user decide whether it is unnecessary to visit
the site (e.g. can't run on a MAC, etc.). Sometimes (not on XML-DEV :-)
with large downloads I have spent an hour or more (costs money) to find
that the kit could not run on what I have.
It also helps people like Robin Cover and many other who compile lists of
software and related resources.
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Sun May 3 23:36:04 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:05 2004
Subject: Crossposting to XML-DEV and XML-L (was Re: Parents and
children (was RFD: comp.text.xml))
In-Reply-To: <354CC975.ACB97DFC@technologist.com>
References:
Message-ID: <3.0.1.16.19980503212724.2f1759c0@pop3.demon.co.uk>
[crossposted to XML-L intentionally]
At 15:45 03/05/98 -0400, Paul Prescod wrote:
>I think that we could end this flamewar quite simply. XML is an SGML
[...]
>
>Since there is no proposal, there is nothing to argue about and
>argumentation is wasted breath.
This discussion (?flame war?) is being carried on in duplicate on XML-L and
XML-DEV. I believe it's a good thing to post only to a single list unless
the posting is of *very* general value. (Automatic replies without reading
the list of recipients is hardly ever a Good Thing.)
I also suggest that on XML-DEV we favour discussions that seem to be
leading to something concrete - whether software, resources, protocols or
merely reshaping the future of the human race.
In arguing for a new newsgroup it's worth making sure that the current
organs fill different and non-overlapping roles. As I said I'm deliberately
neutral.
Relax... hack some SAX...
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From t.mueller at terus.envmtl.com Mon May 4 02:54:27 1998
From: t.mueller at terus.envmtl.com (T.Mueller)
Date: Mon Jun 7 17:01:05 2004
Subject: comp.text.xml
In-Reply-To: <000001bd75ce$fdf449c0$d86118cb@caleb>
Message-ID: <005c01bd76f7$367244e0$71012fd1@tribune.terus.envmtl.com>
James K. Tauber wrote:
> Policy on Advertising
>
> We encourage discussion of the merits and shortcomings of
> commercial products. A certain amount of advertising (both objective
> and advocative) is to be expected and this is to be encouraged...
While I can certainly buy into James premise that advertising can advocate
and I can accept that in something like a legal context advocacy can be
objective I am, however, having trouble trying to figuring out just what
is "objective advertising" ?
A small thought... you humour may vary...
Terry
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jtauber at jtauber.com Mon May 4 04:05:33 1998
From: jtauber at jtauber.com (James K. Tauber)
Date: Mon Jun 7 17:01:05 2004
Subject: comp.text.xml
In-Reply-To: <005c01bd76f7$367244e0$71012fd1@tribune.terus.envmtl.com>
Message-ID: <000401bd7701$17434880$ea6118cb@caleb>
> James K. Tauber wrote:
> > Policy on Advertising
> >
> > We encourage discussion of the merits and shortcomings of
> > commercial products. A certain amount of advertising (both objective
> > and advocative) is to be expected and this is to be encouraged...
>
> While I can certainly buy into James premise that advertising
> can advocate and I can accept that in something like a legal context
> advocacy can be objective I am, however, having trouble trying to figuring
> out just what is "objective advertising" ?
Ok, I confess to pinching that sentence from various sample RFDs around the
place :-) I can see how one might imagine ads on an objective-advocative
continuum but "objective advertising" is an amusing oxymoron.
James
--
James Tauber / jtauber@jtauber.com
Perth, Western Australia
XML Pages: http://www.jtauber.com/xml/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Mon May 4 04:44:53 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:05 2004
Subject: comp.text.xml
References: <005c01bd76f7$367244e0$71012fd1@tribune.terus.envmtl.com>
Message-ID: <354D2BB4.B1E30408@technologist.com>
T.Mueller wrote:
>
> While I can certainly buy into James premise that advertising can advocate
> and I can accept that in something like a legal context advocacy can be
> objective I am, however, having trouble trying to figuring out just what
> is "objective advertising" ?
"Here's our product. It isn't the best in its category, but we think it's
pretty good value for the money. You may or may not agree. However you
feel, please tell us so that we can add your comments to our next round of
advertising."
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"Perpetually obsolescing and thus losing all data and programs every 10
years (the current pattern) is no way to run an information economy or
a civilization." - Stewart Brand, founder of the Whole Earth Catalog
http://www.wired.com/news/news/culture/story/10124.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From alain at cabinfo.com Mon May 4 10:14:33 1998
From: alain at cabinfo.com (Alain DESEINE)
Date: Mon Jun 7 17:01:05 2004
Subject: Software specifications (was Re: announce : innovation Partners and CEI announce IRIS XML
Message-ID: <354D7882.1967@cabinfo.com>
Peter Murray-Rust wrote:
>
> Thanks for the posting,
>
> At 14:53 03/05/98 +0200, Alain DESEINE wrote:
> >innovation Partners and CEI announce IRIS XML DTD GENERATOR beta 1
>
> [... impressive specification of XML tools ...]
>
> I couldn't immediately see a statement of what platforms and/or language
> this system uses (Please forgive me if I've missed it). I suggest that
> those posting software announcements on XML-DEV include a very short
> summary of attributes such as:
> - platform
> - or language if it can run, or be compiled on multiple platforms
> - source availability
> - any usage or distribution restrictions
> - free or costs-money (no need to give prices).
> - any other software required (e.g. uses FOO-XML as a parser, BAR-DSSSL
> for stylesheet support, etc.)
>
> This helps some classes of user decide whether it is unnecessary to visit
> the site (e.g. can't run on a MAC, etc.). Sometimes (not on XML-DEV :-)
> with large downloads I have spent an hour or more (costs money) to find
> that the kit could not run on what I have.
>
> It also helps people like Robin Cover and many other who compile lists of
> software and related resources.
>
> P.
>
> Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
> net connection
> VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
> http://www.venus.co.uk/vhg
>
We effectivly miss this informations so here they are :
- Platform : Windows 3.x, Windows 95, Windows NT (Win 16 bits
application)
- No sources
- Beta versions are free for evaluation purpose, feedback, bug report,
and functionnalities wishes.
- Future 1.00 version will be Commercial, so they won't be free ...
You can download it at http://www.cabinfo.com/download.htm
Alain.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Mon May 4 11:06:04 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:05 2004
Subject: comp.text.xml
In-Reply-To: <354D2BB4.B1E30408@technologist.com>
References: <005c01bd76f7$367244e0$71012fd1@tribune.terus.envmtl.com>
Message-ID: <3.0.1.16.19980504075617.297f3254@pop3.demon.co.uk>
At 22:45 03/05/98 -0400, Paul Prescod wrote:
[...]
>
>"Here's our product. It isn't the best in its category, but we think it's
>pretty good value for the money. You may or may not agree. However you
>feel, please tell us so that we can add your comments to our next round of
>advertising."
About the maximum number of words for a general statement.
On xml-dev we have always assumed that some tools will be commercial and
that an 'objective' announcement may - if appropriately presented - be of
value to the XML development community. In the early stages of development
there might only be a single (commercial) tool for an operation, or a
conflict between syntactic/semantic interpretation (see following message).
It's useful to know about these tools. Factual statements will often
benefit the community.
As any set of tools matures, then we would expect the value of posting such
information to diminish, being replaced by those that address the next set
of problems.
I should compliment the XML/SGML commercial community in that the balance
appears to have been well kept.
Some criteria might be:
- will my product help the XML development community in general? [This is
the most critical.]
- is my product almost entirely concerned with XML?
- does the use of my product address horizontal problems rather than
vertical ones?
- does the use of my product raise important issues for
software/protocols/syntax/semantics
- is a free version of my product available for
test/proof_of_concept/academics/non-profits/personal?
- am I responding to queries addressed on the XML-DEV list (or possibly
elsewhere, but raising important XML-DEV-related issues).
- is my product novel?
It will *not* be appropriate to mention competitors's products ("FOO-BAR
doesn't do this"). Any comparison of products should be for the general
development of the XML-DEV community and preferably by someone not involved
with a manufacturer.
If so, a *brief* statement of the attributes of the product (see previous
message about missing platform information) is valuable at the start of the
posting. Try to refrain from describing every sub-menu item unless they are
novel.
There are specialist fora and resources for the announcement of XML
software and other products, and it may be more advantageous to post them
there.
We shall *certainly* reach the stage where, if someone posts "convert your
SGML documents to XML for only $5000" (yes, I got a paper flier offering
this), we shall suggest it is inappropriate for XML-DEV as there are
already many robust ways of doing this.
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Mon May 4 11:06:10 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:05 2004
Subject: Attributes with Intent
Message-ID: <3.0.1.16.19980504090531.2fa7aec0@pop3.demon.co.uk>
The XML Recommendation creates a category of attributes which have intent.
(2.10 xml:space, 2.12 xml:lang). I believe that this raises implementation
problems for which we may wish to devise a common protocol. The same
problem arises in implementing XLink which, although not final, appears
likely to contain the same (or closely related constructs).
Consider the following macaronic document (about as much as I can manage :-):
]>
gut
What are the attributes of ? Correct: there aren't any, but the
"intent" of xml:lang="de" applies to . I regard this as a subtle form
of minimisation which may create considerable problems further downstream.
I think this will be particularly problematical for software which relies
on attributes to identify or process parts of a document.
It's reasonable to assume that generic text-aware XML software might be
asked "please find all elements in a document which are expressed in
German." It can't look for all elements with xml:lang="de" because they
don't have this explicitly. So general mechanisms such as XPointers can't
be used, and bespoke software must be written. This software (presumably)
finds all elements which *do* have the attribute and continues recursively.
At this point an application developer has to ask: "[how] should I include
support for xml:lang in my *application*?". The spec makes it clear that
the *parser* does not add an attribute xml:lang, i.e. the above document is
not equivalent to:
]>
gut
(which is well-formed but *invalid*).
or
]>
gut
which explicitly shows the author's intent.
The three documents will presumably *behave* similarly (I hope identically).
The same concern applies to xml:space.
XLink is similar but uses a different phrasing (4.3): "If any such
[semantic] attribute [such as role] is omitted from a locator element, the
value providing on the containing linking element is to be used". Example:
The locator elements *behave* *as if* they were represented as:
Note: this would be well-formed but *invalid* unless was
declared with
----------------------------------
The problem is that *all* applications dealing with XML and XLink now have
to consider three separate non-trivial questions. They can ignore the first
two and assume that humans can deal with the resulting fuzziness (though
the automation of the language aspect is important for machine-based
translation and terminology). If XLink is to work, some mechanism *must* be
provided and I am struggling with this at present :-)
Since the three mechanisms seem to be supportable by a single software
module it seem to make sense to discuss whether this is possible. If so
perhaps we can devise a communal mechanism/interface. Otherwise we shall
have semantic fragmentation where some tools support parts of some of these
and other support other parts and some behave inconsistently w.r.t others.
I have voiced these concerns before - are there others who see this as
needing addressing at an implementation level? If so, we should tackle this
quickly before the fuzziness increases [and certainly before other drafters
of new specifications adopt this approach, thus increasing the load on the
implementers.]
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jtauber at jtauber.com Mon May 4 11:46:57 1998
From: jtauber at jtauber.com (James K. Tauber)
Date: Mon Jun 7 17:01:05 2004
Subject: expat question: processStream vs processFile/filemap
Message-ID: <000001bd7741$84c62680$ea6118cb@caleb>
As I've mentioned before, I'm writing some documentation for expat and I'm
currently doing a walkthrough of xmlwf. My lack of knowledge of C file I/O
is letting me down.
Can somebody who has worked with the expat source (or who is at least
willing to have a quick look at xmlwf.c) tell me what the difference is
between using processStream and using processFile/filemap? Why is it a
command-line option?
Why would you pick one over the other?
Thanks
James
--
James Tauber / jtauber@jtauber.com
Perth, Western Australia
XML Pages: http://www.jtauber.com/xml/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jjc at jclark.com Mon May 4 11:58:44 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun 7 17:01:05 2004
Subject: New version of expat
Message-ID: <354D9000.64F6C1D2@jclark.com>
A new version of expat (my XML parser written in C) is now available.
See
http://www.jclark.com/xml/expat.html
expat is an XML parser written in C.
This version just fixes bugs (some of which are quite serious).
James
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jjc at jclark.com Mon May 4 12:43:43 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun 7 17:01:05 2004
Subject: expat question: processStream vs processFile/filemap
References: <000001bd7741$84c62680$ea6118cb@caleb>
Message-ID: <354D9260.1A7A7944@jclark.com>
James K. Tauber wrote:
>
> As I've mentioned before, I'm writing some documentation for expat and I'm
> currently doing a walkthrough of xmlwf. My lack of knowledge of C file I/O
> is letting me down.
>
> Can somebody who has worked with the expat source (or who is at least
> willing to have a quick look at xmlwf.c) tell me what the difference is
> between using processStream and using processFile/filemap? Why is it a
> command-line option?
> Why would you pick one over the other?
Using file mapping is typically more efficient, but you might need to
use processStream if:
- your OS doesn't support file mapping (although xmlwf can fake it by
reading the entire file into memory)
- you don't want to deal with the portability problems that using file
mapping brings
- you have files larger than can be mapped (eg > 2Gb)
- you are reading from a network connection or pipe rather than a file
It's a command-line option for testing purposes.
James
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jjc at jclark.com Mon May 4 12:51:15 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun 7 17:01:05 2004
Subject: SAX Exception Handling
Message-ID: <354D9C4F.8FF94AA8@jclark.com>
David Megginson wrote:
>
> James Clark writes:
> > It's probably too late for this, but I'm having problems seeing the
> > logic in the exception handling design. The design seems to make
> > things inconvenient for both users and implementors: implementors
> > have to wrap SAXException in order to pass it up through their
> > parsers, and in handler methods users have to wrap their exceptions
> > in SAXException.
>
> The former problem exists only when SAX support comes through a
> separate driver (as, admittedly, is usually be the case). A new
> parser, written from scratch, could include SAXException in its throw
> clauses without using a wrapper.
>
> The latter problem is very annoying, but there seems to be no
> obviously correct solution. I received a very, very large number of
> objections to my use of java.lang.Exception. I don't want to
> vacillate any more, and have settled on a SAXException wrapper simply
> as the (slightly) lesser of two evils.
With my application writer's hat on (I just converted XMLTest to the new
SAX), the current solution is a significant step backwards. The typical
simple SAX program that processes an XML document and produces some sort
of output is going to have to make calls in its DocumentHandler
implementation that throw IOExceptions, and each of these methods now
have to do:
try {
...
}
catch (IOException e) {
throw new SAXException(e);
}
The original goal of SAX was to be simple and easy to use for
application writers. I don't think requiring this sort of mumbo jumbo
even for trivial programs is consistent with this goal.
I don't see an ideal solution, but I can think of several possibilities
in addition to the old solution and the current solution, any of which I
think would be an improvement over the current solution:
1. The handler methods are declared to throw both SAXException and
IOException (as I proposed for parse). My guess is that throwing
IOException is going to be very common, and will avoid the need for the
user to wrap exceptions themselves in a large proportion of simple
programs. I can't see any disadvantages over the current proposal.
2. A variation on this would be to make SAXException extend IOException
and then declare both the handler methods and parse as throwing
IOException.
3. Another possibility would be keep the handler methods declared as
throwing java.lang.Exception but declare parse as throwing
SAXException. In other words it would be the responsiblity of the
parser to do something like:
try {
handler.startDocument();
}
catch (RuntimeException e) {
throw e;
}
catch (Exception e) {
throw new SAXException(e);
}
around each call of a SAX handler method. This makes life slightly
harder for application writers but makes life much easier for users. I
can see the objection to having parse throw java.lang.Exception, but I
can't see the objection to having the interface handler methods throwing
java.lang.Exception. Implementations of that interface are free to
declare that they throw only some subclass of java.lang.Exception, and
so no exception type checking is lost. It does make life a little
harder for the parser writer, but I think they can cope: you could even
declare a convenience class that wrapped around a DocumentHandler and
dealt with the exception wrapping:
class DocumentHandlerWrapper {
DocumentHandler handler;
DocumentHandlerWrapper(DocumentHandler handler) {
this.handler = handler;
}
void startDocument() throws SAXException {
try {
handler.startDocument();
}
catch (RuntimeException e) {
throw e;
}
catch (Exception e) {
throw new SAXException(e);
}
}
// ...
}
The parser writer could just wrap a DocumentHandlerWrapper around the
users DocumentHandler and get the same convenience they get with the
current solution.
James
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Mon May 4 14:29:20 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:05 2004
Subject: SAX Exception Handling
In-Reply-To: <354D9C4F.8FF94AA8@jclark.com>
Message-ID: <3.0.1.16.19980504114900.3bb74718@pop3.demon.co.uk>
At 17:45 04/05/98 +0700, James Clark wrote:
>The original goal of SAX was to be simple and easy to use for
>application writers.
I hope that it still is - I appreciate that we have had some drift, but
much of it has centered on Exceptions.
>I don't think requiring this sort of mumbo jumbo
>even for trivial programs is consistent with this goal.
>
>I don't see an ideal solution,
This seems to be a problem that is fundamental to Exception handling in
general (not SAX-specific). It's compounded by the fact that we need to (a)
interoperate with Java Exceptions (b) assume that SAX will be put alongside
packages that have yet another set of Exceptions.
I spent yesterday afternoon hacking the new SAX/AElfred including the
extraction of error messages and transferring them to display of nonwf
documents. My blunderings weren't pretty (I failed to realise that one test
example threw FileNotFoundException rather than a parsing error, and
couldn't understand why I couldn't extract the SAXException message). But I
think I can work with the current SAX. (I was one of those who suggested
SAX should 'wrap' Exceptions, so pillory me.)
>but I can think of several possibilities
>in addition to the old solution and the current solution, any of which I
>think would be an improvement over the current solution:
[... workable solutions snipped ...]
I am sure that David will look at these carefully - I think the final
decision is essentially his - he has put in *so* much work and got enough
public and private mail to make a balanced decision. David must feel like
he's on a never-ending marathon - if he feels that SAX1.0gamma is 'it', I
think we should accept it. [Every finalisation has bits that someone
doesn't like, even XML...]
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Mon May 4 14:35:31 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:06 2004
Subject: Attributes with Intent
In-Reply-To: <3.0.1.16.19980504090531.2fa7aec0@pop3.demon.co.uk>
References: <3.0.1.16.19980504090531.2fa7aec0@pop3.demon.co.uk>
Message-ID: <199805041233.IAA00288@unready.microstar.com>
Peter Murray-Rust writes:
> It's reasonable to assume that generic text-aware XML software
> might be asked "please find all elements in a document which are
> expressed in German." It can't look for all elements with
> xml:lang="de" because they don't have this explicitly. So general
> mechanisms such as XPointers can't be used, and bespoke software
> must be written. This software (presumably) finds all elements
> which *do* have the attribute and continues recursively.
>From a logical perspective, your query is simply
Find all elements in a document where the element or the nearest
ancestor with a value for the xml:lang attribute specifies "de" as
the attribute's value.
With SDQL, for example, this test is surprisingly simple:
(equal? (inherited-attribute-string "xml:lang") "de")
Your point is well-taken, though: fragments of XML documents are often
tightly bound to their context (similar situations would involve
attributes for effectivity, security level, revision status, etc.).
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Mon May 4 14:54:24 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:06 2004
Subject: SAX: Exception Handling
In-Reply-To: <354D9990.5410F401@jclark.com>
References: <199805020046.UAA02818@unready.microstar.com>
<354C4EF0.81ADF185@jclark.com>
<199805031312.JAA00291@unready.microstar.com>
<354D9990.5410F401@jclark.com>
Message-ID: <199805041253.IAA00394@unready.microstar.com>
James Clark writes:
> With my application writer's hat on (I just converted XMLTest to the new
> SAX), the current solution is a significant step backwards. The typical
> simple SAX program that processes an XML document and produces some sort
> of output is going to have to make calls in its DocumentHandler
> implementation that throw IOExceptions, and each of these methods now
> have to do:
>
> try {
> ...
> }
> catch (IOException e) {
> throw new SAXException(e);
> }
>
> The original goal of SAX was to be simple and easy to use for
> application writers. I don't think requiring this sort of mumbo jumbo
> even for trivial programs is consistent with this goal.
I stood by the same position for nearly five months, in the face of
very loud opposition from people whose opinions I have great reason to
respect (as I have yours).
However, upon reflection (or perhaps simply mental exhaustion), I
realised that throwing java.lang.Exception was a false economy. When
I write
public void endElement (String name)
throws java.lang.Exception
{
[..]
}
The Java compiler will not help me discover which constructors or
methods invoked in the body can throw exceptions that I have not yet
considered. With
public void endElement (String name)
throws org.xml.sax.SAXException
{
[..]
}
The Java compiler will raise an error the first time I compile. I
will discover that java.sql.Connection.createStatement (for example)
can throw a SQLException, and I will take steps either to deal with
the exception here in the handler (perhaps by reformulating the query)
or for packing it up to send it to the top level. Of course, I can
always avoid doing so by wrapping the entire contents of each handler
in a try...catch statement
public void endElement (String name)
throws org.xml.sax.SAXException
{
try {
[..]
} catch (SAXException e) {
throw e;
} catch (Exception e) {
throw new SAXException(e);
}
}
By doing so, however, I have consciously pulled the pin out of the
grenade myself, rather than having the grenade handed to me with the
pin already missing.
> I don't see an ideal solution, but I can think of several possibilities
> in addition to the old solution and the current solution, any of which I
> think would be an improvement over the current solution:
>
> 1. The handler methods are declared to throw both SAXException and
> IOException (as I proposed for parse). My guess is that throwing
> IOException is going to be very common, and will avoid the need for the
> user to wrap exceptions themselves in a large proportion of simple
> programs. I can't see any disadvantages over the current proposal.
Your guess supposes that one class of XML application dominates.
Database applications (and they may come to dominate XML) will more
typically need to throw java.sql.SQLException, while GUI applications
will often need to throw java.awt.AWTException.
> 2. A variation on this would be to make SAXException extend IOException
> and then declare both the handler methods and parse as throwing
> IOException.
Again, this variation benefits only one group of users. If
SQLException were a subclass of IOException (for example), then the
choice might be clearer.
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jjc at jclark.com Mon May 4 16:02:06 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun 7 17:01:06 2004
Subject: SAX: Exception Handling
References: <199805020046.UAA02818@unready.microstar.com>
<354C4EF0.81ADF185@jclark.com>
<199805031312.JAA00291@unready.microstar.com>
<354D9990.5410F401@jclark.com> <199805041253.IAA00394@unready.microstar.com>
Message-ID: <354DC8FC.B4291CEC@jclark.com>
David Megginson wrote:
>
> James Clark writes:
>
> > With my application writer's hat on (I just converted XMLTest to the new
> > SAX), the current solution is a significant step backwards. The typical
> > simple SAX program that processes an XML document and produces some sort
> > of output is going to have to make calls in its DocumentHandler
> > implementation that throw IOExceptions, and each of these methods now
> > have to do:
> >
> > try {
> > ...
> > }
> > catch (IOException e) {
> > throw new SAXException(e);
> > }
> >
> > The original goal of SAX was to be simple and easy to use for
> > application writers. I don't think requiring this sort of mumbo jumbo
> > even for trivial programs is consistent with this goal.
>
> I stood by the same position for nearly five months, in the face of
> very loud opposition from people whose opinions I have great reason to
> respect (as I have yours).
>
> However, upon reflection (or perhaps simply mental exhaustion), I
> realised that throwing java.lang.Exception was a false economy. When
> I write
>
> public void endElement (String name)
> throws java.lang.Exception
> {
> [..]
> }
>
> The Java compiler will not help me discover which constructors or
> methods invoked in the body can throw exceptions that I have not yet
> considered.
Right, so don't do that. Declaring an *interface* as throws
java.lang.Exception does not constrain an *implementation* of that
interface to be declared as throws java.lang.Exception. It can be
declared to throw the appropriate subclass of java.lang.Exception.
If you declare the interface
public void endElement(String name) throws java.lang.Exception;
then I can declare my implementation as
public void endElement(String name) throws java.lang.IOException {
}
or
public void endElement(String name) throws java.awt.AWTException {
}
or whatever is in fact appropriate to my implementation. In other words,
it is only be declaring the interface as java.lang.Exception, that I can
correctly declare the exceptions that by implementation throws and
thereby take advantage of Java's exception checking.
I just don't follow your argument at all.
> > I don't see an ideal solution, but I can think of several possibilities
> > in addition to the old solution and the current solution, any of which I
> > think would be an improvement over the current solution:
> >
> > 1. The handler methods are declared to throw both SAXException and
> > IOException (as I proposed for parse). My guess is that throwing
> > IOException is going to be very common, and will avoid the need for the
> > user to wrap exceptions themselves in a large proportion of simple
> > programs. I can't see any disadvantages over the current proposal.
>
> Your guess supposes that one class of XML application dominates.
> Database applications (and they may come to dominate XML) will more
> typically need to throw java.sql.SQLException, while GUI applications
> will often need to throw java.awt.AWTException.
I am not saying it will dominate but I am saying it will be a
significant class, and the other classes wouldn't be hurt by making it
easier for this class. I don't see much merit in the argument that
because we can't make things simpler for *all* applications we therefore
shouldn't make things simpler for *some*.
James
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Mon May 4 16:07:11 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:06 2004
Subject: Attributes with Intent
In-Reply-To: <199805041233.IAA00288@unready.microstar.com>
References: <3.0.1.16.19980504090531.2fa7aec0@pop3.demon.co.uk>
<3.0.1.16.19980504090531.2fa7aec0@pop3.demon.co.uk>
Message-ID: <3.0.1.16.19980504140252.3bb717e8@pop3.demon.co.uk>
At 08:33 04/05/98 -0400, David Megginson wrote:
>Peter Murray-Rust writes:
>
> > It's reasonable to assume that generic text-aware XML software
> > might be asked "please find all elements in a document which are
> > expressed in German." It can't look for all elements with
> > xml:lang="de" because they don't have this explicitly. So general
> > mechanisms such as XPointers can't be used, and bespoke software
> > must be written. This software (presumably) finds all elements
> > which *do* have the attribute and continues recursively.
>
>From a logical perspective, your query is simply
>
> Find all elements in a document where the element or the nearest
> ancestor with a value for the xml:lang attribute specifies "de" as
> the attribute's value.
>
>With SDQL, for example, this test is surprisingly simple:
>
> (equal? (inherited-attribute-string "xml:lang") "de")
>
>Your point is well-taken, though: fragments of XML documents are often
>tightly bound to their context (similar situations would involve
>attributes for effectivity, security level, revision status, etc.).
>
>
>All the best,
>
>
>David
>
>--
>David Megginson ak117@freenet.carleton.ca
>Microstar Software Ltd. dmeggins@microstar.com
> http://home.sprynet.com/sprynet/dmeggins/
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Mon May 4 16:09:49 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:06 2004
Subject: SAX in Applets
Message-ID: <3.0.1.16.19980504140216.33370ee6@pop3.demon.co.uk>
I am trying to put together the JUMBO package to run as an applet as well
as an application. [Jumbo1.0 does this]. Since JUMBO2.0 is written in
JDK1.1, with Swing, there are some new aspects for me. I'd be very grateful
for any pointers - if you think they are general to XML-DEV, please reply
here, else to me.
I need three types of operation:
1. java/jre applications
2. applets from a remote site (i.e. from a server)
3. applets on the same file system (a CDROM) as the data and *.html
2 and 3 are close but there are differences in that the local CLASSPATH can
interact differently with the browser. For example, in 3 it was important
not to have a CLASSPATH set.
My current approach is to structure directories:
jumbo
classes
greetings.html (for driving the applet)
*.jar (aelfred.jar, sax.jar. swing.jar)
jumbo
xml
Jumbo.class (class files, etc.)
greetings.xml example files
*.cfg configuration files (e.g. parsers available, etc)
To run a jre we have:
jre -cp
C:\jumbo\classes;C:\jumbo\classes\aelfred.jar;C:\jumbo\classes\sax.jar;C:\ju
mbo\classes;C:\jdk1.1.5\classes.zip jumbo.xml.Jumbo greetings.xml
This works!
To run an applet the applet contains:
[I have not put Jumbo in a jar, though presumably I could do so.]
I'd be grateful for the following info:
- will the jar files sax.jar, swing.jar, etc. work in applets (are they
compressed? does it matter?)
- can the swing.jar be put in the Netscape *.jar library (I'm using
Netscape 4.05 which says it supports JDK1.1.5).
- do the top-of-the-classes and *.jar *have* to be in the same directory as
the *.html (I believe this used to be a requirement).
- how do I manage resource files? In applications they can be picked up
either from a -D argument or - ?deprecated - from an environment variable.
This isn't possible for applets.
- if there are other problems (or solutions) I haven't thought of, I'd be
grateful.
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From eliot at isogen.com Mon May 4 16:20:10 1998
From: eliot at isogen.com (W. Eliot Kimber)
Date: Mon Jun 7 17:01:06 2004
Subject: Attributes with Intent
Message-ID: <3.0.32.19980504091654.00762d10@postoffice.swbell.net>
At 08:33 AM 5/4/98 -0400, David Megginson wrote:
>Your point is well-taken, though: fragments of XML documents are often
>tightly bound to their context (similar situations would involve
>attributes for effectivity, security level, revision status, etc.).
But one of the advantages of a hierarchically-structured data
representation is the ability to have scopes to which properties apply. If
a processor can't at least maintain a stack of the attributes of the
ancestors of the current element, it's pretty darn braindead.
I mean come on, we have to presume *some* intelligence in processors.
Cheers,
E.
--
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004
www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Mon May 4 16:23:25 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:06 2004
Subject: Attributes with Intent
In-Reply-To: <199805041233.IAA00288@unready.microstar.com>
References: <3.0.1.16.19980504090531.2fa7aec0@pop3.demon.co.uk>
<3.0.1.16.19980504090531.2fa7aec0@pop3.demon.co.uk>
Message-ID: <3.0.1.16.19980504134317.2fcfada0@pop3.demon.co.uk>
At 08:33 04/05/98 -0400, David Megginson wrote:
>Peter Murray-Rust writes:
>
>
>From a logical perspective, your query is simply
>
> Find all elements in a document where the element or the nearest
> ancestor with a value for the xml:lang attribute specifies "de" as
> the attribute's value.
Absolutely. And with the tools I have written it's trivial (for me). And
for those who have SQDL implementations, it's trivial as well. And so on.
But for the newcomer to XML this is a major concern that they have to address.
>
>With SDQL, for example, this test is surprisingly simple:
>
> (equal? (inherited-attribute-string "xml:lang") "de")
>
>Your point is well-taken, though: fragments of XML documents are often
>tightly bound to their context (similar situations would involve
>attributes for effectivity, security level, revision status, etc.).
The real point is that X*L/W3C approaches require *almost any* application
writer to implement this. If I have to write
>
(equal? (inherited-attribute-string "xml:lang") "de")
or (as I do)
String lang =
(currentNode.getXPointerNode("ANCESTOR(1,*,xml-lang,*)").getAttributeValue("
xml:lang"));
it would be nice if we all had a consistent way of doing it.
Whatever the solution it appears that processing links is significantly
more effort because of this. Respecting confidentiality, I may or may not
have made this point on XML-SIG but the decision has been taken and we have
to find the best way of implementing it. The software not only has to
search for (say) those elements with (say) role attributes, but those which
are 'intended' to have role elements.
It gets more fun with attribute remapping:
which says that wherever you encounter the attribute "my-role" in a
it is *really* a 'role' attribute. So we have:
What is the 'role' of the ? [BTW it matters getting this right,
since if you get the wrong reactants there might be an unexpected
exothermic reaction.]
Implementer 1 says:
"map all molecule|new-role to behave like molecule|role"
"make all locator molecules inherit the value of the parent roles (after
mapping"
Implementer 2 says:
"make all locator molecules inherit the value of the parent roles (without
mapping)"
I suspect I side with 2, but can I be sure that everyone sees it that way?
In any case there will be lots of times that we want an attribute from an
element. For certain types of element/attribute we may have to ask:
- give me the attributes
- does this also have some 'intended attributes'
- oh, and does it also have some attributes which are mapped by yet
another level of indirection?
The only way that I can see myself offering reliable software is if we
agree communally on what the semantics of this are *and* implement some
software/APIs to help ensure consistency. For the present I suspect I shall
say:
JUMBO does not support attribute remapping. Sorry.
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From SimonStL at classic.msn.com Mon May 4 16:57:58 1998
From: SimonStL at classic.msn.com (Simon St.Laurent)
Date: Mon Jun 7 17:01:06 2004
Subject: Attributes with Intent
Message-ID:
>But one of the advantages of a hierarchically-structured data
>representation is the ability to have scopes to which properties apply. If
>a processor can't at least maintain a stack of the attributes of the
>ancestors of the current element, it's pretty darn braindead.
This attribute discussion is interesting, because I'm not sure you want to
expect the processor to have the entire document available for stack building.
One of the projects that I'm working on (slowly) is an implementation of
XPointers that uses the server's processing power to chop just out the section
of a document the user wants and avoiding sending out millions of bytes of
wasted data. I suspect that we'll need to make the server handle the xml:lang
and xml:space stack, and re-enter their values into the topmost element
returned, or something like that.
xml:lang and xml:space seem like they're worth the extra effort, though.
Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer / Cookies
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From David.Brownell at Eng.Sun.COM Mon May 4 17:57:45 1998
From: David.Brownell at Eng.Sun.COM (David Brownell)
Date: Mon Jun 7 17:01:06 2004
Subject: SAX: Exception Handling
Message-ID: <199805041556.IAA27904@argon.eng.sun.com>
> > throws java.lang.Exception
> > {
> > [..]
> > }
> >
> > The Java compiler will not help me discover which constructors or
> > methods invoked in the body can throw exceptions that I have not yet
> > considered.
>
> Right, so don't do that. Declaring an *interface* as throws
> java.lang.Exception does not constrain an *implementation* of that
> interface to be declared as throws java.lang.Exception. It can be
> declared to throw the appropriate subclass of java.lang.Exception.
But then since application programmers write to interfaces, not
implementations, then application programmers would have all those
nasty problems with getting handed random exceptions.
A primary intent of exception declarations is to constrain the
kinds of errors that a substystem (e.g. parser) reports, so that
its clients have a clean "contract" and know exactly the kinds of
"expected faults" they have to deal with.
Clauses like "throws Exception" are basically counter to the core
philosophy of exceptions in Java.
- Dave
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From eliot at isogen.com Mon May 4 18:25:31 1998
From: eliot at isogen.com (W. Eliot Kimber)
Date: Mon Jun 7 17:01:06 2004
Subject: Attributes with Intent
Message-ID: <3.0.32.19980504112217.00da4a20@postoffice.swbell.net>
At 02:56 PM 5/4/98 UT, Simon St.Laurent wrote:
>>But one of the advantages of a hierarchically-structured data
>>representation is the ability to have scopes to which properties apply. If
>>a processor can't at least maintain a stack of the attributes of the
>>ancestors of the current element, it's pretty darn braindead.
>
>This attribute discussion is interesting, because I'm not sure you want to
>expect the processor to have the entire document available for stack
building.
You don't need the entire document, only the attributes of the ancestry of
the root of the subchunk you're processing. This is not a big deal.
Cheers,
Eliot
--
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004
www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Mon May 4 21:16:47 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:06 2004
Subject: AElfred: URL Change
Message-ID: <199805041915.PAA01628@unready.microstar.com>
Microstar is moving the AElfred XML parser a little. Please change
all links and bookmarks to point to
http://www.microstar.com/XML/AElfred/
The old URL, "http://www.microstar.com/XML/", still contains a
prominent pointer to AElfred for now, so stale links will not be
disastrous.
Thanks, and all the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From maillist at chris.hubick.com Mon May 4 22:07:55 1998
From: maillist at chris.hubick.com (Chris Hubick)
Date: Mon Jun 7 17:01:06 2004
Subject: PEReference
Message-ID:
I am writing an XML analization tool upon which I am building a
parser. At the bottom level it reads in XML and generates
start/end/character events based on the productions in the XML spec. For
example, when the parser encounters something matching the Name
production, say the Name "foo", it would generate events:
foo
When it is complete I hope to build an actual parser on top of it.
The analizer can currently read most any document, "all" it is lacking
support for, and which I am working on, is production [29] markupdecl, and
all of it's dependencies. Now this is where I hit a snag.
In the XML spec at:
http://www.w3.org/TR/REC-xml#NT-markupdecl
It states:
> The markup declarations may be made up in whole or
> in part of the replacement text of parameter entities.
> The productions later in this specification for
> individual nonterminals (elementdecl, AttlistDecl,
> and so on) describe the declarations after all the
> parameter entities have been included.
I want the productions for an XML document BEFORE the parameter entities
have been included. I really think the XML spec should have included
productions for before as well as after PEReference inclusion.
I want to do PEReference inclusion at the parser level, not at my lower
"analizer" level, which I want to generate events that directly reflect
what is in the document (before inclusion).
So for my purposes, I need to figure out the grammer for an _unprocessed_
XML document.
So my first step/idea was to just look at the current grammer, and start
adding PEReferences where I thought necessary:
[45] elementdecl ::= ''
[48] cp ::= (Name | PEReference | choice | seq) ('?' | '*' | '+')?
[51] Mixed ::= '(' S? '#PCDATA' (S? '|' S? (Name | PEReference))* S? ')*' | '(' S? '#PCDATA' S? ')'
[52] AttlistDecl ::= ''
[53] AttDef ::= S (Name | PEReference) S AttType S DefaultDecl
[54] AttType ::= StringType | TokenizedType | EnumeratedType | PEReference
[58] NotationType ::= 'NOTATION' S '(' S? (Name | PEReference) (S? '|' S? (Name | PEReference))* S? ')'
[59] Enumeration ::= '(' S? (Nmtoken | PEReference) (S? '|' S? (Nmtoken | PEReference))* S? ')'
Where I get really confused is:
[9] EntityValue ::= '"' ([^%&"] | PEReference | Reference)* '"' | "'" ([^%&'] | PEReference | Reference)* "'"
If, as the spec states, these are the declarations AFTER PE inclusion, how
can there be PEReferences???
Part of the reason I am writing this is to get a better grip on (read
learn) XML. Any guidance would be much appreciated, thanks!
---
Chris Hubick
mailto:chris@hubick.com
http://www.hubick.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Mon May 4 22:09:58 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:06 2004
Subject: Attributes with Intent
In-Reply-To: <3.0.32.19980504091654.00762d10@postoffice.swbell.net>
Message-ID: <3.0.1.16.19980504200839.29af6aba@pop3.demon.co.uk>
At 09:17 04/05/98 -0500, W. Eliot Kimber wrote:
>At 08:33 AM 5/4/98 -0400, David Megginson wrote:
>
>>Your point is well-taken, though: fragments of XML documents are often
>>tightly bound to their context (similar situations would involve
>>attributes for effectivity, security level, revision status, etc.).
>
>But one of the advantages of a hierarchically-structured data
>representation is the ability to have scopes to which properties apply. If
>a processor can't at least maintain a stack of the attributes of the
>ancestors of the current element, it's pretty darn braindead.
I suppose most of my code must be pretty darn undead then... The point I
was making is that 'scope' is introduced in a total of about 3-4 sentences
in total in the XML + XLink specs. It's actually quite easy for an
implementer to miss. I'm not complaining - I'm saying that it would be
useful to discuss whether we can all implement this easily and if so how.
[BTW how much freely available is actually out there which implements
any/all of these attributes?] The touchstone of XML-applications - the
desperate Perl Hacker - is not necessarily going to build a stack of the
attributes of an/every element. And I'd represent - gently - that some
people (at least one) are going to have met attribute re-mapping for the
first time and possibly get muddled.
For example it seems that a Node must have 2-3 sets of methods:
getAttributeValue(String attName);
getPseudoAttributeValue(String specialAttnameInContext)
and
getPseudoAttributeValueAfterRemapping(String possiblyRemappedAttributeName)
If this is a useful way forward (other than the somewhat contrived names)
it would be nice to know. If not, what?
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From donpark at quake.net Mon May 4 23:12:14 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun 7 17:01:06 2004
Subject: ANN: SAXDOM Updated
Message-ID: <002b01bd77a0$51b597d0$2ee044c6@arcot-main>
A preliminary version of SAXDOM supporting the latest DOM spec (04/16/98) is
released. There are still some clean ups (i.e. syncronization blocks for
concurrent access to DOM objects) and plenty of tests to go.
This version of SAXDOM includes NodeIterator.release method which is
currently not in the spec. I am hoping it is included in the spec. You
will find SaxContainerIterator class particularly interesting because it is
implemented as a Node to allow 'between' iterator position and 'live' model
requirements.
Enough said. Here is the URL:
http://www.docuverse.com/personal/saxdom.html
Regards,
Don Park
http://www.docuverse.com/personal/index.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From eliot at isogen.com Mon May 4 23:35:38 1998
From: eliot at isogen.com (W. Eliot Kimber)
Date: Mon Jun 7 17:01:06 2004
Subject: Attributes with Intent
Message-ID: <3.0.32.19980504163150.006fd2f8@postoffice.swbell.net>
At 08:08 PM 5/4/98, Peter Murray-Rust wrote:
>For example it seems that a Node must have 2-3 sets of methods:
> getAttributeValue(String attName);
> getPseudoAttributeValue(String specialAttnameInContext)
>and
> getPseudoAttributeValueAfterRemapping(String possiblyRemappedAttributeName)
Pretty much. I usually have these functions:
- AttValue(string LocalAttName)
Returns value of attribute given name as specified for the element.
- LocalAttName(string ArchAttName)
Returns local name mapped to architectural name (e.g., whatever xml:lang
has been mapped to for this element).
- ArchAttvalue(string ArchAttName)
Returns the value of an attribute given its architectural name. Resolves
remapping as necessary.
- AttEffectiveValue(string LocalAttName)
Returns the effective value of an attribute given its local (to the
document)
name.
- ArchAttEffectiveValue(string ArchAttName)
Returns the effective value of an attribute given its architectural
name.
Obviously, the last two are implemented using the first three to recursively
examine the ancestry of the current element.
There's another case, less common, but to think about, which is the case
where the effective value of an ancestor element's attribute is dependent
on the value in its content. You see this most often with security
classification attributes, where the effective security classification of
an element is the highest classification of any of its content, e.g., in
this example, the document element's effective security "eyes-only":
]>
......
Where "eyes-only" is a higher classification than "top-secret".
I would not expect to see a generalized method for determining the
effective value of attributes in this way.
Propogation of attribute values down the hierarchy is also trickier than
just looking up the ancestry, because you might have rules about how
defaults get set or reset. HyTime's "default value list" (clause A.5.6,
Default value list, part of the General Architecture), provides a more
complete facility for specifying exactly what the defaults for implied
attributes are and how they should be propogated. It's an example of the
sorts of options you need to anticipate in the general case.
>The touchstone of XML-applications - the
>desperate Perl Hacker - is not necessarily going to build a stack of the
>attributes of an/every element.
How hard can it be?:
# NOTE: forgive my pseudo Perl syntax-- you know what I mean.
@attstack() ; # stack of attlist structures
@attlist{name} = "value" ;# Associative array of atts values by name
sub process_element() {
push(attstack, attlist); # Push associative array of atts on stack
for each att in element.atts ; # Method that returns array of att objects
$attlist{att.name} = att.value
next
}
Cheers,
Eliot
--
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004
www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Tue May 5 00:58:02 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:07 2004
Subject: Attributes with Intent
In-Reply-To: <3.0.32.19980504163150.006fd2f8@postoffice.swbell.net>
Message-ID: <3.0.1.16.19980504225658.302f3ca6@pop3.demon.co.uk>
At 16:32 04/05/98 -0500, W. Eliot Kimber wrote:
>At 08:08 PM 5/4/98, Peter Murray-Rust wrote:
>
>>For example it seems that a Node must have 2-3 sets of methods:
>> getAttributeValue(String attName);
>> getPseudoAttributeValue(String specialAttnameInContext)
>>and
>> getPseudoAttributeValueAfterRemapping(String possiblyRemappedAttributeName)
>
>Pretty much. I usually have these functions:
>
>- AttValue(string LocalAttName)
> Returns value of attribute given name as specified for the element.
>- LocalAttName(string ArchAttName)
> Returns local name mapped to architectural name (e.g., whatever xml:lang
> has been mapped to for this element).
>- ArchAttvalue(string ArchAttName)
> Returns the value of an attribute given its architectural name. Resolves
> remapping as necessary.
>- AttEffectiveValue(string LocalAttName)
> Returns the effective value of an attribute given its local (to the
>document)
> name.
>- ArchAttEffectiveValue(string ArchAttName)
> Returns the effective value of an attribute given its architectural
> name.
>
>Obviously, the last two are implemented using the first three to recursively
>examine the ancestry of the current element.
This is exactly was I was after :-). These may seem nobrainers to you, but
they aren't to people like me :-)
[...]
>>The touchstone of XML-applications - the
>>desperate Perl Hacker - is not necessarily going to build a stack of the
>>attributes of an/every element.
>
>How hard can it be?:
>
># NOTE: forgive my pseudo Perl syntax-- you know what I mean.
>@attstack() ; # stack of attlist structures
>@attlist{name} = "value" ;# Associative array of atts values by name
>
>sub process_element() {
>
> push(attstack, attlist); # Push associative array of atts on stack
> for each att in element.atts ; # Method that returns array of att objects
> $attlist{att.name} = att.value
> next
>
The point is not that it is not hard to *do*, it's knowing exactly *what*
you have to do when. For you it's second nature :-). That's why I'm asking...
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From eliot at isogen.com Tue May 5 01:11:44 1998
From: eliot at isogen.com (W. Eliot Kimber)
Date: Mon Jun 7 17:01:07 2004
Subject: Attributes with Intent
Message-ID: <3.0.32.19980504180736.00e25644@postoffice.swbell.net>
At 10:56 PM 5/4/98, Peter Murray-Rust wrote:
>At 16:32 04/05/98 -0500, W. Eliot Kimber wrote:
>>
>>Obviously, the last two are implemented using the first three to recursively
>>examine the ancestry of the current element.
>
>This is exactly was I was after :-). These may seem nobrainers to you, but
>they aren't to people like me :-)
My appologies: it does seem obvious to me, but I've been doing this sort of
thing about 15 years now, so I guess it would. I've also had the advantage
of being able to crib from lots of good code (such as the old sgmls.pl that
you got with James Clark's old sgmls package...)...
There's probably an "XML Processing Patterns" book in there somewhere,
although I suspect that Bob Ducharme's book is that (I haven't read it yet).
>The point is not that it is not hard to *do*, it's knowing exactly *what*
>you have to do when. For you it's second nature :-). That's why I'm
asking...
Again, good point...keep asking. I'll try to be a bit more supportive in my
answers.
Cheers,
E.
--
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004
www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From dan at intervista.com Tue May 5 02:00:17 1998
From: dan at intervista.com (Dan Ancona)
Date: Mon Jun 7 17:01:07 2004
Subject: Separation of formatting...
Message-ID: <3.0.32.19980504171122.0097fe20@mail.intervista.com>
At 09:24 PM 5/1/98 -0400, Gregg Reynolds wrote:
>An interesting paper on a similar topic is at
>http://www.sil.org/sgml/ohco1.html, "Refining Our Notion of What Text
>Really Is: The Problem of Overlapping Hierarchies."
Ooh, that's a *really* good link. Unfortunately, it melted my cerebral
cortex. I'll try to read it in more detail later.
Re: the later posts in this thread, I hadn't considered the
text/presentation feedback loop or how to handle it, but that will
definitely have an effect on what I'm working on. Thanks for the
thoughtful answers...
Dan
__________________________________________________________
Dan Ancona "engage!"
Tech Support, Evangelism and Cookery Intervista Software
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jjc at jclark.com Tue May 5 08:29:49 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun 7 17:01:07 2004
Subject: SAX: Exception Handling
References: <199805041556.IAA27904@argon.eng.sun.com>
Message-ID: <354EB022.28130075@jclark.com>
David Brownell wrote:
> James Clark wrote:
> > Declaring an *interface* as throws
> > java.lang.Exception does not constrain an *implementation* of that
> > interface to be declared as throws java.lang.Exception. It can be
> > declared to throw the appropriate subclass of java.lang.Exception.
>
> But then since application programmers write to interfaces, not
> implementations, then application programmers would have all those
> nasty problems with getting handed random exceptions.
>
> A primary intent of exception declarations is to constrain the
> kinds of errors that a substystem (e.g. parser) reports, so that
> its clients have a clean "contract" and know exactly the kinds of
> "expected faults" they have to deal with.
You're not distinguishing two quite different uses of interface:
1. There are interfaces like Parser. These are interfaces that are
implemented by the SAX implementation and called by clients of the
application. For interfaces that are used in this way, I completely
agree with the points you make. I did not suggest that methods on
parser should be declared as throws Exception: they should be declared
as throws SAXException.
2. There are interfaces like DocumentHandler. These are not implemented
by the SAX implementation, nor are they called by the client. Instead
the reverse is true: these are called by the SAX implementation and
implemented by the client. Your arguments don't carry through to
interfaces that are used in this way. Declaring methods in these
interfaces as throws Exception does not cause a client to have to deal
with random errors because they are not called by the client. Nor does
it require the parser to deal with random errors
> Clauses like "throws Exception" are basically counter to the core
> philosophy of exceptions in Java.
It's not that simple.
The real problem ehere is the lack of parameterized types in Java. If
Java had a concept of parameterized types, we could get the typing
exactly write, eg (using a C++ like syntax):
template
interface DocumentHandler {
void endElement(String name) throws ET;
}
template
interface Parser {
void setDocumentHandler(DocumentHandler handler);
void parser(InputSource in) throws ET;
}
class MyDocumentHandler implements DocumentHandler {
void endElement(String name) throws IOException {
}
}
class Demo {
void foo(Parser parser) {
parser.parse(new MyDocumentHandler());
}
}
The use of Exception that I'm arguing for is analogous to the use of
Object in a class like Vector. Because Java doesn't have parameterized
types, you can't express the idea that the type you can get out of the
Vector constrains the type you can put in (or vice-versa), so you have
to settle for using Object. Similarily with SAX you can't express that
the idea that the type of the exception that the Parser can throw
contrains the type of the exception that the handler methods can throw,
so you have to settle for using Exception.
Just as with Vectors, you can easily add a type-safe wrapper:
interface IODocumentHandler extends DocumentHandler {
void endElement(String name) throws IOException;
}
class IOParser {
Parser parser;
public void setDocumentHandler(IODocumentHandler handler) {
parser.setDocumentHandler(handler);
}
public void parser(InputSource in) throws IOException {
try {
parser.parse(in);
}
catch (SAXException e) {
// The cast can't fail since the DocumentHandler is an
// IODocumentHandler that can only throw IOExceptions
throw (IOException)e.getException();
}
}
}
With this wrapper I now get a version of SAX specialized for handlers
that throw IOExceptions. Using this wrapper I get complete exception
type-safety. I can similarily do a wrapper for AWTException or
SQLException which will give me exception type-safety for applications
using AWT or SQL. Note that these wrappers have minimal run-time cost,
because the handler methods are called directly.
The more I think about this the more certain I am that having the
handler methods declared as throws Exception is the right solution.
James
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Tue May 5 12:59:14 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:07 2004
Subject: AElfred: Missing files added
Message-ID: <199805051057.GAA00457@unready.microstar.com>
There were a couple of non-critical files missing from the AElfred 1.2
distribution (in particular, README.txt and ReaderDemo). I have
updated the distribution with no other changes; you may download it
from the following URL:
http://www.microstar.com/XML/AElfred/aelfred-1.2.zip
Please note that if you have already downloaded AElfred 1.2, you do
not need to do so again unless you are interested in the missing
files.
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jjc at jclark.com Tue May 5 13:01:20 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun 7 17:01:07 2004
Subject: SAX: Exception Handling
References: <199805041556.IAA27904@argon.eng.sun.com> <354EB022.28130075@jclark.com>
Message-ID: <354EEFF9.BC000335@jclark.com>
I've just switched XP to use the exception handling approach I'm
advocating for SAX, and I'm very happy with it:
- if you
import com.jclark.xml.parse.*;
import com.jclark.xml.parse.io.*;
you get an interface where the handler methods can throw IOException and
where the parse method throws IOException (the same as the old XP
interface)
- if you
import com.jclark.xml.parse.*;
import com.jclark.xml.parse.awt.*;
you get an interface where the handler methods can throw AWTException
and where the parse method throws AWTException and IOException;
- if your application needs to throw some other kind of exception, you
can either create a package with 4 trivial wrapper classes (which could
be automatically generated from the com.jclark.xml.parse.awt.* source),
or you can use com.jclark.xml.parse.base.* where the handler methods are
declared as throwing Exception and the parse method is declared as
throwing IOException and ApplicationException (which is a wrapper
analagous to SAXException).
James
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From clovett at microsoft.com Tue May 5 19:04:09 1998
From: clovett at microsoft.com (Chris Lovett)
Date: Mon Jun 7 17:01:07 2004
Subject: Is MSXML pure Java?
Message-ID: <2F2DC5CE035DD1118C8E00805FFE354C0395C744@red-msg-56.dns.microsoft.com>
> I just installed it to check it out and I noticed that it installs
> a DLL. It also doesn't come in a standard jar or zip file,
> but uses an installer which appends to the Microsoft Java VM
> classes.zip file in \winnt\java\classes. In order words, it
> seems too close and comfy with the Microsoft VM which scares me
> into thinking that perhaps it uses one of MS's horible extensions
> to tie it to IE/Windows.
>
> Is it non-pure? Or is the DLL stuff just some kind of ActiveX/DSO
> bridge junk?
>
[Chris Lovett] The DLL that is an optimization that only works under
Windows to improve the speed of Java IO by a large amount (I think it is a
factor of 4 or something). This will simply not load on any other platform
and become a no-op. To be sure you can remove the XMLStream classes all
together. The rest of the parser is very much standard plain vanilla Java,
although it does use some JDK 1.1 features. There is also a switch in their
that looks for JDK 1.1, so if you have a newer JDK than 1.1 you will need to
disable that test and simply set the jdk11 member variable to "true".
> The docs aren't JavaDoced either :(, but it does appear to
> have its own Object Model which seems suspiciously like
> DOM (which is understandable since MS is one of the submitters),
> but different enough to be confusing. Any word whether
> MSXML will eventually support DOM native?
>
[Chris Lovett] Yes, the plan is to support DOM. The Java parser was
written last year when the DOM was still in its infancy.
So far the DOM is still a moving target.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From clovett at microsoft.com Tue May 5 19:16:55 1998
From: clovett at microsoft.com (Chris Lovett)
Date: Mon Jun 7 17:01:07 2004
Subject: [xX][mM][lL] is reserved
Message-ID: <2F2DC5CE035DD1118C8E00805FFE354C0395C746@red-msg-56.dns.microsoft.com>
Spec issue:
Section 2.3 paragraph 3:
"A Name is a token beginning with a letter or one of a few
punctuation characters, and continuing with letters, digits, hyphens,
underscores, colons, or full stops, together known as name characters.
Names beginning with the string "xml", or any string which wouild match
(('x' | 'X')('m' | 'M') ('l' | 'L')) are reserved for standardization in
this or future versions of this specification".
Currently MSXML disallows "xml" from being used as a "namespace", for
example: "xml:foo" since this seems to be the way the XML spec is heading
with "xml:space" and "xml:lang". So is the above paragraph just a little
too restrictive and should it include the colon ? If not then the following
very common XML example is illegal:
This is a test.
Tim Bray's web site (http://www.xml.com/axml/axml.html) says explicitly that
"xmlu" is an illegal name, but James Clark's expat seems to allow this ??
Can the XML spec gurus clarify this paragraph ? Thanks.
Chris Lovett.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From crism at ora.com Tue May 5 19:55:44 1998
From: crism at ora.com (Chris Maden)
Date: Mon Jun 7 17:01:07 2004
Subject: [xX][mM][lL] is reserved
In-Reply-To:
<2F2DC5CE035DD1118C8E00805FFE354C0395C746@red-msg-56.dns.microsoft.com>
(message from Chris Lovett on Tue, 5 May 1998 10:15:35 -0700)
Message-ID: <199805051754.NAA21373@ruby.ora.com>
[Chris Lovett]
> "A Name is a token beginning with a letter or one of a few
> punctuation characters, and continuing with letters, digits,
> hyphens, underscores, colons, or full stops, together known as name
> characters. Names beginning with the string "xml", or any string
> which wouild match (('x' | 'X')('m' | 'M') ('l' | 'L')) are reserved
> for standardization in this or future versions of this
> specification".
[...]
> Can the XML spec gurus clarify this paragraph ? Thanks.
The spec seems clear to me. Names may not begin with "xml" or any
case variation thereof. What's unambiguous?
If you're asking if it *should* say what it does, the answer is yes,
to me. The XML WG can always choose to free the string for wider use
later, but would be unable to capture it again having done so. As
with many other decisions, they've chosen to err (if an error it is)
on the side of caution.
However, one side effect of this is that a conformant XML 1.0
processor may reject a conformant XLink document for using "xml:link"
as an attribute name.
-Chris
--
http://www.oreilly.com/people/staff/crism/ +1.617.499.7487
90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek>
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tbray at textuality.com Tue May 5 20:45:29 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun 7 17:01:07 2004
Subject: [xX][mM][lL] is reserved
Message-ID: <3.0.32.19980505114421.00c608b0@pop.intergate.bc.ca>
At 10:15 AM 5/5/98 -0700, Chris Lovett wrote:
>Currently MSXML disallows "xml" from being used as a "namespace", for
>example: "xml:foo" since this seems to be the way the XML spec is heading
>with "xml:space" and "xml:lang". So is the above paragraph just a little
>too restrictive and should it include the colon ?
Yes, this should be disallowed as a namespace prefix... the latest
namespace draft supports this point of view.
>If not then the following
>very common XML example is illegal:
>
> This is a test.
>
Yes.
>Tim Bray's web site (http://www.xml.com/axml/axml.html) says explicitly that
>"xmlu" is an illegal name, but James Clark's expat seems to allow this ??
Well, end-users certainly shouldn't invent such names in their documents.
On the other hand, it's not clear that a processor should reject it,
because...
>Can the XML spec gurus clarify this paragraph ? Thanks.
Well, actually, the spec is a bit mushy on this; for PI targets, it
excludes "^xml*" right in the grammar, but for other names, it just
says such names are "reserved," whatever that means.
My interpretation has been that "reserved" meant, in practical terms,
reserved precisely for things like xlink and namespaces and so on;
but the spec doesn't do a good job of making this clear. I think,
Chris, you ought to get Charles or Jean to raise this formally in the
WG. -Tim
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From donpark at quake.net Tue May 5 22:17:05 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun 7 17:01:07 2004
Subject: Is MSXML pure Java?
Message-ID: <000a01bd785a$5c558460$2ee044c6@arcot-main>
>> The docs aren't JavaDoced either :(, but it does appear to
>> have its own Object Model which seems suspiciously like
>> DOM (which is understandable since MS is one of the submitters),
>> but different enough to be confusing. Any word whether
>> MSXML will eventually support DOM native?
>>
>[Chris Lovett] Yes, the plan is to support DOM. The Java parser was
>written last year when the DOM was still in its infancy.
>So far the DOM is still a moving target.
FYI, My FREE-DOM package (formerly called SAXDOM) can be used with MSXML if
you combine two drivers: one for bridging FREE-DOM to SAX and another for
bridging SAX to MSXML. A direct driver from FREE-DOM to MSXML as well as
other popular parsers is planned.
FREE-DOM can be obtained at:
http://www.docuverse.com/personal/freedom/index.html
Regards,
Don Park
http://www.docuverse.com/personal/index.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From elm at arbortext.com Wed May 6 01:35:26 1998
From: elm at arbortext.com (Eve L. Maler)
Date: Mon Jun 7 17:01:07 2004
Subject: [xX][mM][lL] is reserved
In-Reply-To: <98May5.140037edt.26883@thicket.arbortext.com>
References: <2F2DC5CE035DD1118C8E00805FFE354C0395C746@red-msg-56.dns.microsoft.com>
Message-ID: <3.0.5.32.19980505193327.00a2bac0@village.promanage-inc.com>
At 01:54 PM 5/5/98 -0400, Chris Maden wrote:
>However, one side effect of this is that a conformant XML 1.0
>processor may reject a conformant XLink document for using "xml:link"
>as an attribute name.
FYI, because of the namespace work, we voted to change the xml:link
attribute to xlink:form, though this doesn't appear in the spec yet.
You'll see it in the next draft. (No, I'm not sure when that will be. :-)
Eve
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cbullard at hiwaay.net Wed May 6 03:19:05 1998
From: cbullard at hiwaay.net (len bullard)
Date: Mon Jun 7 17:01:07 2004
Subject: RFD: comp.text.xml
References:
Message-ID: <354FBA1C.B61@hiwaay.net>
Steven Champeon wrote:
>
> Len, you're sounding a bit defensive.
Quite true. Some ideas and principles are worth defending to
a point. This post is the last point. I've not time for a
flamewar, and anyway, that is what the icon that looks like
crossed sabres on the Outlook menu is there to stop. :-)
> Do you think C would have gotten as
> far as it did if every time someone brought it up, someone else popped up
> to remind everyone that it was inspired by BCPL? Need we say that Java was
> once named Oak in order to reap its benefits?
No. That we remember this is the case is sufficient. OTOH, in my
own descriptions of these, I decline to call Java a standard. It is
a product of Sun well on its way to becoming a standard technology
similar to RealMedia. These distinctions enable me to look at
customers and say, "yes, runs best on X machine" and not endure
the endless "But Why Won't They Implement the Standard?" questions.
When it goes to ISO, as HTML has, I will call it a standard. Until
then, XML is a standard technology and the property of the W3C.
> I started out working with SGML, waited for the CALS table model, hung
> out wondering when DSSSL would be done, learned Author/Editor's internal
> styles language as a desperation measure, bemoaned the ridiculous complexity
> of FOSI, and when the Web hit I never looked back.
With you all the way. I spent time and personal capital pleading for a
simplification. When the SGML On The Web project was announced, I
stepped up to endorse it. It was an idea whose time had come and
in fact, was overdue. I had watched customers, friends, companies,
pay large prices. OTOH, I was also there to see certain ones
pay off as SGML delivered on the lifecycle properties promised.
Markup works. Since XML is the same subset of SGML that most
of us had always been using, I feel confident it will deliver
the same benefits. No quarrel there.
> I'm thrilled to death
> that the idea of separating presentation from markup has become a reality
> for those of us without multi-million dollar consulting budgets,
So am I Steve, but frankly, I am the rogue who didn't buy into the
whole arcana and was using an SGML hypertext system with stylesheets
by 1992-93. I even arranged to give it away: IADS. It wasn't
everything I wanted, but it sure proved the point, and hey, it
even let you design and use your own DTDs. That reality has been
real for almost a decade.
> and I'm
> thrilled that XML will be part of the next generation of browsers. I'm
> afraid that Microsoft has cornered the market on Windows-based XML parsing,
> by virtue of its being built into the "Internet Explorer OS Upgrade" blob
> they seem to be pushing as a pre-req for any new application installs,
This doesn't bother me a bit. The rules for introducing standard
technologies using the W3C processes have been followed apparently
to the letter. Practicality prevails. At this point, MS is the
defacto standard platform. Having XML on it is the right thing.
It will be interesting to see how well it is used and by whom.
There are some real issues about integrating Chrome with other
standard languages, but that is another problem to be worked.
I think XML and MS may soon face a pretty unruly crowd over that one.
> but the fact that there is a standard to appeal to - a simple standard,
> which doesn't give MS much cover should they decide to slightly alter
> their implementation. I think it's amazing that there are fully-functional
> parsers out there today.
Since the goal was to enable the DPH to write a parser in a week or so,
this shouldn't amaze you too much. Meeting the design requirements is
usually good practice in engineering efforts. Given that no one had to
start from scratch and most of the major decisions were pruning
decisions
over existing solid work, I would be very surprised if that weren't the
case.
> AFAIK, there isn't a single application out there
> which fully supports all of the SGML arcana.
AFAIK, you are right and that hasn't made a warts worth
of difference. SGML has been and is applied to large
publishing and database projects with much success. Cost
of tools varied, but that was mainly in the rendering.
The SGML vendor community overcharged and often
overemphasized complete compliance. It was suicide
and opened the door to the events that followed. The SGML
Way was a recipe for commercial failure. XML proves
that lesson was learned. Good.
> Len, I've known you since my first days reading comp.text.sgml, and I
> respect your enlightened approach to subjects both technical and humane.
Thank you.
> Please don't expect XML to be a bait-and-switch move to bring SGML into
> everyone's home.
I don't expect it to be. Nor did I expect SGML On The Web to be a bait
and switch move to replace SGML with XML, but I was wrong. OTOH, at
certain levels, I don't care about that either. For those of us who
were fortunate enough to watch Dr. Goldfarb, James Mason, Lynn Price,
Martin Bryan, Steve Newcomb, Sharon Adler, Anders Berglund, and the
others who created
SGML, XML is the crowning achievement, the proof that their ideas and
their work have succeeded. If a name change is required to ensure
the continuing success, so be it.
But I will not sit by and watch them be robbed by the also
rans and newtoBes of the industry of the credit for that. Not
only does that deny what I know to be true, it is wrong. It also
repeats and legitimizes that same precedent set by those who have
publicly
derided a work to capture their own market, then turn to
take that same work, rename it, and call themselves fathers
of a new language. History requires a certain
ethical and moral payment: respect.
> And please recognize that XML is a powerful, simple,
> iteration on some of the ideas espoused by SGML.
To be sure: SGML On The Web.
> Where XML will not suffice,
> perhaps SGML will thrive.
ISO 8879 is dead. Without a base of implementors, no ideas however
good can thrive. But there will be a price for it. ISO
rules and processes had a hint of openness and rule of law.
A Don Park stood a chance in that. Yet these rules and
processes are made by men and in that, justice is the
gift of the individual. I remember that very strange
fellow with the prison tattoos who made it into the
HyTime meeting in Almaden (sorry eliot, this predates
you). I thought it awkward that he was there, yet, Dr
Goldfarb neither feared him nor took measures to remove him.
It was an awesome demonstration of integrity. If this
is the case with the Director, then let him change the
rules and open the meetings. This technology which he
espouses and has done so much to promote enables it and
without nearly the problems Dr. Goldfarb overcame. Let him
be that much a man, and I will give him that much credit.
> But where XML will suffice, its users need not
> know thing one about SGML. In my mind, this is a good thing.
In my view, they are the same thing and that is a good thing. If
we have to accept that our *standards* are the product of
The Director's approval, and that only the elect can know
the shape of things to come such that a Don Park doesn't even
have half a chance to compete, then it is time to go back to
playing guitar for happy hour crowds. It is something I
understand and can see the good of doing.
I appreciate your comments, Steve. I really do. But my
time here is past and there are other tasks to be done.
There will be comp.text.xml, then there will be whatever
follows that. As the twig is bent....
len
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cbullard at hiwaay.net Wed May 6 03:34:53 1998
From: cbullard at hiwaay.net (len bullard)
Date: Mon Jun 7 17:01:07 2004
Subject: Parents and children (was RFD: comp.text.xml)
References:
Message-ID: <354FBD8D.329C@hiwaay.net>
Simon St.Laurent wrote:
>
> So far it has its own limitations, and 20-odd announced books (3 of which have
> SGML in the title) coming out, as well as a new name.
Congratulations. That is a good start.
> It also has press
> attention like SGML hasn't _ever_ seen.
Umm... press in the web is a taken for granted thing now. As for
SGML, check the references in the early part of the decade including
a whole issue of Byte. Remember, those were different times before
the deflated value of the press today. In those days, even a
column in Byte caused quite a stir.
> It has ringing endorsements from
> major corporations, and promising new products. None of it's here yet,
> really, but the standard only arrived two months ago.
Since it is a pruned version of SGML, one might say
that even Frankenstein could walk on the first day since
the legs were already able. "It's Alive!!!"
> Not a bad start. Hopefully, it will grow.
Hopefully it will stay small and its applications will grow.
> As for abusing the parent, I think you may be in denial about SGML's
> less-than-friendly reputation.
No. I have had to contend with that for some years.
> There were good reasons that the W3C created
> XML, I understand, as well as good reasons for naming it XML rather than
> SGML-Lite.
I agree with the first, not the last. We have to let that go.
> This doesn't mean that SGML is evil incarnate - it just means that
> it involved a learning curve that many people found less than exciting.
So does XML. I think once it is out there awhile, you will get to
endure that as well. You see, SGML did not start out with that
reputation. It grew as some found it convenient. Compared to
tagging Digital Standard Runoff by hand, it was a piece of cake.
> I just don't
> think that comp.text.xml is meant as a forum for diplomacy. XML-DEV or other
> forums are probably more appropriate. I don't understand why you seem so
> dismissive of the needs of people who are coming to XML from non-SGML (and
> non-standards body) backgrounds.
You will find that the first time a Jorn Barger drops on the list,
diplomacy will be of some use. I don't dismiss their needs. I hope
to see them met. I dismiss the arguments that say they become
confused by SGML arcana. It is the same people answering both sets of
questions. So far, that hasn't been a problem.
> I don't think it's a great place to learn about XML. It's not always
> >> easy to determine when SGML-only features are being discussed, and the
> >> question overlaps can lead to some fairly significant confusion, especially
> >> for beginners.
>
> >This is speculative but so far, the one issue that can be taken
> >seriously.
>
> Well, at least we seem to agree on the pedagogical value of a separate
> comp.text.xml newsgroup. That's a start.
Yes. Much success to you.
BTW, rumors are always suspect. ;-)
len
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From donpark at quake.net Wed May 6 03:50:39 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun 7 17:01:07 2004
Subject: RFD: comp.text.xml
Message-ID: <001a01bd7890$62c2a8b0$2ee044c6@arcot-main>
Guys,
>ISO 8879 is dead. Without a base of implementors, no ideas however
>good can thrive. But there will be a price for it. ISO
>rules and processes had a hint of openness and rule of law.
>A Don Park stood a chance in that. Yet these rules and
What is A Don Park? Are you guys using my name in the same spirit as John
Smith or as in Roadkill?
Don Park /don bark'/ n 1. Hopeless idealistic person who is destined to be a
roadkill in the information age dominated by large corporations. || vt 2. to
place and leave (a vehicle) temporarily only to find it flattened under a
giant lowercase letter E.
At least my name will be above Don Quixote whose entry will be amended to
read "See Above".
As silly as a Donut,
Don Park
http://www.docuverse.com/personal/index.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Wed May 6 05:48:42 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:08 2004
Subject: Parents and children (was RFD: comp.text.xml)
References: <354FBD8D.329C@hiwaay.net>
Message-ID: <354FDDC1.A80CEBB@technologist.com>
len bullard wrote:
>
> Umm... press in the web is a taken for granted thing now. As for
> SGML, check the references in the early part of the decade including
> a whole issue of Byte.
And even another misleading Economist article! And articles in PC
Magazine, etc.
> > This doesn't mean that SGML is evil incarnate - it just means that
> > it involved a learning curve that many people found less than exciting.
>
> So does XML. I think once it is out there awhile, you will get to
> endure that as well. You see, SGML did not start out with that
> reputation.
To me, this is such a crucial point that it deserves re-emphasis (and
re-re-emphasis). XML inherits 95% of everything that made SGML difficult
to apply and use. Not 95% of what made it hard to memorize the SGML
standard, but that was never relevant, because only one person ever
memorized the standard (at which point, presumably, SGML got boring and he
moved on to other things). But 95% of the stuff that makes SGML hard to
*use* is in XML. The flexibility, the dangerous parameter entities, the
responsibility for defining your own tags, your own stylesheets and your
own systems. The responsibility to define standards and stick to them. The
perceived and real need to bend them. It's all there.
Compared to HTML, XML is rocket science, just as SGML was before it.
The Byte/Salon/comp....html backlash is a re-occurrence of a very old,
very boring phenomenon. Generic markup makes people's lives harder (in the
short term) in several ways. People have enough on their plates without
digital elites trying to shove down their throats complicated,
multi-layered junk that makes their lives with the promise that it will
have long term benefits.
The best (only?) way to get past that is with instant feedback, immediate
gratification software. Using XML for designing data protocols provides
pretty immediate gratification, which is why it has taken off in that
capacity. On the other hand (and perhaps this is heretical...sorry), the
data protocols world got along okay without XML in the first place. I have
personally documented some interesting applications of XML to software
interoperability problems, but they are essentially incremental changes: a
more expressive IDL, HTTP with richer objects, and so forth. XML makes
these things easier, but not fundamentally more powerful. In my opinion,
XML's real power remains in the ability to make persistent data truly
persistent -- across technological sea changes.
XML browsers will one day provide more or less instant gratification for
Jane Schmoe. She will still have to endure an extra level of abstraction,
understanding and responsibility, but at least she can have her web page
up and running in an evening. At that point, SGML/XML will have really
rounded a corner. Right now, it merely seems to have done so, because it
is the media's interest to promote it as such. And of course it will be
similarly in their benefit to report and perhaps exaggerate the backlash.
Luckily, people are willing (for now) to believe us that we will one day
provide them with more interesting, interactive web pages. This is in
large part because it is in the media's interest to promote that vision
with us. I hope we do not keep the webmasters waiting so long that they
start to disbelieve. Considering that the browser vendors cannot even
implement CSS right, I sometimes have moments of shaky belief myself.
Without the browsers, XML is a programmer's toy and we must begin our
long, slow climb to usability again. Still, with Netscape's source
available, we should not be subject to their whims anymore, so I am not
too worried.
In the meantime, if you teach XML, as I do, I would suggest that the
closest thing to instant gratification we have today is Jade+Docbook for
XML. Creating a series of web pages and a nice printed document from the
same source file is pretty damn cool and much more exciting than looking
at the output of a parser.
> In my view, they are the same thing and that is a good thing. If
> we have to accept that our *standards* are the product of
> The Director's approval, and that only the elect can know
> the shape of things to come such that a Don Park doesn't even
> have half a chance to compete, then it is time to go back to
> playing guitar for happy hour crowds. It is something I
> understand and can see the good of doing.
It wasn't so long ago that we were having a similarly pessimistic
conversation about web standards (at that time, HTML and CSS) and you
admitted that despite all of the setbacks, it was a good time for
hypertext. That is more true today than ever. XML may still be due for a
backlash, but lights are going on in people's heads all over the place. As
long as the pattern is two steps forward, one step back, we are still
making progress.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"Perpetually obsolescing and thus losing all data and programs every 10
years (the current pattern) is no way to run an information economy or
a civilization." - Stewart Brand, founder of the Whole Earth Catalog
http://www.wired.com/news/news/culture/story/10124.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From pl at xmatrix.com Wed May 6 05:54:55 1998
From: pl at xmatrix.com (Philippe Lourier)
Date: Mon Jun 7 17:01:08 2004
Subject: ANN: Tim Bray on TechNetcast, Transcript Released
Message-ID: <3.0.5.32.19980505235220.00798ca0@mail.bettynet.com>
The transcript of Tim Bray's appearance on
DDJ TechNetcast has been posted on our
website,
www.technetcast.com
Thanks to all who provided comments and
suggestions
PL
Dr. Dobb's TechNetcast
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From mrc at allette.com.au Wed May 6 06:25:41 1998
From: mrc at allette.com.au (Marcus Carr)
Date: Mon Jun 7 17:01:08 2004
Subject: Parents and children (was RFD: comp.text.xml)
References: <354FBD8D.329C@hiwaay.net> <354FDDC1.A80CEBB@technologist.com>
Message-ID: <354FE614.2958E331@allette.com.au>
Paul Prescod wrote:
>
Thank you. And thank you Len Bullard also. Although SGML has its warts, many of
us have been using it and providing real solutions for many years - it is
disheartening to see it kicked from pillar to post, often by those who have never
had a commercial imperative to make it work. (This is a general observation and
is not necessarily directed at any of the current participants of this thread.)
That said, the parent/child analogy is entirely appropriate - the child has the
responsibility for forging its own way in a different world than the parent grew
up in, however the genetic bond is indisputable. XML wasn't invented, it was
selectively bred.
--
Regards
Marcus Carr email: mrc@allette.com.au
_______________________________________________________________
Allette Systems (Australia) email: info@allette.com.au
Level 10, 91 York Street www: http://www.allette.com.au
Sydney 2000 NSW Australia phone: +61 2 9262 4777
fax: +61 2 9262 4774
_______________________________________________________________
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ricko at allette.com.au Wed May 6 08:52:19 1998
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun 7 17:01:08 2004
Subject: Parents and children (was RFD: comp.text.xml)
Message-ID: <00a901bd78bb$c06d2dc0$880b4ccb@NT.JELLIFFE.COM.AU>
From: Paul Prescod
> XML inherits 95% of everything that made SGML difficult
>to apply and use...The flexibility, the dangerous parameter entities, the
>responsibility for defining your own tags, your own stylesheets and your
>own systems. The responsibility to define standards and stick to them. The
>perceived and real need to bend them. It's all there.
I think this is a good point.
Markup languages have to cope with text--it is a different world to tagging
nice simple fielded records of databases. People who are coming to
text new usually have no idea the enormous richness of text--standard
generalized markup languages (i.e. XML aka SGML) attempt to provide
a very simple mechanism to cope with it--an element hierarchy with
attributes
and ID/IDREFs to allow graph structures to be represented. But the
richness of text, and the sophistication of what people want to do with it,
means that a simple tool like a standard generalized markup language
makes many problems tractable but still not simple.
People will blame the tool (XML), but the enemy is text itself.
Anyway, people have trouble with standards (e.g. look at all the W3C
applications
which give a puported EBNF syntax), with the generalized approach (e.g. that
famous web page "The Web is ruined and I ruined it"--sorry no URL), with
markup (e.g. those who want documents to be embedded in programming
languages) and with languages (e.g. the people who want to use a binary
format for everything because of sometimes misplaced notions of efficiency).
People already baulk at each of these 4 issues, sometimes appropriately,
sometimes inappropriately. These central issues are much more important
than the readability or implementability of ISO 8879 versus REC-XML.
No matter how simple XML became, IMHO people will still resist buying into
XML's standard generalized markup langage approach.
In the Byte issue about XML, Gates says something about XML being preferred
for the moment. So from their top we can see that Microsoft has not bought
into XML as a strategy, but just as a tactic. The RDF people still see XML
as a serialization format, which suggests that they may see DOM as more
important than XML. So I think we should not fool ourselves that XML's
future
is assured. People who have not grown used them often have a mental block
against standard generalized markup languages and always wish they would
go away in favour of semi-proprietary, low-level, procedural, binary
formats.
Rick Jelliffe
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jmulet at datalab.es Wed May 6 15:04:20 1998
From: jmulet at datalab.es (Jordi Mulet)
Date: Mon Jun 7 17:01:08 2004
Subject: Architectural Forms, separation of formatting and loose-leaf management
Message-ID: <01bd78ee$cb8ff240$69ac92c0@jordi.praxis.es>
Hello,
We are involved in a SGML project and we are finalisizing the analysis phase.
At present we are thinking about how to implement the management of loose-leaf pages updates because any availabe off-the-shelf comercial implementation can cope with our needs.
We think that the concept of architectural forms can help to manage the processing the flow between the intial information structures (SGML semantic tags ) and the presentational/layout structures of the final pages.
As Rick Jelliffe said in a past discussion on XML-dev DSSSL ( with the MIF/RTF backend for example) can help to do the transformations to the the layout structures ( down-sizing), but we can't obtain any feedback from the layout program to the stylesheet language.
We know that others programs as Omnimark or Balise can help to down-size the SGML instances but we will need a flow of dependence from the pages/layout to the original SGML.
A possible implemention of our system would be to build three independent Databases:
- Information SGML Database.
- Layout Database (Quark, FrameMaker,...).
- Page Objects Database ( PDF).
and a "meta-Database" to manage all the "processing" an "relationships" between the elements of each database. This "meta-Database" would control and manage the workflow and updates of the different elements.
Can architectural forms model this meta-database schema to control the three database from a top structure? Architectural forms can store all the information about processing.
How difficult would be to build a similar system ?
It will be necessary to define property sets and grove plans for the Layout scheme and Page scheme, doesn't ? Is there any working experience on this topic ? ( PDF, Quark, FrameMaker,...)
Any feedack will be of great value for us!
Thanks,
Jordi Mulet
Editorial Praxis S.A
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980506/b2514885/attachment.htm
From jmarckel at netco.com Wed May 6 15:07:52 1998
From: jmarckel at netco.com (jeff marckel)
Date: Mon Jun 7 17:01:08 2004
Subject: unsubscribe
Message-ID: <35505F60.547F50CF@netco.com>
unsubscribe
--
Jeff Marckel -- Lead Software Engineer, WAM!BASE
jmarckel@netco.com --or-- 612.886.5642
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From mintert at irb.informatik.uni-dortmund.de Wed May 6 15:23:48 1998
From: mintert at irb.informatik.uni-dortmund.de (Stefan Mintert)
Date: Mon Jun 7 17:01:08 2004
Subject: SGML declaration for XML
Message-ID: <199805061323.PAA15216@brown.informatik.uni-dortmund.de>
Hi!
which is the correct SGML declaration for XML?
I found three:
1a)
I am new to SGML/SMIL, and I find myself wondering if an SMIL editor would
be a good choice of commercial products to target. Do I need a wake-up
call? I am imagining a specialized XML editor that includes temporal
attributes and a frame by frame like content view.
If the project is a go, then I would have moderate development funds to
work with.
I am aware there are key individuals on this list who are well established
in such technology, and who are already working on their own agendas. Could
we cooperate for mutual benefit?
I would be interested in knowing more about desired product features,
design issues, foreseeable risks, and whether anyone would be interested in
participating with development and to what extent.
Regards,
Rolande Kendal
kendal@interlog.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Wed May 6 16:22:39 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:08 2004
Subject: ANN: Tim Bray on TechNetcast, Transcript Released
References: <3.0.5.32.19980505235220.00798ca0@mail.bettynet.com>
Message-ID: <354FF09E.EDF19719@technologist.com>
This transcript is really excellent stuff. Tim has hit the nail on the
head about many issues. It should be required reading for people in the
media.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"Perpetually obsolescing and thus losing all data and programs every 10
years (the current pattern) is no way to run an information economy or
a civilization." - Stewart Brand, founder of the Whole Earth Catalog
http://www.wired.com/news/news/culture/story/10124.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Wed May 6 16:24:13 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:08 2004
Subject: Parents and children (was RFD: comp.text.xml)
References: <354FBD8D.329C@hiwaay.net> <354FDDC1.A80CEBB@technologist.com> <354FE614.2958E331@allette.com.au>
Message-ID: <354FF0A0.C1BF555F@technologist.com>
Marcus Carr wrote:
>
> That said, the parent/child analogy is entirely appropriate - the child has the
> responsibility for forging its own way in a different world than the parent grew
> up in,
You can call XML a child of SGML, or a new iteration of it. One view
establishes a separation and the other eliminates it. Neither has any
particular basis in reality: they are both rhetorical views.
Consider:
C++ (as a random example) must also forge its own way in a different world
than previous iterations of it grew up in. Unlike XML, it wasn't renamed
as it was adapted for new environments and tasks. Nevertheless, it went
from being a Unix-centric front-end to C into a massive programming
language used on every platform with compilers that border on artificial
intelligence. Is modern C++ a "child" of that older one? Or is it just a
new version? Stroustrop once told me that if he were going to update the
ARM, he would essentially have to rewrite it from scratch. The differences
between C++ in 1986, 1990, 1993 and 1998 are much more severe than those
between "old SGML" and XML.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"Perpetually obsolescing and thus losing all data and programs every 10
years (the current pattern) is no way to run an information economy or
a civilization." - Stewart Brand, founder of the Whole Earth Catalog
http://www.wired.com/news/news/culture/story/10124.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ricko at allette.com.au Wed May 6 16:30:00 1998
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun 7 17:01:08 2004
Subject: SGML declaration for XML
Message-ID: <002401bd78fb$8c90b4a0$a10b4ccb@NT.JELLIFFE.COM.AU>
From: Stefan Mintert
>What is the correct SGML declaration for XML?
The correct one is:
tags,
otherwise just use for EMPTY element types.
"ISO 8879:1986 (ENR)" is transitional only.
Finally tags, otherwise just use for
EMPTY element types.
>(2):
> a Name may start (NAMESTRT) with a ':'
All uses of ":" are still experimental. Starting a name with ":" would
presumably
be done to implement some scoping. In general, XML has avoided anything
which makes parsing or understanding a name depend on context--it has a
terrible complicating effect. (Note however, that attribute and data values
are full of context-dependent connotations.)
My understanding is that the namespace proposal is currently not
intending to allow ":" to prefix names, but you should ask the WG people
for some confirmation: (Tim?)
Rick Jelliffe
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ricko at allette.com.au Wed May 6 16:52:47 1998
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun 7 17:01:08 2004
Subject: Architectural Forms, separation of formatting and loose-leaf management
Message-ID: <002c01bd78fe$bd3e87a0$a10b4ccb@NT.JELLIFFE.COM.AU>
Here are three random things which may be useful to consider.
1) The first is that DSSSL allows you to have external functions. So even though DSSSL itself has no way to query the pagination system, DSSSL does allow you to stick in your own queries or functions. You can do all sorts of tricks with these. I dont know to what extent JADE supports this, though. One trouble with stream-based SGML processors is that they often have an output buffer (or are in a pipe) so unless you can flush the output buffers, your SGML processor may be left stranded if it waits for some feedback from a downstream program.
A DSSSL system built on top of a general purpose Scheme would be most likely to cope with feedback from layout engines. Tony Graham of the DSSSL list would be a good contact in this regard.
2) People often put pagination information in processing instructions. Or the information can be kept in an external database with, for example, HyTime locators. If you can decide in advance to only break pages on paragraph boundaries, then you can piggyback the pagination information on top of element markup.
3) If you find you have many of these concurrent structures, you may opt for "point markup", which is rather extreme, and would be an interesting challenge for some stream-based processors. In point markup, your main text is just marked up using
Then you have as separate element trees for each kind of structure: these trees probably contain no character data of their own, just IDREFs to the start and end of their range. In this way you can represent concurrent, overlapping hierarchies in SGML. For example:
]>
here is some
data of no interest.
This structure has the advantage of neatness, and provides a lot of modeling power
for just one extra level of indirection. If you used HREF rather than REFID, you can use
external point markup too.
The effect, of course, is to have concurrently
here is some
data of no interest.
and
here is some
data of no interest.
Rick Jelliffe
Author, "The XML & SGML Cookbook", out in May from Prentice Hall.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980506/36137d71/attachment.htm
From ricko at allette.com.au Wed May 6 16:56:33 1998
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun 7 17:01:08 2004
Subject: SGML declaration for XML
Message-ID: <003301bd78ff$64eba3c0$a10b4ccb@NT.JELLIFFE.COM.AU>
From: Rick Jelliffe
>My understanding is that the namespace proposal is currently not
>intending to allow ":" to prefix names, but you should ask the WG people
>for some confirmation: (Tim?)
Oops I mean I believe they are not allowing ":" to *start* names.
red:dog = OK
:dog = not OK
Rick Jelliffe
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ray at guiworks.com Wed May 6 17:34:19 1998
From: ray at guiworks.com (Ray)
Date: Mon Jun 7 17:01:08 2004
Subject: Should I develop an SMIL editor?
In-Reply-To: <3.0.32.19980506101721.0181b100@interlog.com> from Rolande Kendal at "May 6, 98 10:17:27 am"
Message-ID: <199805061532.JAA03075@coldsnap.guiworks.com>
>
> I am new to SGML/SMIL, and I find myself wondering if an SMIL editor would
> be a good choice of commercial products to target. Do I need a wake-up
> call? I am imagining a specialized XML editor that includes temporal
> attributes and a frame by frame like content view.
I think like anything, you need to define yourself a niche and attack it.
How are you going to differentiate yourself?
Remember, the basic functions of editing and playback are going to
in products from Real Networks, Adobe, Macromedia, etc. But who knows,
the big guys might trivially support it as an import/export format
just to say "I support another feature"
On the other hand, I don't think "just an SMIL editor" is a commercial
product, because it's too simple. It has to be combined with something,
like an existing audio/video authoring suite that is being converted
to support the web, etc.
If something is not complicated to implement, and easy to author
without a tool, it is probably a good candidate for freeware.
I was toying with the idea myself of implementing an SMIL player using the
Java Media Framework, but I've got too many projects on my lap already. :)
Now, I think a MathML editor/displayer has potential. :) For instance,
a tool that could convert LaTeX to MathML would be valuable, and
perhaps could export from MathML to Mathematica/Maple.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From bckman at ix.netcom.com Wed May 6 18:40:22 1998
From: bckman at ix.netcom.com (Frank Boumphrey)
Date: Mon Jun 7 17:01:09 2004
Subject: Tim Bray on TechNetcast, Transcript Released
Message-ID: <01bd7926$e8915a40$8d431ecc@uspppBckman>
I have just finished reading the transcript. It makes great reading,
especially for those who have been following the "parent's and children"
saga!!
Frank Boumphrey
http://www.hypermedic.com/style/index.htm
-----Original Message-----
From: Philippe Lourier
To: xml-dev@ic.ac.uk
Date: Tuesday, May 05, 1998 8:56 PM
Subject: ANN: Tim Bray on TechNetcast, Transcript Released
>
>The transcript of Tim Bray's appearance on
>DDJ TechNetcast has been posted on our
>website,
>
>www.technetcast.com
>
>Thanks to all who provided comments and
>suggestions
>
>PL
>Dr. Dobb's TechNetcast
>
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From SimonStL at classic.msn.com Wed May 6 19:38:34 1998
From: SimonStL at classic.msn.com (Simon St.Laurent)
Date: Mon Jun 7 17:01:09 2004
Subject: Parents and children (was RFD: comp.text.xml)
Message-ID:
>People will blame the tool (XML), but the enemy is text itself.
That says it all, though I'd expand the enemies list to include many more
kinds of data.
During the excitement this weekend over these issues, I wrote an essay that
more completely describes my take on these issues. I'm still tinkering, but
the basic framework is there. You can check it out, if you like, at:
http://members.aol.com/simonstl/xml/lettinggo.htm
Comments and suggestions are welcome, though this list may not be the right
place for that. The document has already been strengthened by a number of
assaults; just try to keep them friendly.
Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer / Cookies
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From crism at ora.com Wed May 6 20:02:50 1998
From: crism at ora.com (Chris Maden)
Date: Mon Jun 7 17:01:09 2004
Subject: SGML declaration for XML
In-Reply-To: <199805061323.PAA15216@brown.informatik.uni-dortmund.de> (message
from Stefan Mintert on Wed, 06 May 1998 15:23:23 +0200)
Message-ID: <199805061801.OAA20446@ruby.ora.com>
[Stefan Mintert]
> which is the correct SGML declaration for XML?
>
> I found three:
>
> 1a) "ISO 8879:1986 (ENR)"
>
> 1b) "ISO 8879:1986 (WWW)"
>
>
> both in http://www.w3.org/TR/NOTE-sgml-xml-971215
> Comparison of SGML and XML
>
> 2) "ISO 8879:1986 (WWW)"
>
> in http://www.sgmlsource.com/8879rev/n1955.htm
> ISO 8879 TC2
XML is from the W3C. Trust the resources from there. The "SGML
Declaration for XML" in the corrigendum for SGML is a sample. It is
in no way authoritative with respect to XML.
> As far as I know (1b) is correct, since XML 1.0 reads:
>
> [5] Name ::= (Letter | '_' | ':') (NameChar)*
>
> which means, a Name may start (NAMESTRT) with a ':'
That's correct, in pre-namespace XML. The namespace specification
will impose additional constraints which a namespace-aware processor
would enforce.
-Chris
--
http://www.oreilly.com/people/staff/crism/ +1.617.499.7487
90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek>
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From eliot at isogen.com Wed May 6 20:13:56 1998
From: eliot at isogen.com (W. Eliot Kimber)
Date: Mon Jun 7 17:01:09 2004
Subject: Parents and children (was RFD: comp.text.xml)
Message-ID: <3.0.32.19980506131034.00dfbcd4@postoffice.swbell.net>
A non-text attachment was scrubbed...
Name: not available
Type: text/enriched
Size: 5284 bytes
Desc: not available
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980506/e9ae16c6/attachment.bin
From tbray at textuality.com Wed May 6 20:43:21 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun 7 17:01:09 2004
Subject: XML & SGML
Message-ID: <3.0.32.19980506112553.009aaad0@pop.intergate.bc.ca>
At 01:10 PM 5/6/98 -0500, W. Eliot Kimber wrote:
>>>>
He also makes a clear distinction between use domains that require full strength SGML (and can therefore afford to implement its use) and use domains that do not. This is a key message:
XML->full SGML represents a continuum of cost/value that you can choose any point along.
<<<<
The only problem being that we lack the industrial experience
that would tell us at what point XML runs out of steam and you
need SGML extras. I must say, that is one of the #1 questions
I get at customer sites, and I just don't have a good answer
yet. It's not easy; just because they're totally web-oriented
doesn't mean they don't need SGML, and just because they're
currently using the &-connector doesn't mean they couldn't
succeed with just XML. -Tim
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From eliot at isogen.com Wed May 6 21:05:27 1998
From: eliot at isogen.com (W. Eliot Kimber)
Date: Mon Jun 7 17:01:09 2004
Subject: XML & SGML
Message-ID: <3.0.32.19980506140210.00de98bc@postoffice.swbell.net>
At 11:26 AM 5/6/98 -0700, Tim Bray wrote:
>At 01:10 PM 5/6/98 -0500, W. Eliot Kimber wrote:
>>>>>
>He also makes a clear distinction between use domains that require full
strength SGML (and can therefore afford to implement its use) and use
domains that do not. This is a key message:
>XML->full SGML represents a continuum of cost/value that you can choose
any point along.
><<<<
>The only problem being that we lack the industrial experience
>that would tell us at what point XML runs out of steam and you
>need SGML extras. I must say, that is one of the #1 questions
>I get at customer sites, and I just don't have a good answer
>yet. It's not easy; just because they're totally web-oriented
>doesn't mean they don't need SGML, and just because they're
>currently using the &-connector doesn't mean they couldn't
>succeed with just XML. -Tim
I say start with XML by itself and see how far you get--that's one of the
benefits of XML being SGML--you can slide along the continuum at will at
minimal cost. By the time you get a pilot project in place, you'll
probably know.
The main determiners seem to be:
1. How complex are my DTD development and maintenance requirements?
XML DTDs are essentially unmaintainable because of the limitations on the
use of parameter entities. This means you either use some non-DTD syntax
(e.g., your favorate schema scheme) and generate XML declaration sets as
needed or use normal SGML and generate XML documents as needed. The latter
is easier to do quickly because there are free tools to do most or all of
the work (e.g., SX, etc.). The former might be a better solution in the
long term because you can do more clever things.
2. What are my authoring requirements?
If you need authoring beyond text editing, your only real choices today are
SGML editors with XML support (e.g., ADEPT*Editor's save as XML feature).
But even here, most tools should be able to handle XML directly (possibly
with a little fiddling).
If you need text-based authoring that depends on markup minimization, and
such use environments do exist, then you have no choice but to author in
SGML and generate XML.
Downstream processes (transforms, production) are largely insensitive to
the use of XML or SGML because they are not dependent on syntax details but
on information semantic (element types rather than omitted start tags).
This analysis does not address application issues like the use of XLink,
XPointers and XSL. In general, XPointers are not useful for authoring
because they are too rigid and lack any form of indirection. XSL is of
course not yet defined to the point where you would be safe using it in
production. As rule, the XML add-on standards are designed for Web-based
delivery only and do not meet the requirements of authoring and archiving.
This is because authoring and archiving impose requirements that delivery
either does not impose or cannot tolerate (such as indirection or
dependence on arbitrary queries for indirection). Fortunately, all of the
SGML add-on standards can be applied to XML documents as well as to
full-SGML documents, so you can mix and match as you need to.
Cheers,
Eliot
--
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004
www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tbray at textuality.com Wed May 6 21:21:28 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun 7 17:01:09 2004
Subject: XML & SGML
Message-ID: <3.0.32.19980506121625.00915100@pop.intergate.bc.ca>
At 02:02 PM 5/6/98 -0500, W. Eliot Kimber wrote:
>XML DTDs are essentially unmaintainable because of the limitations on the
>use of parameter entities.
Hmm, but those limitations really only apply to the internal subset. Am
I missing something? (Mind you, there are those who say that the absence
of exclusion exceptions makes 'em unmaintainable).
As for the rest of Eliot's remarks, right on. In particular, the idea that
you should start with XML see how far that gets you. It's risk-free.
-Tim
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From eliot at isogen.com Wed May 6 21:29:50 1998
From: eliot at isogen.com (W. Eliot Kimber)
Date: Mon Jun 7 17:01:09 2004
Subject: XML & SGML
Message-ID: <3.0.32.19980506142627.00de9b6c@postoffice.swbell.net>
At 12:20 PM 5/6/98 -0700, Tim Bray wrote:
>At 02:02 PM 5/6/98 -0500, W. Eliot Kimber wrote:
>>XML DTDs are essentially unmaintainable because of the limitations on the
>>use of parameter entities.
>
>Hmm, but those limitations really only apply to the internal subset. Am
>I missing something? (Mind you, there are those who say that the absence
>of exclusion exceptions makes 'em unmaintainable).
The inability to use parameter entities inside of declarations is the
problem--unless I've misunderstood the restriction.
Cheers,
E.
--
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004
www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From larsga at step.de Wed May 6 21:42:20 1998
From: larsga at step.de (Lars Marius Garshol)
Date: Mon Jun 7 17:01:09 2004
Subject: XML & SGML
References: <3.0.32.19980506142627.00de9b6c@postoffice.swbell.net>
Message-ID: <3550BBC5.71CF5C5C@step.de>
W. Eliot Kimber wrote:
>
> The inability to use parameter entities inside of declarations is the
> problem--unless I've misunderstood the restriction.
You have. (Thankfully. Reassuring to see that you can be wrong about
something. :)
is entirely valid in the external subset, but not in the internal. This
is the relevant part of the spec (from section 2.8):
"Well-Formedness Constraint: PEs in Internal Subset
In the internal DTD subset, parameter-entity references can
occur only where markup declarations can occur, not within
markup declarations. (This does not apply to references that
occur in external parameter entities or to the external
subset.)"
I still haven't found a sensible way to implement this in the external
subsets, though. :-(
--Lars M.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From eliot at isogen.com Wed May 6 21:52:32 1998
From: eliot at isogen.com (W. Eliot Kimber)
Date: Mon Jun 7 17:01:09 2004
Subject: XML & SGML
Message-ID: <3.0.32.19980506144723.006aa4f0@postoffice.swbell.net>
At 02:26 PM 5/6/98 -0500, W. Eliot Kimber wrote:
>At 12:20 PM 5/6/98 -0700, Tim Bray wrote:
>>At 02:02 PM 5/6/98 -0500, W. Eliot Kimber wrote:
>>>XML DTDs are essentially unmaintainable because of the limitations on the
>>>use of parameter entities.
>>
>>Hmm, but those limitations really only apply to the internal subset. Am
>>I missing something? (Mind you, there are those who say that the absence
>>of exclusion exceptions makes 'em unmaintainable).
>
>The inability to use parameter entities inside of declarations is the
>problem--unless I've misunderstood the restriction.
Tim pointed out that this restriction is only in the internal subset, not
the external subset. I had forgotten this detail.
Cheers,
E.
--
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004
www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From eliot at isogen.com Wed May 6 21:52:38 1998
From: eliot at isogen.com (W. Eliot Kimber)
Date: Mon Jun 7 17:01:09 2004
Subject: XML & SGML
Message-ID: <3.0.32.19980506144910.006a00cc@postoffice.swbell.net>
At 09:36 PM 5/6/98 +0200, Lars Marius Garshol wrote:
>W. Eliot Kimber wrote:
>>
>> The inability to use parameter entities inside of declarations is the
>> problem--unless I've misunderstood the restriction.
>
>You have. (Thankfully. Reassuring to see that you can be wrong about
>something. :)
I'm sure I voted for consistency, and thus against allowing them in the
external subset, but such discussions are lost to my memory. I am happy to
stand corrected on this issue.
Cheers,
E.
--
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004
www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From larsga at step.de Wed May 6 22:07:21 1998
From: larsga at step.de (Lars Marius Garshol)
Date: Mon Jun 7 17:01:09 2004
Subject: #cdata?
Message-ID: <3550C1AC.35D27450@step.de>
(I post this here since it's XML-relevant and I know at least some of
the XPointer/XLink WG members read this list.)
I'm currently writing a toy XPointer implementation in Python (which
will quite possibly become serious later) and have to say I'm impressed
with the XPointer specification. Very clear descriptions and a nice
syntax.
However, I'm rather surprised that the 19980303 WD allows pointers to
distinguish nodes of type #cdata, that is, CDATA marked sections, so
that
root().child(3,#cdata)
points to the third CDATA section that's a direct child of the document
element.
The reason I'm surprised that this is in the WD is that to support this
the XPointer implementation will have to ride on top of a parser that
reports CDATA sections as a distinct syntactic construct, instead of
classing it as text, which is the usual solution. SAX does not do this,
nor does the DOM level 1 (WD 19980416) AFAICS.
I also have problems seeing the utility of this feature.
Can anyone comment on on whether it will be an XPointers feature in
future WDs and if so why?
--Lars M.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Wed May 6 22:52:14 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:09 2004
Subject: XML & SGML
Message-ID: <3550C7DD.AC2641F8@technologist.com>
"The similarities between the two languages, in syntax, origin, and
possible application, have led several members of the community to deny
any separation between the two markup languages. From their perspective,
XML is SGML - less SGML perhaps, but still SGML. Consortia, publications,
newsgroups, and mailing lists currently mix discussion of XML and SGML,
blurring the distinctions further."
I think that the inaccuracies of English are more at fault than anything,
but I don't think that the two statements above belong together. There is
no separation between XML and SGML, because every application of SGML has
always, and will always, live on a continuum from simplicity (now
explicitly embodied in XML) and sophistication (I'm tempted to say this is
embodied in HyTime, but things like RDF also show major signs of
sophistication and complexity).
But the fact that there is no separation does not imply that there should
not be no separate publications, newsgroups and mailing lists for XML.
There are magazines/newsgroups/consortia targeted towards beginning
webmasters and gurus, OO dabblers and UML masters. As the easy form of
SGML, XML deserves magazines, books, classes etc.
I reject the parent/child analogy because it would imply, for instance,
that RDF belongs in a magazine on XML and not SGML, and "an intro to
architectural forms" belongs in a magazine on SGML but not XML. The
reverse is true. RDF applies at least as much to SGML as it does to XML
(perhaps more, because it is in the complexity class of HyTime, not HTML).
Architectural forms will eventually apply to XML as much as to SGML unless
someone comes up with something better. You couldn't really come up with
something much simpler that would do the same job...but you could perhaps
come up with something technically better.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"Perpetually obsolescing and thus losing all data and programs every 10
years (the current pattern) is no way to run an information economy or
a civilization." - Stewart Brand, founder of the Whole Earth Catalog
http://www.wired.com/news/news/culture/story/10124.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cso at vignette.com Wed May 6 23:21:59 1998
From: cso at vignette.com (Conleth O'Connell)
Date: Mon Jun 7 17:01:09 2004
Subject: XML & SGML
In-Reply-To: <3550BBC5.71CF5C5C@step.de>
References: <3.0.32.19980506142627.00de9b6c@postoffice.swbell.net>
<3550BBC5.71CF5C5C@step.de>
Message-ID: <199805062038.PAA15561@zeus.vignette>
Lars Marius Garshol writes:
> W. Eliot Kimber wrote:
> >
> > The inability to use parameter entities inside of declarations is the
> > problem--unless I've misunderstood the restriction.
>
> You have. (Thankfully. Reassuring to see that you can be wrong about
> something. :)
>
>
>
> is entirely valid in the external subset, but not in the internal. This
> is the relevant part of the spec (from section 2.8):
>
> "Well-Formedness Constraint: PEs in Internal Subset
>
> In the internal DTD subset, parameter-entity references can
> occur only where markup declarations can occur, not within
> markup declarations. (This does not apply to references that
> occur in external parameter entities or to the external
> subset.)"
>
> I still haven't found a sensible way to implement this in the external
> subsets, though. :-(
It depends on your intent, but you may have use yet another level of
indirection.
Let's say I have DTD foo located at http://mycom.com/f.dtd
The XML instance might look like:
]>
This is valid because the internal subset entity declaration is resolved
before the entity declaraition in the external subset.
The area I got tripped up was in the Attribute declarations:
- only one element can be declared in each ATTLIST
- it is more difficult to declare and use common attributes
This requires an intervening external subset to declare and use the
entities. Too many years on DocBook I guess |-)
Cheers,
Con
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cbullard at hiwaay.net Thu May 7 00:51:59 1998
From: cbullard at hiwaay.net (len bullard)
Date: Mon Jun 7 17:01:09 2004
Subject: XML & SGML
References: <3.0.32.19980506112553.009aaad0@pop.intergate.bc.ca>
Message-ID: <3550E923.8CD@hiwaay.net>
Tim Bray wrote:
> The only problem being that we lack the industrial experience
> that would tell us at what point XML runs out of steam and you
> need SGML extras. I must say, that is one of the #1 questions
> I get at customer sites, and I just don't have a good answer
> yet. It's not easy; just because they're totally web-oriented
> doesn't mean they don't need SGML, and just because they're
> currently using the &-connector doesn't mean they couldn't
> succeed with just XML. -Tim
The point at which you can't tell the difference is
the point of success. In practice, tools will know.
Now there will be that awkward problem with
reserved words and PIs.
len
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cbullard at hiwaay.net Thu May 7 01:09:41 1998
From: cbullard at hiwaay.net (len bullard)
Date: Mon Jun 7 17:01:09 2004
Subject: Parents and children (was RFD: comp.text.xml)
References: <354FBD8D.329C@hiwaay.net> <354FDDC1.A80CEBB@technologist.com>
Message-ID: <3550ED44.4894@hiwaay.net>
Paul Prescod wrote:
>
> It wasn't so long ago that we were having a similarly pessimistic
> conversation about web standards (at that time, HTML and CSS) and you
> admitted that despite all of the setbacks, it was a good time for
> hypertext. That is more true today than ever. XML may still be due for a
> backlash, but lights are going on in people's heads all over the place. As
> long as the pattern is two steps forward, one step back, we are still
> making progress.
>
Yes. We have more toys, better toys and
some pretty neat content. I am looking forward to playing
with SMIL a la RealMedia and VRML. So, will I use XML?
You bet. After using HTML so much, I hope i can still
remember how to use another DTD. ;-)
Just to be schizophrenic, on the VRML list I have argued
for XML and the use of RealMedia streaming technology. It
isn't inconsistent. RealMedia offers us working streaming
now when we need it. RA provides a broadcast format; the
archival format is still wav, etc., so my lifecycle/standards
needs are met and my performance/runningCode needs are met.
With the content hat on, I am very pleased. Two weeks ago,
the world's first virtual theatre production was broadcast.
That is something I've been waiting for for a LONNNNG time.
Cool.
Standards are nice to have. Running code is nice to have.
We just have to keep both priorities up front and keep talking
openly. That way, we measure and meet all of our needs.
The game is a long one. Play to that.
Simon: Beers together sometime. Even if the arguments look
fierce, at the end of the day, beers together. After awhile
one finds it is like being in bands and that once one has
done it long enough, other musicians are the best company
after 2AM even if before 2AM you were cutting heads.
len
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Thu May 7 01:32:46 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:10 2004
Subject: XML & SGML
References: <3.0.32.19980506142627.00de9b6c@postoffice.swbell.net>
<3550BBC5.71CF5C5C@step.de> <199805062038.PAA15561@zeus.vignette>
Message-ID: <3550F2F9.CEDC8489@technologist.com>
Conleth O'Connell wrote:
>
> The XML instance might look like:
>
>
> ]>
>
>
Note that this sort of trickery is poorly supported by SGML editors. They
like to think of a DTD as a single immutable thing that spans documents
(as users do). But technically, DTDs are really just properties of
documents, and documents can fiddle with them as you demonstrate above. I
think that I would advise against depending on this sort of fiddling until
we see whether XML editors are smarter (or at least more compliant) in
this regard than their old-SGML counterparts.
This one feature probably makes implementation of an efficient, friendly
XML editor harder than most of the features that were eliminated from SGML
(omittag, shorttag and rank put together are comparatively trivial...).
The other hard feature is entity references in attribute values (yipes!
try to make a simple, friendly MFC dialog box that handles *that*
properly).
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"Perpetually obsolescing and thus losing all data and programs every 10
years (the current pattern) is no way to run an information economy or
a civilization." - Stewart Brand, founder of the Whole Earth Catalog
http://www.wired.com/news/news/culture/story/10124.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Thu May 7 02:23:42 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:10 2004
Subject: XML & SGML
References: <3550C7DD.AC2641F8@technologist.com>
Message-ID: <3550F9FA.7E5CD761@technologist.com>
Paul Prescod wrote:
>...
> But the fact that there is no separation does not imply that there
> should not be no separate publications, newsgroups and mailing lists
> for XML.
Egad! I meant: "Even though there need be no separation between XML and
other uses of SGML (they all live on a continuous complexity continuum),
there could still be separate publications, newsgroups and mailing
lists for XML."
Re: Consortia: I'm not sure that XML consortia make sense. XML usage will
soon be so diverse that I'm not sure that various "XML" companies would
have much in common to discuss. It strikes me as somewhat like an
"HTTP" consortia. SGML was already showing signs of that problem despite
the community's emphasis on document processing. XML will have
succeeded when people choose not to get together and talk about XML,
but rather about XML-based OO protocols, XML-document processing,
XML-knowledge representation and so forth.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"Perpetually obsolescing and thus losing all data and programs every 10
years (the current pattern) is no way to run an information economy or
a civilization." - Stewart Brand, founder of the Whole Earth Catalog
http://www.wired.com/news/news/culture/story/10124.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From SimonStL at classic.msn.com Thu May 7 04:10:45 1998
From: SimonStL at classic.msn.com (Simon St.Laurent)
Date: Mon Jun 7 17:01:10 2004
Subject: XML & SGML
Message-ID:
Paul Prescod wrote:
>XML will have
>succeeded when people choose not to get together and talk about XML,
>but rather about XML-based OO protocols, XML-document processing,
>XML-knowledge representation and so forth.
Tim Bray compares it to ASCII. Does anyone discuss ASCII anymore? Once in a
while, maybe, in the context of Unicode or other character encodings that are
supplanting it. You're completely right on this one. XML should become
invisible and well-understood as it becomes ubiquitous; the problems it solves
should be the focus of the discussion.
Len Bullard wrote:
>Beers together sometime. Even if the arguments look
>fierce, at the end of the day, beers together.
I think we could all use more beers together. There will always be arguments,
but there will always be (hopefully) beer. Someday I'll actually make it to
one of these conferences, and we'll have beer.
Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer / Cookies
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From trafford at interlog.com Thu May 7 04:22:57 1998
From: trafford at interlog.com (Ben Trafford)
Date: Mon Jun 7 17:01:10 2004
Subject: SP and Expat
Message-ID: <35511CED.9A8E678@interlog.com>
I've got a question: how does Expat's parsing differ from the XML
parsing available in SP 1.3?
--->Ben Trafford
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Thu May 7 05:14:51 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:10 2004
Subject: SP and Expat
References: <35511CED.9A8E678@interlog.com>
Message-ID: <355126D6.DB98086E@technologist.com>
Ben Trafford wrote:
>
> I've got a question: how does Expat's parsing differ from the XML
> parsing available in SP 1.3?
Smaller. Faster. No validation.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"Perpetually obsolescing and thus losing all data and programs every 10
years (the current pattern) is no way to run an information economy or
a civilization." - Stewart Brand, founder of the Whole Earth Catalog
http://www.wired.com/news/news/culture/story/10124.html
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Thu May 7 10:36:22 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:10 2004
Subject: XML & SGML
In-Reply-To: <3550F9FA.7E5CD761@technologist.com>
References: <3550C7DD.AC2641F8@technologist.com>
Message-ID: <3.0.1.16.19980507083126.08c78824@pop3.demon.co.uk>
[many people] wrote:
[... interesting material snipped ...]
Though deliberately neutral about the ethics and validity of XMLogenesis,
it's very important that the contributions of the *people* involved is
highlighted and valued. I think I can take an appropriately dispassionate
view of SGML, not being a practitioner.
I have found the commitment of the WG members to the XML process and their
technical ability to be outstanding. I have been privileged on XML-SIG to
see the (often weekly) reports from the WG and the frequent list of
questions that were addressed to the SIG. What may not be generally
realised is that the WG *worked very hard indeed at the nitty-gritty as
well as the strategy*. When the XML process ran into problems
someone/subgroup on the WG would be asked to take the problem away and come
back with a proposal that addressed the queries raised by the SIG. I
remember several summaries that, after a really intractable discussion,
were a delight to read.
During the whole process there was never a hint of vested interests.
Occasionally there is the sort of statement - " I don't think my colleagues
would find it easy to use this", "We do this, and it works for us", etc.
But not "FooCorp would see X as giving away market advantage". And,
although I have no knowledge of the companies involved, I suspect their has
been enlightened managerial oversight in allowing so much effort to be put
into this process. I have no doubt it will amply repay this investment,
since XML has to be a corporate philosophy as well as a technology.
There is no doubt that we owe a great deal to the XML-WG (and related
groups) and I have found the process to be wholly admirable. I am trying to
champion it in my own vertical area (chemistry) as an efficient, creative,
democratic and open process. It's probably impossible to combine all of
these, but most current efforts in chemical informatics could learn a great
deal from it.
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From yosr.hmaied at inria.fr Thu May 7 16:04:16 1998
From: yosr.hmaied at inria.fr (Yosr HMAIED)
Date: Mon Jun 7 17:01:10 2004
Subject: parser for xml-data?
Message-ID: <3551BF43.2F3@inria.fr>
Do you know parser for xml-data?
--
Yosr HMAIED
mailto:yosr.hmaied@inria.fr
tel :01 39 63 51 71
Projet Rodin
INRIA Rocquencourt
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From rbourret at dvs1.informatik.tu-darmstadt.de Thu May 7 16:24:31 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun 7 17:01:10 2004
Subject: ANY
Message-ID: <199805071337.PAA27616@berlin.dvs1.tu-darmstadt.de>
The definition of ANY in the spec seems to be limited to the following sentence
(section 3):
"The declaration matches ANY, and the types of any child elements have been
declared."
From "child elements," I assume the following:
1) An element of type ANY can have zero or more children.
2) An element of type ANY cannot contain PCDATA. That is, it cannot have a
mixed content model.
Is this correct? Thanks.
-- Ron Bourret
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From larsga at step.de Thu May 7 16:35:26 1998
From: larsga at step.de (Lars Marius Garshol)
Date: Mon Jun 7 17:01:10 2004
Subject: ANY
References: <199805071337.PAA27616@berlin.dvs1.tu-darmstadt.de>
Message-ID: <3551C561.47C36B2C@step.de>
Ron Bourret wrote:
>
> From "child elements," I assume the following:
>
> 1) An element of type ANY can have zero or more children.
Correct.
> 2) An element of type ANY cannot contain PCDATA. That is, it cannot have a
> mixed content model.
Wrong.
lots of text and here
is valid XML.
--Lars M.
PS: This seems to be a candidate for a FAQ, a specification
clarification,
as well as a good argument for comp.text.xml
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tbray at textuality.com Thu May 7 16:50:50 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun 7 17:01:10 2004
Subject: ANY
Message-ID: <3.0.32.19980507074659.00b7ce50@pop.intergate.bc.ca>
At 03:37 PM 5/7/98 +0200, Ron Bourret wrote:
>From "child elements," I assume the following:
>
>1) An element of type ANY can have zero or more children.
>2) An element of type ANY cannot contain PCDATA. That is, it cannot have a
>mixed content model.
>
>Is this correct? Thanks.
No - the spec is less than explicit, but since it doesn't rule out
text content, it is allowed. The annotated spec at http://www.xml.com
has some further elucidation on this subject. -Tim
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From elm at arbortext.com Thu May 7 18:55:39 1998
From: elm at arbortext.com (Eve L. Maler)
Date: Mon Jun 7 17:01:10 2004
Subject: #cdata?
In-Reply-To: <98May6.160827edt.26886@thicket.arbortext.com>
Message-ID: <3.0.5.32.19980507124933.00a6f100@village.promanage-inc.com>
At 04:01 PM 5/6/98 -0400, Lars Marius Garshol wrote:
>(I post this here since it's XML-relevant and I know at least some of
>the XPointer/XLink WG members read this list.)
>
>I'm currently writing a toy XPointer implementation in Python (which
>will quite possibly become serious later) and have to say I'm impressed
>with the XPointer specification. Very clear descriptions and a nice
>syntax.
Thank you. :-) I hope you'll share future news of your implementation as
you go along.
>However, I'm rather surprised that the 19980303 WD allows pointers to
>distinguish nodes of type #cdata, that is, CDATA marked sections, so
>that
>
>root().child(3,#cdata)
>
>points to the third CDATA section that's a direct child of the document
>element.
>
>The reason I'm surprised that this is in the WD is that to support this
>the XPointer implementation will have to ride on top of a parser that
>reports CDATA sections as a distinct syntactic construct, instead of
>classing it as text, which is the usual solution. SAX does not do this,
>nor does the DOM level 1 (WD 19980416) AFAICS.
>
>I also have problems seeing the utility of this feature.
>
>Can anyone comment on on whether it will be an XPointers feature in
>future WDs and if so why?
It was decided to add this back in December. However, several WG members
have since had a change of heart. So I have a feeling that it may not
survive.
My own belief is that having the "string" location term is sufficient, and
sufficiently robust, to do any addressing you'd want to do into a CDATA
section.
If we do want to allow addressing into some construct that the DOM doesn't
support, it's incumbent on the XLink side to request new functionality from
the DOM folks. In other words, we're not artificially going to restrict
ourselves to what the DOM has today. This approach was actually first
suggested by Lauren Wood (the DOM chair) to ensure that the dependencies
flow in the right direction.
Eve
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From sudar at pspl.co.in Fri May 8 07:05:32 1998
From: sudar at pspl.co.in (Sudarshan Purohit)
Date: Mon Jun 7 17:01:10 2004
Subject: parser for xml-data?
References: <3551BF43.2F3@inria.fr>
Message-ID: <355292B9.A2E581A1@pspl.co.in>
Yosr HMAIED wrote:
> Do you know parser for xml-data?
I think this is a good place to talk of the project I'm doing right now.
Basically, it's a translator between an XML-Data document and any
ODBC interface. This involves :
1. Taking an input XML-Data file and outputting a series of SQL
statements that result in the same information being stored in an RDBMS.
2. Using an ODBC Recordset interface to get an equivalent XML-Data
file. ( I'm still working out this one)
The first part involves parsing the XML file, so I might be able to help out
somewhere, Yosr.
While we're on the topic, I'd like to ask a few things.
1. XML-Data stores Data as "entities". This implies that we've got
a more abstract representaion of data than what would have been
possible through relational databases. For example, we've got this
"ONEORMORE,ZEROORMORE" occurence attribute for child elements.
Implementing this in an RDBMS would mean creating another linked
table for that data item. The same goes for complex data types.
Am I on the right track?
2. If we're creating XML specifically for this purpose, it'll have to
be in the format
....
....
.....
This restrictive structure, of course, means that you end up
not using this kind of document for anything else. Am I right
in thinking that this has to be done?
3. The XML-Data specification says that it's open-ended
and new types of entities may be added. I'm assuming this means
I'm able to add lines like
or
into my element delcarations, which will be ignored by other
parsers than mine. Am I right ?
I came rather late into the discussion, so it's possible that
this stuff may have been figured out already. If so, tell me.
Thanks,
Sudarshan 1030hrs.
--
------------------------------------------------------
The only difference between a rut and a grave is
the depth.
------------------------------------------------------
Sudarshan "Connection Machine" Purohit
Persistent Systems Private Limited, Pune
------------------------------------------------------
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From maillist at chris.hubick.com Fri May 8 07:41:02 1998
From: maillist at chris.hubick.com (Chris Hubick)
Date: Mon Jun 7 17:01:11 2004
Subject: Character Encoding Detection
Message-ID:
I am new to Character Encodings, and am trying to implement them
for my XML parser.
As I understand it, UCS has two flavors, UCS-2 and UCS-4, either of which
can optionally have a UCS transformation applied to them. It is my
understanding that you could author an XML document in either of these,
without applying a transformation.
The UTF-16 spec at:
http://www.stonehand.com/unicode/standard/wg2n1035.html
states:
"In UTF-16, any UCS character from the BMP shall be represented by
its UCS-2 coded representation."
Now in UCS-2:
'<' is 00 3C
'?' is 00 3f
So the start of a UCS-2 or UTF-16 encoded XML document would be 00 3C 00
3F
In the section on autodetection of character encodings the XML spec
states "00 3C 00 3F: UTF-16, big-endian, no Byte Order Mark (and thus,
strictly speaking, in error)"
My question is, why is this an error rather than a perfectly
acceptable untransformed UCS-2 document?
---
Chris Hubick
mailto:chris@hubick.com
http://www.hubick.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From larsga at step.de Fri May 8 09:28:46 1998
From: larsga at step.de (Lars Marius Garshol)
Date: Mon Jun 7 17:01:11 2004
Subject: #cdata?
References: <3.0.5.32.19980507124933.00a6f100@village.promanage-inc.com>
Message-ID: <3552B2DD.ADB174D7@step.de>
Eve L. Maler wrote:
>
> I hope you'll share future news of your implementation as
> you go along.
With pleasure. :)
I'll post again with more information and feedback, when I've got enough
of
it covered to make a release.
> It was decided to add this back in December. However, several WG members
> have since had a change of heart. So I have a feeling that it may not
> survive.
That would make at least one XML hacker slightly happier.
> My own belief is that having the "string" location term is sufficient, and
> sufficiently robust, to do any addressing you'd want to do into a CDATA
> section.
That's a belief I share.
> If we do want to allow addressing into some construct that the DOM doesn't
> support, it's incumbent on the XLink side to request new functionality from
> the DOM folks.
Then I see something that seems highly important: a method in the
Document
interface for returning the element with a given ID. Something like
this:
Element getElementWithID(in wstring id);
You've very likely already thought of and discussed this, so it would be
nice to hear what is likely to happen in this area. (I could subclass
the
Document object and the DOM builder in the Python DOM implementation to
get
this information, but it seems pointless if the DOM is going to include
it.)
--Lars M.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jjc at jclark.com Fri May 8 10:52:08 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun 7 17:01:11 2004
Subject: [xX][mM][lL] is reserved
References: <2F2DC5CE035DD1118C8E00805FFE354C0395C746@red-msg-56.dns.microsoft.com>
Message-ID: <3552B03E.616451C5@jclark.com>
Chris Lovett wrote:
>
> Spec issue:
>
> Section 2.3 paragraph 3:
>
> "A Name is a token beginning with a letter or one of a few
> punctuation characters, and continuing with letters, digits, hyphens,
> underscores, colons, or full stops, together known as name characters.
> Names beginning with the string "xml", or any string which wouild match
> (('x' | 'X')('m' | 'M') ('l' | 'L')) are reserved for standardization in
> this or future versions of this specification".
>
> Currently MSXML disallows "xml" from being used as a "namespace", for
> example: "xml:foo" since this seems to be the way the XML spec is heading
> with "xml:space" and "xml:lang". So is the above paragraph just a little
> too restrictive and should it include the colon ? If not then the following
> very common XML example is illegal:
>
>
> This is a test.
>
>
> Tim Bray's web site (http://www.xml.com/axml/axml.html) says explicitly that
> "xmlu" is an illegal name, but James Clark's expat seems to allow this ??
>
> Can the XML spec gurus clarify this paragraph ? Thanks.
I would say this is an "error" as defined in 1.2 of the spec but not a
fatal error nor a violation of a validity constraint.
Your processor is therefore free to ignore it or to detect and report
it. I think it's probably best to report this only at user option. If
tools blow up when they see such names, it's going to cause problems if
future versions of XML make use of these names. For example, I don't
want my XML processor to complain because I'm using an xml:stylesheet
PI, although it would be nice to have a "pedantic" option to tell me
that this is strictly an error (until such time as this gets licensed by
the XML spec).
James
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From rbourret at dvs1.informatik.tu-darmstadt.de Fri May 8 11:47:57 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun 7 17:01:11 2004
Subject: parser for xml-data?
Message-ID: <199805080934.LAA00769@berlin.dvs1.tu-darmstadt.de>
Sudarshan Purohit wrote:
> I think this is a good place to talk of the project I'm doing right now.
> Basically, it's a translator between an XML-Data document and any
> ODBC interface. This involves :
> 1. Taking an input XML-Data file and outputting a series of SQL
> statements that result in the same information being stored in an RDBMS.
I'm working on a similar project (moving data from XML to relational databases
and vice versa).
Do you really mean you are moving data from an XML-Data document? XML-Data is
an XML language for describing metadata, including DTDs. As such, the data in
an XML-Data file is metadata, not data. This is suitable if you want to create
tables in the RDBMS that match the XML-Data schema. However, an XML-Data file
won't give you any data to put in those tables.
By the way, there was a question the other day about how one might actually
deploy XML-Data as a substitute for a DTD. Was this ever answered?
> 2. Using an ODBC Recordset interface to get an equivalent XML-Data
> file. ( I'm still working out this one)
> The first part involves parsing the XML file, so I might be able to help
out
> somewhere, Yosr.
>
> While we're on the topic, I'd like to ask a few things.
> 1. XML-Data stores Data as "entities". This implies that we've got
> a more abstract representaion of data than what would have been
> possible through relational databases. For example, we've got this
> "ONEORMORE,ZEROORMORE" occurence attribute for child elements.
> Implementing this in an RDBMS would mean creating another linked
> table for that data item. The same goes for complex data types.
> Am I on the right track?
You are correct. XML describes an object model, so the problem is roughly the
same as moving object data to and from an RDBMS -- you might want to look at
some papers on the subject. There is an excellent white paper introducing the
subject at:
http://www.ontos.com/mapcon.htm
Another good paper is:
Shekar Ramanathan, Julia E. Hodges: Extraction of Object-Oriented Structures
from Existing Relational Databases, SIGMOD Record, Vol. 26, Number 1, March
1997, pp. 59-64.
Postscript: http://bunny.cs.uiuc.edu/sigmod/sigmod_record/9703/rama.ps
The only major difference I have found so far for XML is that the elements in
XML documents are ordered, while data members in OO programming languages are
not.
>
> 2. If we're creating XML specifically for this purpose, it'll have to
> be in the format
>
>
> ....
>
>
> ....
>
>
>
> .....
>
>
>
>
>
>
> This restrictive structure, of course, means that you end up
> not using this kind of document for anything else. Am I right
> in thinking that this has to be done?
Again, this makes sense if you are storing schema information, but I'm not sure
that that is what you want. The problem is one of creating a mapping from the
DTD (expressed as a DTD, XML-Data, or another competing schema language) to
tables/columns and then using that mapping to move data from an XML file (which
uses that DTD) to the RDBMS. You can certainly do this for a large number of
DTDs -- I'm not yet convinced it is possible for all.
>
> 3. The XML-Data specification says that it's open-ended
> and new types of entities may be added. I'm assuming this means
> I'm able to add lines like
>
>
> or
> into my element delcarations, which will be ignored by other
> parsers than mine. Am I right ?
>
> I came rather late into the discussion, so it's possible that
> this stuff may have been figured out already. If so, tell me.
>
> Thanks,
>
> Sudarshan 1030hrs.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From sudar at pspl.co.in Fri May 8 13:27:06 1998
From: sudar at pspl.co.in (Sudarshan Purohit)
Date: Mon Jun 7 17:01:11 2004
Subject: parser for xml-data?
References: <199805080934.LAA00769@berlin.dvs1.tu-darmstadt.de>
Message-ID: <3552EC2D.7BDC0297@pspl.co.in>
Ron Bourret wrote:
> Do you really mean you are moving data from an XML-Data document? XML-Data is
> an XML language for describing metadata, including DTDs. As such, the data in
> an XML-Data file is metadata, not data. This is suitable if you want to create
> tables in the RDBMS that match the XML-Data schema. However, an XML-Data file
> won't give you any data to put in those tables.
Yes, you're right. My mistake. I'll be using the schema to create the tables,
then adding rows to this from other (XML) files.
> By the way, there was a question the other day about how one might actually
> deploy XML-Data as a substitute for a DTD. Was this ever answered?
I don't think so. What I'm doing at the moment is ignoring any other schema
that appears in the data file itself. So in a sense, I'm actually using the
XML-Data file as a DTD. I'm also not yet clear on whether and how the data file
refers to the schema. In my program the user will have to specify them both.
> Again, this makes sense if you are storing schema information, but I'm not sure
> that that is what you want. The problem is one of creating a mapping from the
> DTD (expressed as a DTD, XML-Data, or another competing schema language) to
> tables/columns and then using that mapping to move data from an XML file (which
> uses that DTD) to the RDBMS. You can certainly do this for a large number of
> DTDs -- I'm not yet convinced it is possible for all.
Um, yes, actually the end result of my project should be an XML web page that
is updated whenever a change is made in a company's database. I think I'll be
storing schema info at the same location so that the data is more generally
accessible.
Thanks ,
Sudarshan 1700hrs.
PSPL,Pune 8th may 98
------------------------------------------------------
The only difference between a rut and a grave is
the depth.
------------------------------------------------------
Sudarshan "Connection Machine" Purohit
Persistent Systems Private Limited, Pune
------------------------------------------------------
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From rbourret at dvs1.informatik.tu-darmstadt.de Fri May 8 14:27:54 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun 7 17:01:11 2004
Subject: Attribute type NMTOKEN
Message-ID: <199805081213.OAA01196@berlin.dvs1.tu-darmstadt.de>
Why does the attribute type NMTOKEN exist and how is it intended to be used? My
best guess is that it is the same as an enumerated type, except that the
enumeration is not actually specified in the DTD, perhaps because it is meant to
be open-ended or is too big to list, such as all words in the English language.
Is this true?
-- Ron Bourret
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Fri May 8 14:50:23 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:11 2004
Subject: XML & SGML
References: <3.0.32.19980506121625.00915100@pop.intergate.bc.ca>
Message-ID: <3552FF7D.33CA5886@technologist.com>
Tim Bray wrote:
>
> As for the rest of Eliot's remarks, right on. In particular, the idea that
> you should start with XML see how far that gets you. It's risk-free.
I think that this was always prudent. You, too, Tim. IIRC, the OED didn't
have a DTD, for example. It was essentially XML.
Varying levels of tool support for different features created a defacto
simple SGML/full SGML split, and it was very expensive to start walking up
the complexity ladder because tool support started to drop away. "Sorry,
I've got this really neat DTD for you, but you'll have to buy all new
software...and it isn't written yet." Some features in XML are already a
little ways up that ladder.
I think that Eliot, on the other hand, felt it was his personal duty to
use every SGML feature possible so that he would have an excuse to tell
vendors that their products were broken. :) That is good for the rest of
us, because he kept raising the bar, but it must have been very painful
out there on the bleeding edge. I think I'll continue to duck in behind
him.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
Can we afford to feed that army,
while so many children are naked and hungry?
Can we afford to remain passive,
while that soldier-army is growing so massive?
- "Gabby" Barbadian Calpysonian in "Boots"
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From rbourret at dvs1.informatik.tu-darmstadt.de Fri May 8 14:51:52 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun 7 17:01:11 2004
Subject: parser for xml-data?
Message-ID: <199805081244.OAA01325@berlin.dvs1.tu-darmstadt.de>
Sudarshan Purohit wrote:
> > By the way, there was a question the other day about how one might actually
> > deploy XML-Data as a substitute for a DTD. Was this ever answered?
>
> I don't think so. What I'm doing at the moment is ignoring any other schema
> that appears in the data file itself. So in a sense, I'm actually using the
> XML-Data file as a DTD. I'm also not yet clear on whether and how the data
file
> refers to the schema. In my program the user will have to specify them both.
One possibility is that something in your XML document, such as an attribute at
the root, would refer to the XML document containing the XML-Data definition of
your grammar:
...
Another (uglier) possibility is that you use namespaces: the XML-Data namespace
and the namespace your XML-Data data defines. I haven't looked enough at either
the namespaces or XML-Data specs to be sure how this would work, but it seems
the object structure might be something like:
Root object
XML-Data root
XML-data data...
Your data root
Your data...
In either case, a validating parser would need to know that it used the XML-Data
schema to validate the XML-Data object and the schema defined by the XML-Data
data to validate your object, which is essentially what you're doing.
Any hints from the XML-Data folks?
-- Ron Bourret
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Fri May 8 15:17:43 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:11 2004
Subject: SDD bogus
Message-ID: <3553061D.BF3A3461@technologist.com>
Is the standalone document declaration bogus and perhaps dangerous? The
whole feature strikes me as over-complicated and over-specific for a
language like XML, but I'm aware of the historical processes that gave
rise to it.
My understanding of a typical usage scenario goes like this: a sender
creates a document. It creates it specifically so that it will be
standalone. It validates that this is the case (while it validates
everything else) and then it sends it to the receiver who hopes to consume
it without validating it. Things already strike me as a little bizarre,
because if your protocol is designed such that the consumer trusts the
receiver, then couldn't the SDD be implied in your out-of-band agreement?
Further, what do you do if the SDD is other than you expect? Halt the
parse and start again with a validating processor?
But that's not what I'm concerned about. I'm concerned because I believe
this to be a valid XML document:
]>
In my opinion, section 5.1 will require the non-validating parser to skip
the attribute list declaration, even if memo.dtd is an empty file.
The receiver has no way of knowing that this case has occured if it uses a
"standard parser" (since XML's semantics are, for the moment at least,
imprecisely specified, I only know what that means intuitively ... SAX,
Lark, Expat, etc. would not give you enough information to detect this
case).
This to me suggest that applications cannot trust the SDD and it must
therefore be presumed to be meaningless.
But I'm glad to be proven wrong. Despite its reputation to the contrary,
XML is intricate and deep and I may have missed something important.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
Can we afford to feed that army,
while so many children are naked and hungry?
Can we afford to remain passive,
while that soldier-army is growing so massive?
- "Gabby" Barbadian Calpysonian in "Boots"
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Fri May 8 15:27:42 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:11 2004
Subject: SDD again
Message-ID: <3553086A.3494D7F9@technologist.com>
Let me risk another step into the language courtroom. Validating parsers
must always read the whole DTD. So the SDD is only for non-validating
parsers. Non-validating parsers do not read element type declarations. So
what is the point of this line:
"The standalone document declaration must have the value "no" if any
external markup declarations contain declarations of:"
...
"element types with element content, if white space occurs directly within
any instance of those types."
First, why does a non-validating parser care about element/mixed content?
It has no responsibility to do any marking of insignificant whitespace
anyhow. Second, if there is no class of processor that can reliably
reproduce the intended parse tree without reading the whole DTD, then
doesn't that significantly weaken the utility (okay, "purity") of the SDD?
Even if I am wrong on the last point, it seems that it does not do what it
is supposed to do properly.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
Can we afford to feed that army,
while so many children are naked and hungry?
Can we afford to remain passive,
while that soldier-army is growing so massive?
- "Gabby" Barbadian Calpysonian in "Boots"
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From simpson at polaris.net Fri May 8 15:50:42 1998
From: simpson at polaris.net (John E. Simpson)
Date: Mon Jun 7 17:01:11 2004
Subject: Attribute type NMTOKEN
In-Reply-To: <199805081213.OAA01196@berlin.dvs1.tu-darmstadt.de>
Message-ID: <3.0.3.32.19980508095011.008ce420@nexus.polaris.net>
At 02:13 PM 5/8/98 +0200, Ron Bourret wrote:
>Why does the attribute type NMTOKEN exist and how is it intended to be
used? My
>best guess is that it is the same as an enumerated type, except that the
>enumeration is not actually specified in the DTD, perhaps because it is
meant to
>be open-ended or is too big to list, such as all words in the English
language.
>Is this true?
As I understand it (would appreciate being corrected if wrong), NMTOKEN
essentially applies a constraint on the form of the attribute's value (i.e.
value must be a valid name token -- start with either a letter or an
underscore) without constraining the *specific* value. So yes, your
"open-ended or... too big to list" assumption is correct, with the
qualification that a type of NMTOKEN would not allow a value such as
"1ABC," "//ENGLISH," or " WHITESPACE."
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tbray at textuality.com Fri May 8 15:53:14 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun 7 17:01:11 2004
Subject: SDD bogus
Message-ID: <3.0.32.19980508065011.00b3b520@pop.intergate.bc.ca>
At 09:18 AM 5/8/98 -0400, Paul Prescod wrote:
>But that's not what I'm concerned about. I'm concerned because I believe
>this to be a valid XML document:
>
>
>
>
>]>
>
>
>In my opinion, section 5.1 will require the non-validating parser to skip
>the attribute list declaration, even if memo.dtd is an empty file.
Welll, it can't be valid if memo.dtd is an empty file, because you
don't have anywhere. But yes, 5.1 suggests the
attribute default shouldn't be used.
>The receiver has no way of knowing that this case has occured if it uses a
>"standard parser"
If the sender is stupid enough to send something like this to a
non-validating parser, he gets what he deserves. If it's a validating
parser, then of course the emptiness of memo.dtd will be detected.
> (since XML's semantics are, for the moment at least,
>imprecisely specified, I only know what that means intuitively ... SAX,
>Lark, Expat, etc. would not give you enough information to detect this
>case).
Huh?
>This to me suggest that applications cannot trust the SDD and it must
>therefore be presumed to be meaningless.
You do raise a good question; it would seem that standalone='true'
*ought* to mean that the rule of 5.1 about the effect of
external PE refs could be ignored. Hmmmm -Tim
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Fri May 8 15:57:54 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:11 2004
Subject: SDD bogus
In-Reply-To: <3553061D.BF3A3461@technologist.com>
References: <3553061D.BF3A3461@technologist.com>
Message-ID: <199805081351.JAA00618@unready.microstar.com>
Paul Prescod writes:
> Is the standalone document declaration bogus and perhaps dangerous?
Yes and yes.
The problem, I think, came from the mistaken idea that people
(i.e. desperate Perl hackers) would write custom parsers for each XML
application (like RDF), and that these people would not want to deal
with seemingly difficult problems like external entity resolution.
In the end, as one might have predicted, there is an impressive range
of free XML processors available in several different programming
languages: someone writing an RDF tool does not need to worry about
the character and entity level of XML at all, and can work with XML
easily through a more abstract interface such as the DOM or SAX.
So, we should let the authors decide -- if an author creates a
document referencing external entities (including an external DTD
subset), then the XML parser should handle them; if the author does
not want to use external entities, then she can simply avoid
referencing any.
As many XML parser writers have shown, resolving external entities is
one of the easiest parts of XML (especially in higher-level languages
like Java or Perl, and, I presume, Python). Allowing parsers to skip
external entities -- rather than simplifying XML -- ended up making it
much more complicated, and as you point it, the standalone declaration
really doesn't help things.
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From eliot at isogen.com Fri May 8 16:30:49 1998
From: eliot at isogen.com (W. Eliot Kimber)
Date: Mon Jun 7 17:01:11 2004
Subject: XML & SGML
Message-ID: <3.0.32.19980508092316.007091a4@postoffice.swbell.net>
At 08:50 AM 5/8/98 -0400, Paul Prescod wrote:
>I think that Eliot, on the other hand, felt it was his personal duty to
>use every SGML feature possible so that he would have an excuse to tell
>vendors that their products were broken. :) That is good for the rest of
>us, because he kept raising the bar, but it must have been very painful
>out there on the bleeding edge. I think I'll continue to duck in behind
>him.
I'm still trying to staunch the bleeding.
But seriously, I use those features of SGML that offer value to my and my
clients' applications. Most of the features of SGML were invented for a
good reason to solve real problems that people have. And, like any
sufficiently general system, facilities designed for one purpose turn out
to be useful for other, unanticipated purposes. For example, the simple
LINK feature, which is really hopeless at its intended task, is ideal for
adding architectural mappings to documents without disturbing the original
DTD declarations.
I have never asked for facilities that aren't useful. For example, I have
never berated a tool supplier for not supporting CONCUR, RANK, DATATAG, or
explicit LINK.
Cheers,
E.
--
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004
www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Fri May 8 17:07:54 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:12 2004
Subject: SDD again
In-Reply-To: <3553086A.3494D7F9@technologist.com>
References: <3553086A.3494D7F9@technologist.com>
Message-ID: <199805081500.LAA00857@unready.microstar.com>
Paul Prescod writes:
> Let me risk another step into the language courtroom. Validating
> parsers must always read the whole DTD. So the SDD is only for
> non-validating parsers. Non-validating parsers do not read element
> type declarations. So what is the point of this line:
Your first premise is correct, but your second one is not. The spec
states that a validating parser must use the whole DTD; it does not
state that a non-validating parser may not use the DTD. AElfred, for
example, reads the DTD well enough that it can even flag ignorable
whitespace base on an element type's content model, but it is
non-validating.
That said, I still agree that the standalone declaration is wrong.
Perhaps some day, if there's an XML 1.1, we can think about fixing it.
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Fri May 8 17:11:17 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:12 2004
Subject: SAX: Final Requests?
Message-ID: <199805081508.LAA01003@unready.microstar.com>
Could anyone else who has experimented with implementing SAX 1.0gamma
(either in a parser or in an application) please let me know about any
bugs or other serious problems in the interfaces, implementation, or
documentation?
Thank you again to all of you who have already sent in comments,
suggestions, and bug reports.
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tbray at textuality.com Fri May 8 17:19:08 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun 7 17:01:12 2004
Subject: SDD bogus
Message-ID: <3.0.32.19980508081709.00b57c60@pop.intergate.bc.ca>
At 09:51 AM 5/8/98 -0400, David Megginson wrote:
>As many XML parser writers have shown, resolving external entities is
>one of the easiest parts of XML
Yes, but as is well-documented, difficulty is *not* the reason we made
their processing optional by non-validating processors. The prime mover
behind this decision was a passionate presentation from Jean Paoli
explaining that the auto-include semantic of parsed entities is just
*wrong* for web browsers. I've attached an explanation of why at the
end of this message, but if you want to see it context, go to section
4.4.3 of the annotated spec and click on the "H".
Having said that, Paul did raise a valid concern about the SDD (too bad
this issue wasn't pointed out before the spec was frozen). Having said
*that*, I think, for reasons that are on the record in the same place,
that the problem the SDD exists to solve will essentially never
arise in real operational scenarios anyhow. -Tim
=================
>From the annotated spec at http://xml.com/axml/axml.html
Why Are External Entities
Included Optionally?
In discussion of external entities,
we realized that the semantics of
external text entities (compulsory
inclusion at the point where they
are encountered) are deeply
incompatible with the desired
behavior of Web browsers.
Consider the following example of
the beginning of an XML
document:
]>
Netscape today
announced that &NSA;. In
response, Microsoft
issued the following
statement: &MSA;.
...
A Web browser is typically
making an aggressive effort to
display text to the user as soon as
possible, in parallel with fetching it
from the network. In the example
above, if a browser were required
to fetch and process all external
entities, it could only display the
first four words before starting
another network fetch operation.
To make things worse, bear in
mind that the replacement text for
the entity NSA could well include
other external entities which in
turn would need to be fetched.
This type of situation is
unacceptable. Hence the rule that
non-validating parsers need not
fetch external entities if they don't
want to.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From lauren at sqwest.bc.ca Fri May 8 18:43:12 1998
From: lauren at sqwest.bc.ca (Lauren Wood)
Date: Mon Jun 7 17:01:12 2004
Subject: #cdata?
In-Reply-To: <3552B2DD.ADB174D7@step.de>
References: <3.0.5.32.19980507124933.00a6f100@village.promanage-inc.com>
Message-ID: <199805081642.JAA28647@sqwest.bc.ca>
At 08/05/98 12:23 AM , Lars Marius Garshol wrote:
>Then I see something that seems highly important: a method in the
>Document
>interface for returning the element with a given ID. Something like
>this:
>
> Element getElementWithID(in wstring id);
>
>You've very likely already thought of and discussed this, so it would be
>nice to hear what is likely to happen in this area. (I could subclass
>the Document object and the DOM builder in the Python DOM implementation to
>get this information, but it seems pointless if the DOM is going to include
>it.)
The DOM will include something like this (it's even listed expressly in our
requirements!) but it won't be in Level 1, since we're concentrating on
really basic functionality in Level 1.
Lauren
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From crism at ora.com Fri May 8 20:11:47 1998
From: crism at ora.com (Chris Maden)
Date: Mon Jun 7 17:01:12 2004
Subject: Character Encoding Detection
In-Reply-To:
(message from Chris Hubick on Thu, 7 May 1998 22:38:08 +0000 (GMT))
Message-ID: <199805081810.OAA18622@ruby.ora.com>
[Chris Hubick]
> In the section on autodetection of character encodings the XML spec
> states "00 3C 00 3F: UTF-16, big-endian, no Byte Order Mark (and
> thus, strictly speaking, in error)"
>
> My question is, why is this an error rather than a perfectly
> acceptable untransformed UCS-2 document?
The XML spec states, by fiat, in 4.3.3, that "Entities encoded in
UTF-16 must begin with the Byte Order Mark". So the reason the
example is an error is because the spec says so.
UCS-2 is identical to UTF-16, and so it is subject (presumably) to the
same rule.
As a side note, I was unsure until just now whether they were
equivalent, but I finally found ISO 10646-1 clause 8:
Plane 00 of Group 00 shall be the Basic Multilingual Plane (BMP).
The BMP can be used as a two-octet coded character set in which
case it shall be called UCS-2.
From:
Linkname: ISO/IEC 10646-1 including AMD 1 thru 4
URL: http://wwwold.dkuug.dk/JTC1/SC2/WG2/docs/N1396.doc
-Chris
--
http://www.oreilly.com/people/staff/crism/ +1.617.499.7487
90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek>
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cmatei at agora.ro Fri May 8 20:27:57 1998
From: cmatei at agora.ro (Crystian)
Date: Mon Jun 7 17:01:12 2004
Subject: Comment and PI in a Mixed or children elements...
Message-ID: <01BD7AC8.56166CC0@jackson.agora.ro>
Hi,
Can be comments or Processing Instructions in a element that his declaration matches Mixed or children?
For example
] >
is valid?
Thank you in advance!
Crystian
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Fri May 8 21:19:18 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:12 2004
Subject: SDD bogus
References: <3.0.32.19980508081709.00b57c60@pop.intergate.bc.ca>
Message-ID: <35535AE5.FFBAE939@technologist.com>
Tim Bray wrote:
>
> Having said that, Paul did raise a valid concern about the SDD (too bad
> this issue wasn't pointed out before the spec was frozen).
Yes, I want to point out to those who do not know the dynamics here that I
use the word "bogus" because I was in the SIG and it is as much my fault
as anyone's that it got through. Were I talking about someone else's spec.
I would be more tactful.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
Can we afford to feed that army,
while so many children are naked and hungry?
Can we afford to remain passive,
while that soldier-army is growing so massive?
- "Gabby" Barbadian Calpysonian in "Boots"
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Fri May 8 21:34:59 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:12 2004
Subject: SDD bogus
References: <3553061D.BF3A3461@technologist.com> <199805081351.JAA00618@unready.microstar.com>
Message-ID: <35535E95.3B3F5B23@technologist.com>
David Megginson wrote:
>
> In the end, as one might have predicted, there is an impressive range
> of free XML processors available in several different programming
> languages: someone writing an RDF tool does not need to worry about
> the character and entity level of XML at all, and can work with XML
> easily through a more abstract interface such as the DOM or SAX.
That's a dangerous heresy, though one that I've promoted myself. If we
presume that programmers are going to work through parsers, then why
couldn't we leave GI's out of end tags and make XML substantially less
verbose (qualitatively at least)? Anyhow, many people argue with some
justification that regexp-based processing of the source files will still
be very important and popular. I'm not convinced that the cost/benefit
ratio is right, if we win over the awk hackers and annoy the document
authors, but we will see.
> So, we should let the authors decide -- if an author creates a
> document referencing external entities (including an external DTD
> subset), then the XML parser should handle them; if the author does
> not want to use external entities, then she can simply avoid
> referencing any.
Although I agree with Tim that this is a separate issue, I agree with you
that external entities should always be processed. It seems strange to me
to put responsibility for managing performance in the hands of the parser.
The parser writer has no information about the performance characteristics
of the entity vs. the importance of the data. Ignoring content should not
be the parser's perogative.
Consider Tim's example:
> Netscape today
> announced that &NSA;. In
> response, Microsoft
> issued the following
> statement: &MSA;.
As an author, I would be pretty damn pissed off if a browser presented the
document that way, even if it uses nice icons for the entities. What does
the document mean with half of its text missing? If I've used external
entities in that way, then I have probably decided to do it for a good
reason. If I wanted a text inclusion that could be downloaded after a
mouseclick, that sounds like a special behaviour that could be
accomplished through XLink.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
Can we afford to feed that army,
while so many children are naked and hungry?
Can we afford to remain passive,
while that soldier-army is growing so massive?
- "Gabby" Barbadian Calpysonian in "Boots"
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Fri May 8 21:43:53 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:12 2004
Subject: SDD again
References: <3553086A.3494D7F9@technologist.com> <199805081500.LAA00857@unready.microstar.com>
Message-ID: <355360A0.1C6900A6@technologist.com>
David Megginson wrote:
>
> Your first premise is correct, but your second one is not. The spec
> states that a validating parser must use the whole DTD; it does not
> state that a non-validating parser may not use the DTD. AElfred, for
> example, reads the DTD well enough that it can even flag ignorable
> whitespace base on an element type's content model, but it is
> non-validating.
You are right. I guess when I think of parsers, I prefer to think of them
as interchangable within a class, so I would be leery about trusting
features that only overachievers like you implement. :)
> That said, I still agree that the standalone declaration is wrong.
> Perhaps some day, if there's an XML 1.1, we can think about fixing it.
Probably better to deprecate it. And as soon as possible!
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
Can we afford to feed that army,
while so many children are naked and hungry?
Can we afford to remain passive,
while that soldier-army is growing so massive?
- "Gabby" Barbadian Calpysonian in "Boots"
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From maillist at chris.hubick.com Fri May 8 22:06:31 1998
From: maillist at chris.hubick.com (Chris Hubick)
Date: Mon Jun 7 17:01:12 2004
Subject: Character Encoding Detection
In-Reply-To: <199805081810.OAA18622@ruby.ora.com>
Message-ID:
Thanks Chris! That led me down the path of enlightenment.
On Fri, 8 May 1998, Chris Maden wrote:
> UCS-2 is identical to UTF-16, and so it is subject (presumably) to the
> same rule.
Ahh, I hadn't read the details of UTF-16 encoding. So, as I
understand it now, UTF-16 is only applicable to UCS-4 (not UCS-2 ??)
documents, because it transforms a UCS-4 document into a UCS-2 document
that uses a reserved code range (rows D8-DB and DC-DF of the BMP) to
represent the extra characters (where only UCS-4 chars < 0011 0000 can
be mapped). A normal UCS-2 document would not use these reserved codes
(??), so in that respect a UTF-16 encoded UCS-4 document is not identical
to a "native" UCS-2 document. What this means is that you can only tell a
UTF-16 document from a UCS-2 document by the encoding declaration, or by
encountering a character in the reserved UCS-2 range for UTF-16 encoding.
And both UCS-2 and UTF-16 docs must start with a byte order mark. Is this
byte order mark "#xFEFF" specific to XML documents, or is it part of UCS?
> As a side note, I was unsure until just now whether they were
> equivalent, but I finally found ISO 10646-1 clause 8:
>
> Plane 00 of Group 00 shall be the Basic Multilingual Plane (BMP).
> The BMP can be used as a two-octet coded character set in which
> case it shall be called UCS-2.
Without reading the whole document (uhg, I see it coming) I didn't
think this statement had anything to do with UTF, I thought it was just
explaining how UCS-2 is a subset of UCS-4.
Though I think the following does state what you say:
"UCS Transformation Format 16 (UTF-16)" at
http://www.stonehand.com/unicode/standard/wg2n1035.html
> The following method transforms the coded representation of over a
> million graphic characters of UCS-4 into a form that is compatible with
> the two-octet BMP form of UCS-2 (section 14.1). This permits the
> coexistence of those UCS-4 characters within coded character data that
> is in accordance with UCS-2.
>
> In UTF-16 each graphic character from the UCS-2 repertoire retains its
> UCS-2 coded representation. In addition, the coded representation of any
> character from a single contiguous block of 16 Planes in Group 00
> (1,048,576 code positions) is transformed to pairs of two-octet
> sequences, where each sequence corresponds to a cell in a single
> contiguous block of 8 Rows in the BMP (2,048 code positions). These
> codes are reserved for the use of this transformation method, and shall
> not be allocated for any other purpose.
As for the goal of it being relatively easy for the desperate Perl
hacker, or myself, the desperate Java hacker, to code an XML parser...it's
not that bad. It helps if you come into it with prior knowledge of stuff
like character encodings. I think all I would have changed in the spec is
to not allow PE's _within_ markup declarations in the external DTD, I
still have to sort that out if I ever want to make my parser validating.
I might also have been tempted to not allow recursive PE's. They could
have left it out for 1.0 and added it for 1.1, but now you have to be
backward compatible. I think if you find yourself needing that kind of
thing, SGML might be the language for you.
---
Chris Hubick
mailto:chris@hubick.com
http://www.hubick.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jarle.stabell at dokpro.uio.no Fri May 8 22:25:25 1998
From: jarle.stabell at dokpro.uio.no (Jarle Stabell)
Date: Mon Jun 7 17:01:12 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
Message-ID: <01BD7AD2.0F899E80.jarle.stabell@dokpro.uio.no>
Paul Prescod wrote:
> If we
> presume that programmers are going to work through parsers, then why
> couldn't we leave GI's out of end tags and make XML substantially less
> verbose (qualitatively at least)? Anyhow, many people argue with some
> justification that regexp-based processing of the source files will still
> be very important and popular. I'm not convinced that the cost/benefit
> ratio is right, if we win over the awk hackers and annoy the document
> authors, but we will see.
(those who have found the discussion about short end tags tiresome
years/months ago, please forgive me and don't read any further.)
I would love to see empty end tags making it into the standard in the
future. In many cases, one only marks up single words, and then empty end
tags would justify having longer and more descriptive GI's, when forced to
write both start and end tags fully, one may be too tempted to use
"cryptic"/abbreviated GI's.
Example:
Let's say your're making a markup language for documentating source code,
which would be embedded inside comments in the source (like JavaDoc). Then
you would prefer the tags to be as "silent" as possible during
development/maintaince of the source code itself.
Compare:
1. The method M1 of the fantastic
Class1 can be used in situation
X.
to the "thinner":
2. The method M1> of the fantastic Class1> can
be used in situation X>.
I think variant 2 is faster to read than variant 1, and you don't have to
check the end-tags for misspellings.
The argument that compressing reduces/eliminates the size advantage of
documents with empty end tags often doesn't apply, the document will often
be stored uncompressed on users hard-disks, in databases and in memory.
Cheers,
Jarle Stabell
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From crism at ora.com Fri May 8 22:27:46 1998
From: crism at ora.com (Chris Maden)
Date: Mon Jun 7 17:01:13 2004
Subject: Attribute type NMTOKEN
In-Reply-To: <3.0.3.32.19980508095011.008ce420@nexus.polaris.net>
(simpson@polaris.net)
Message-ID: <199805082025.QAA23192@ruby.ora.com>
[John E. Simpson]
> As I understand it (would appreciate being corrected if wrong),
> NMTOKEN essentially applies a constraint on the form of the
> attribute's value (i.e. value must be a valid name token -- start
> with either a letter or an underscore) without constraining the
> *specific* value. So yes, your "open-ended or... too big to list"
> assumption is correct, with the qualification that a type of NMTOKEN
> would not allow a value such as "1ABC," "//ENGLISH," or
> " WHITESPACE."
A possible misunderstanding here: a NMTOKEN itself can't contain
whitespace, but the attribute value specification, the quoted thing
after the equals sign, can: so
*is* a valid NMTOKEN attribute specification. The NMTOKEN in this
example is only "WHITESPACE", though; the leading (and trailing)
whitespace is trimmed.
-Chris
--
http://www.oreilly.com/people/staff/crism/ +1.617.499.7487
90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek>
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From eliot at isogen.com Fri May 8 22:44:53 1998
From: eliot at isogen.com (W. Eliot Kimber)
Date: Mon Jun 7 17:01:13 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
Message-ID: <3.0.32.19980508154119.0077c1e8@postoffice.swbell.net>
At 10:38 PM 5/8/98 +0200, Jarle Stabell wrote:
>I would love to see empty end tags making it into the standard in the
>future. In many cases, one only marks up single words, and then empty end
>tags would justify having longer and more descriptive GI's, when forced to
>write both start and end tags fully, one may be too tempted to use
>"cryptic"/abbreviated GI's.
If you're going to ask for empty end tags for this, then why not go all the
way and reinstate NET-enabled start tags?
asd asdfasd
Markup minimization is a slippery slope--in for a penny, in for a pound.
Cheers,
E.
--
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004
www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Fri May 8 22:46:58 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:13 2004
Subject: SDD bogus
In-Reply-To: <35535E95.3B3F5B23@technologist.com>
References: <3553061D.BF3A3461@technologist.com>
<199805081351.JAA00618@unready.microstar.com>
<35535E95.3B3F5B23@technologist.com>
Message-ID: <199805082043.QAA02512@unready.microstar.com>
Paul Prescod writes:
> David Megginson wrote:
> >
> > In the end, as one might have predicted, there is an impressive range
> > of free XML processors available in several different programming
> > languages: someone writing an RDF tool does not need to worry about
> > the character and entity level of XML at all, and can work with XML
> > easily through a more abstract interface such as the DOM or SAX.
>
> That's a dangerous heresy, though one that I've promoted myself. If we
> presume that programmers are going to work through parsers, then why
> couldn't we leave GI's out of end tags and make XML substantially less
> verbose (qualitatively at least)?
It's a matter of finding the right balance: XML must be simple enough
to parse that there is a healthy competition among parser writers (and
thus, a better quality and larger choice of tools for application
writers), but simple enough to write that authors (including
developers of tools that generate XML) are willing to learn and use
it. SGML, with its bizarre tag-omission rules, delimiter-in-context
recognition, shortrefs, etc., gave too much away to the authors, and
as a result, there were very few good SGML parsers implemented (and
never a one in Java, although many of us tried).
> > So, we should let the authors decide -- if an author creates a
> > document referencing external entities (including an external DTD
> > subset), then the XML parser should handle them; if the author does
> > not want to use external entities, then she can simply avoid
> > referencing any.
>
> Although I agree with Tim that this is a separate issue, I agree with you
> that external entities should always be processed. It seems strange to me
> to put responsibility for managing performance in the hands of the parser.
> The parser writer has no information about the performance characteristics
> of the entity vs. the importance of the data. Ignoring content should not
> be the parser's perogative.
I'd take one step towards Tim's side, and allow the _application_ to
specify whether (and which) external entities may be included, since
they may have special knowledge that the parser lacks; you are quite
right, though, that it should not be up to the parser, and that all
parsers should be able to include external entities. In XML 1.1, I'd
like to see one of those "at user discretion" clauses here.
> Consider Tim's example:
>
> > Netscape today
> > announced that &NSA;. In
> > response, Microsoft
> > issued the following
> > statement: &MSA;.
>
> As an author, I would be pretty damn pissed off if a browser
> presented the document that way, even if it uses nice icons for the
> entities. What does the document mean with half of its text
> missing? If I've used external entities in that way, then I have
> probably decided to do it for a good reason. If I wanted a text
> inclusion that could be downloaded after a mouseclick, that sounds
> like a special behaviour that could be accomplished through XLink.
Absolutely correct: an entity is a fundamental chunk of a document,
not a link (web heads can think of it somelink sort-of like a
client-side include). I'm sorry to see that this kind of
misunderstanding arose as recently as last fall, especially since work
was already well underway on XLink (aka XLL aka XML-Link), so (as you
mention) people could see what kind of mechanisms would be available
for linking to embedded information.
In any case, couldn't a browser render around the entity references,
then adjust the rendition as the entities were resolved, as HTML
browsers generally do with images?
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tbray at textuality.com Fri May 8 22:52:05 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun 7 17:01:13 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
Message-ID: <3.0.32.19980508134903.00b49e30@pop.intergate.bc.ca>
At 10:38 PM 5/8/98 +0200, Jarle Stabell wrote:
>I would love to see empty end tags making it into the standard in the
>future.
Speaking as one of those who opposed this, now I agree with you.
Among other things, perl is going to have a built-in parser so no
matter how D the PH is, this problem is just invisible. But it
was a good decision at the time. -T.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Bryan_Gilbert at pml.com Fri May 8 22:58:54 1998
From: Bryan_Gilbert at pml.com (Bryan Gilbert)
Date: Mon Jun 7 17:01:13 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
Message-ID:
> -----Original Message-----
> From: W. Eliot Kimber [SMTP:eliot@isogen.com]
> Sent: Friday, May 08, 1998 1:41 PM
> To: xml-dev@ic.ac.uk
> Subject: Re: A little wish for short end tags (Was: RE: SDD
> bogus)
>
> At 10:38 PM 5/8/98 +0200, Jarle Stabell wrote:
> >I would love to see empty end tags making it into the standard in the
>
> >future. In many cases, one only marks up single words, and then empty
> end
> >tags would justify having longer and more descriptive GI's, when
> forced to
> >write both start and end tags fully, one may be too tempted to use
> >"cryptic"/abbreviated GI's.
>
> If you're going to ask for empty end tags for this, then why not go
> all the
> way and reinstate NET-enabled start tags?
>
> asd asdfasd
> marked
>
> Markup minimization is a slippery slope--in for a penny, in for a
> pound.
>
> Cheers,
>
> E.
> --
[BG] Why is it a slippery slope? Why go all the way and do
these NET-enabled things. (They look gross.)
My interest is supporting XML on embedded platforms where
ROM, RAM, CPU, are all very limited. XML combined
with XSL will be much better than HTML documents for this
application but lecc verbosity would be good too.
For those who need cryptic tags see the example DTDs that
John Tigue (DataChannel) developed for the WebBRoker project.
He presents a verbose DTD and then a terse version. Both
use preliminary XML namespaces to scope the elements.
The namespace helps document the terse version as
does the comments in the DTD.
http://www.datachannel.com/xml/dtd/index.html
Bryan
>
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From simpson at polaris.net Fri May 8 23:03:05 1998
From: simpson at polaris.net (John E. Simpson)
Date: Mon Jun 7 17:01:13 2004
Subject: Attribute type NMTOKEN
In-Reply-To: <199805082025.QAA23192@ruby.ora.com>
References: <3.0.3.32.19980508095011.008ce420@nexus.polaris.net>
Message-ID: <3.0.3.32.19980508170242.008c5870@nexus.polaris.net>
At 04:25 PM 5/8/98 -0400, Chris Maden wrote:
>[John E. Simpson]
>> As I understand it (would appreciate being corrected if wrong),
>> NMTOKEN essentially applies a constraint on the form of the
>> attribute's value (i.e. value must be a valid name token -- start
>> with either a letter or an underscore) without constraining the
>> *specific* value. So yes, your "open-ended or... too big to list"
>> assumption is correct, with the qualification that a type of NMTOKEN
>> would not allow a value such as "1ABC," "//ENGLISH," or
>> " WHITESPACE."
>
>A possible misunderstanding here: a NMTOKEN itself can't contain
>whitespace, but the attribute value specification, the quoted thing
>after the equals sign, can: so
>
>*is* a valid NMTOKEN attribute specification. The NMTOKEN in this
>example is only "WHITESPACE", though; the leading (and trailing)
>whitespace is trimmed.
Thanks for that clarification, Chris!
Best regards, | It's no disgrace t'be poor,
John E. Simpson | but it might as well be.
simpson@polaris.net | -- "Kin" Hubbard
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From russ at thetrip.com Fri May 8 23:33:34 1998
From: russ at thetrip.com (Russ Heithoff)
Date: Mon Jun 7 17:01:13 2004
Subject: Attribute type NMTOKEN
Message-ID: <01BD7A96.6843D180.russ@thetrip.com>
Also, if I remember right, a NMTOKEN can start with Name characters. This
is in contrast to a Name which must start with Name Start characters
followed by characters. In otherwords 1ABC is a legal NMTOKEN but not a
legal NAME.
Russ
-----Original Message-----
From: John E. Simpson [SMTP:simpson@polaris.net]
Sent: Friday, May 08, 1998 3:25 PM
To: xml-dev@ic.ac.uk
Subject: Re: Attribute type NMTOKEN
At 04:25 PM 5/8/98 -0400, Chris Maden wrote:
>[John E. Simpson]
>> As I understand it (would appreciate being corrected if wrong),
>> NMTOKEN essentially applies a constraint on the form of the
>> attribute's value (i.e. value must be a valid name token -- start
>> with either a letter or an underscore) without constraining the
>> *specific* value. So yes, your "open-ended or... too big to list"
>> assumption is correct, with the qualification that a type of NMTOKEN
>> would not allow a value such as "1ABC," "//ENGLISH," or
>> " WHITESPACE."
>
>A possible misunderstanding here: a NMTOKEN itself can't contain
>whitespace, but the attribute value specification, the quoted thing
>after the equals sign, can: so
>
>*is* a valid NMTOKEN attribute specification. The NMTOKEN in this
>example is only "WHITESPACE", though; the leading (and trailing)
>whitespace is trimmed.
Thanks for that clarification, Chris!
Best regards, | It's no disgrace t'be poor,
John E. Simpson | but it might as well be.
simpson@polaris.net | -- "Kin" Hubbard
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Fri May 8 23:45:01 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:13 2004
Subject: Comment and PI in a Mixed or children elements...
In-Reply-To: <01BD7AC8.56166CC0@jackson.agora.ro>
Message-ID: <3.0.1.16.19980508210135.753f3410@pop3.demon.co.uk>
Hi Crystian,
At 21:29 08/05/98 +-300, Crystian wrote:
>Hi,
>
>Can be comments or Processing Instructions in a element that his
declaration matches Mixed or children?
[... question snipped ...]
When I have questions like this I generally tend to see what happens with a
parser. The parser writers have spent a *lot* of time over the spec and the
current generation of parsers are very high quality. It's worth installing
2-3 and seeing if they all agree.
There are a (very) few places where some parsers don't match the current
spec (especially those which are pre-XML1.0). But they (the parsers and
their authors) are much smarter than me. So I often use a computer to 'do
my thinking for me'
P.
When SAX1.0 is released JUMBO2.0 will be able to support a number of
SAX-compliant parsers under it. It depends on the timescale that the
authors are able to manage. You can then compare how each parser behaves.
The areas of concern that use of multiple parsers will highlight are those
whose semantics are fuzzy. PaulP highlighted one - the question of what a
parser should do with SDD+internalSubset+externalSubset. This is an area
where it is likely that different parser writers might make different
implementations..
BTW I didn't answer the question, did I? :-) I think you will find it is
yes, but trust a parser and not me.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Fri May 8 23:55:16 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:13 2004
Subject: Comment and PI in a Mixed or children elements...
In-Reply-To: <3.0.1.16.19980508210135.753f3410@pop3.demon.co.uk>
References: <01BD7AC8.56166CC0@jackson.agora.ro>
<3.0.1.16.19980508210135.753f3410@pop3.demon.co.uk>
Message-ID: <199805082153.RAA02861@unready.microstar.com>
Peter Murray-Rust writes:
> BTW I didn't answer the question, did I? :-) I think you will find it is
> yes, but trust a parser and not me.
By all means, run problems like this through a good parser (don't use
AElfred for this purpose, since it is too tolerant of errors).
In the end, however, the only trustworthy source is the grammatical
productions in the spec (together with constraints and other normative
text):
[39] element ::= EmptyElemTag | STag content ETag
[43] content ::= (element | CharData | Reference | CDSect |
PI | Comment)*
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From eliot at isogen.com Fri May 8 23:58:45 1998
From: eliot at isogen.com (W. Eliot Kimber)
Date: Mon Jun 7 17:01:13 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
Message-ID: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net>
At 01:53 PM 5/8/98 -0700, Bryan Gilbert wrote:
> [BG] Why is it [minimization] a slippery slope?
Because if you can make an argument for empty end tags, you can probably
make an argument for NET-enabled end tags, and minimized attribute
specifications, and omitted tags, and so on and so on.
Just because *your* requirements make it obvious that empty end tags are
enough, that doesn't mean that everybody's do. Any decision short of no
minimization or full minimization will be an arbitrary one that will cause
those whose requirements are not met to wonder why. There is always full
SGML, after all.
The only argument I could see for empty tags involves the fact that you
don't need explicit declarations in order to use it. But the same argument
applies to NET-enabled start tags, so that razor would't help here.
[NOTE: I'm using NET-enabled start tags as an example to make a point. I'm
really aggitating for their inclusion in XML--they're handy, but not that
much more handy than empty end tags, saving only one keystroke.]
Cheers,
E.
--
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004
www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From sjd at eps.inso.com Fri May 8 23:58:46 1998
From: sjd at eps.inso.com (Steven DeRose)
Date: Mon Jun 7 17:01:13 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
In-Reply-To: <01BD7AD2.0F899E80.jarle.stabell@dokpro.uio.no>
Message-ID: <3.0.5.32.19980508175003.00840e00@mailhost.eps.inso.com>
At 10:38 PM 05/08/98 +0200, Jarle Stabell wrote:
>2. The method M1> of the fantastic Class1> can
>be used in situation X>.
>
>I think variant 2 is faster to read than variant 1, and you don't have to
>check the end-tags for misspellings.
True, you don't have to check them; but the often forgotten corrolary is
that you also *can't* check the end-tag for misspellings if you go that route.
So if the data is erroneous you are far less likely to detect it *at all*,
making for truly nasty debugging. This is an ancient information-theoretic
tradeoff: you can always save space, but the more you save, the less chance
you have of detecting errors. This is because when you reduce redundancy,
you increase the % of all possible bit sequences that are syntactically
correct.
For example, imagine trying to communicate in a noisy room if every
possible sequence of sounds was a legitimate English word. Or imagine
programming in a language where every possible byte sequence is a
syntactically correct program (APL and raw machine code are the only
approximations I can think of to that -- guess why).
>
>The argument that compressing reduces/eliminates the size advantage of
>documents with empty end tags often doesn't apply, the document will often
>be stored uncompressed on users hard-disks, in databases and in memory.
Sorry, but with Win98 rumored to demand 64MB of RAM just to run and with
Moore's Law applying to memory prices, I can't muster much enthusiasm for
an argument that it is too costly to shave bytes on markup. If you had to
put *ten* full tags on every element you'd hardly ever notice any impact
except on a 747 manual, and anything that big can't be handled practically
in raw unparsed form anyway. I did a lot of statistics on this a few years
ago: a fully-marked-up file with no minimization is still wayyyyy smaller
than the equivalent word processor file in typical systems, so what's the
big deal?
I agree it would be handy when typing XML by hand or reading it raw. But it
is not without adverse consequences too. I'd rather see better editing
tools so I don't even have to know about such details.
Steven J. DeRose, Ph.D.
Visiting Chief Scientist, STG | Chief Scientist
Adjunct Associate Professor | Inso Corp. EPS
Steven_DeRose@brown.edu | sjd@eps.inso.com
401-863-3690 | 401-752-4438
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Bryan_Gilbert at pml.com Sat May 9 00:14:30 1998
From: Bryan_Gilbert at pml.com (Bryan Gilbert)
Date: Mon Jun 7 17:01:13 2004
Subject: Less verbose XML (Was: A little wish for short end tags (Was: RE:
SDD bogus))
Message-ID:
From: Steven DeRose [SMTP:sjd@eps.inso.com]
Sent: Friday, May 08, 1998 2:50 PM
To: xml-dev@ic.ac.uk
Subject: Re: A little wish for short end tags (Was: RE:
SDD bogus)
Sorry, but with Win98 rumored to demand 64MB of RAM just to run
and with
Moore's Law applying to memory prices, I can't muster much
enthusiasm for
an argument that it is too costly to shave bytes on markup. If
you had to
Work for a while in a system that has 1/2 MB ROM,
maybe 1MB RAM, and a CPU that is managing
intense calculations and I/O.
Am I alone? Or are there others out there with similar
constraints who want to use XML? I'd like to hear from
you.
Thanks
Bryan Gilbert, B.Sc. M.Sc.
Systems Engineer, ION Modules Team,
Power Measurement Ltd
2195 Keating Cros Road, Saanichton, BC, Can, V8M 2A5
Phone: (250) 652-7100 ext 7570
Fax: (250) 652-0411
email: (mailto:bryan_gilbert@pml.com)
WWW: (http://www.pml.com)
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From adam at cyber-guru.com Sat May 9 00:26:35 1998
From: adam at cyber-guru.com (Adam M. Donahue)
Date: Mon Jun 7 17:01:13 2004
Subject: EBNF question
In-Reply-To: <199805082025.QAA23192@ruby.ora.com>
References: <3.0.3.32.19980508095011.008ce420@nexus.polaris.net> (simpson@polaris.net)
Message-ID: <9805082226.AA14674@acf2.nyu.edu>
I just want to clear something up about reading the XML
productions. Am I correct in the following assumptions:
A character or sequence of characters between single quotation
marks, e.g., 'ABC' is to be represented literally. There must be no
space characters between the quotation marks and the entity in the
production. So def ::= ' ABC ' would not be legal for representing
" ABC " .. you'd need to use SP 'ABC' SP. However, when spaces
do exist between the quotation marks, then the name(s) inside are
to be treated as productions and the *quotation marks* as literals.
For example:
abc ::= 'ABC' ; literally ABC
lit ::= ' abc ' ; literally ' ABC '
lit ::=' ABC ' ; undefined?
Is this correct?
Adam
mailto:adam@cyber-guru.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Sat May 9 00:34:48 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:13 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
In-Reply-To: <01BD7AD2.0F899E80.jarle.stabell@dokpro.uio.no>
Message-ID: <3.0.1.16.19980508222800.747792f8@pop3.demon.co.uk>
At 22:38 08/05/98 +0200, Jarle Stabell wrote:
>Paul Prescod wrote:
>> If we
>> presume that programmers are going to work through parsers, then why
>> couldn't we leave GI's out of end tags and make XML substantially less
>> verbose (qualitatively at least)? Anyhow, many people argue with some
>> justification that regexp-based processing of the source files will still
>> be very important and popular. I'm not convinced that the cost/benefit
>> ratio is right, if we win over the awk hackers and annoy the document
>> authors, but we will see.
>
>(those who have found the discussion about short end tags tiresome
>years/months ago, please forgive me and don't read any further.)
>
>I would love to see empty end tags making it into the standard in the
>future. In many cases, one only marks up single words, and then empty end
[... and others agreed/disagreed ...]
This is not, of course, a new discussion - it has been vigorously debated
on XML-SIG/WG and there were differences of opinion just as there are here.
At this stage of XML it's important to agree that we should all adhere to
XML1.0 *completely* whatever we feel its merits and demerits to be. The
full spec has only been out ca 2 months and we are seeing and will continue
to see a large amount of high quality software - it's critical that this
conforms. When there has been more experience of what level of tools are
actually used and what quality of documents are actually produced it may be
time for revisions.
The minimum-minimization principle is extremely valuable if there are a
significant number of documents that are prone to well-formedness errors. I
still create - and will continue to create - XML files by hand. I also
write programs to create XML files. Both of these are error-prone processes
:-) and it's extremely valuable to be able to locate the precise point of
error.
Using XML-DEV to push for immediate revision of the spec is probably
unlikely to be a very productive process. Using it to highlight unnecessary
- and therefore omissible - constructs (e.g. SDD) or areas of semantic
fuzziness is probably more likely to be useful.
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ricko at allette.com.au Sat May 9 02:23:50 1998
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun 7 17:01:14 2004
Subject: Character Encoding Detection
Message-ID: <001a01bd7ae0$f8d2a7e0$a20b4ccb@NT.JELLIFFE.COM.AU>
From: Chris Hubick
>Now in UCS-2:
> '<' is 00 3C
> '?' is 00 3f
>
>So the start of a UCS-2 or UTF-16 encoded XML document would be 00 3C 00
>3F
Not always. Because when storing a double-octet (i.e. two 8-bit bytes)
different
CPUs store them in different byte order: big-endian and little-endian. So if
you
read a big-endian file as a sequence of bytes you will get 003c003f,
but if you read a little-ending file as a sequence of bytes you will get
3c003f00.
This is why a binary file from an Intel machine will typically need to be
byte-swapped for use on a Motorola 68K machine. I believe the Motorola
PowerPC CPU has an instruction to switch between little-endian and
big-endian
reading modes. When you are storing 4 octet data there are 4 possibile
arrangements, but in fact only 3 seem to have been used: PDP-11 use one,
Motorola used another, Intel used another.
To cope with this, network protocols and binary file formats will usually
specify a particular byte order for storing or transfering 16 or 32 bit
data.
As an alternative, Unicode (UTF-16) reserves two characters near the high
end
of the codespace. From these characters, it is possible to determine whether
the file is saved as big-endian or little endian. Unicode software will then
resequence
incoming bytes correctly, to construct the 16-but values.This is called the
Byte Order Mark (BOM).
The XML character encoding strategy is to encourage you, wherever possible,
to make sure that all the information needed to parse your document can be
marked-up in the document. Operating systems typically just provide a "text"
type (.txt, Mac "TEXT") under the assumption, which was OK in the West when
everyone had standalone computers, that all "text" on a computer would use
the
same character set or encoding. So, because operating systems and protocols
fail in this key regard, XML provides its heuristic for determining text:
this heuristic
is reliable, in the sense that if you use it, your data is clearly
labelled--it takes the
guess work out of character detection.
The heuristic is basically this:
1) If the operating system or transport protocol tells you the character
set, and if
it is reliable, use that.
2) If there is a BOM, then your data is Unicode.
3) Otherwise, use the XML declarations (i.e., the ) to
determine the encoding (this is a little bit complex, but straightforward)
4) Otherwise, it is UTF-8 (7-bit ASCII conforms to UTF-8 already).
The draft RFC for the MIME type text/xml will be made public soon. It gives
some more policy on these issues.
>In the section on autodetection of character encodings the XML spec
>states "00 3C 00 3F: UTF-16, big-endian, no Byte Order Mark (and thus,
>strictly speaking, in error)"
>
> My question is, why is this an error rather than a perfectly
>acceptable untransformed UCS-2 document?
>
>
The reason that the encoding declaration and BOM are not mandatory was
to allow legacy documents and software systems to be used. But there seems
to be a general concensus with the XML SIG/WG people that all new XML
documents should have either a BOM or an explicit XML declaration
(with the encoding attribute).
As part of the XML-development process, some of us asked the question:
"What are the intrinsic properties of text?", with the idea that if
something was
intrinsic ("prime metadata") then XML should provide a standard way of
representing it. The things we came up with were notation (i.e. XML,
including
version and treatment of white-space xml:space), encoding, language
(i.e., xml:lang). All these things are fundamental to any parsing of text
files
in a world wide web. (I also think that written-script is also an intrinsic,
but
the ISO standard for script codes was not finished and the RFC 1766
language format was flexible enough for most needs, it was thought.)
Software developers should rise to this challenge. When you write out an XML
file, make sure you write out the appropriate XML header: Don't treat it as
an option but as a necessity. And if you are writing UTF-16/Unicode tools,
always write out the BOM as the first character. With XML we have the chance
to not get out of the character set and encoding maze: not by being forced
to use a single encoding, but by disconnecting encoding from document
character set (i.e. ISO 10646) and by clearly labelling which encoding is
being used.
Rick Jelliffe
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ricko at allette.com.au Sat May 9 02:37:29 1998
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun 7 17:01:14 2004
Subject: Less verbose XML (Was: A little wish for short end tags (Was: RE: SDD bogus))
Message-ID: <003501bd7ae2$dd719d10$a20b4ccb@NT.JELLIFFE.COM.AU>
If you are worried about file size, then compress your files.
Tags, being short strings commonly used, compress really nicely with
the common compression algorithms out there.
I once did an experiment (to confirm one that Gavin Nicol had done)
using a document with several thousand lines, each with one start-tag,
end-tag pair and no content. I tried compressing this file with
* no minimization
* short end-tags
* end-tag ommission
The uncompressed file was something like 50K. The compressed files
differed by only a few 100bytes from each other. The gains from short
end-tags did not carry over into the compressed versions, and the
compressed versions were so much smaller there seemed little
contest.
Having short-tag ommision can only compress a document by less
than 50% at an improbable maximum (in the case of a document with
not data, not attributes, no white-space, and incredibly long GIs).
Compression is a far better approach.
Rick Jelliffe
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ricko at allette.com.au Sat May 9 02:56:16 1998
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun 7 17:01:14 2004
Subject: parser for xml-data?
Message-ID: <006e01bd7ae5$7a49ac70$a20b4ccb@NT.JELLIFFE.COM.AU>
From: Ron Bourret
> The only major difference I have found so far for XML is that the elements
in
> XML documents are ordered, while data members in OO programming languages
are
> not.
There is an error in the (January 5?) XML-data report, in the very first
example.
It gives the clear impression that XML-data does not constrain sequence for
the element types it describes. In the example, the declarations for the
element
types which can appear as the content of an element types are given in one
order, but the instance has them in another order. (A Microsoft
representative
pointed this out to me: I dont know why they haven't just reissued the note,
since it
is a fairly cricitical point for implementors.)
Also note that the usage of ISO 8601 date formats seems to be wrong.
ISO 8601 date format is yyyy-mm-dd, e.g. 1998-05-09, and not 19980509,
last time I looked.
If anyone is thinking of implementing XML-data, I suggest they befriend the
authors, because the report misses out on several key issues. (I have
previously
mentioned that is does not seem to make clear whether you can have an
XML-data schema as part of a document, or whether it must be external. If it
is internal, can it describe the document's root element? I suppose a close
reading of the XML-data text might help, but it is not clear to me after
dozens of readings, but I do not claim to be particularly brilliant in this
area.)
Rick Jelliffe
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From alex at veosystems.com Sat May 9 03:18:07 1998
From: alex at veosystems.com (alex@veosystems.com)
Date: Mon Jun 7 17:01:14 2004
Subject: SDD again
Message-ID: <19980509011757.19901.qmail@veosystems.com>
A non-text attachment was scrubbed...
Name: not available
Type: text
Size: 1866 bytes
Desc: not available
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980509/a6a71159/attachment.bat
From tbray at textuality.com Sat May 9 03:26:53 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun 7 17:01:14 2004
Subject: SDD again
Message-ID: <3.0.32.19980508182528.00b5c2b0@pop.intergate.bc.ca>
At 06:17 PM 5/8/98 -0700, alex@veosystems.com wrote:
>Currently, validating parsers are the only real *safe* bet.
Whatever the limitations of the SDD, this statement is simply
false. There are many useful applications for non-validating
processors; in fact, I suspect, a very large majority of XML
apps. -Tim
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From alex at veosystems.com Sat May 9 03:52:23 1998
From: alex at veosystems.com (alex@veosystems.com)
Date: Mon Jun 7 17:01:14 2004
Subject: SDD again
In-Reply-To: <3.0.32.19980508182528.00b5c2b0@pop.intergate.bc.ca> from "Tim Bray" at May 8, 98 06:25:30 pm
Message-ID: <19980509015211.20167.qmail@veosystems.com>
A non-text attachment was scrubbed...
Name: not available
Type: text
Size: 1934 bytes
Desc: not available
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980509/5ff16937/attachment.bat
From jnava at Adobe.COM Sat May 9 05:29:43 1998
From: jnava at Adobe.COM (Joel A. Nava)
Date: Mon Jun 7 17:01:14 2004
Subject: FW: IDL in XML or SGML
Message-ID: <001801bd7afa$87e20070$b83f2099@moebius.corp.adobe.com>
I hope this is the first time this has come through, as the email was
bounced from the Mail Delivery System [Mailer-Daemon@ic.ac.uk] the last 2
times I sent it. I am guessing that it was because I was subscribed to the
digest and not the regular list. So, I am back on the regular list now.
Joel
-----Original Message-----
From: Joel A. Nava [mailto:jnava@adobe.com]
Sent: Thursday, May 07, 1998 10:32 PM
To: xml-dev@ic.ac.uk
Cc: Joel A. Nava
Subject: IDL in XML or SGML
I am forwarding the question of a colleague who is not on this list. I would
appreciate being cc'ed, as I am getting xml-dev in digest form.
Thanks,
Joel
=================================================================
Is anyone aware of an XML application which encodes an Interface Description
Language, such as for DCE RPC, COM or CORBA? I'm particularly looking for
an XML (or even SGML) application of encoding the Microsoft IDL (MIDL).
I'd appreciate pointers.
Thanks,
Perry
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Sat May 9 06:33:03 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:14 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
References: <3.0.32.19980508154119.0077c1e8@postoffice.swbell.net>
Message-ID: <3553DCB0.50431ADB@technologist.com>
W. Eliot Kimber wrote:
>
> Markup minimization is a slippery slope--in for a penny, in for a pound.
I don't think that slippery slope arguments are interesting in technical
discussions. You and I are smart enough to hammer out reasonable
compromises. XML is full of such reasonable compromises. XML *is* such a
compromise.
One could have made the argument that as soon as you allowed element type
declarations (for example) you are inevitably on the slope down to
including every kind of declaration and pseudo-declaration SGML allows. Or
once you make one concession to bandwidth (e.g. the external entity
optionality) you are going to end up making all such concessions (of which
markup minimization would be one). Or once you made one concession to easy
parsability (fixed delimiters), you are going to end up making them all
(but we allowed alternate literal string delimiters). Etc. etc. etc.
But of course reasonable people do not (and did not) go that route. We
weighed the costs and benefits of *each* feature more or less
individually, and kept some and threw out some. I believe that this
process works just as well for minimization. Just because one language
went overboard does not mean that all future ones must shun minimzation as
if it were heroine.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
Can we afford to feed that army,
while so many children are naked and hungry?
Can we afford to remain passive,
while that soldier-army is growing so massive?
- "Gabby" Barbadian Calpysonian in "Boots"
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tbray at textuality.com Sat May 9 06:36:37 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun 7 17:01:14 2004
Subject: SDD again
Message-ID: <3.0.32.19980508213503.00b5b650@pop.intergate.bc.ca>
At 06:52 PM 5/8/98 -0700, alex@veosystems.com wrote:
>No, it is not false. I hightlighted the word 'safe'. If you absolutely
>*must* know that everything was read and interpreted correctly, you *must*
>use a validating parser. There are many applications where this is not
>an absolute requirement and, thus, you may use a well-formed parser.
No. This claim is without technical merit and I cannot let it pass
unchallenged. It is trivially possible to achieve correctness and 100%
unquestioned reliability without the use of a validating processor. If you
either (a) construct your DTD so that standalone='true' or (b) don't have
external markup declarations, then you can have NO DOUBT WHATSOEVER that the
document is being parsed correctly, for any sane denotation or connotation of
"correctly". So please stop making this demonstrably false assertion.
Not only is your premise false in theory, it is vacuous in practice. If you
think, for any real-world application, that its validation against some DTD
guarantees "correctness" in any nontrivial sense, then I don't want to
go anywhere near your software. Validity is a highly specific claim, one
which is of great utility in many applications, but it does not equate to
having "safety" or "correctness". Equally, lack of validity does not
equate to lacking "safety" or "correctness".
>In addition, there are some applications that need some level of guarantee
>about whether external declaration subsets will be read and honored. It is
>this class of applications that we cannot address today with the current
>definition of well-formed.
This statement is correct, except for the unnecessary temporizing about
"some level of guarantee". "Guarantee" is a binary condition; if you need
a guarantee that the organization to which you are sending information
will have external declarations read, then you need to specify the use of
a validating processor at that end. If not, then not.
But please don't equate this particular guarantee with general concepts
of "safety" or "correctness" - doing so gives the impression that the use of
documents which are merely well-formed is in some way sloppy or irresponsible;
such a claim is fatuous and very, very, very unhelpful.
-Tim Bray
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jjc at jclark.com Sat May 9 06:48:41 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun 7 17:01:14 2004
Subject: SDD bogus
References: <3553061D.BF3A3461@technologist.com> <199805081351.JAA00618@unready.microstar.com>
Message-ID: <3553D056.5D6D5087@jclark.com>
David Megginson wrote:
> As many XML parser writers have shown, resolving external entities is
> one of the easiest parts of XML
It depends what sort of API your parser has. If it has an API like SAX
(or Aelfred or XP) where the parser makes calls to get input, it is
indeed easy. But that sort of API is not what is needed for convenient
integration in Web browsers (at least not for Netscape). They need an
API like expat's where the parser never blocks waiting for input, but
instead the browser makes calls on the parser giving it input as it
becomes available. Expat doesn't support parsing of external entities
at all at the moment. I suspect it would be possible to make it support
external general entities, and I'll probably do that at some point. It
wouldn't be practical to support external parameter entities or external
DTD subsets without switching to a different kind of API which be less
convenient for Web browser integration.
James
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jjc at jclark.com Sat May 9 06:49:19 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun 7 17:01:14 2004
Subject: SDD bogus
References: <3553061D.BF3A3461@technologist.com>
Message-ID: <3553DD4F.D3CAE0C6@jclark.com>
Paul Prescod wrote:
> I'm concerned because I believe
> this to be a valid XML document:
>
>
>
>
> ]>
>
>
> In my opinion, section 5.1 will require the non-validating parser to skip
> the attribute list declaration, even if memo.dtd is an empty file.
This is very good point.
Your example isn't quite right: the entity must be referenced. Also a
non-validating parser only has to skip the ATTLIST declaration if it
skips the entity reference. Apart from this, your interpretation of 5.1
is the obvious one. Expat behaves consistently this.
I think this is a serious problem, because it breaks the principle that
if you declare your document as standalone=yes and validate it, then you
will get the same result when you parse it with any non-validating
parser (which to me is the point of the SDD).
I think a bit of creative interpretation would be in order. Section 5.1
says:
[Non-validating processors] must not process
entity declarations or attribute-list declarations encountered after a
reference to a parameter entity that is not read, since the
entity may have contained overriding declarations.
The "since" clause is false when standalone=yes, so I think this can
fairly be said to be an inconsistency in the spec (rather than simply a
poor design choice), which should be resolved by not applying this
requirement when standalone=yes.
The other way to fix this would be to tweak the definition of standalone
to say that declarations after the first reference to an external
parameter entity count as external for the purposes of determining
whether the document is standalone.
This is clearly needs to be fixed one way or the other.
> Despite its reputation to the contrary,
> XML is intricate and deep and I may have missed something important.
Yes. Entities and the SDD are both tricky: the interaction of the two is
particularily so.
James
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Sat May 9 07:00:09 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:14 2004
Subject: SDD bogus
References: <3553061D.BF3A3461@technologist.com>
<199805081351.JAA00618@unready.microstar.com>
<35535E95.3B3F5B23@technologist.com> <199805082043.QAA02512@unready.microstar.com>
Message-ID: <3553E309.ED168257@technologist.com>
David Megginson wrote:
>
> It's a matter of finding the right balance: XML must be simple enough
> to parse that there is a healthy competition among parser writers (and
> thus, a better quality and larger choice of tools for application
> writers), but simple enough to write that authors (including
> developers of tools that generate XML) are willing to learn and use
> it.
I agree 100%. I just believe that XML does NOT have balance.
> SGML, with its bizarre tag-omission rules, delimiter-in-context
> recognition, shortrefs, etc., gave too much away to the authors, and
> as a result, there were very few good SGML parsers implemented (and
> never a one in Java, although many of us tried).
This strikes me as similar to Eliot's argument. We've gone too far in the
past so let's not go anywhere near there again. Rather, the opposite
should be true: we know from experience the limits of what is reasonable
and should standardize it for all of the same reasons we standardized many
other good practices in XML.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
Can we afford to feed that army,
while so many children are naked and hungry?
Can we afford to remain passive,
while that soldier-army is growing so massive?
- "Gabby" Barbadian Calpysonian in "Boots"
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From andrewl at microsoft.com Sat May 9 07:19:46 1998
From: andrewl at microsoft.com (Andrew Layman)
Date: Mon Jun 7 17:01:15 2004
Subject: Less verbose XML (Was: A little wish for short end tags (Was:
RE: SDD bogus))
Message-ID: <5BF896CAFE8DD111812400805F1991F7038CA177@red-msg-08.dns.microsoft.com>
This is not to debate the value of short end tags. That's been debated.
But just as a technical matter, I also ran some tests a while ago, and found
that files with short end tags compress 5 to 10 percent better than those
with full end tags. I believe the reason is that with full end tags there
are more unique pairs of end/start, while with short end tags there are only
as many end/start pairs as types of start tag. So a compression scheme that
first shortened end tags, then applied a standard compression would be more
efficient size-wise.
-----Original Message-----
From: Rick Jelliffe [mailto:ricko@allette.com.au]
Sent: Friday, May 08, 1998 5:39 PM
To: 'XML-DEV'
Subject: Re: Less verbose XML (Was: A little wish for short end tags
(Was: RE: SDD bogus))
If you are worried about file size, then compress your files.
Tags, being short strings commonly used, compress really nicely with
the common compression algorithms out there.
I once did an experiment (to confirm one that Gavin Nicol had done)
using a document with several thousand lines, each with one start-tag,
end-tag pair and no content. I tried compressing this file with
* no minimization
* short end-tags
* end-tag ommission
The uncompressed file was something like 50K. The compressed files
differed by only a few 100bytes from each other. The gains from short
end-tags did not carry over into the compressed versions, and the
compressed versions were so much smaller there seemed little
contest.
Having short-tag ommision can only compress a document by less
than 50% at an improbable maximum (in the case of a document with
not data, not attributes, no white-space, and incredibly long GIs).
Compression is a far better approach.
Rick Jelliffe
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From alex at veosystems.com Sat May 9 07:55:53 1998
From: alex at veosystems.com (alex@veosystems.com)
Date: Mon Jun 7 17:01:15 2004
Subject: SDD again
In-Reply-To: <3.0.32.19980508213503.00b5b650@pop.intergate.bc.ca> from "Tim Bray" at May 8, 98 09:35:05 pm
Message-ID: <19980509055543.21260.qmail@veosystems.com>
A non-text attachment was scrubbed...
Name: not available
Type: text
Size: 4729 bytes
Desc: not available
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980509/2e364cef/attachment.bat
From Jon.Bosak at eng.Sun.COM Sat May 9 09:30:31 1998
From: Jon.Bosak at eng.Sun.COM (Jon Bosak)
Date: Mon Jun 7 17:01:15 2004
Subject: NOTE: Stylesheets and XML
Message-ID: <199805090728.AAA25990@boethius.eng.sun.com>
At the request of the W3C XML WG, James Clark has put into the form of
a Note a proposal made some time ago for a simple method for
associating stylesheets with XML documents. This method closely
parallels the mechanism used for the same purpose in HTML 4.0. It is
not an official recommendation, but the proposal has already been
successfully implemented and should serve well for the first
experiments with XML/CSS browsers. See
http://www.w3.org/TR/NOTE-xml-stylesheet
Jon Bosak
Chairman, XML WG
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Sat May 9 10:39:21 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:15 2004
Subject: SDD again
In-Reply-To: <19980509055543.21260.qmail@veosystems.com>
References: <3.0.32.19980508213503.00b5b650@pop.intergate.bc.ca>
Message-ID: <3.0.1.16.19980509082154.7d6fb61e@pop3.demon.co.uk>
At 22:55 08/05/98 -0700, [several people] wrote:
[... important and occasionally emotive discussion of SDD snipped...]
This discussion exemplifies one of the roles we hoped XML-DEV could address
- that different people could make different semantic assumptions from the
spec. I am not surprised that this particular problem has surfaced because
the distinction between validity and well-formedness is new and because it
has been clear that it is a difficult one to formalise. In the current
situation it would appear that different parser-creators can create code
which can sometimes produce different parse trees from the same document.
[Please correct me if I'm wrong.] These writers are not incompetent - they
take different interpretations of what the spec requires to be done and
under what circumstances.
If - as appears - we have a spec which is fuzzy in places then we it would
be extremely useful to know - dispassionately - where the problems lie. If
they can be identified then we might be able to:
- agree on a common course of action and/or
- agree that the document creator could encode her wishes in some way and/or
- agree that the parserwriter could make various options available to the
reader/client (e.g. through commandline switches).
Some validating parsers (e.g. DXP) already have commandline switches (e.g.
-v for validate) and I suspect that there will be increasing pressure to
add more ('ignore standalone declaration' for example). If we could
systematise these it could be one way to reduce unexpected effects of parsing.
[But I'm not the right person to offer :-)]
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From alex at veosystems.com Sat May 9 16:05:43 1998
From: alex at veosystems.com (alex@veosystems.com)
Date: Mon Jun 7 17:01:15 2004
Subject: SDD again
In-Reply-To: <3.0.1.16.19980509082154.7d6fb61e@pop3.demon.co.uk> from "Peter Murray-Rust" at May 9, 98 08:21:54 am
Message-ID: <19980509140520.22138.qmail@veosystems.com>
A non-text attachment was scrubbed...
Name: not available
Type: text
Size: 3056 bytes
Desc: not available
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980509/bbdc29ad/attachment.bat
From simpson at polaris.net Sat May 9 16:26:03 1998
From: simpson at polaris.net (John E. Simpson)
Date: Mon Jun 7 17:01:15 2004
Subject: Attribute type NMTOKEN
In-Reply-To: <3.0.3.32.19980508095011.008ce420@nexus.polaris.net>
References: <199805081213.OAA01196@berlin.dvs1.tu-darmstadt.de>
Message-ID: <3.0.3.32.19980509102520.008be510@nexus.polaris.net>
At 09:50 AM 5/8/98 -0400, in a stuporous moment I wrote:
>a type of NMTOKEN would not allow a value such as
>"1ABC," "//ENGLISH," or " WHITESPACE."
At 04:25 PM 5/8/98 -0400, Chris Maden clarified:
>A possible misunderstanding here: a NMTOKEN itself can't contain
>whitespace, but the attribute value specification, the quoted thing
>after the equals sign, can: so
>
>*is* a valid NMTOKEN attribute specification. The NMTOKEN in this
>example is only "WHITESPACE", though; the leading (and trailing)
>whitespace is trimmed.
Then at 03:31 PM 5/8/98 -0600, Russ Heithoff further clarified:
>Also, if I remember right, a NMTOKEN can start with Name characters. This
>is in contrast to a Name which must start with Name Start characters
>followed by characters. In otherwords 1ABC is a legal NMTOKEN but not a
>legal NAME.
Sheesh -- 2 flubs out of 3 swings at it!
So to summarize:
(1) The purpose of the NMTOKEN attribute type is to constrain the content
of attribute values by their *form* (rather than by their actual value, as
would be done with an enumeration). (I think this general statement was
true even though I muffed the examples!)
(2) The *form* of an attribute value declared as type NMTOKEN is such that
it may contain only valid NAME characters. The valid name characters are
"letters" (alphabetics as expressed in ASCII Latin, ISO Latin, and so on);
digits (also including ASCII and various international sets); limited
punctuation (full stop (.), hyphen (-), colon (:)); and various combining
and extending characters required for internationalization. Internal
whitespace is not allowed in NMTOKEN-type attribute values. [Constraints in
item (2) established by productions #4, 7, and 84-89 in REC-xml-19900210.]
(3) Leading and trailing whitespace is trimmed/ignored before determining
whether the value is valid for a NMTOKEN-type attribute.
Thanks to Chris (again) and Russ for correcting my examples.
Best regards, | It's no disgrace t'be poor,
John E. Simpson | but it might as well be.
simpson@polaris.net | -- "Kin" Hubbard
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From robin at ACADCOMP.SIL.ORG Sat May 9 20:03:42 1998
From: robin at ACADCOMP.SIL.ORG (Robin Cover)
Date: Mon Jun 7 17:01:15 2004
Subject: XML conformance and test suites
Message-ID: <199805091810.NAA03382@ACADCOMP.SIL.ORG>
Alex Milowski said:
> Such a test suite and conformance to it needs to be governed by some
> standards and/or consortium. OASIS springs to mind...
OASIS (formerly SGML Open) has set up an XML Conformance Subcommittee,
currently under the direction of G. Ken Holman. Ken overviewed the
work plans of the subcommittee at the Seattle XML Conference, and
slides from his presentation are to be posted on the OASIS web site.
See other referrences/hints in the "XML Conformance" section of the
SGML/XML Web Page:
http://www.sil.org/sgml/xml.html#conformance
Robin Cover
-------------------------------------------------------------------------
Robin Cover Email: robin@acadcomp.sil.org
6634 Sarah Drive
Dallas, TX 75236 USA >>> The SGML/XML Web Page <<<
Tel: +1 (972) 296-1783 (h) http://www.sil.org/sgml/sgml.html
Tel: +1 (972) 708-7346 (w)
FAX: +1 (972) 708-7380
=========================================================================
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From mrc at allette.com.au Sun May 10 01:13:30 1998
From: mrc at allette.com.au (Marcus Carr)
Date: Mon Jun 7 17:01:15 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net>
Message-ID: <3554E2E6.6E13B796@allette.com.au>
W. Eliot Kimber wrote:
> Just because *your* requirements make it obvious that empty end tags are
> enough, that doesn't mean that everybody's do. Any decision short of no
> minimization or full minimization will be an arbitrary one that will cause
> those whose requirements are not met to wonder why.
Agreed. It would only be a matter of time before someone asked for:
paragraph 1>
<>paragraph 2
<>paragraph 3>
to be made legal, (as it is in SGML) for exactly the same reasons as were
originally proposed.
--
Regards
Marcus Carr email: mrc@allette.com.au
_______________________________________________________________
Allette Systems (Australia) email: info@allette.com.au
Level 10, 91 York Street www: http://www.allette.com.au
Sydney 2000 NSW Australia phone: +61 2 9262 4777
fax: +61 2 9262 4774
_______________________________________________________________
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From adam at cyber-guru.com Sun May 10 01:55:58 1998
From: adam at cyber-guru.com (Adam M. Donahue)
Date: Mon Jun 7 17:01:15 2004
Subject: EBNF again...
In-Reply-To: <3554E2E6.6E13B796@allette.com.au>
Message-ID: <9805092355.AA09694@acf2.nyu.edu>
So no one out there knows the answer to my question?
Adam
mailto:adam@cyber-guru.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Sun May 10 03:02:58 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:15 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au>
Message-ID: <3554FCF0.D08AB8FB@technologist.com>
Marcus Carr wrote:
>
> Agreed. It would only be a matter of time before someone asked for:
Thankfully, whatever language the conversation is conducted in is likely
to have the word "no" in it. We've got quite proficient at using the
English version in the XML effort, which is why slippery slope arguments
strike me as bizarre. XML is precisely about achieving compromises --
about walking halfway down the slope without losing your balance.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
Can we afford to feed that army,
while so many children are naked and hungry?
Can we afford to remain passive,
while that soldier-army is growing so massive?
- "Gabby" Barbadian Calpysonian in "Boots"
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From SimonStL at classic.msn.com Sun May 10 03:17:20 1998
From: SimonStL at classic.msn.com (Simon St.Laurent)
Date: Mon Jun 7 17:01:15 2004
Subject: NOTE: Stylesheets and XML
Message-ID:
Jon Bosak wrote:
>At the request of the W3C XML WG, James Clark has put into the form of
>a Note a proposal made some time ago for a simple method for
>associating stylesheets with XML documents. This method closely
>parallels the mechanism used for the same purpose in HTML 4.0. It is
>not an official recommendation, but the proposal has already been
>successfully implemented and should serve well for the first
>experiments with XML/CSS browsers. See
>
> http://www.w3.org/TR/NOTE-xml-stylesheet
After taking a look at the note, I have a concern.
The note proposes
as the equivalent for the HTML
(There are, of course, many more examples...)
The same information, for the most part, is contained in both statements, and
a processing instruction ducks the issues raised by using an element to
implement the link. Still, I'm wondering if XML isn't already becoming loaded
down with different ways to reference external material.
External entities provide one mechanism for including material. The DOCTYPE
declaration provides another mechanism. This processing instruction is yet
another mechanism. XLink provides another set of mechanisms which at least
'transclude' information if not 'include' it.
I realize that these things are are connecting material of different types in
different contexts, but it seems too many mechanisms are providing similar
functionality in different contexts.
I'm definitely _not_ saying that this way of implementing stylesheets is a bad
idea - it's probably the most convenient way to do it in many circumstances.
I'm hoping strongly that we don't find any other needs that require another
reference of a different type.
Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer / Cookies
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From bckman at ix.netcom.com Sun May 10 03:31:46 1998
From: bckman at ix.netcom.com (Frank Boumphrey)
Date: Mon Jun 7 17:01:15 2004
Subject: EBNF again...
Message-ID: <01bd7bcd$3ee43fe0$2facdccf@uspppBckman>
Adam,
I'm not quite sure I followed the gist of your original question, but
the following are legitimate characters in XML
Sect 2.2
Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] |
[#x10000-#x10FFFF]
and as you can see the ASCII equivelents of 9,10,13,and 32(i.e. White
space) are all called legitimate characters.
On the other hand if you look at name char, and check out the letter,
the digit, the CombiningChar and the extender productions you will see that
they do not include whitespace.
[4] NameChar ::= Letter | Digit | '.' | '-' | '_' | ':' | CombiningChar |
Extender
Whether you can include white space or not depends on the notation.
On the other hand the literal string " abc " is ASC(32) +ASC(97) +ASC(98)
+ASC(99) + ASC(32)
Does this answer your question?
Frank
-----Original Message-----
From: Adam M. Donahue
To: xml-dev@ic.ac.uk
Date: Saturday, May 09, 1998 4:59 PM
Subject: EBNF again...
>So no one out there knows the answer to my question?
>
>Adam
>
>mailto:adam@cyber-guru.com
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From mrc at allette.com.au Mon May 11 00:01:11 1998
From: mrc at allette.com.au (Marcus Carr)
Date: Mon Jun 7 17:01:15 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com>
Message-ID: <35562374.DF2B9B4A@allette.com.au>
Paul Prescod wrote:
> Marcus Carr wrote:
>
> > Agreed. It would only be a matter of time before someone asked for:
>
> Thankfully, whatever language the conversation is conducted in is likely
> to have the word "no" in it. We've got quite proficient at using the
> English version in the XML effort, which is why slippery slope arguments
> strike me as bizarre. XML is precisely about achieving compromises --
> about walking halfway down the slope without losing your balance.
I'd be interested to hear how you would justify allowing short end tags and not
short start tags; after all, they're trying to accomplish exactly the same thing,
so any argument for one surely supports the other. Without adequate justification
for short end tags and against short start tags, any decision will appear to be
arbitrary rather than a balanced compromise. This is what the slippery slope
scenario means to me; your "halfway" won't be everyone elses.
--
Regards
Marcus Carr email: mrc@allette.com.au
_______________________________________________________________
Allette Systems (Australia) email: info@allette.com.au
Level 10, 91 York Street www: http://www.allette.com.au
Sydney 2000 NSW Australia phone: +61 2 9262 4777
fax: +61 2 9262 4774
_______________________________________________________________
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cys at arbortext.com Mon May 11 01:21:14 1998
From: cys at arbortext.com (Cynthia Lenora Shern)
Date: Mon Jun 7 17:01:15 2004
Subject: Is anybody working on ...
Message-ID: <98May10.191726edt.26887@thicket.arbortext.com>
This makes sense to me too.
And, do you envision this particular family of markup residing internal or
external to XML instances (as a perferred model)?
At 03:13 PM 4/29/98 -0400, Paul Prescod wrote:
>I would think that in the case where the formatting is an intrinsic part
>of the meaning of a document, it should be represented in the markup just
>like everything else. XML encourages you to separate formatting from the
>abstraction, but it does not require you to. As someone else pointed out,
>Precision Graphics Markup Language is an XML-based language designed
>explicitly for formatting -- but it is still XML.
>
> Paul Prescod - http://itrc.uwaterloo.ca/~papresco
>
>"Perpetually obsolescing and thus losing all data and programs every 10
>years (the current pattern) is no way to run an information economy or
>a civilization." - Stewart Brand, founder of the Whole Earth Catalog
>http://www.wired.com/news/news/culture/story/10124.html
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>
>
Cynthia Shern (cshern@arbortext.com)
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Mon May 11 01:23:43 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:15 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com> <35562374.DF2B9B4A@allette.com.au>
Message-ID: <3556370D.FC18BF6E@technologist.com>
Marcus Carr wrote:
>
> I'd be interested to hear how you would justify allowing short end tags and not
> short start tags;
Simple: short end tags are trivial to understand, trivial to parse,
non-obfuscatory and context-free. They also have plenty of precedent in
many other languages. Short start tags behave differently depending on
context, and thus are not trivial to understand, are not conducive to good
document maintenance and are not implemented in any non-SGML language I
know of.
> after all, they're trying to accomplish exactly the same thing,
> so any argument for one surely supports the other. Without adequate justification
> for short end tags and against short start tags, any decision will appear to be
> arbitrary rather than a balanced compromise.
XML is a relatively arbritrary mix of SGML's most interesting features. I
certainly think that the decision to include some things and exclude
others seems arbitrary. C'est la vie. That's how you get a workable
subset.
> This is what the slippery slope scenario means to me; your "halfway" won't
> be everyone elses.
I didn't claim it had to be, nor should be. That's why the XML working
group holds votes to decide the outcome of decisions. Each person chooses
their halfway and the majority rules. This situation is no different than
any other.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
Can we afford to feed that army,
while so many children are naked and hungry?
Can we afford to remain passive,
while that soldier-army is growing so massive?
- "Gabby" Barbadian Calpysonian in "Boots"
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cys at arbortext.com Mon May 11 01:41:56 1998
From: cys at arbortext.com (Cynthia Lenora Shern)
Date: Mon Jun 7 17:01:15 2004
Subject: Is anybody working on ...
Message-ID: <98May10.193809edt.26885@thicket.arbortext.com>
I (questioner) am wondering whether, the markup in question wouldn't be
used to drive multiple processes or presentations.
So I guess I'm assuming that if (for example) formatting needs to
"provide_for" references to objects (graphics, sound, video, code, and
"other_stuff_to_be_described"); and if these referenced objects would be
processed differently, or not all, or multiply, depending_on_the_processor
and media the processor was presenting the data to; then would an external
file would allow for more portability of instances?
08:44 PM 4/29/98 -0400, Paul Prescod wrote:
>James K. Tauber wrote:
>>
>> > Is anybody working on creating an abstration for preserving formatting
>> > information inside or along with SGML documents, in cases where format
>> > really is part_of the information.
>>
>> I'm wondering if something like this could be done with the Precision
>> Graphics Markup Language (see http://www.jtauber.com/xml/appl/pgml.html)
>> with out-of-line links asserting relationships between the precise
>> formatting expressed in PGML and the document in a more
application-specific
>> (eg survey) DTD.
>
>It isn't clear to me why information that the questioner feels is
>intrinsic to the document should be linked instead of embedded.
>
> Paul Prescod - http://itrc.uwaterloo.ca/~papresco
>
>"Perpetually obsolescing and thus losing all data and programs every 10
>years (the current pattern) is no way to run an information economy or
>a civilization." - Stewart Brand, founder of the Whole Earth Catalog
>http://www.wired.com/news/news/culture/story/10124.html
>
>
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>
>
Cynthia Shern (cshern@arbortext.com)
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cys at arbortext.com Mon May 11 02:06:21 1998
From: cys at arbortext.com (Cynthia Lenora Shern)
Date: Mon Jun 7 17:01:15 2004
Subject: Is anybody working on ...
Message-ID: <98May10.200235edt.26885@thicket.arbortext.com>
Exactly - semantically significant is what I'm searching for here.
In my first message I gave the example of kindergarten through grade 12
text books and marketing surveys as two examples that might require such
types of abstractions.
For K-12 the markup would probably have some sort of pedagogical
significance. The marketing survey type might have some sort of statistical
or perhaps psychological semantic.
I think they are probably XML application specific. And probably some
extension or addendum to an existing document type definition.
So, based on what is hopefully more clarification, is anyone aware of some
markup models that are addressing this type information.
At 09:19 PM 4/29/98 -0400, David Megginson wrote:
>Paul Prescod writes:
>
> > Still, it isn't wrong to use Java in a way that is tied to a
> > particular platform nor to use SGML in a way that is tied to a
> > particular formatter. I think that we agree on that central point.
>
>I don't think that that's the suggestion (though I agree that it would
>be a possible application of SGML/XML). Rather, the suggestion is
>that people may need to encode physical information about a text when
>that information is required for useful processing, possibly with a
>wide range of XML processing tools. More controversially, you could
>say that the formatting information is semantically-significant.
>
>
>All the best,
>
>
>David
>
>--
>David Megginson ak117@freenet.carleton.ca
>Microstar Software Ltd. dmeggins@microstar.com
> http://home.sprynet.com/sprynet/dmeggins/
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>
>
Cynthia Shern (cshern@arbortext.com)
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Mon May 11 08:10:47 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:16 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
In-Reply-To: <35562374.DF2B9B4A@allette.com.au>
References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net>
<3554E2E6.6E13B796@allette.com.au>
<3554FCF0.D08AB8FB@technologist.com>
<35562374.DF2B9B4A@allette.com.au>
Message-ID: <199805110611.CAA00265@unready.microstar.com>
Paul Prescod has quite rightly objected to a simplistic slippery-slope
argument against short end tags. I have thought of what might be a
stronger argument: as an (unstated?) design principle, XML provides
exactly one alternative for nearly every markup item. For example,
the following are some of the possible start tags in SGML (the
second-last one is an example of a shortref):
""
""
"|"
""
The following are some of the possible end tags in SGML (the
second-last one, again, is a shortref):
""
"/"
">"
"|"
""
In XML, start tags with no attributes look like "" and end tags
with no attributes look like "". That saves a chapter or so
from every XML text, a month or so from each parser-writer's calendar,
a few days from each intro to XML course, etc. etc.
Of course, any good pedant will point out that "" (with trailing
whitespace)is a sort-of variant in XML, and that XML does provide what
might be called variant or redundant features (such as the choice of
"'" or '"' as literal delimiters, and pre-declared entities like
"&" where "&" would serve as well).
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From mrc at allette.com.au Mon May 11 10:41:28 1998
From: mrc at allette.com.au (Marcus Carr)
Date: Mon Jun 7 17:01:16 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com> <35562374.DF2B9B4A@allette.com.au> <3556370D.FC18BF6E@technologist.com>
Message-ID: <3556B982.2A0C2B68@allette.com.au>
Paul Prescod wrote:
> > I'd be interested to hear how you would justify allowing short end tags and not
> > short start tags;
>
> Simple: short end tags are trivial to understand...
> Short start tags behave differently depending on context, and thus are not trivial to
> understand,
If you mean that they behave differently based on whether OMITTAG is set to 'yes' or
'no', that wouldn't be an issue for XML.
> are not conducive to good document maintenance
I couldn't agree more, but short end tags can be abused in the same way.
> and are not implemented in any non-SGML language I know of.
Me either, though I'm not sure if that matters.
I don't want short start tags, I don't like short start tags and I have never
(intentionally) used short start tags, but unless I have misunderstood you, the reasons
you have given above don't seem sufficient basis for voting them down while supporting
short end tags. If (as I'm sure most of us believe) these issues are decided primarily
on the basis of intelligent discourse rather than gut feeling, it would appear that
there is not yet enough to separate short start tags from short end tags. If gut
feeling does come into it, then so be it - I can accept that, I just wasn't aware that
was the way the process works.
This is getting a mile off topic...
--
Regards
Marcus Carr email: mrc@allette.com.au
_______________________________________________________________
Allette Systems (Australia) email: info@allette.com.au
Level 10, 91 York Street www: http://www.allette.com.au
Sydney 2000 NSW Australia phone: +61 2 9262 4777
fax: +61 2 9262 4774
_______________________________________________________________
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From M.H.Kay at eng.icl.co.uk Mon May 11 11:27:42 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun 7 17:01:16 2004
Subject: Attribute type NMTOKEN
Message-ID: <002e01bd7cbf$2eba4900$1e09e391@mhklaptop.bra01.icl.co.uk>
>A possible misunderstanding here: a NMTOKEN itself can't
contain
>whitespace, but the attribute value specification, the
quoted thing
>after the equals sign, can: so
>
>*is* a valid NMTOKEN attribute specification. The NMTOKEN
in this
>example is only "WHITESPACE", though; the leading (and
trailing)
>whitespace is trimmed.
You're right of course, but I had a devil of a job
discovering why! This is one example of the spec being
terribly unhelpful: it says quite clearly (in section
3.3.1): "Values of type NMTOKEN must match the Nmtoken
production...", and then qualifies this in section 3.3.3, by
saying in effect: "the value I was talking about in section
3.3.1 isn't the "attribute value" defined in section 3.1 as
you might have thought, rather it's the "normalized
attribute value" defined as follows..."
Next time round could we please have a bit more rigour in
the spec!?
Mike Kay
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From M.H.Kay at eng.icl.co.uk Mon May 11 11:39:35 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun 7 17:01:16 2004
Subject: ISO Date (was: parser for xml-data?)
Message-ID: <006d01bd7cc0$cf7bbd00$1e09e391@mhklaptop.bra01.icl.co.uk>
Rick Jelliffe:
>Also note that the usage of ISO 8601 date formats seems to
be wrong.
>ISO 8601 date format is yyyy-mm-dd, e.g. 1998-05-09, and
not 19980509,
>last time I looked.
It allows both. The first form is preferred where the text
is intended to be human-readable, but the second is
permitted where it is primarily for machine use. [That's
from memory, I don't have the spec to hand].
Mike Kay
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From edz at bsn.com Mon May 11 13:51:26 1998
From: edz at bsn.com (Edward C. Zimmermann)
Date: Mon Jun 7 17:01:16 2004
Subject: ISO Date (was: parser for xml-data?)
In-Reply-To: <006d01bd7cc0$cf7bbd00$1e09e391@mhklaptop.bra01.icl.co.uk> from "Michael Kay" at May 11, 98 10:40:18 am
Message-ID: <199805111151.NAA07681@gils.bsn.com>
>
> Rick Jelliffe:
> >Also note that the usage of ISO 8601 date formats seems to
> be wrong.
> >ISO 8601 date format is yyyy-mm-dd, e.g. 1998-05-09, and
> not 19980509,
> >last time I looked.
>
> It allows both. The first form is preferred where the text
> is intended to be human-readable, but the second is
> permitted where it is primarily for machine use. [That's
> from memory, I don't have the spec to hand].
ISO date formats:
YYYYMMDDTHHMMSS[Z]
YYYYMMDDTHH:MM:SS[Z]
YYYY-MM-DDTHHMMSS[Z]
YYYY-MM-DDTHH:MM:SS[Z] (eg. 1996-07-16T13:19:51Z)
Z := UTC (aka GMT) else unspecified (resp. local)
>
> Mike Kay
>
>
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>
--
______________________
Edward C. ZimmermannBasis Systeme netzwerk/Munich
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Mon May 11 13:52:40 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:16 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com> <35562374.DF2B9B4A@allette.com.au> <3556370D.FC18BF6E@technologist.com> <3556B982.2A0C2B68@allette.com.au>
Message-ID: <3556E683.EC0BE1F1@technologist.com>
Marcus Carr wrote:
>
> Me either, though I'm not sure if that matters.
> I don't want short start tags, I don't like short start tags and I have never
> (intentionally) used short start tags, but unless I have misunderstood you, the reasons
> you have given above don't seem sufficient basis for voting them down while supporting
> short end tags. If (as I'm sure most of us believe) these issues are decided primarily
> on the basis of intelligent discourse rather than gut feeling, it would appear that
> there is not yet enough to separate short start tags from short end tags. If gut
> feeling does come into it, then so be it - I can accept that, I just wasn't aware that
> was the way the process works.
Call it gut feeling. Call it years of experience. Call it common sense.
Call it a simple matter of drawing a line somewhere just for the sake of
drawing the line. It doesn't matter. In short: the perceived and observed
usability and usefulness of short-end tags is high and that of short
start-tags is low for a variety of reasons. If nobody on the list
disagrees with that fundamental point then I don't really see why we
should prepare an argument against *just in case*. Nobody wants short
start tags, and even if they did, the working group wouldn't put them in.
I'm sure a simple poll would reveal unanimous agreement on that issue.
It's really that simple.
Markup language design is not at all precise, and is influenced much more
strongly by subjective factors than by definitive arguments. I expect that
designing languages to manage the interface between human beings and
computers can only be done subjectively.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
Can we afford to feed that army,
while so many children are naked and hungry?
Can we afford to remain passive,
while that soldier-army is growing so massive?
- "Gabby" Barbadian Calpysonian in "Boots"
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From rajesh_n1 at verifone.com Mon May 11 16:16:07 1998
From: rajesh_n1 at verifone.com (Rajesh N)
Date: Mon Jun 7 17:01:16 2004
Subject: freewares..
Message-ID: <3.0.5.32.19980511194649.00808cf0@blr-nt-mail1.verifone.com>
Hi all,
could anyone give me some pointers regarding the availability
of any freewares with/without source code for the following..
i) An XML parser written in C++
ii) An XML parser written in C but should be DTD aware
iii) Any XML generator API that uses a DTD while building an
XML doc ? (ie, performs validations during the manipulation
operations of the tree nodes )
thanks in advance,
Rajesh N.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From alaly at inlink.com Mon May 11 16:56:19 1998
From: alaly at inlink.com (Michael Alaly)
Date: Mon Jun 7 17:01:16 2004
Subject: XML parser in C++
Message-ID: <03d701bd7ced$502901c0$06001cac@mike>
I believe that Microsoft released a C++ parser (no code) as well as a Java Parser which included code. They can both be found at www.microsoft.com/xml
hope this helps,
Michael Alaly
alaly@inlink.com | 314.878.6474
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980511/331869c4/attachment.htm
From Bryan_Gilbert at pml.com Mon May 11 17:48:04 1998
From: Bryan_Gilbert at pml.com (Bryan Gilbert)
Date: Mon Jun 7 17:01:16 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
Message-ID:
Yes don't forget that XML is meant
for Humans too. Short end tags are a pain
when there is lots of intermediate text.
C programmers are familar with this
when they have to deal with nested
#ifdef #endif statements: the #endif part
is a short end tag.
For non-C programmers consider trying
to find out which start tag > belongs
to after you have read pages and pages
of text.
So the best solution is the current solution: each
start tag has a complete matching end tag. IMHO
Bryan Gilbert
PS Formerly I wanted short end tags for embedded
processors but human readability is more important
to me.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From maillist at chris.hubick.com Mon May 11 21:13:09 1998
From: maillist at chris.hubick.com (Chris Hubick)
Date: Mon Jun 7 17:01:16 2004
Subject: SAX Parsers - standalone
Message-ID:
Up until now I have been using MSXML for my project at work. I am
now converting to use SAX. I have documents with the following:
The thing is, right now, "/dtd/JDF.dtd" doesn't exist. This is a
browsing application, and I don't want to do validation. And I need to
feed the parser using an input stream. Using MSXML I can do
"document.setLoadExternal(false);" before I call load. With SAX, I
thought I would just be able to return null in my entityResolver, and I
would achieve the same functionality. Aelfred throws "no protocol:
/dtd/JDF.dtd".
What can I do to keep the parser from looking for external
documents? Since all the documents are standalone, I could remove the
DOCTYPE's from each, but I may want to do validation under some
circumstances in the future, and would like to avoid this if possible. I
also use the SystemLiteral to identify the document type (I guess I
could use the root element name = JDF).
Much Thanks!
---
Chris Hubick
mailto:chris@hubick.com
http://www.hubick.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From distobj at acm.org Mon May 11 22:08:52 1998
From: distobj at acm.org (Mark Baker)
Date: Mon Jun 7 17:01:16 2004
Subject: Less verbose XML (Was: A little wish for short end tags
(Was: RE: SDD bogus))
In-Reply-To:
Message-ID: <3.0.2.32.19980511160727.00724810@pop1.sympatico.ca>
At 03:09 PM 8/5/98 -0700, Bryan Gilbert wrote:
>Work for a while in a system that has 1/2 MB ROM,
>maybe 1MB RAM, and a CPU that is managing
>intense calculations and I/O.
>
>Am I alone? Or are there others out there with similar
>constraints who want to use XML? I'd like to hear from
>you.
I'm developing for small devices. And I must say, as a developer, it
certainly keeps you honest.
But I don't have a burning desire for short end tags.
At Beduin, we've learned the hard way about how difficult it is to render
HTML because some tags don't require end tags (in combo with the general
crappy state of HTML generators). I'd guess that our parser is twice as
large as it needed to be because of these problems. With short end tags,
in order for us to render XML, we'd have to have a similar amount of code
to render broken XML. So IMHO, short end tags would make my code larger
(though runtime memory requirements *might* drop - not sure).
BTW, while on the topic of verbosity, I thought the following position
paper by Rohit Khare for the "Future of HTML" conference might be of interest;
http://xent.ics.uci.edu/FoRK-archive/apr98/0465.html, "Requirements for
Interactive Access to HTML and XML Documents", aka "YML Requirements";
"XML has been accused of a great many sins, but brevity is not among them.
>From the 'waste' of to non-normative whitespace,
many a character can be considered extraneous. However, rather than
fragment off into new archival formats using compact end-tags,
rediscovering markup minimization, or paren-based S-expressions, we're best
off approaching the Shannon limit directly rather than hacking around it."
Well said.
MB
--
Mark Baker. CTO, Beduin Communications Corp
Ottawa, Ontario, CANADA http://www.beduin.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From mrc at allette.com.au Tue May 12 00:04:11 1998
From: mrc at allette.com.au (Marcus Carr)
Date: Mon Jun 7 17:01:16 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com> <35562374.DF2B9B4A@allette.com.au> <3556370D.FC18BF6E@technologist.com> <3556B982.2A0C2B68@allette.com.au> <3556E683.EC0BE1F1@technologist.com>
Message-ID: <355775A9.9358FC20@allette.com.au>
Paul Prescod wrote:
> Markup language design is not at all precise, and is influenced much more
> strongly by subjective factors than by definitive arguments. I expect that
> designing languages to manage the interface between human beings and
> computers can only be done subjectively.
Thanks for the explanation. As one with a passing interest in mathematics, I probably just
assumed that more formal approaches would be employed in the first instance, but never having
participated in such an endeavour there isn't any basis for that. I'm sure an element of
subjectivity is both expeditious and valuable when faced with some of these issues.
--
Regards
Marcus Carr email: mrc@allette.com.au
_______________________________________________________________
Allette Systems (Australia) email: info@allette.com.au
Level 10, 91 York Street www: http://www.allette.com.au
Sydney 2000 NSW Australia phone: +61 2 9262 4777
fax: +61 2 9262 4774
_______________________________________________________________
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Tue May 12 02:46:51 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:16 2004
Subject: Less verbose XML
In-Reply-To: <3.0.2.32.19980511160727.00724810@pop1.sympatico.ca>
References:
<3.0.2.32.19980511160727.00724810@pop1.sympatico.ca>
Message-ID: <199805120047.UAA00374@unready.microstar.com>
Mark Baker writes:
> At Beduin, we've learned the hard way about how difficult it is to
> render HTML because some tags don't require end tags (in combo with
> the general crappy state of HTML generators). I'd guess that our
> parser is twice as large as it needed to be because of these
> problems. With short end tags, in order for us to render XML, we'd
> have to have a similar amount of code to render broken XML. So
> IMHO, short end tags would make my code larger (though runtime
> memory requirements *might* drop - not sure).
I think that Mark is quite right. Just for interest, here are some
statistics, using Jon Bosak's ot.xml (with CRLF line-ends, and with a
different XML declaration at the top):
With full end tags: 3,880,187 bytes
With short end tags: 3,773,844 bytes (2.7% saving)
Here are the same files compressed with gzip -9 (savings are relative
to the uncompressed version with full end tags):
With full end tags: 994,835 bytes (74.4% saving)
With short end tags: 988,976 bytes (74.5% saving)
All the best,
David
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ak117 at freenet.carleton.ca Tue May 12 07:30:56 1998
From: ak117 at freenet.carleton.ca (David Megginson)
Date: Mon Jun 7 17:01:16 2004
Subject: Announcement: SAX 1.0 is finished
Message-ID: <199805120530.BAA03034@unready.microstar.com>
I am happy to announce public release 1.0 of SAX (the Simple API for
XML), a collaborative project of the members of the XML-DEV discussion
group. The first release of SAX is in Java, but versions in other
programming languages may follow. SAX is free for both commercial and
non-commercial use.
SAX is a common, event-based API for parsing XML documents. The first
draft of SAX was supported by IBM's XML for Java, DataChannel's DXP,
James Clark's XP, and Microstar's AElfred parsers. Support for Tim
Bray's Lark parser and Microsoft's MSXML parser was provided through
third-party drivers.
SAX fills the same role for XML that the JDBC fills for SQL: with SAX,
a Java application can work with any XML parser, as long as the parser
has a SAX 1.0 driver available. SAX 1.0 drivers for the major parsers
will be appearing shortly.
For more information, or to download SAX 1.0 with two sample drivers
(Lark and MSXML), please visit the following URL:
http://www.megginson.com/SAX/
Thanks and congratulations to everyone,
David Megginson
--
David Megginson ak117@freenet.carleton.ca
Microstar Software Ltd. dmeggins@microstar.com
http://home.sprynet.com/sprynet/dmeggins/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From akitoshi.yoshida at sap-ag.de Tue May 12 10:45:45 1998
From: akitoshi.yoshida at sap-ag.de (Akitoshi Yoshida)
Date: Mon Jun 7 17:01:16 2004
Subject: Less verbose XML (Was: A little wish for sh
Message-ID: <199805120831.KAA27617@hs2114.wdf.sap-ag.de>
standard text compression algorithms can generally
compress files with short end-tags than those with
full end-tags because these standard algorithms
can't build the right source model for XML (in practice).
to increase the compression factor, one should
model the input source more accurately using the
XML specs and the DTD of the input file.
but in this case, encoding/decoding needs more work.
regards,
aki yoshida
-----Original Message-----
From: Andrew Layman
This is not to debate the value of short end tags. That's been debated.
But just as a technical matter, I also ran some tests a while ago, and found
that files with short end tags compress 5 to 10 percent better than those
with full end tags. I believe the reason is that with full end tags there
are more unique pairs of end/start, while with short end tags there are only
as many end/start pairs as types of start tag. So a compression scheme that
first shortened end tags, then applied a standard compression would be more
efficient size-wise.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From rajesh_n1 at verifone.com Tue May 12 12:11:14 1998
From: rajesh_n1 at verifone.com (Rajesh N)
Date: Mon Jun 7 17:01:16 2004
Subject: Qn. reg. Expat parser
Message-ID: <3.0.5.32.19980512154051.007c5550@blr-nt-mail1.verifone.com>
Hi all,
Does the James Clark's expat parser use external DocType
declarations , what i mean is something like this
Does it do parameter entity substitution if it does and also
General Entity substitution?? So far i have been unable to observe this
using the expat parser!!!
Any help ?
Thanks,
Rajesh N.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From robin at ACADCOMP.SIL.ORG Tue May 12 17:02:15 1998
From: robin at ACADCOMP.SIL.ORG (Robin Cover)
Date: Mon Jun 7 17:01:17 2004
Subject: SGML/XML and literate programming
Message-ID: <199805121509.KAA05497@ACADCOMP.SIL.ORG>
Today I created a document entitled "SGML/XML and Literate Programming"
as part of the SGML/XML Web Page. It is intended to be a collection of
references to literature programming (publications and ongoing work)
within the context of descriptive markup language applications.
See: http://www.sil.org/sgml/xmlLitProg.html
and the rationale for this new section, in the "What's New" page,
http://www.sil.org/sgml/sgmlnew.html
Readers subscribed to this list are invited to contact me via email
to report on other candidate URLs or bibliographic references. I
have not launched a major investigation to ascertain what activity
might now be underway, or what references to previous work might be
available.
So: your comments/corrections/additions are welcome.
Best wishes,
Robin Cover
-------------------------------------------------------------------------
Robin Cover Email: robin@acadcomp.sil.org
6634 Sarah Drive
Dallas, TX 75236 USA >>> The SGML/XML Web Page <<<
Tel: +1 (972) 296-1783 (h) http://www.sil.org/sgml/sgml.html
Tel: +1 (972) 708-7346 (w)
FAX: +1 (972) 708-7380
=========================================================================
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From James.Anderson at mecom.mixx.de Tue May 12 17:24:34 1998
From: James.Anderson at mecom.mixx.de (james anderson)
Date: Mon Jun 7 17:01:17 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
References:
Message-ID: <35586A0E.4A785960@mecom.mixx.de>
maybe i've been writing lisp too long, and i've even come to trust my text
editor's ability to balance parens, but even before the days of emacs and co.,
reading (and indenting, if need be) for balancing became second nature.
i won't repeat paul graham's argument from "on lisp" here, but it came down to
"you end up reading shapes, not parentheses. if you get to need to count the
parentheses, the cause is lost anyway..."
besides, the editors i've seen for c and java these days not only blink
braces, they even display them in pretty colors.
shouldn't one expect the same of xml editors - to the extent that they're not wysiwyg?
Bryan Gilbert wrote:
>
> Yes don't forget that XML is meant
> for Humans too. Short end tags are a pain
> when there is lots of intermediate text.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From James.Anderson at mecom.mixx.de Tue May 12 17:26:17 1998
From: James.Anderson at mecom.mixx.de (james anderson)
Date: Mon Jun 7 17:01:17 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com> <35562374.DF2B9B4A@allette.com.au> <3556370D.FC18BF6E@technologist.com>
Message-ID: <35586A2C.C77EF45E@mecom.mixx.de>
although i agree with mr prescod's intent here, the details are not quite complete
Paul Prescod wrote:
>
> Marcus Carr wrote:
> >
> > I'd be interested to hear how you would justify allowing short end tags and not
> > short start tags;
>
> Simple: short end tags are trivial to understand, trivial to parse,
> non-obfuscatory and context-free. They also have plenty of precedent in
----------------------^?
> many other languages. Short start tags behave differently depending on
> context, and thus are not trivial to understand, are not conducive to good
> document maintenance and are not implemented in any non-SGML language I
> know of.
>
they both depend on state. and although i would suggest, off the top of my
head, that the amount of state required for short end tags is less than that
required for short start tags, that all depends on the storage semantics
implicit in ones xml parser/processor.
in my case, this relation holds and there would be two clear steps involved-
one notably larger than the other and a slippery slope by no means.
on the hand, given the absence of such a semantics for xml in general, any
consideration of the implications of minimization find themselves not on a
slipery slope but rather in an amorphous swamp...
bye for now,
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From bsteele at tdiinc.com Tue May 12 19:04:22 1998
From: bsteele at tdiinc.com (Bob Steele)
Date: Mon Jun 7 17:01:17 2004
Subject: Less verbose XML
References:
<3.0.2.32.19980511160727.00724810@pop1.sympatico.ca> <199805120047.UAA00374@unready.microstar.com>
Message-ID: <355880D2.94F60EA@tdiinc.com>
David Megginson wrote:
> Here are the same files compressed with gzip -9 (savings are relative
> to the uncompressed version with full end tags):
>
> With full end tags: 994,835 bytes (74.4% saving)
> With short end tags: 988,976 bytes (74.5% saving)
I suggest you also experiment with DTD's designed for the data transfer of
objects. With tall inheritance hierarchies in my samples, my generated XML
documents grow very large and very repetitive. This is exactly what compression
algorithms such as LZW are designed to eliminate. I have found compressions as
high as 700%. This does not mean that the resulting compressed documents are
small but it does help make practical the transfer of data in XML.
Note: The size of the document is influenced by my choice of element over XML
attribute encoding for simple types. This was done for symmetry.
bob
--
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From michael at textscience.com Tue May 12 20:58:51 1998
From: michael at textscience.com (Michael Leventhal)
Date: Mon Jun 7 17:01:17 2004
Subject: Announcement of a book for XML developers
References: <3.0.5.32.19970326203913.008ff790@192.168.1.1>
Message-ID: <35589860.95ED3C17@shell1.aimnet.com>
We have written a book directed to XML developers. A description of
it and the table of contents can be found below. The book is available
at Amazon and should be appearing in bookstores. It will also be
available in the GCA bookstore at the SGML/XML conference in Paris
next week.
Michael Leventhal
-----------------------------------------------------------------------
Designing XML Internet Applications
Michael Leventhal, David Lewis, Matthew Fuchs with contributions
from Stuart Culshaw and Gene Kan
582 pages, CD-ROM included.
Published by Prentice Hall PTR in the Charles F. Goldfarb Series
on Open Information Management, ISBN 0-13-616822-1.
"Designing XML Internet Applications" is divided into five parts.
In the first part we will introduce you to the XML universe.
Here you will find a discussion of the role of XML in the internet
and a quick-start on the XML recommendation and XML tools. We
don't assume prior knowledge of either XML or SGML but our task
here is not to provide an extended tutorial or reference on the
language syntax. What we do do is develop the perspective of the
XML internet application designer and provide any background
that is needed to comprehend the subsequent chapters.
The next three parts consist of a series of projects using XML
in actual internet applications. Working through the projects
the reader will gain concrete experience in the design of XML
applications, DTDs, and programming. We also delve into
standards related to XML and the internet wherever relevant.
The first project spans five chapters as the construction of
several types of components is involved including a bulletin
board, forms processing tools, a search engine, and
transformation filters.
Most of the work is done in Perl and the approach is less
rigorous than that used in subsequent projects. Our
intention here is to introduce XML programming in the most
simple and "exposed" form possible.
We have chosen to use Perl in this first part for various
reasons. It is the closest thing we know of to a lingua
franca for internet programmers, it is extremely compact
allowing us to construct complete examples in relatively
few lines of code, and, most significantly, Perl is the
most versatile XML scripting language.
The second project implements SGML/XML email and digs into
the topics of entity management, catalogs, MIME, and full-
scale SGML/XML parsing. Code is presented in Perl and
C++.
Lest the reader think we are Perl bigots the third project
plunges us into Java and XML, building an application based on
the Document Object Model and making use of a Java XML
parser API. Java is the language in which most of the new
XML internet infrastructure is being built.
The fifth and final section of the book takes a rigorous,
formal look at the role of XML in software architectures
and agents based on the paradigm of negotiation.
Full source code for all the projects has been included on
the CD-OM as have all the public domain tools used in the
book.
TABLE OF CONTENTS
PART I Internets, XML, and Tools
Chapter 1 Internets
1.1 Introduction
1.2 Why XML
1.3 Structure of the Book
1.4 Let's Talk: Internets are for Communicating
1.5 The Velocity of Information
1.6 Into the Smart Network
1.7 Current Approaches-Can the Web Help?
HTML - Java
1.8 Where the Web Needs Help
1.9 Beyond the Traditional Document
1.10 Toward the Active Document
1.11 Down to the Nitty-Gritty
1.12 What Do We Do with Documents?
1.13 DTDs and Content Specifications-A Short Excursion
Search - Retrieve - Store - Send/Receive -
Import/Export - Type Transformation -
Hyperlinking - Compare - Interpret - Define -
Create
1.14 Conclusion
Chapter 2 XML, Data, and Documents
2.1 XML-What It Is, What It Does, SGML Ancestry
SGML Essential Description - The XML Subset
and HTML - How XML Simplifies SGML - Valid
versus Well-Formed XML
2.2 XML and Data-Driven Architecture
2.3 XML and Documents
Using XML to Deliver Large and Complex
Documents Efficiently - Taming the Chaos -
Production of Multiple Information Products
from a Single Source - Reuse and Preservation -
Information Interchange Standards - Cost of
Production - Safety, Regulatory, and Other
Legal Documentation Requirements, Advanced
Hypertext, Collaborative Authoring?, Advanced
Information Management-Connecting Databases
Chapter 3 XML and SGML Tools
3.1 Tool Information Sources
The SGML/XML Web Page - The Whirlwind Guide
to SGML & XML Tools and Vendors - SGML Buyer's
Guide
3.2 Evolution of SGML and XML Tools
3.3 Software
Parsers - Programming Languages - Browsers -
Search Engines - Document and Component Management -
Authoring - Conversion, Capture, and Export of XML -
DTD Design Tools - Down Conversion from XML -
HyTime
3.4 DTDs as Development Resources
Part II Perl and XML
Chapter 4 The Desperate Perl Hacker and Internet
Applications: Overview
4.1 Apropos of Perl and the Desperate Perl Hacker
Java Versus Perl - Perl and XML Compliance
4.2 System Components
Applications - Functionality - Software
4.3 System Operation
Chapter 5 An XML Bulletin Board
5.1 Overview
5.2 XML Document Types
Messages - Password Docuemnt - User State Documents
5.3 Reading and Writing XML in the Bulletin Board
Writing Messages - Reading Messages - Password and
User State Documents - Transformation from XML to
HTML
Chapter 6 An XML Contact Database
6.1 Overview
6.2 XML Data Formats
6.3 Reading and Writing XML in the Customer Database
XMLForms - Using an XML Editor and CSS to Create
Contact Database Records -
Chapter 7 Structure-based Search
7.1 Overview: Structure- and Property-Driven Search
Search Tools in Internet Applications
7.2 sgrep's Query Language
A Web Interface to sgrep
Chapter 8 Type Transformation, Import, and Export
8.1 Overview
Import, Type Transformation, Export
8.2 Approaches to Transformation
Event Stream, Groves, DSSSL/XSL Transformation
8.3 Event Stream Transformation with Perl
Core Routine - Element-in-Context Subroutines? -
Generation of Subroutine Stubs - Bulletin Board
Type Transformation - Contact Database Type
Transformation - Bulletin Board Export to RTF -
Delayed Output and Forward References
Part III XML/SGML E-mail
Chapter 9 XML E-mail
9.1 Overview
9.2 Why XML/SGML is Hard to E-mail
9.3 Entity Catalogs
Entity Catalog Structure - Catalog Entry
Syntax - Building an E-mail Message from a
Catalog
9.4 MIME
Parts of a MIME message - Handling MIME
Messages
9.5 Building the SGMaiL Agent
9.6 The Sending Agent
Modifying SP for "createCatalog"
9.7 Parsing the Catalog and Creating the E-mail
Message
9.8 The Receiving Agent
9.9 Conclusion
Part IV XML and Java - Parsers and APIs
Chapter 10 XML Parsers and Application Programmer
Interfaces
10.1 Introduction
10.2 Parser Capabilities and Applications
Well-formedness and Validity Verification -
Document Instance Decomposition - Parser
Applications
10.3 XML Parser Interfaces
Command-Line and ESIS Event Stream
Interfaces - Event Callback Interfaces -
Object Model Interfaces
10.4 Sample XML Parsers
NXP XML Parser - AElfred XML Parser
10.5 The SAX Event Callback API
10.6 The W3C Document Object Model andAPI
Support and Implementation - Acquiring
Specifications - Overview of the W3C's
DOM Level-1 Interface
10.7 Sample DOM Implementation
DOM Interface Definition - DOM Interface
Implementations - Intergrating the DOM
Implementation - The XSpecViser Application
10.8 Chapter Summary
Part V Future - Agents and all that
Chapter 11 Input Gathering and Negotiation using XML
11.1 Introduction
The Ubiquitous Problem of Input Gathering -
Input Gathering and Negotiation - Negotiation
Processes from 20,000 Feet
11.2 Negotiation and Language-Agent Architectures
Negotiation Problem Specification -
Specification Problems and Language-Agent
Architecture - Optimal Specification
Environments
11.3 Description of Negotiation Problems
Negotiation Problem Output and Structure -
Negotiation Problem Output-Agreements -
Recurring Negotiation Problems - Output
Specification Languages - Constraints on
Valid Agreement - Practical Enforcement of
Output Constraints - Examples from Current
Systems Practice
11.4 Manufacturing Negotiator Behavior
Overview of Information Used by a
Negotiator - Introductory and Ancillary
Information - Information about Negotiable
Variables - Negotiation Structure -
Output Specification Language and Instance
Generation
11.5 Chapter Summary
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From M.H.Kay at eng.icl.co.uk Tue May 12 21:05:19 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun 7 17:01:17 2004
Subject: Announcing SAXON
Message-ID: <001b01bd7dd9$0337b680$0c11e391@mhklaptop.bra01.icl.co.uk>
With SAX now stable, I feel ready to expose SAXON to the
world.
SAXON is a Java package that provides a layer of services on
top of SAX. These services include:
* support for specific element handlers for each element
type, allowing your application to be more modular
* providing context information about each element, saving
your application from having to maintain this itself
* management of multiple output streams linked to particular
elements in the input document, useful when you are
splitting an XML document or loading the data into a
database
* an ActiveX interface to make it easy to do an HTML
rendition of an XML document from server-side VBScript code
on an ASP page
I have used SAXON to do a variety of XML->XML and XML->HTML
transformations. It includes a number of standard element
handlers that are convenient for doing common HTML
renditions as well as more advanced functions such as
auto-numbering and sorting.
More details, and downloadable software, on
http://home.iclweb.com/icl2/mhkay/saxon.html
Why "SAXON"? Well, because it's a layer on top of SAX. (And
originally it was closely linked to AElfred)...
Have fun.
Mike Kay, ICL
M.H.Kay@eng.icl.co.uk
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From alain at cabinfo.com Tue May 12 23:24:22 1998
From: alain at cabinfo.com (Alain DESEINE)
Date: Mon Jun 7 17:01:17 2004
Subject: innovation Partners and CEI announce IRIS XML EDITOR beta 1
Message-ID: <3558BD6E.3D86@cabinfo.com>
innovation Partners and CEI announce IRIS XML EDITOR beta 1
The first release (beta 1) of IRIS XML EDITOR is now available for
download. This first beta allow
the user to create XML projects based on a DTD generated (saved) with
our IRIS XML DTD GENERATOR
software. Futures version will permit to open DTDs that was not saved
with IRIS XML DTD GENERATOR.
So, if you have provide additionnal informations (like Hint,
Description, Toolbar, and icon) in IRIS
XML DTD GENERATOR, you'll get in IRIS XML EDITOR dynamics toolbar that
contains buttons which can
insert easily elements you describe in your DTD. You'll get dynamic
element wizzards too, if you
provide some attributes for elements. Obviously, IRIS XML EDITOR is a
full XML oriented editor
with many improved editing functionnalities like :
- Toggle between Upper and lower case with shift + F3
- Glossaries
- Full text search and replace
- Intelligent inserting of element
- multi pages editing windows to edit large collections of files
- etc.
You can download IRIS XML EDITOR from the following URL.
http://www.cabinfo.com/download.htm
We currently working on improving support of other DTDs (those not saved
with IRIS XML DTD GENERATOR,
please see the readme.txt file in the distribution for more informations
on how to use others
DTDs). We are also working on an internal XSL engine for visualisation
purpose. All this work will
appear in the next beta version (beta 2 or beta 3).
Well known in FRANCE for his HTML EDITOR ODYSSEE ( see
http://www.mondial-telecom.com/odyssee ),
Innovation Partners has start to think about XML many months ago. This
thought tell us to develop
3 XML software. The first one, IRIS XML DTD GENERATOR, is a tool for
building XML DTDs. The second
is IRIS XML EDITOR for editing XML files. And the third one is a XSL
stylesheet EDITOR for
presenting and processing XML files. These 3 softwares can naturally
work together, building a complete
sofware suite for XML editing. For limitations, supported
functionnalities, and bugs of beta
versions, please see the readme file of the downloaded software. Now
lets have a look to the
DTD functionnalities.
IRIS XML DTD GENERATOR
- Editing existing DTDs
- Creating DTDs
- Logical and Text view mode for editing DTDs
- Wizzard for creating ELEMENTS both in text an logical view
- Wizzard for creating ATTRIBUTES both in text an logical view
- Wizzard for creating ENTITIES both in text an logical view
- Wizzard for creating COMMENTS both in text an logical view
- Support of additionnals informations for IRIS XML EDITOR (saved as
comment in the DTD)
- Generation of XML DATA schemas
- Find an replace both in logical and text view
- Wizzard for creating xml standard attributes (like xml:lang for
example)
- support of XLL linking attributes
- Glossaries
- Printing a DTD
- multi language software (currently French and English only - You can
change the language in the file menu)
- Automatic save capabilities
- backup on save capabilities
- MDI environment
NOTE : All of these functionnalities are not implemented in the beta 1
release. This list describe
what the software can be for the version 1.00 release. Some of these
functionnalities will perharps
disappear for the final release.
IRIS XML EDITOR
- Creating XML files according to a DTD generated by IRIS XML DTD
GENERATOR, or generated by something else.
- Project notion for managing XML file collections (like in the ODYSSEE
HTML EDITOR).
- glossaries
- Dynamic toolbars with generated icons that can insert ELEMENTS,
ATTRIBUTES, or ENTITIES you define in your DTD
- Dynamic Wizzard generated automaticly for ELEMENTS with attributes,
that ask user for informations.
- Text view for editing XML files
- Hierachical view for for editing the XML file (the tree object flow
view)
- Visualisation view for seeing the rendering of the XML file :
- If an XSL file is specified in your XML document these one is use to
generate the
visualisation view.
- if none is provide, a minimal built in style sheet is use to render
your document.
- Automatic save capabilities
- backup on save capabilities
- XML document printing
- XML Visualisation view printing
- FTP publication of XML files
- Specific Wizzard adding capabilities through an Standardize DLL API
(like in ODYSSEE HTML EDITOR)
- Friend application capabilities
- multi language software (currently French and English only - You can
change the language in the file menu)
- Find an replace both in tree and text view
- Insertion of XML Standard Attributes (like xml:lang)
NOTE : All of these functionnalities are not implemented in the beta 1
release. This list describe
what the software can be for the version 1.00 release. Some of these
functionnalities will perharps
disappear for the final release.
IRIS XSL STYLER
- Creating XSL stylesheet
- full support of XSL proposal and future ones.
- glossaries
- Text view for editing XSL files
- Hierachical view for for editing the XSL rules
- Import a DTD to re-use its elements, attributes, etc. in the style
sheet
- Visualisation view for seeing the rendering produce with the XSL file
:
- You can provide your own XML file to see the rendering.
- if none is provide, a minimal built in XML file based on the
target-elements is used.
- Automatic save capabilities
- backup on save capabilities
- XSL document printing
- XSL Visualisation view printing
- Specific Wizzard adding capabilities through an Standardize DLL API
(like in ODYSSEE HTML EDITOR)
- Friend application capabilities
- multi language software (currently French and English only - You can
change the language in the file menu)
- Find an replace both in tree and text view
NOTE : All of these functionnalities are not implemented in the beta 1
release. This list describe
what the software can be for the version 1.00 release. Some of these
functionnalities will perharps
disappear for the final release.
Currently, only the IRIS XML DTD GENERATOR beta 1 and IRIS XML EDITOR
beta 1 are available.
IRIS XSL STYLER beta 1 will be available in June. All theses software
are commercial, so
they are copyright. The beta version of these software are freely usable
for evaluation purpose,
bugs report, and functionnality discussions. Once the final release
(Version 1.00) launch up, you
***MUST*** register the software if you want to use it.
For Bugs reports, Functionnalities discussion, technical questions, XML
and XSL discussion, you
can use the DTD GENERATOR mailling list, for details about the mailling
list, please see the readme
file include in the beta 1 distribution. You can download the Beta 1
distribution of IRIS XML
EDITOR at :
http://www.cabinfo.com/download.htm
Paris the May 4th 1998
Alain DESEINE, Technical contact.
********************************************************************
Innovation Partners
Commercial contact : M. Emmanuel CHAMBON innovationpartners@hol.fr
Technical contact : M. Alain DESEINE alain@cabinfo.com
http://www.mondial-telecom.com/odyssee
http://www.cabinfo.com
Commercial department
148, rue Boucicaut
92260 Fontenay-aux-Roses
Tel : 33 01 43 50 95 25 or 33 06 09 80 10 16 (GSM)
Postal Address :
9, bis rue des Besnards
92260 FONTENAY AUX ROSES
FRANCE
Tel : 33 01 41 41 00 89 or 33 01 41 41 91 91 or 33 06 09 80
10 16 (GSM)
Fax : 33 01 41 41 02 34
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Tue May 12 23:46:31 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:17 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
In-Reply-To: <355775A9.9358FC20@allette.com.au>
References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net>
<3554E2E6.6E13B796@allette.com.au>
<3554FCF0.D08AB8FB@technologist.com>
<35562374.DF2B9B4A@allette.com.au>
<3556370D.FC18BF6E@technologist.com>
<3556B982.2A0C2B68@allette.com.au>
<3556E683.EC0BE1F1@technologist.com>
Message-ID: <3.0.1.16.19980512195759.09575568@pop3.demon.co.uk>
At 08:03 12/05/98 +1000, [lots of people] wrote:
[...lots of stuff about how nice it would/wouldn't be to have short end,
short start-tags, other revisions to the spec, etc... all snipped...]
Just in case any newcomer to XML-DEV is confused, short end-tags, short
start-tags, no tags at all, etc. are illegal in XML V1.0 and all conforming
parsers will throw well-formedness errors. There is - as far as I know - no
formal or informal note being taken of these discussions for any future
revision of the spec, whether or not such revision exists.
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Tue May 12 23:51:58 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:17 2004
Subject: LISTRIVIA (Re: XML parser in C++)
In-Reply-To: <03d701bd7ced$502901c0$06001cac@mike>
Message-ID: <3.0.1.16.19980512195111.09076a24@pop3.demon.co.uk>
At 09:58 11/05/98 -0500, [an XML-DEV member] wrote:
[... helpful message snipped ...]
>
>Attachment Converted: "c:\eudora\attach\XMLparse.htm"
It's not a good idea to attach things to XML-DEV postings - they take up
bandwidth (costs me money) and eat up my disk space. Also they do terrible
things to the hypermail. I don't know if this was the problem, but my
browser went a horrid shade of turquoise on reading this message. (It
didn't actually throw a Draconian error, but I think it came close).
P.
>
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Tue May 12 23:52:38 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:17 2004
Subject: freewares..
In-Reply-To: <3.0.5.32.19980511194649.00808cf0@blr-nt-mail1.verifone.com
>
Message-ID: <3.0.1.16.19980512194742.09078402@pop3.demon.co.uk>
At 19:46 11/05/98 +0500, Rajesh N wrote:
>Hi all,
> could anyone give me some pointers regarding the availability
>of any freewares with/without source code for the following..
Rajesh,
There are now a *lot* or very good resources describing XML software - try
Robin Cover's XML page to start (www.sil.org/sgml/xml.html) It will give
you a better idea than getting random (or null) replies here and has *all*
the pointers to other resources.
P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From peter at ursus.demon.co.uk Tue May 12 23:57:08 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun 7 17:01:17 2004
Subject: Announcing SAXON
In-Reply-To: <001b01bd7dd9$0337b680$0c11e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <3.0.1.16.19980512200245.0957c554@pop3.demon.co.uk>
At 20:06 12/05/98 +0100, Michael Kay wrote:
>With SAX now stable, I feel ready to expose SAXON to the
>world.
[... details snipped...]
>
>Mike Kay, ICL
>M.H.Kay@eng.icl.co.uk
Thanks very much Mike - looks great fun - haven't downloaded it yet.
This interoperability of tools developed in a virtual environment is
exactly our original vision of one way that XML-DEV might work. It is
extremely gratifying to see it happen.
>
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From jeremy at omsys.com Wed May 13 03:15:52 1998
From: jeremy at omsys.com (Jeremy H. Griffith)
Date: Mon Jun 7 17:01:17 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
In-Reply-To: <35586A0E.4A785960@mecom.mixx.de>
References: <35586A0E.4A785960@mecom.mixx.de>
Message-ID: <355ff2c4.244161315@mail.together.net>
On Tue, 12 May 1998 17:26:09 +0200, james anderson
wrote:
>maybe i've been writing lisp too long, and i've even come to trust my text
>editor's ability to balance parens, but even before the days of emacs and co.,
>reading (and indenting, if need be) for balancing became second nature.
>i won't repeat paul graham's argument from "on lisp" here, but it came down to
>"you end up reading shapes, not parentheses. if you get to need to count the
>parentheses, the cause is lost anyway..."
>besides, the editors i've seen for c and java these days not only blink
>braces, they even display them in pretty colors.
>shouldn't one expect the same of xml editors - to the extent that they're not wysiwyg?
Wouldn't that be nice! Of course, asking an editor to balance based
on string comparison of the tag names isn't quite as easy as matching
parens... Too bad XML was defined like this:
...
instead of this:
< ... >
or even this:
< ... >
which would work with every programming editor around, right now...
--Jeremy H. Griffith
http://www.omsys.com/jeremy/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From h.rzepa at ic.ac.uk Wed May 13 09:30:01 1998
From: h.rzepa at ic.ac.uk (Rzepa, Henry)
Date: Mon Jun 7 17:01:17 2004
Subject: LISTRIVIA (Re: XML parser in C++)
In-Reply-To: <3.0.1.16.19980512195111.09076a24@pop3.demon.co.uk>
References: <03d701bd7ced$502901c0$06001cac@mike>
Message-ID:
>At 09:58 11/05/98 -0500, [an XML-DEV member] wrote:
>
>[... helpful message snipped ...]
>>
>>Attachment Converted: "c:\eudora\attach\XMLparse.htm"
>
>It's not a good idea to attach things to XML-DEV postings - they take up
>bandwidth (costs me money) and eat up my disk space. Also they do terrible
>things to the hypermail. I don't know if this was the problem, but my
>browser went a horrid shade of turquoise on reading this message. (It
>didn't actually throw a Draconian error, but I think it came close).
Attachments (and v-cards) are normally bad form for lists. Majordomo
does handle the attachment, but Hypermail (ancient as it is) does not.
More modern combinations might make a better job, but I would not
recommend using this list to send attachments (not modem friendly for
one).
If there is a real need to exchange such things, probably the easiest solution
is to mount your document on a server (ftp, http, etc) somewhere, and
just send the list the URL.
Dr Henry Rzepa, Dept. Chemistry, Imperial College, LONDON SW7 2AY;
mailto:rzepa@ic.ac.uk; Tel (44) 171 594 5774; Fax: (44) 171 594 5804.
URL: http://www.ch.ic.ac.uk/rzepa/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Wed May 13 13:53:59 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:17 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net>
<3554E2E6.6E13B796@allette.com.au>
<3554FCF0.D08AB8FB@technologist.com>
<35562374.DF2B9B4A@allette.com.au> <199805110611.CAA00265@unready.microstar.com>
Message-ID: <3559889A.C7C59C7B@technologist.com>
David Megginson wrote:
>
> Paul Prescod has quite rightly objected to a simplistic slippery-slope
> argument against short end tags. I have thought of what might be a
> stronger argument: as an (unstated?) design principle, XML provides
> exactly one alternative for nearly every markup item.
I think that you are correct that this is a stronger point. But XML does
allow alternative encodings. Consider defaulted attributes. Or
vs. . Consider < vs. < . vs. .
Each of these was consciously added to XML as a usability feature.
Nevertheless all of them cause implementors more work. But in my mind, not
one of them has the cost to implementors vs. benefit to users ratio of
short end tags.
Tim is right that the critical issue is the ease of processing of XML
software by "stupid" (regexp-based) software. He feels that this situation
has changed since we debated it. I don't agree. I thought that it was
evident then that there would be parsers for XML for Perl, Python and any
other regular-expression friendly. At the very least perlsgml, sgmlspl and
sgmllib would always exist.
Nevertheless the situation has changed in some ways. XML is now widely
hyped and destined for success. It is far more popular with implementors
than with users. This is the right time to try to rebalance the scales so
that XML becomes a megahit and not a hacker favourite.
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
Can we afford to feed that army,
while so many children are naked and hungry?
Can we afford to remain passive,
while that soldier-army is growing so massive?
- "Gabby" Barbadian Calpysonian in "Boots"
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Wed May 13 13:55:09 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:18 2004
Subject: A little wish for short end tags (Was: RE: SDD bogus)
References: <35586A0E.4A785960@mecom.mixx.de>
Message-ID: <35598924.AC686153@technologist.com>
> Bryan Gilbert wrote:
> >
> > Yes don't forget that XML is meant
> > for Humans too. Short end tags are a pain
> > when there is lots of intermediate text.
james anderson wrote:
>
> maybe i've been writing lisp too long, and i've even come to trust my text
> editor's ability to balance parens, but even before the days of emacs and co.,
> reading (and indenting, if need be) for balancing became second nature.
I think that even more important than this argument is that nobody is
proposing mandatory short end tags. XML with optional short end tags
offers the advantages of languages with uniform, short end markers but
also allows you to "be redundant" where that will help. I've proposed in
the past that full SGML should take optional redundancy farther to allow
something like this:
\n:\1\n:g;
# join hyphenated lines, take out hardreturns, replace multispaces with one
space.
# s:-\n::g;
s/\n+/ /g;
s/( )+/\1/g;
# removes spaces after and before tags.
s/> />/g;
s/ ::g;
s:::g;
s:::g;
s:::g;
s:::g;
s:::g;
s:::g;
# adds space after emphasis and s-script tags.
s:():\1 :g;
s:(): \1:g;
s:():\1 :g;
# gets the entities right.
s:°ree;:°:g;
s:&ree;:&:g;
s:@reg;:®:g;
# joins seqlists broken by paras.
# s:::g;
# dehyphenates mistakenly broken paras.
s:-?([a-z]):\1:g;
# puts seqlists inside the previous para.
s#(:|.)[\032]*()#\1\2#g;
# turns emphasis tags just inside paras into titles of the previous para0.
# works on nested emphasis tags as well, to two levels.
s#([0-z
\-\(\)/]*)\. #\2.#g;
s#([0-z \-\(\)/]*)\. #\2.#g;
# puts in numstyle attrib. in seqlists.
if(/<\/ITEM>[^<>\/]*<\/PARA><\/ITEM>[^<>/]*)<\/NOTE>/) {
s:([^<>/]*)([^<>/]*)+():\1\3\2:g;
}
if(/<\/FIGURE><\/CAUTION>/) {
s:([^<>/]*)([^<>/]*)+():\1\3\2:g;
}
if(/<\/FIGURE><\/WARNING>/) {
s:([^<>/]*)([^<>/]*)+():\1\3\2:g;
}
# puts untagged FIGURE inside tags.
if(/FIGURE [0-9 \-\.]* [^<>]*<\/PARA>/) {
s:FIGURE ([0-z\-\.]*\.) ?([^<>]*):\2:g;
}
# check for FIGURES with part of the label inside the title.
if(/[0-9\.]+ /) {
s:()([0-9\.]+) :\1\2\4\3:g;
}
# puts cautions, warnings, notes inside the previous item tag.
if(/<\/PARA><\/ITEM><\/SEQLIST><\/PARA>/) {
s:()([^<>]*):\2\1:g;
}
if(/<\/PARA><\/ITEM><\/SEQLIST><\/PARA>/) {
s:()([^<>]*):\2\1:g;
}
if(/<\/PARA><\/ITEM><\/SEQLIST><\/PARA>/) {
s:()([^<>]*):\2\1:g;
}
if(/">[A-Z \-\/\(\)\.]*\. /) {
s:(">)()([A-Z \-\/\(\)\.]*\.) :\1\3\2:g;
}
# joins seqlists that were created by bad tagging of cautions, warnings,
and notes.
if(/<\/CAUTION><\/ITEM><\/SEQLIST><\/PARA>):\1:g;
}
if(/<\/NOTE><\/ITEM><\/SEQLIST><\/PARA>):\1:g;
}
if(/<\/WARNING><\/ITEM><\/SEQLIST><\/PARA>):\1:g;
}
# fix erring para0s (figures, lb-in.)
s:(lb in.):\1
\2:g;
s:(lb
in.):\1 \2:g;
s:(lb in.):\1
\2:g;
s:(lb
in.):\1 \2:g;
s:figure:figureS \1:g;
s:figure:figureS \1:g;
s:figure:figureS \1:g;
s:figure:figureS \1:g;
s:figure:figureS \1:g;
# change the dtd header from docgasturb to dcgastep -- for meter books
s:\[:[]>:;
s:([^YH])*::;
print;
}
The bitch of it is, this script worked fairly well and saved us enormous
amounts of time.
I hope that this ugliness demonstrates that Perl and SGML-like text files
can allow a complete neophyte to do wonders, and that any hifalutin changes
to the relative simplicity of the current XML spec would be detrimental to
the average 'frustrated perl programmer'.
Steve
(I'm so ashamed)
--
"All the good geek things, schampeo@hesketh.com
only without all the http://a.jaundicedeye.com
bad geek things." http://hesketh.com/schampeo/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From papresco at technologist.com Sun May 17 21:39:29 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun 7 17:01:24 2004
Subject: A little wish for short end tags
References: <199805161649.JAA27414@boethius.eng.sun.com>
<3.0.5.32.19980516161020.00995ea0@wasabi.hesketh.com> <3.0.5.32.19980517123239.00afcde0@wasabi.hesketh.com>
Message-ID: <355F3D36.AADDE804@technologist.com>
Sorry, Steven. I still don't understand. The output of the scripts was
irregular, but it didn't use any of SGML's hundreds of "hard" features
like entities, whitespace in tags etc. The data may not have been
normalized in any formal sense, but it used a predictable set of SGML's
easier features. This is "good enough" to allow processing with regexps
and basically partial normalization. If the data had used entities,
whitespace in tags, comments etc., then I would say it was completely
unnormalized.
But anyhow, I don't understand how you can point to your *success* at
taming a system built around a language with hundreds of minimizations as
proof that one of those minimizations should not be allowed in another
language. Had you failed, solely because of short end-tags, that would
have been a persuasive argument. My belief is that that would not happen,
because there are so many other factors. Most data falls either into the
category of predictable and simple (which yours seems to have) or
unpredictable and requiring normalization (which most data authored by
people with text editors will look like -- short end-tags or not).
Paul Prescod - http://itrc.uwaterloo.ca/~papresco
"A writer is also a citizen, a political animal, whether he likes it or
not. But I do not accept that a writer has a greater obligation
to society than a musician or a mason or a teacher. Everyone has
a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From digitome at iol.ie Mon May 18 12:58:20 1998
From: digitome at iol.ie (Sean Mc Grath)
Date: Mon Jun 7 17:01:24 2004
Subject: A little wish for short end tags
Message-ID: <199805181058.LAA12969@GPO.iol.ie>
[Jon Bosak]
>If you really like using shortcuts, then go all the way: get a genuine
>SGML tool and define a DTD that allows not just end-tag minimization
>but full omission of both start-tags *and* end-tags. Knock yourself
>out. Just make sure to normalize the result before you call it XML
>and ship it out to the rest of us to work with.
I just had to reply and express my wholehearted aggreement with Jon's
posting. SGML and SGML power tools can make excellent XML production
systems.
James Clarks SGML to XML tool - SX - for example is built on top of the
core SP library and thus you get basically fully blown SGML power.
All the minimization power you can shake a stick at.
Sean
Sean Mc Grath
http://www.digitome.com/sean.htm
County Sligo, Ireland, Tel: +353 96 47391
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tyler at infinet.com Mon May 18 14:32:45 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun 7 17:01:24 2004
Subject: question: NodeIterator in DOM
References: <199805151519.RAA02352@hs2114.wdf.sap-ag.de>
Message-ID: <35602AC2.60993E3A@infinet.com>
Akitoshi Yoshida wrote:
> Hi,
> Another question regarding NodeIterator:
> You have an NodeIterator instance with the iterator
> position set somewhere in the middle. When you
> remove three nodes: first, the node just after the
> iterator position, second the node after this removed
> node, and finally the node just before
> the original iterator position,
> then what should toNextNode() and toPrevNode() return?
Essentially a similiar problem occurs in the JDK 1.2 Collection classes. What
JavaSoft suggests is that for all Iterators (a new interface in the JDK 1.2
Collection classes) you synchronize on the object being iterated so that no
internal changes to the object's state are made while iterating. Since DOM is
supposed to be architectural and language neutral, languages which lack
synchronization support may find some difficulty in making sure that an object
which returns an Iterator object is not mutated while iteration is occurring in a
multithreaded environment. If your application is single threaded, you should
not have to worry about your object (such as a List) that returns the Iterator
object being mutated except for programming error where you add and remove
objects to the object returning the Iterator object while iterating. Sorry to be
so convoluted here but I got 3 hours of sleep last night so I hope the above made
sense.
Tyler
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Amitr at abinfosys.com Mon May 18 16:42:46 1998
From: Amitr at abinfosys.com (Amit)
Date: Mon Jun 7 17:01:24 2004
Subject: Loading an External DTD for an XML using MSXML
Message-ID: <9805181515.AB25790@del2.vsnl.net.in>
Hello All,
I am invoking the MSXML java parser through the XMLDSO class which I am
embeding in the