.
They're meant to show that there is an infinite variety of meanings
available. I deliberately picked a couple of examples from outside
HTML to show that meaning was more than structural rules (a can
contain a ) or a tiny set of presentational rules (this is block
text, this is a link).
> As in all human communication (which includes XML or HTML no matter
> it's bias towards easy computer recognition of content), I think
> there's an assumed common referent for things like pliars and
> citations and such. Any marked departure from such common referents
> reduces the ability to communicate imho.
Absolutely correct, but how can we capture those referents for
automatic discovery?
> I don't think any markup language is, or has the potential to be, a
> universal language. It does offer the opportunity, within a
> community of interest, for people to agree on the meaning and
> structure of information that's relevant to that domain and for
> people who become interested in that domain to rapidly become
> fluent in the vocabulary of that domain.
Yes, this is my original point. Any given group of users can agree on
a severly restricted universe of meanings, but we're dealing with the
problem of blind exchange, where the receiver does not necessariily
belong to the same group as the sender: if I send you an arbitrary
chunk of XML, how do you figure out what to do with it? My claim is
that the problem is not solvable in the general case.
> I don't think it's appropriate for any single organization to attempt to
> set definitions (i.e. specify namespaces) for particular domains of
> interest. I see that as an exercise best left to those who know whatever
> particular domain they are creating a tagset/namespaces for.
That was my point, more or less.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ann at webgeek.com Mon Sep 6 16:36:20 1999
From: ann at webgeek.com (Ann Navarro)
Date: Mon Jun 7 17:14:41 2004
Subject: an unfilled need
In-Reply-To: <37D35119.D05739D3@pacbell.net>
References: <4.1.19990905155519.0409f3a0@mail.webgeek.com>
Message-ID: <4.1.19990906103647.0409d430@mail.webgeek.com>
At 10:28 PM 9/5/99 -0700, David Brownell wrote:
>I hope this begins to reflect some kind of consensus -- "First,
>do no harm".
As always, this only reflects my personal thoughts and opinions, unless
clearly and explicitly stated otherwise.
Ann
---
Author of Effective Web Design: Master the Essentials
Coming in September --- Mastering XML
Founder, WebGeek Communications http://www.webgeek.com
Vice President-Finance, HTML Writers Guild http://www.hwg.org
Director, HWG Online Education http://www.hwg.org/services/classes
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ann at webgeek.com Mon Sep 6 16:31:45 1999
From: ann at webgeek.com (Ann Navarro)
Date: Mon Jun 7 17:14:41 2004
Subject: an unfilled need
In-Reply-To: <3.0.1.32.19990905165428.0187fd80@mail.accessone.com>
References: <14290.56269.189604.432980@localhost.localdomain>
<4.1.19990905155519.0409f3a0@mail.webgeek.com>
<4.1.19990905155519.0409f3a0@mail.webgeek.com>
Message-ID: <4.1.19990906102427.04097280@mail.webgeek.com>
At 04:54 PM 9/5/99 -0700, David LeBlanc wrote:
>I don't think it's appropriate for any single organization to attempt to
>set definitions (i.e. specify namespaces) for particular domains of
>interest. I see that as an exercise best left to those who know whatever
>particular domain they are creating a tagset/namespaces for.
And this doesn't suggest that.
Leaving assertions of anthromorphization aside:
In today's Web browser, there is built-in "knowledge" of what HTML elements
are supposed to be and do. They have been programmed to recognize a
element as a new structural block, presented with a leading line space,
text flush left, unless otherwise indicated in a style sheet or align
attribute. Simple, primarily presentation information, but it's
preprogrammed in how to deal with such elements. That it doesn't recognize
and act upon the designed content model for P or UL or anything else, was a
design decision of the *browser* programmer, one that could be questioned.
Allowing for discovery, based on a schema, DTD, or whatever other defining
mechanism is provided, lets tomorrow's Web browser have that same
"knowledge" of a element, and any other knowledge deemed necessary or
prudent by those who will be using said element.
No one is suggesting we'll be creating Commander Data by doing this. Nor is
it "will o' the wisp".
I am not a software architect, I do not pretend to be one. Yet this is not
an idea that is foreign to many who are, and even thought of as necessary
by some of those.
The "rest of the world", as it were, is trying to do this already. By all
the misunderstandings that arise *outside of this forum* about the intent
and usefulness of namespaces, it should be evident to everyone that there
is a desire for such abilities, even if the more academic of us look down
on it as ill-informed.
It's certainly not something that should be laughed off the list.
Ann
---
Author of Effective Web Design: Master the Essentials
Coming in September --- Mastering XML
Founder, WebGeek Communications http://www.webgeek.com
Vice President-Finance, HTML Writers Guild http://www.hwg.org
Director, HWG Online Education http://www.hwg.org/services/classes
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Mon Sep 6 17:09:18 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:41 2004
Subject: an unfilled need
In-Reply-To: <4.1.19990906102427.04097280@mail.webgeek.com>
References: <14290.56269.189604.432980@localhost.localdomain>
<4.1.19990905155519.0409f3a0@mail.webgeek.com>
<3.0.1.32.19990905165428.0187fd80@mail.accessone.com>
<4.1.19990906102427.04097280@mail.webgeek.com>
Message-ID: <14291.55243.697468.921617@localhost.localdomain>
Ann Navarro writes:
> Allowing for discovery, based on a schema, DTD, or whatever other defining
> mechanism is provided, lets tomorrow's Web browser have that same
> "knowledge" of a element, and any other knowledge deemed necessary or
> prudent by those who will be using said element.
But what would it discover? Stylesheets can already tell you how
should be rendered in a browser. What other kind of information
could reasonably be discovered in the general case besides structural
validation rules and possibly simple data-typing? Neither of those is
going to tell my application much about if it doesn't know about
already.
> The "rest of the world", as it were, is trying to do this
> already. By all the misunderstandings that arise *outside of this
> forum* about the intent and usefulness of namespaces, it should be
> evident to everyone that there is a desire for such abilities, even
> if the more academic of us look down on it as ill-informed.
I'm still confused about exactly what 'this' is. *What* is the rest
of the world trying to do? I'm not asking for a software-architect's
perspective, but just for a clear requirement.
Perhaps an example would help. Here's an XML document (without
Namespaces for now):
112
g
xxx
Now, imagine that my XML application has just received this piece of
XML and knows nothing about the markup language used. What kind of
information should it be able to discover automatically, so that it
can process this document usefully?
All the best,
Daivd
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From brendan at w3s.ie Mon Sep 6 17:48:33 1999
From: brendan at w3s.ie (Brendan McKenna)
Date: Mon Jun 7 17:14:41 2004
Subject: an unfilled need
In-Reply-To: Your message of "Mon, 06 Sep 1999 11:10:58 -0400."
<14291.55243.697468.921617@localhost.localdomain>
Message-ID: <199909061550.QAA27493@ns.w3s.ie>
Hi,
David Megginson wrote:
: Perhaps an example would help. Here's an XML document (without
: Namespaces for now):
:
:
: 112
: g
: xxx
:
:
: Now, imagine that my XML application has just received this piece of
: XML and knows nothing about the markup language used. What kind of
: information should it be able to discover automatically, so that it
: can process this document usefully?
:
I would say that that depended on the application. However, a
'general' XML browser -- in the sense that Netscape or IE are 'general' HTML
browsers -- would need to be able to discover a means of validating the
document -- whether it's one or more Schema's or DTD's is irrelevant -- and a
stylesheet for displaying it, or take some 'appropriate' action (such as
displaying the source in a 'raw' form) in the absence of one.
Brendan
--
Brendan McKenna
Technical Director Phone: +353-(0)61-338177 x4143
W3 Services Ltd. Fax: +353-(0)61-338065
Innovation Centre Email: brendan@w3s.ie
National Technological Park
Limerick
Ireland
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tbray at textuality.com Mon Sep 6 17:51:55 1999
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun 7 17:14:41 2004
Subject: an unfilled need
Message-ID: <3.0.32.19990906085410.00b6d4e0@pop.intergate.bc.ca>
At 10:28 PM 9/5/99 -0700, David Brownell wrote:
>I hope this begins to reflect some kind of consensus -- "First,
>do no harm". The XHTML spec will be helpful without addressing
>the namespace issue -- a straight XML 1.0 application. Although
>I'd far prefer to see a single namespace defined for (X)HTML, it
>could wait for a while yet.
No it can't! The damage is already starting to creep in - for example,
look at IE5, which recognizes HTML tags using the hardwired prefix
"html:" - something any moron can see is fragile and broken. But we
can't reasonably get mad at Microsoft because the W3C just can't be bothered
to define a name for HTML, something that would take about 15 minutes
once they realized it was worth doing. Yes, I've raised this point
repeatedly within the W3C - I don't know what the problem is.
Maybe it's as Ann points out - since we haven't yet built a universal
related-package-of-truth discovery mechanism for XML, they're going to
punish us by denying us the limited (but real) benefits of a robust,
reliable way to distinguish which tags are HTML and which aren't.
Dammit, if the W3C obstinately refuses to give HTML a name for
programmers to hang their hats on, I'm going to start a campaign right
here in xml-dev for *us* to pick something, it'll be easier than
designing SAX. -Tim
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ricko at allette.com.au Mon Sep 6 18:52:12 1999
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun 7 17:14:41 2004
Subject: an unfilled need
Message-ID: <004f01bef889$1f6763b0$5bf96d8c@NT.JELLIFFE.COM.AU>
From: Arjun Ray
>On Sun, 5 Sep 1999, Ann Navarro wrote:
>
>> That need is: a formal deterministic means of discovery.
>
>That's an AI problem at root, and for now, a will-o'-the-wisp.
I think people have misread Ann's comment.
She is asking "for a means of discovering what belongs
with that name, it's associated definitions, semantics, and other data
that
may be necessary to complete their operations."
In other words, what declarations does this name appear in? What
schema does this name participate in?
This seems a reasonable thing to ask.
Rick Jelliffe
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Patrice.Bonhomme at loria.fr Mon Sep 6 18:56:31 1999
From: Patrice.Bonhomme at loria.fr (Patrice Bonhomme)
Date: Mon Jun 7 17:14:42 2004
Subject: XML Database/Repository with content indexing ?
Message-ID: <199909061659.SAA05197@chimay.loria.fr>
I was wondering if it exists some XML database system that provide a content
(full text) indexing mechanism. This system should be able to answer to this
query : give me all documents that contains the expression "big blue" within
their "/TEI.2/teiHeader/*/title" element(s).
Thanks for all informations and pointers
Pat
--
==============================================================
bonhomme@loria.fr Tel : 03 83 59 30 52 / 06 11 34 03 85
http://www.loria.fr/~bonhomme Office : B.228
--------------------------------------------------------------
* Serveur Silfide : http://www.loria.fr/projets/Silfide
==============================================================
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From paul at prescod.net Mon Sep 6 18:56:42 1999
From: paul at prescod.net (Paul Prescod)
Date: Mon Jun 7 17:14:42 2004
Subject: Crazy idea
References: <02ba01bef84e$1041ae30$4602a8c0@capella.co.il>
Message-ID: <37D3EF98.BB075628@prescod.net>
This "stylesheet as schema" idea comes around every so often but I think
that it has one major flaw: stylesheets cannot drive syntax directed
editors.
> - XSLT is be "too strong". After all, it is Turing complete.
>
> XSchemas are already stronger then DTDs, for a good reason, so this might be
> an advantage. Otherwise, it might be useful to define a subset of XSLT to
> use in "schema" stylesheets.
>From a mathematical point of view, I don't think that XSchemas are much
stronger than DTDs. The set of tag-based languages they can describe are
pretty much the same. The XSLT set of languages would be radically
different. What's the XSLT equivalent for this content model:
((a*,(b|c)+,d)+|(d,(b|c)*,d?)+)
On the other hand there are constraints that XSLT could support that
schemas probably could not.
Paul Prescod
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From matthew at praxis.cz Mon Sep 6 19:04:04 1999
From: matthew at praxis.cz (Matthew Gertner)
Date: Mon Jun 7 17:14:42 2004
Subject: an unfilled need
References: <14290.56269.189604.432980@localhost.localdomain>
<4.1.19990905155519.0409f3a0@mail.webgeek.com>
<3.0.1.32.19990905165428.0187fd80@mail.accessone.com>
<4.1.19990906102427.04097280@mail.webgeek.com> <14291.55243.697468.921617@localhost.localdomain>
Message-ID: <37D3F46B.EAC60CBD@praxis.cz>
> But what would it discover? Stylesheets can already tell you how
> should be rendered in a browser. What other kind of information
> could reasonably be discovered in the general case besides structural
> validation rules and possibly simple data-typing? Neither of those is
> going to tell my application much about if it doesn't know about
> already.
Wasn't this exactly the point of Jon Bosak's by-now-classic paper about XML and Java? Maybe
this has been generally written off as shameless promotion of Sun's interests, but if so this
seems like a shame to me, since the idea is absolutely brilliant. There should be a way to
associate tags with Java classes (and preferably a generic mechanism that at least opens up
the possibility of associating them with code of any kind). The class in question could be
determined in a number of ways (stylesheet, schema, whatever) and would then consume the
contents of the element, doing something useful. The nice thing about general-purpose
programming languages is that this something could be absolutely anything.
Matthew
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From matthew at praxis.cz Mon Sep 6 19:16:16 1999
From: matthew at praxis.cz (Matthew Gertner)
Date: Mon Jun 7 17:14:42 2004
Subject: an unfilled need
References: <3.0.32.19990906085410.00b6d4e0@pop.intergate.bc.ca>
Message-ID: <37D3F74D.AA84B8FF@praxis.cz>
> No it can't! The damage is already starting to creep in - for example,
> look at IE5, which recognizes HTML tags using the hardwired prefix
> "html:" - something any moron can see is fragile and broken. But we
> can't reasonably get mad at Microsoft because the W3C just can't be bothered
> to define a name for HTML, something that would take about 15 minutes
> once they realized it was worth doing. Yes, I've raised this point
> repeatedly within the W3C - I don't know what the problem is.
On reading this I can't resist the temptation to raise once again the issue of W3C's
confidentially rules, because this is a perfect example of the danger of this policy. On the
one hand, after countless discussions on this topic with countless knowledgeable people of
various backgrounds and viewpoints, I still can't for the life of me understand why this
policy exists. The argument about information overload is convincing in the context of giving
open access to voting on W3C recommendations and the like, but certainly doesn't preclude
giving everyone access to the minutes of working group meetings and standards (sorry,
recommendations) in progress. The argument that this will confuse people since the works in
progress change so often is clearly bogus: in that case W3C members are spending good money
(not to mention significant amounts of time) to be befuddled. The argument that members are
paying money to be given advance access to this information is frightening and not obviously
sensible. The competitors these companies might be concerned about are presumably all members
as well.
(The uncharitable view is that the primary motivation is to give members an elitist buzz every
time they say: "Of course, I can't go into detail about this", but who could believe that?)
There is, however, a very strong argument for changing this policy. Judging by recent
discussion on this list, the W3C is losing some of its implicit authority in the XML
community. No one objected to the W3C controlling XML at the onset because it was far from a
foregone conclusion that XML was win over a number of plausible (but in retrospect clearly
inferior) approaches. Now XML is mainstream and this no longer flies. Lack of complete buyin
(not to mention open hostility) from XML developers is certainly not in the W3C's interest,
and only opens the way for Microsoft and other major players to step in with their own
proprietary (and inevitably less well thought out) approaches.
I must be missing something. Can someone please set me straight?
Matthew
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From czinkos at mail.matav.hu Mon Sep 6 19:18:28 1999
From: czinkos at mail.matav.hu (Zsolt Czinkos)
Date: Mon Jun 7 17:14:42 2004
Subject: XML Database/Repository with content indexing ?
References: <199909061659.SAA05197@chimay.loria.fr>
Message-ID: <37D3F895.5BBF25D2@mail.matav.hu>
Patrice Bonhomme wrote:
>
>
>
> I was wondering if it exists some XML database system that provide a content
> (full text) indexing mechanism. This system should be able to answer to this
> query : give me all documents that contains the expression "big blue" within
> their "/TEI.2/teiHeader/*/title" element(s).
>
> Thanks for all informations and pointers
I've read that Oracle8i can do it. See http://www.oracle.com .
Best,
Zsolt Czinkos
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Daniel.Brickley at bristol.ac.uk Mon Sep 6 19:21:39 1999
From: Daniel.Brickley at bristol.ac.uk (Dan Brickley)
Date: Mon Jun 7 17:14:42 2004
Subject: an unfilled need
In-Reply-To: <37D3F46B.EAC60CBD@praxis.cz>
Message-ID:
On Mon, 6 Sep 1999, Matthew Gertner wrote:
> > But what would it discover? Stylesheets can already tell you how
> > should be rendered in a browser. What other kind of information
> > could reasonably be discovered in the general case besides structural
> > validation rules and possibly simple data-typing? Neither of those is
> > going to tell my application much about if it doesn't know about
> > already.
>
> Wasn't this exactly the point of Jon Bosak's by-now-classic paper about XML and Java? Maybe
> this has been generally written off as shameless promotion of Sun's interests, but if so this
> seems like a shame to me, since the idea is absolutely brilliant. There should be a way to
> associate tags with Java classes (and preferably a generic mechanism that at least opens up
> the possibility of associating them with code of any kind). The class in question could be
> determined in a number of ways (stylesheet, schema, whatever) and would then consume the
> contents of the element, doing something useful. The nice thing about general-purpose
> programming languages is that this something could be absolutely anything.
While having a mechanism for relating classes (eg.
java:com.goodwebguide.rating) with Web namespaces
(eg. http://rating.goodwebguide.com/ns1/) would doubtless be handy,
this just punts the hard problem into a different arena. Instead of
software agents puzzling over what arbitrary XML data structures might
be telling us, or useful for, they'll be puzzling over what arbitrary
Java bean components can do for us. Mapping into beans is interesting
(eg. we for example get the notion of get/set'able bean properties),
but without taxonomies of java class behaviours, we're just as lost
when dealing with an unknown java classes as with unknown XML data
structures... I don't see any principled difference between these two
problem scenarios: in both case you've got some previously unencountered
Web resource (a URI-nameable thingy) and you want to know what it's
useful for.
Dan
--
Daniel.Brickley@bristol.ac.uk
Institute for Learning and Research Technology http://www.ilrt.bris.ac.uk/
University of Bristol, Bristol BS8 1TN, UK. phone:+44(0)117-9287096
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ricko at allette.com.au Mon Sep 6 19:26:10 1999
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun 7 17:14:42 2004
Subject: Crazy idea
Message-ID: <009801bef88d$c9320bd0$5bf96d8c@NT.JELLIFFE.COM.AU>
From: Oren Ben-Kiki
>A DTD/XSchema, as defined today, performs two tasks. One is to
validate a
>document. The output of this is a single Boolean value. The other is to
>"complete" the document - that is, add defaults.
>
>The latter part seems like a transformation task to me, which raises
the
>question: why not specify this as an XSLT stylesheet? Once you consider
>this, it becomes obvious that the first part can also be specified as a
>stylesheet, given that one introduces some way for a stylesheet to
indicate
>an error in the input (say, ). Actually, that's a good idea
by
>itself. I already simulate it in my stylesheets by emitting an
obtrusive
>error boilerplate, but that's a kludge.
>How about it?
I have a note on "Using XSL as a Validation Language" at
http://www.ascc.net/xml/en/utf-8/XSLvalidation.html also published in
July Interchange magazine (ISUG). The implications of XPath are
sketched out in the note "Axis Models and Path Models" at
http://www.ascc.net/xml/en/utf-8/validaxis.html
Francis Norton has a subsequent article "Generating XSL for Schema
Validation" at http://www.redrice.com/ci/generatingXslValidators.html
which gives an XSL stylesheet for generating an XSL validator from a DCD
schema. (Francis has worked further on this line, and may update that
article and the software sometime, I believe.)
I think it is worthwhile to raise the possibility that validation is not
really a binary function, though it is convenient to speak of it as such
for rhetorical purposes. In fact, a validator that merely says "valid"
or "invalid" is almost useless (not quite: it is still useful for giving
structural preconditons that a progammer can use to reduce the number of
cases their program has to deal with) and certainly unacceptable for
users: it is the diagnostic information which is the useful thing.
Rather than considering the diagnostics as nice side-effects that a
validator may do, by using XSL we can make the diagnostic information
the centre of attention for validation.
This has a big impact on schemas: rather than the schema being a passive
description, it should become a far more user-friendly and active thing:
for example, a schema language might require that repeated groups of
elements in content models should be given a name, which diagnostic
messages can give to the user to help them. Or the schema could include
declarations for error-recover from validity errors: if a certain
element is found out of context, then repair it or sanitize it. I don't
see why a schema should not include such things.
For anyone interested, there are also other articles on various
alternatives to DTDs and content models at
http://www.ascc.net/xml/en/utf-8/schemas.html If anyone is going to the
APWeb99 conference in Hong Kong later this month, I am giving a paper on
this subject. Anyone in Taiwan can contact Academia Sinica Computing
Centre; we are giving XML workshops that include the topic "Using XSL
for Validation".
Rick Jelliffe
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From whisper at accessone.com Mon Sep 6 19:31:01 1999
From: whisper at accessone.com (David LeBlanc)
Date: Mon Jun 7 17:14:42 2004
Subject: HTML namespace (was: an unfilled need)
In-Reply-To: <3.0.32.19990906085410.00b6d4e0@pop.intergate.bc.ca>
Message-ID: <3.0.1.32.19990906103953.026b29d0@mail.accessone.com>
At 08:54 AM 9/6/99 -0700, Tim Bray wrote:
>Dammit, if the W3C obstinately refuses to give HTML a name for
>programmers to hang their hats on, I'm going to start a campaign right
>here in xml-dev for *us* to pick something, it'll be easier than
>designing SAX. -Tim
>
Hmm.. how about something after C++: Any tag which does not have a
namespace prefix or is not recognizably within a namespace is implicitly
HTML. Any tag which is not prefixed or not contained within a namespace and
is not defined by the version of HTML in use is an error.
When I first wrote this I thought it didn't answer the problem posed by
Tim, but then I realized that it does: if you don't have a doc header, you
don't have an HTML document thus you don't have tags which are prefixed or
within a namespace and thus the whole document is an error.
This, of course, is going to fly merrily down the runway and smack into a
wall consisting of all those pages out there that either don't use a doc
header or don't use it correctly (as was pointed out by a nearby post about
IBM's usage).
Excuse me if I really did miss your point Tim.
Dave LeBlanc
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ricko at allette.com.au Mon Sep 6 19:54:58 1999
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun 7 17:14:42 2004
Subject: Crazy idea
Message-ID: <00a701bef891$c7745970$5bf96d8c@NT.JELLIFFE.COM.AU>
From: Paul Prescod
>This "stylesheet as schema" idea comes around every so often but I
think
>that it has one major flaw: stylesheets cannot drive syntax directed
>editors.
Yes, but the idea is not "stylesheet as schema" but "validator
implemented by
stylesheet transformation language". As Francis Norton's tool (see
previous
email) shows, DCD (which is pretty suitable for a syntax-directed
editor)
validation can be implemented using XSLT.
Also, why should a syntax-directed editor work outside-in (i.e., using
content
models). That the currently do so is because they use grammars where the
children are keyed by the parent, but that is not the only way in which
a
grammar can be specified.
I do not see why a syntax-directed editor could not key off path
expressions
instead of the curent element type. That is surely how global exclusions
and
inclusion exceptions in SGML work: you can or cannot insert this element
type
because some ancestor's (extended) content model has allowed or banned
it.
>From a mathematical point of view, I don't think that XSchemas are much
>stronger than DTDs. The set of tag-based languages they can describe
are
Here here.
>pretty much the same. The XSLT set of languages would be radically
>different. What's the XSLT equivalent for this content model:
>
>((a*,(b|c)+,d)+|(d,(b|c)*,d?)+)
It would be a lot of typing in XSLT, but it cannot see why it could not
be
done. It can be a transformed from a schema anyway.
Another possibility is also that a validator need not perform all
possible
validations to be still useful. I have a note "Weaker Validation" at
http://www.ascc.net/xml/en/utf-8/weakvalid.html on this. In particular,
for documents in development is it useful to have a weaker validation.
>On the other hand there are constraints that XSLT could support that
>schemas probably could not.
I think abbreviated RDF and XML namespaces should be test cases for
XML schemas: if they can handle those, then we have a clear advance.
Rick Jelliffe
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Mon Sep 6 20:05:55 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:42 2004
Subject: an unfilled need
In-Reply-To: <3.0.32.19990906085410.00b6d4e0@pop.intergate.bc.ca>
References: <3.0.32.19990906085410.00b6d4e0@pop.intergate.bc.ca>
Message-ID: <14291.64673.475435.922488@localhost.localdomain>
Tim Bray writes:
> Dammit, if the W3C obstinately refuses to give HTML a name for
> programmers to hang their hats on, I'm going to start a campaign
> right here in xml-dev for *us* to pick something, it'll be easier
> than designing SAX. -Tim
If MS and Netscape/Mozilla both buy into the same Namespace URI, then
it won't much matter what the W3C says, but I'd still like to hope
that the XHTML WG will give us a URI -- even just a NOTE with the line
The XML Namespace URI for HTML is "http://www.w3.org/Markup/".
would help prevent the nightmarish interoperability problems that are
coming up. If we had this, I'd be willing to wait a while for XHTML
proper.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tbray at textuality.com Mon Sep 6 20:16:27 1999
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun 7 17:14:42 2004
Subject: an unfilled need
Message-ID: <3.0.32.19990906111905.00abd6c0@pop.intergate.bc.ca>
At 01:54 PM 9/6/99 -0400, David Megginson wrote:
>If MS and Netscape/Mozilla both buy into the same Namespace URI, then
>it won't much matter what the W3C says, but I'd still like to hope
>that the XHTML WG will give us a URI -- even just a NOTE with the line
>
> The XML Namespace URI for HTML is "http://www.w3.org/Markup/".
>
>would help
Hmm, I observe that the namespace name hardwired to the prefix "xml:" is
http://www.w3.org/XML/1998/namespace
(amusing, since the namespace spec is dated January 1999). Anyhow, I
seem to recall that Dan Connolly cooked this up on behalf of the W3C and
there was a real good reason which I unfortunately forget for having a
date in there.
So by analogy, a candidate namespace name for HTML would be
http://www.w3.org/HTML/1999/namespace
And yes, a one-line NOTE on www.w3.org/TR would do the job. Hey there,
HTML WG, any chance? If you pick a URI, we all promise not to complain
about what it is. Right, guys? -Tim
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ricko at allette.com.au Mon Sep 6 20:24:32 1999
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun 7 17:14:42 2004
Subject: W3C (was Re: an unfilled need)
Message-ID: <00b801bef895$e9456f90$5bf96d8c@NT.JELLIFFE.COM.AU>
From: Tim Bray
>Yes, I've raised this point
>repeatedly within the W3C - I don't know what the problem is.
I think there is missing stage in procedures at W3C. Some people
want standards to allow plurality, others want standards to
prevent useless duplication. The "we don't need XSL" kafuffle
recently shows this. XML has attracted both peoples because its
extensibility allows plurality but it has a fixed delimiter set
which attracts the stern antiduplicationists.
The antiduplicationist see schemas like this: we will make a
universal schema language and then we can get rid of DTDs.
But the WWW was built on allowing plurality.
So the missing stage in procedures in the W3C is this: for every
new XML technology proposed, there should be a mechanism
first developed to allow alternatives. The stylesheet PI is an
exemplar for this.
In the case of schemas, before XML schemas are developed,
we should have a mechanism for invoking them, otherwise
what else is there but namespaces? I agree with Tim 100% here.
Even if just as an interim, the W3C should introduce a schema
declaration PI, which binds the document (or an element type)
to a schema URL.
Rick Jelliffe
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david-b at pacbell.net Mon Sep 6 21:07:49 1999
From: david-b at pacbell.net (David Brownell)
Date: Mon Jun 7 17:14:42 2004
Subject: an unfilled need
References: <3.0.32.19990906085410.00b6d4e0@pop.intergate.bc.ca>
Message-ID: <37D41196.3DFAFFB6@pacbell.net>
Tim Bray wrote:
>
> At 10:28 PM 9/5/99 -0700, David Brownell wrote:
> >I hope this begins to reflect some kind of consensus -- "First,
> >do no harm". The XHTML spec will be helpful without addressing
> >the namespace issue -- a straight XML 1.0 application. Although
> >I'd far prefer to see a single namespace defined for (X)HTML, it
> >could wait for a while yet.
>
> No it can't! The damage is already starting to creep in - for example,
> look at IE5, which recognizes HTML tags using the hardwired prefix
> "html:"
OK, I confess to overstating my real point: that the two issues
could in fact be separated:
* An DTD for XHTML is a useful thing in its own right,
usable immediately in conjunction with other parts
of the current XHTML draft.
* The namespace structure of XHTML is separable, and in
fact doesn't need a DTD (or schema) in order to be useful.
(With a good DTD structure it just costs an !)
It's clearly my preference to see one namespace. And I'll agree
that the issue can't remain unresolved for too much longer, since
people have needed this answer for longer than they've needed an
official XHTML DTD. (How many unofficial DTDs have folk seen or
used? Let me count the ways ... ;-)
But since they're separable, it still seems reasonable to me that
the XHTML PR train continue on its mad rush to REC status -- but that
the namespace issue could be untied from the railroad tracks and led
aside, first.
> Dammit, if the W3C obstinately refuses to give HTML a name for
> programmers to hang their hats on, I'm going to start a campaign right
> here in xml-dev for *us* to pick something, it'll be easier than
> designing SAX. -Tim
I've thought the very thing. Not about committee design of a URL
(pick a reasonable one and go with it), but about xml-dev acting as
the nucleus around which a better solution can form.
> http://www.w3.org/HTML/1999/namespace
Sure, I'd vote for it. Now we need DTDs using this.
Does anyone want to get one that's done right -- using conditional
sections to disable transitional or frameset rules appropriately,
rather than having three separate DTDs? :-)
- Dave
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From lauren at sqwest.bc.ca Mon Sep 6 21:27:17 1999
From: lauren at sqwest.bc.ca (Lauren Wood)
Date: Mon Jun 7 17:14:42 2004
Subject: confidentiality in W3C WGs
In-Reply-To: <37D3F74D.AA84B8FF@praxis.cz>
Message-ID: <199909061929.PAA19983@gw.sqwest.bc.ca>
The only really good reason I see for the confidentiality agreement
that exists in several W3C Working Groups has marketing, public
relations, and politics at its base.
Imagine a lively WG discussion, in which two large member
companies (let's call them A and B) disagree. The rest of the WG
agrees with A and that's what goes into the spec. Maybe B gives in
graciously, maybe B gives in less than graciously, maybe B
doesn't give in at all. Then imagine the minutes are all public, and
everyone can tell everyone else exactly what happened.
The upshot is that some journalist, or maybe a public relations
person in company A, starts making a big thing out of how
company A "won" over company B in this issue. Next thing you
know, company B starts fighting back in public, or refuses to give
in graciously on any issue, or doesn't contribute properly to the
discussions, or leaves the WG.
Could this happen? Yes. I've had journalists call me to talk about
specs such as the DOM or XML, and when I've said that there are
discussions and sometimes companies disagree, I've had
questions such as "tell me when company A won and company B
lost" (no prizes for guessing which companies they were
most concerned with). I've seen articles written about other
standards committees (not in W3C) where the journalist has tried
to make out there is a fight in every small disagreement, and to get
quotes from any participant and twist them to make it look like a
fight. Makes for better press, I guess. Not so great if you're on the
WG involved, trying to get some consensus on sticky technical
issues. That can be hard enough without press articles and PR
problems getting in the way.
So sometimes you need confidentiality, to build trust and a
knowledge that what is said in a WG remains within that WG, so
that people can concentrate on the technical work, and not on the
politics.
Lauren
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david-b at pacbell.net Mon Sep 6 21:26:23 1999
From: david-b at pacbell.net (David Brownell)
Date: Mon Jun 7 17:14:42 2004
Subject: an unfilled need
References: <14290.56269.189604.432980@localhost.localdomain>
<4.1.19990905155519.0409f3a0@mail.webgeek.com>
<4.1.19990905155519.0409f3a0@mail.webgeek.com> <4.1.19990906102427.04097280@mail.webgeek.com>
Message-ID: <37D415F0.1E628C73@pacbell.net>
Ann Navarro wrote:
>
> Allowing for discovery, based on a schema, DTD, or whatever other defining
> mechanism is provided, lets tomorrow's Web browser have that same
> "knowledge" of a element, and any other knowledge deemed necessary or
> prudent by those who will be using said element.
>
> I am not a software architect, I do not pretend to be one. Yet this is not
> an idea that is foreign to many who are, and even thought of as necessary
> by some of those.
Those of us who _are_ software architects work with the issues of
extensible systems all the time. At some level it always boils
down to moving new code to someplace that can safely execute it,
which has been done since the first set of patch cords leaped out
of the primordial sea of circuitry onto a wall of clicking relays.
Given that it's been getting done sucessfully for so long, and
without addressing the "need" that started this thread, I still
fail to see why the W3C should try to involve itself in such work.
Whatever the W3C would try to dictate in this space, a better
solution will come along within the year. And I suspect neither
would get that much more use than the systems people use today.
Plugins? Applets? Active-security-hole? Yoiks.
- Dave
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From oren at capella.co.il Mon Sep 6 22:38:14 1999
From: oren at capella.co.il (Oren Ben-Kiki)
Date: Mon Jun 7 17:14:42 2004
Subject: Crazy idea
References: <02ba01bef84e$1041ae30$4602a8c0@capella.co.il> <37D3EF98.BB075628@prescod.net>
Message-ID: <034801bef8a6$b2e90db0$4602a8c0@capella.co.il>
Paul Prescod wrote:
> This "stylesheet as schema" idea comes around every so often but I think
> that it has one major flaw: stylesheets cannot drive syntax directed
> editors.
Validation is one thing, and assisted construction is another. Related
issues, to be sure, but there is a lot of functionality you'd expect in
"assisted document construction" which simply does not belong in validation.
It is not at all clear to me that a specification for validation should
consider these issues. That said...
Stylesheets _can_ be used to assist construction, as in "transform an empty
node matching the following criteria into ...". The compromise I suggested -
that a "schema" stylesheet must be a no-op if applied twice - immediatly
suggests that such stylesheets would be very useful in assisted
construction. Just run the partial document through them, flag errors, and
repeat until all errors are gone and the user is happy.
> From a mathematical point of view, I don't think that XSchemas are much
> stronger than DTDs. The set of tag-based languages they can describe are
> pretty much the same.
The mathematical point of view is all well and good, but in practice
XSchemas meant to replace DTDs since they can do useful things DTD can't,
period. If we accept XSLT, we'll know for certain that whatever we throw at
it, it will be able to handle. With an XSchema type specification, there's
allways the worry that some case has not been covered, and we'd need
XSchema2.0 to make up for the lack.
> The XSLT set of languages would be radically
> different. What's the XSLT equivalent for this content model:
>
> ((a*,(b|c)+,d)+|(d,(b|c)*,d?)+)
This can be done in XSLT by converting the regexp into a series of 'choose'
statements. Admittedly, in this particular case, the result wouldn't be
pretty. If it turns out that this sort of constraints is common, we might
consider adding some extra ability into XSLT to make expressing them more
convenient.
> On the other hand there are constraints that XSLT could support that
> schemas probably could not.
And things which are trivial to express in XSLT which probably would be a
pain to express in a schema language (e.g., if the element has an 'a'
attribute with value 'v', it should not contain an 'e' sub-element).
I have no idea what the overall effect on real-world schemas would be. It
might be interesting to take some real world DTDs/XSchemas and convert them
to XSLT, as a test. Note that whoever does that gains the benefit of an
executable validation specification stronger then a DTD which runs today on
his XML system of choice.
Which brings up an interesting point: people who are interested in effective
validation can use the XSLT way today. It might be that some people have
already done so. Assume that it is a good way - that is, people get what
they need from it. Further assume that XSchema recommendation will take a
long time to finalize, and longer for implementations to become common,
while XSLT implementations are much more widespread. The expected result
would be that nobody would care about the XSchema recommendation in
practice, regardless of the W3C.
The only problem is that the W3C will, in the mean while, twist other
recommendations to be compatible with XSchemas. For example, specifying
three name spaces for XHTML - which brings us a full circle back to where we
started a week ago :-)
Have fun,
Oren Ben-Kiki
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Mon Sep 6 22:38:37 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:42 2004
Subject: an unfilled need
In-Reply-To: <199909061550.QAA27493@ns.w3s.ie>
References: <14291.55243.697468.921617@localhost.localdomain>
<199909061550.QAA27493@ns.w3s.ie>
Message-ID: <14292.9627.156385.196996@localhost.localdomain>
Brendan McKenna writes:
> I would say that that depended on the application. However, a
> 'general' XML browser -- in the sense that Netscape or IE are
> 'general' HTML browsers -- would need to be able to discover a
> means of validating the document -- whether it's one or more
> Schema's or DTD's is irrelevant -- and a stylesheet for displaying
> it, or take some 'appropriate' action (such as displaying the
> source in a 'raw' form) in the absence of one.
Well, if that's all we need, we've got it already. Linking to a
stylesheet is already standardized in a W3C Recommendation:
http://www.w3.org/TR/xml-stylesheet/
The XML 1.0 DOCTYPE declaration provides a crude, per-document schema
link, and once we have other kinds of schemas, we'll probably have a
mechanism for linking to those as well. If that's all people want
when they talk about "automatic discovery", then we have an easy road
ahead, but it really buys very, very little. As Brendan rightly
points out, meaning is not only document- but application-specific.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cbullard at hiwaay.net Mon Sep 6 22:41:54 1999
From: cbullard at hiwaay.net (Len Bullard)
Date: Mon Jun 7 17:14:42 2004
Subject: an unfilled need
References: <3.0.32.19990906085410.00b6d4e0@pop.intergate.bc.ca> <37D3F74D.AA84B8FF@praxis.cz>
Message-ID: <37D4247B.4FD2@hiwaay.net>
Matthew Gertner wrote:
>
> No one objected to the W3C controlling XML at the onset because it was far from a
> foregone conclusion that XML was win over a number of plausible (but in retrospect clearly
> inferior) approaches.
Yes some did. The XML effort hijacked an international standard by
subsetting
it and putting the W3C trademark on it. The confidentiality rules of
the W3C
and the singleton control of its policy ensured that this standard would
be
modified and shaped by and for the goals of the members of a
consortium. Some
objected strenuously. As the twig is bent...
By the way, nice article, Jon. It says what namespaces are and aren't.
Otherwise,
it ducks all of the issues the list is trying to address without
any leadership. Yes, tie XML to Java. If that gets applications out
the door and checks coming back, by all means, do that.
Now, the rest of you should decide what you want to use namespaces
for and just do it. You don't need permission and if you wait for it,
you only fall behind in development.
len
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cbullard at hiwaay.net Mon Sep 6 22:42:07 1999
From: cbullard at hiwaay.net (Len Bullard)
Date: Mon Jun 7 17:14:42 2004
Subject: confidentiality in W3C WGs
References: <199909061929.PAA19983@gw.sqwest.bc.ca>
Message-ID: <37D426E3.6EAA@hiwaay.net>
Lauren Wood wrote:
>
> So sometimes you need confidentiality, to build trust and a
> knowledge that what is said in a WG remains within that WG, so
> that people can concentrate on the technical work, and not on the
> politics.
All true, Lauren. Yet it does not relieve the working groups of
the responsibility for documenting issues and resolutions with regard
to the contents of the work. Such procedures have been established
and have worked in standards organizations for years. It is time for
the W3C to develop and use these procedures. The results of not doing
so
are plain to all. It is the responsibility of those who undertook
XML to do what is needed. That is the leadership, price and payment.
Tim Berners-Lee must begin to restructure the leadership roles and
responsiblities of the W3C to reflect the growing community. It is time
for a touch of Washingtonian sagacity about the nature, use and
preservation of power.
len
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ken_north at compuserve.com Mon Sep 6 23:08:02 1999
From: ken_north at compuserve.com (Ken North)
Date: Mon Jun 7 17:14:42 2004
Subject: XML Database/Repository with content indexing ?
References: <199909061659.SAA05197@chimay.loria.fr>
Message-ID: <001701bef8ac$563f3980$0b00a8c0@grissom>
Patrice Bonhomme wrote:
> I was wondering if it exists some XML database system that provide a
content
> (full text) indexing mechanism.
Yes. Oracle 8i interMedia and IBM DB2 UDB (w/ XML Extender) manage
XML-enabled databases:
- various indexing schemes to provide stemming, linguistic searches, and so
on
- section searches of XML docs
- SQL queries over XML data using criteria such as CONTAINS 'impressionist'
or ABOUT 'impressionist'. Also query over rich types such as images and
audio.
===================================================
Don't miss XML One Europe, London, October 6-8, 1999
XML, Java, and Database Magic (Oct. 7)
http://www.xmlconference.com/xmleuro
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From xml at searchtools.com Mon Sep 6 23:15:03 1999
From: xml at searchtools.com (Avi Rappoport)
Date: Mon Jun 7 17:14:42 2004
Subject: XML Database/Repository with content indexing ?
In-Reply-To: <199909061659.SAA05197@chimay.loria.fr>
References: <199909061659.SAA05197@chimay.loria.fr>
Message-ID:
At 6:59 PM +0200 9/6/1999, Patrice Bonhomme wrote:
>
>
>I was wondering if it exists some XML database system that provide a content
>(full text) indexing mechanism. This system should be able to answer to this
>query : give me all documents that contains the expression "big blue" within
>their "/TEI.2/teiHeader/*/title" element(s).
Check out SIM (Structured Information Manager) at
, sgrep at
, BUS (Bottom Up
Search) at , Cheshire at
and the XQL proposal at
.
Best of luck,
Avi
_______________________________________________________
Guide to Local Site, Intranet, and Portal Search Engines:
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From donpark at docuverse.com Mon Sep 6 23:59:03 1999
From: donpark at docuverse.com (Don Park)
Date: Mon Jun 7 17:14:42 2004
Subject: Namespace for HTML (Signup Here)
In-Reply-To: <3.0.32.19990906111905.00abd6c0@pop.intergate.bc.ca>
Message-ID: <000401bef8b3$86d9aec0$6645fea9@w21tp>
Both Tim Bray and David Megginson proposed we settle on the Namespace URI
for HTML. I think this is a clear enough and small enough task for us
(XML-DEV) to accomplish here and now.
Here are some candidates:
1) "http://www.w3.org/Markup/" - by David Megginson
2) "http://www.w3.org/HTML/1999/namespace" - by Tim Bray
3) "http://www.w3.org/HTML/2000/Namespace" - by Don Park (I like zeros :)
Lets just settle on one and start using it. If W3C balks, we can go with:
4) "http://www.xml.org/HTML/2000/Namespace" - (if Jon Bosak agrees)
Signup by replying with one of the numbered options.
If you are against this proposal, select this:
0) "hell://no.stinking.namespace/4/HTML"
Best,
Don Park - mailto:donpark@docuverse.com
Docuverse - http://www.docuverse.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From lisarein at finetuning.com Tue Sep 7 00:08:37 1999
From: lisarein at finetuning.com (Lisa Rein)
Date: Mon Jun 7 17:14:42 2004
Subject: Associated Press/ZDNet Blunder
References: <10443A04FBE2D21182800008C7335C067784@poha.com>
Message-ID: <37D440BD.16D6830F@finetuning.com>
Hello everyone:
I decided to try to get to the bottom of this quickly, before anyone
wasted any more energy on it. I'm cross-posting it to the dev list FYI
only (and, in all honesty, because i am a little curious if there are
any exceptions with regard to any of the technical assumptions that I
make below in number 6.)
I still wonder why the associated press even bothered with it? It must
have been a SLOW day...for them AND ZDNET. And, unfortunately, for a
couple Seattle papers and at one or two in northern California (the only
others i have heard about or seen first hand) over the last few days.
Ultimately, the Associated Press is who i blame for this product of
bumbling yellow journalism -- whoever wrote this "staff" release
couldn't have even gone to even one of the web site's of any of the
organizations involved for a second, and instead chose to take a chance
at misrepresenting a situation that basically amounts to a non-issue in
every way except politically -- Representing a slanted press release as
if the information it provided was some kind of setback to XML standards
is absolutely appalling.
If anyone knows jesse's email, please forward it to him. ZDNet doesn't
provide email links to any of the editors or writers on the site, and
really, under the circumstances, i figured they probably aren't too
concerned with making corrections anyway :-)
So first, here's the link to AP's embarrasing prose
http://www.freep.com/tech/qbnews4.htm
I will now provide a brief list of facts that should wrap this all up in
a jiffy. (okay maybe it's not so brief, but it's at least it will be
researched and referenced accordingly -- hey! what a concept -- maybe
i'll run it by AP :-) -- lisa
1. After researching this so-called "story" released by ZDNET, AP, and
various local newspapers that obviously picked it up the AP wire,
paraphrased it, without checking out any of the facts, and without
contacting anyone to verify the story. I was able to find out
everything in this email in a little more than twenty minutes.
(I merely read every single document on the CompTIA website) -- then
glanced over at RosettaNet, which was a no-brainer, because there is
nothing specific on the either site about any "rejection" (only a press
release from a month ago on the comptia site explaining that they have
submitted "comments" to RosettaNet.)
2. CompTIA has only "submitted its comments" on RosettaNet's PIPs -- it
hasn't "rejected" anything -- and this has NOTHING to do with the XML
standard itself, nor will it affect any of the XML standards work in
progress at the W3C, nor is there any vital XML standard in the works
that is now going to held up, due to the submission of these comments.
here's a quote right from CompTIA:
> "Over the last five years we've gained significant expertise in developing and implementing process, technology and data standards
> within the IT supply chain," said Reiner Schaaf, director of electronic commerce for Computer 2000 and chair of the CompTIA Europe
> EC standards board. "As an organization, we need to support RosettaNet in its global initiative through our extensive and detailed
> feedback on their specifications."
3. Additionally, there doesn't appear to be any sort of specification
that was expected to be released next February of next year or anything
that will now be held up. The statement amounts to little more than
unsubstantiated heresay, at best. (Sheesh, at this rate, maybe AP is
so confused it is referring to the core XML 1.0 W3C Recommendation
standard that already came out TWO februaries ago :-)
4. http://www.comptia.org
Background on the organization that has "rejected the XML standard":
CompTIA - The Computing Technology Industry Association (CompTIA)
is a not-for-profit organization that makes money selling certification
training. Apparently, due to their "other" declared goals of improving
ethics in the computer industry and being a largely POLITICAL force, in
theory, for the whole IT industry they are able to remain
not-for-profit.
Bottom line: This group lobbies congress to either get
legislation passed or stop legislation it doesn't like from being
passed, in the name of non-government interference and free enterprise,
and stuff like that :-) Whether a particular proposed legislation is
"good" or "bad" seems to hinge on whether its passing will cost money or
generate money for the IT departments of the (mostly) very large
corporations in the software/computer industry which make up its
members.
Just FYI: Here's their latest "battle" for instance -- trying to work it
so IT companies can receive tax credits for the cost of training their
employees they would have to train anyway. Nice angle, essentially free
money for the IT companies -- even though IT training is something that
all companies will have to be paying for...
http://www.techcoalition.org
5. This week's media coverage was the result of the calculated
measures of CompTIA, who apparently views XML's early and widespread
implementation as a bit of a threat. CompTIA makes its money certifying
IT people in many currently-implemented, largely proprietary, networking
technologies (corba, COM, probably SAP, PeopleSoft). Not only are most
of these technologies now integrating XML into their current framework
-- any kind of generalized certification program is immediately going to
leave its pupils instantly behind the times if they don't ALSO have some
XML experience.
In fact, it is already arguable that such experience could be deemed
MORE important than the previously aforementioned technologies
(especially at the rather limited and introductory skill level that a
graduate from one of these programs, who otherwise had NO real world
experience would have to offer.
This company (ooops, not for profit organization) has spent a lot of
time and money on its "certification" materials, and now, if they don't
integrate XML training into their certification program in general, and
quickly, because soon many of these complicated protocols will no longer
be necessary now that XML is making it easy to exchange data. Ouch for
them! No more complicated protocols for corporations to have to spend
thousands of dollars on training courses for -- CompTIA makes its bread
and butter on these complications. So the longer it can convince its
members that they don't really "need" XML, the better for
business...
6. To CompTIA's defense, I suppose they could be just really, really,
confused. As evidenced by this exerpt from their press release about
their comments on the RosettaNet PIPs:
> The ECSB, which reviewed all of the 42 RosettaNet Partner Interface Processes (PIPs) and the implementation framework, focused
> its comments on regulatory, legal, revenue, privacy and tax guidelines faced by companies conducting business in multiple
> countries. Specific comments concerned the inability to use an ANSI framework among large and small partners outside of the US,
> standard identification protocols such as the DUNS number that are not prevalent in Europe, Asia and Latin America and the
> requirement for UCC/EAN GTIN numbers that are not sufficient to uniquely identify trade globally.
Any XML E-commerce standard will undoubtedly be incorporating all of
these identification mechanisms into its larger framework. At least,
these were non-issues over a year ago when RosettaNet began (the article
is also incorrect when it says the organization was just launched in
June -- we all know that!)
> The standards-setting group, called RosettaNet,
> was launched with much ballyhoo in June to
> harness the XML software language to speed the
> transmission of data across electronic networks
> linking computer and software makers, parts
> suppliers and distributors.
I'm not even going to bother pointing out the other inaccuracies...
7. This part is just plain silly:
>
> The review also addressed problems communicating name and address purchase information across country boundaries,
> requirements of identification of country of origin and destination within the information technology fulfillment chain.
These "technical" issues were some of the first to be solved. Only the
politics of such transactions still need to be determined. The legal
ramifications of such transactions are still very much up in the air.
But "communicating" country and identification information across
international boundaries? I think almost ANY XML-based specification
would have that covered :-)
8. In conclusion: what a sad, sad day for american journalism indeed --
9. The moral: think twice before you take an AP story at face value --
this little episode should prove once and for all that they are NOT
checking for accuracy, verifying sources, or doing anything else that
ANY respectable news bureau, much less one of the two or three bureaus
who, if anything, should be taking EXTRA precautions due to their
position of privilege in the eyes of the public at large (rather than
setting new records of inaccurate, sloppy reporting).
The technical ramifications: NULL SET
Okay in all fairness, this took me about an hour to finish writing it
up, not just the twenty minutes to research -- but if I were to print a
story on XML.com I'd have to get original quotes and additional factual
verifications from every organization involved -- and i'm not sure i
wish to commit any more time to clearing up someone else's
misinformation. I'm still not convinced that I'm not just playing right
into their political press machine by writing about it at all, much less
giving them additional coverage online :-)
Hope this has been helpful in clearing up this press mess.
Needless to say, if anyone feels that anything contained in this email
is inaccurate, speak up and set me straight -- as long as you can
reference the source of your "corrected" information -- so we don't just
end up in heresay circles again, ok?
take care,
lisa rein
http://www.finetuning.com
Orin Rehorst wrote:
>
> Please comment on the following (source;
> http://www.zdnet.com/anchordesk/story/story_3824.html):
>
>
> The Computing Technology Industry Association rejected a proposed standard
> for XML software that would speed the easy exchange of data between
> networks. The group said the proposal skewed toward American firms and would
> prevent European companies from taking part. The move is likely to delay
> release of an XML standard, which previously had been expected next
> February. Jesse's take: Given the importance of XML to the Net's future,
> this is a disappointing and foolish setback; and smacks of anti-American
> sentiment.
>
> Regards,
> Orin Rehorst
> Port of Houston Authority
>
> ==========================================
> XML/EDI Group members-only discussion list
> Homepage = http://www.xmledi.com
>
> Brought to you by: Online Technologies Corporation
> Home of BizServe - www.bizserve.com
>
> TO UNSUBSCRIBE: Send email to
> Leave the subject blank, and
> In the body of the message, enter ONLY: unsubscribe
>
> Questions/requests should be sent to: owner-xmledi-list@bizserve.com
> To join the XML/EDI Group complete the form located at:
> http://www.geocities.com/WallStreet/Floor/5815/mail1.htm
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From lee at fastwater.com Tue Sep 7 00:37:23 1999
From: lee at fastwater.com (Lee Fife)
Date: Mon Jun 7 17:14:42 2004
Subject: Namespace for HTML (Signup Here)
References: <000401bef8b3$86d9aec0$6645fea9@w21tp>
Message-ID: <37D44269.2216CDB2@fastwater.com>
My vote: either 3 or 4.
-Lee
Don Park wrote:
> Both Tim Bray and David Megginson proposed we settle on the Namespace URI
> for HTML. I think this is a clear enough and small enough task for us
> (XML-DEV) to accomplish here and now.
>
> Here are some candidates:
>
> 1) "http://www.w3.org/Markup/" - by David Megginson
> 2) "http://www.w3.org/HTML/1999/namespace" - by Tim Bray
> 3) "http://www.w3.org/HTML/2000/Namespace" - by Don Park (I like zeros :)
>
> Lets just settle on one and start using it. If W3C balks, we can go with:
>
> 4) "http://www.xml.org/HTML/2000/Namespace" - (if Jon Bosak agrees)
>
> Signup by replying with one of the numbered options.
>
> ...
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From mrc at allette.com.au Tue Sep 7 01:19:58 1999
From: mrc at allette.com.au (Marcus Carr)
Date: Mon Jun 7 17:14:42 2004
Subject: Namespace for HTML (Signup Here)
References: <000401bef8b3$86d9aec0$6645fea9@w21tp> <37D44269.2216CDB2@fastwater.com>
Message-ID: <37D44C8B.60684AD8@allette.com.au>
Lee Fife wrote:
> My vote: either 3 or 4.
>
> -Lee
Rather than getting a boxful of mail like this, could we ask someone to tally the votes? Might
I suggest that a subject reflecting the voters choice be used, for example "Re: Namespace for
HTML (3 or 4)" with nothing in the body? Don Park, you started this and I would have no
problem with entrusting you with the count, but if you can't do it, I could (if there were no
objections). All votes would be made available to anyone who asked (making the process
accountable :-) after the deadline.
--
Regards,
Marcus Carr email: mrc@allette.com.au
___________________________________________________________________
Allette Systems (Australia) www: http://www.allette.com.au
___________________________________________________________________
"Everything should be made as simple as possible, but not simpler."
- Einstein
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From paul at qub.com Tue Sep 7 01:14:52 1999
From: paul at qub.com (Paul Tchistopolskii)
Date: Mon Jun 7 17:14:42 2004
Subject: Hierarchical namespaces. Re: Namespace for HTML (Signup Here)
References: <000401bef8b3$86d9aec0$6645fea9@w21tp>
Message-ID: <010301bef8be$5961aa00$5df5c13f@PaulTchistopolskii>
> Both Tim Bray and David Megginson proposed we settle on the Namespace URI
> for HTML. I think this is a clear enough and small enough task for us
> (XML-DEV) to accomplish here and now.
I'm having impression that I'm stupid, but
I don't think that designing even trivial hierarchy
is small and easy task, if taking into account that
almost any hierarchy should consider future inheritance.
The URIs will appear in thousands of documents.
I guess that folks who designed UNIX were not happy
with their 'creat' syscall ( instead of 'create' ).
It's to illustrate my point that sometimes just one character
matters. Of course, the URIs problem it's a small problem
if comparing it with other problems of XML.
Anyway. The URI is very importent chunk of information
about the namespace. Maybe it's my mistake to consider
it to be important, but my experience shows me that
when designing public API's spending just one day
thinking about good naming convention is importent.
In case that I have missed some previous attempt of
W3C to design the structure of URI sufficient for a long
run, could anybody please point me the URL I can read
on this?
> Here are some candidates:
>
> 1) "http://www.w3.org/Markup/" - by David Megginson
The best one. I like it more than other variants. It is clear
hierarchy :
Vendor
Area
But it is not perfect. Were is the word HTML here?
What would be the relation with XHTML ?
I think this name could be improved to become:
http://www.w3.org/Markup/HTML
Which would allow
http://www.w3.org/Markup/XHTML
In the future.
1. I don't think TR or other versioning information should
be presented in the URI, but I'm not sure.
2. I think that because namespaces are common for all
other standarts one should think about the flexible,
reasonable and expandable hierarchy of namespaces.
2. I don't think just simple prefix matching would work on
long run. For example, I can imagine a situation when
some new vendor comes with something 'inherited from'
http://www.w3.org/Markup/HTML/
But it would cause
http://the.vendor.id/Markup/HTML/extension
> 2) "http://www.w3.org/HTML/1999/namespace" - by Tim Bray
> 3) "http://www.w3.org/HTML/2000/Namespace" - by Don Park (I like zeros :)
Don't think the information should be duplicated.
The URI will already appear in the 'namespace' instruction,
so there is no need in word 'namespace'.
I don't think information about the dates should be in
the URI.
Vendor
Area
Subarea
may be enough for most cases. Unfortunately
there is no way to do some complex things with
trivial inheritance, but I don't think it would be a
problem. My idea is that namespaces URIs should
not be as chaotic as they are now.
It is public API ! Who was blaming MS ? ;-)
Again - maybe I'm missing something.
Conclusion.
I don't think URIs should be just some abstract and
unpredictable strings because URIs are part of
XML public API. Namespaces affect every part
of XML. 'Saving' one day not designing this
part of public API seems strange. If that
day has been spend already - is there any way
to check what was the descision ?
Rgds.Paul.
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
paul@pault.com www.renderx.com www.pault.com
XMLTube * Perl/JavaConnector * PerlApplicationServer
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From mgl at cereus7.com Tue Sep 7 01:53:55 1999
From: mgl at cereus7.com (mgl@cereus7.com)
Date: Mon Jun 7 17:14:42 2004
Subject: XML through XSL to HTML translation
Message-ID: <199909062355.QAA11535@proxy3.ba.best.com>
Hi there XML/XSL folks,
I have a need to generate HTML pages (for 4.x and above browsers) and would
like to mainly author in HTML but would like to use XML for data storage
and XSL to "insert" the XML data into the HTML code. The problem I'm
facing at this point is that there seems (a) no tools for creating XSL
style sheets that emit HTML (without hand coding the entire HTML, i.e.
<BODY> etc., etc.) nnd/or (b) No existing tools for encoding HTML
with data references as an XSL style sheet automatically.
My current thoughts are to author the HTML and then go back and put
comments into the HTML that specify where the data extracted from the XML
should go and then translate (somehow) that HTML document into an XSL style
sheet so that when I use XML + XSL the result becomes well formed HTML.
As there is currently (as far as I can find) one browser (IE5) that
supports XSL directly and many of my target systems are non-MS-using
companies, just using XSL (via IE5) is not an answer.
If this has previously been discussed to death, I appologize and humbly ask
someone to send me the results / post-mortem or whatever. If not, I'd love
to hear anyone with tools, ideas and/or alternatives.
Regards,
Michael Lehman
System Architect
Cereus Design Corp.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From lisarein at finetuning.com Tue Sep 7 02:43:07 1999
From: lisarein at finetuning.com (Lisa Rein)
Date: Mon Jun 7 17:14:42 2004
Subject: xmlns:html="foo" - a cross-browser solution?
References: <000401bef8b3$86d9aec0$6645fea9@w21tp>
Message-ID: <37D464F9.FB8DC8DE@finetuning.com>
Idea:
Right now, millions of confused HTML authors are anxiously waiting to
use namespaces, even though most of them are not sure why.
I'm not usually one to give HTML special treatment :-), but it might be
worthwhile in this case: there are a billion versions of the HTML specs
out there, all those HTML Doctypes addresses, W3C website addresses for
the specs themselves, and whatever else someone might mistake for the
correct URI value (should one ever be chosen).
With all this in mind, how can we just pick out a single HTML NameSpace
URI amongst ourselves during one holiday-ish afternoon on a mailing list
and then expect for that value (or any other single value) to be
wholeheartedly "adopted" by every past, present, and future follower?
That gave me an idea: What about allowing ANY unique URI value to be
declared within the html namespace; they would just be used for
rendering well-formed HTML from within a browser (all it really should
be used for anyway)...
This way, the IE5+ and Mozilla 5+ html namespace declarations could at
least be made compatible -- next time around -- no excuses :-)
Something like this:
or
Which is really just this to their respective, embedded processors:
Since each browsers' HTML display behavior is essentially hardcoded in,
making its rendering behavior effectively application-specific anyhow.
Wouldn't it at least be helpful to have that next time around thing
cleared up?
This would also make things "easy" for all walks of HTML authors -- and
keep things backwards-and-forwards-compatible.
Would this create other problems I haven't thought of yet?
2. And we are NOT talking about XHTML, right? XHTML is the components
made up of well-defined collections of HTML elements, and XHTML must
have an explicitly-defined Namespace, etc.....
Right? :-)
thanks,
lisa
Don Park wrote:
>
> Both Tim Bray and David Megginson proposed we settle on the Namespace URI
> for HTML. I think this is a clear enough and small enough task for us
> (XML-DEV) to accomplish here and now.
>
> Here are some candidates:
>
> 1) "http://www.w3.org/Markup/" - by David Megginson
> 2) "http://www.w3.org/HTML/1999/namespace" - by Tim Bray
> 3) "http://www.w3.org/HTML/2000/Namespace" - by Don Park (I like zeros :)
>
> Lets just settle on one and start using it. If W3C balks, we can go with:
>
> 4) "http://www.xml.org/HTML/2000/Namespace" - (if Jon Bosak agrees)
>
> Signup by replying with one of the numbered options.
>
> If you are against this proposal, select this:
>
> 0) "hell://no.stinking.namespace/4/HTML"
>
> Best,
>
> Don Park - mailto:donpark@docuverse.com
> Docuverse - http://www.docuverse.com
>
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david-b at pacbell.net Tue Sep 7 04:01:25 1999
From: david-b at pacbell.net (David Brownell)
Date: Mon Jun 7 17:14:42 2004
Subject: xmlns:html="foo" - a cross-browser solution?
References: <000401bef8b3$86d9aec0$6645fea9@w21tp> <37D464F9.FB8DC8DE@finetuning.com>
Message-ID: <37D4725B.DAC8F996@pacbell.net>
Lisa Rein wrote:
>
> That gave me an idea: What about allowing ANY unique URI value to be
> declared within the html namespace;
Somehow I don't think making the prefix ("html") become
a reserved word is the right way to go ...
- Dave
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From aray at q2.net Tue Sep 7 06:23:49 1999
From: aray at q2.net (Arjun Ray)
Date: Mon Jun 7 17:14:43 2004
Subject: an unfilled need
In-Reply-To: <37D3F74D.AA84B8FF@praxis.cz>
Message-ID:
On Mon, 6 Sep 1999, Matthew Gertner wrote:
> No one objected to the W3C controlling XML at the onset because it was
> far from a foregone conclusion that XML was win over a number of
> plausible (but in retrospect clearly inferior) approaches. Now XML is
> mainstream and this no longer flies.
Actually, the origin of XML as a W3C activity was irregular: it didn't
become an approved activity until Summer '97, at which point the ERB+WG
structure (with a "closed but public" mailing list) was reorganized into
the new WG+SIG (and the mailing list was sucked into the Members Area.)
The original list did make of use of W3C facilities (generously extended),
but my impression is that there was a "wait and see" approach at work, and
XML wasn't really -ahem- assimilated until it had proved itself in the
form of the Nov'96 draft.
> Lack of complete buyin (not to mention open hostility) from XML
> developers is certainly not in the W3C's interest, and only opens the
> way for Microsoft and other major players to step in with their own
> proprietary (and inevitably less well thought out) approaches.
I would argue that it was in Microsoft et al's *interest* to "close" XML
to the wider community. How else could they exploit standardization as a
marketing tool?
I'll just point out that namespaces were discussed exhaustively at least
twice on the WG/SIG. Avaliable to the public is the discussion of
Microsoft's "Structured Data" proposal in May '97: SD5 was about
namespaces.
Very very unfortunately, the discussion preceding the XML Namespaces Draft
was on the new list, which the public can't see. (Since the SIG has been
dead for a year, the reasons why the archive shouldn't be made public seem
to be political.)
Arjun
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From gregg at informalsoftware.com Tue Sep 7 06:59:30 1999
From: gregg at informalsoftware.com (GW)
Date: Mon Jun 7 17:14:43 2004
Subject: Who's actually into VML?
Message-ID: <199909070501.WAA04363@proxy4.ba.best.com>
Hi -
Was wondering if anyone on the list is actually using VML for anything?
It's supported in IE5, and so forth but I never see it mentioned on this
list; hence I question its viability, reality, whether to invest in the
technology or not. Anyone have opinions on whether VML has life?
Thanks!
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ricko at allette.com.au Tue Sep 7 07:23:54 1999
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun 7 17:14:43 2004
Subject: an unfilled need
Message-ID: <00f301bef8f2$139be330$5bf96d8c@NT.JELLIFFE.COM.AU>
>On Mon, 6 Sep 1999, Matthew Gertner wrote:
>
>> No one objected to the W3C controlling XML at the onset because it
was
>> far from a foregone conclusion that XML was win over a number of
>> plausible (but in retrospect clearly inferior) approaches. Now XML is
>> mainstream and this no longer flies.
> > Lack of complete buyin (not to mention open hostility) from XML
> >developers is certainly not in the W3C's interest, and only opens
the
> > way for Microsoft and other major players to step in with their own
> > proprietary (and inevitably less well thought out) approaches.
I disagree on both counts. Firstly, W3C does not control XML to the
extent that they can control other syntaxes, because XML is SGML.
Secondly, I think that if Microsoft made a successor to XML, it is quite
possible it could be better than XML is, learning from experience. If
Microsoft cares to give me a million dollars, I am prepared to develop
such a thing!
Rick Jelliffe
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From donpark at docuverse.com Tue Sep 7 07:48:44 1999
From: donpark at docuverse.com (Don Park)
Date: Mon Jun 7 17:14:43 2004
Subject: RFP: Namespace URI for HTML
In-Reply-To: <37D44C8B.60684AD8@allette.com.au>
Message-ID: <000001bef8f5$1cd5c440$5be0fea9@w21tp>
All right. I wanted to see the momentum build visually on XML-DEV but I
guess there is enough interest to do this proper.
Request for Proposal: Namespace URI for HTML
Propose what you would like to see as the Namespace URI for HTML. I will
collect them all and post it on a public polling site (probably
www.freepoll.com) so that we can all take a wack at it. Please send it to
me personally rather than posting it to XML-DEV.
Current candidates:
0) "hell://no.stinking.namespace/4/HTML"
1) "http://www.w3.org/Markup/" - by David Megginson
2) "http://www.w3.org/HTML/1999/namespace" - by Tim Bray
3) "http://www.w3.org/HTML/2000/Namespace" - by Don Park (I like zeros :)
4) "http://www.xml.org/HTML/2000/Namespace" - (if Jon Bosak agrees)
5) "http://www.w3.org/Namespace/HTML/" - by Don Park (2nd try)
Deadline for proposal is: September 8th, 1999. Voting can start on 9th and
I think one week should be enough. Your comments are welcome.
Best,
Don Park - mailto:donpark@docuverse.com
Docuverse - http://www.docuverse.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From aray at q2.net Tue Sep 7 07:47:37 1999
From: aray at q2.net (Arjun Ray)
Date: Mon Jun 7 17:14:43 2004
Subject: an unfilled need
In-Reply-To: <00f301bef8f2$139be330$5bf96d8c@NT.JELLIFFE.COM.AU>
Message-ID:
On Tue, 7 Sep 1999, Rick Jelliffe wrote:
> Firstly, W3C does not control XML to the extent that they can control
> other syntaxes, because XML is SGML.
XML is a W3C trademark. I think that settles the "ownership" question.
> Secondly, I think that if Microsoft made a successor to XML, it is quite
> possible it could be better than XML is, learning from experience.
Just as Outlook Express is a "better" newsreader than the many that went
before.
Experience is one thing. Track record is another.
> If Microsoft cares to give me a million dollars, I am prepared to
> develop such a thing!
But Ricko, that's why they won't give you a million dollars!;)
Arjun
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From aray at q2.net Tue Sep 7 08:43:32 1999
From: aray at q2.net (Arjun Ray)
Date: Mon Jun 7 17:14:43 2004
Subject: an unfilled need
In-Reply-To: <4.1.19990906102427.04097280@mail.webgeek.com>
Message-ID:
On Mon, 6 Sep 1999, Ann Navarro wrote:
> In today's Web browser, there is built-in "knowledge" of what HTML
> elements are supposed to be and do. They have been programmed to
> recognize a element as a new structural block, presented with a
> leading line space, text flush left, unless otherwise indicated in a
> style sheet or align attribute.
Well, this isn't really true. The popular browsers are inheritors of the
Mosaic paradigm, which starts with an inherently different view of tags to
begin with - as isolated (or "streamed") embedded commands; the concept of
"element" doesn't pertain at all. Consider, for instance, this elaborate
non-explanation of why
should make a difference:
(This is also why, IMHO, stylesheets never had a chance with these
browsers.)
> Simple, primarily presentation information, but it's preprogrammed in
> how to deal with such elements. [...] Allowing for discovery, based
> on a schema, DTD, or whatever other defining mechanism is provided,
> lets tomorrow's Web browser have that same "knowledge" of a
> element,
How? Either tomorrow's browser has the preprogramming, or it doesn't.
Arjun
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From paul at qub.com Tue Sep 7 09:06:13 1999
From: paul at qub.com (Paul Tchistopolskii)
Date: Mon Jun 7 17:14:43 2004
Subject: XML belongs to itself. (Re: an unfilled need)
References: <00f301bef8f2$139be330$5bf96d8c@NT.JELLIFFE.COM.AU>
Message-ID: <029301bef900$2ee924a0$5df5c13f@PaulTchistopolskii>
> >On Mon, 6 Sep 1999, Matthew Gertner wrote:
> >
> >> No one objected to the W3C controlling XML at the onset because it was
> >> far from a foregone conclusion that XML was win over a number of
> >> plausible (but in retrospect clearly inferior) approaches. Now XML is
> >> mainstream and this no longer flies.
>
> > > Lack of complete buyin (not to mention open hostility) from XML
> > >developers is certainly not in the W3C's interest, and only opens the
> > > way for Microsoft and other major players to step in with their own
> > > proprietary (and inevitably less well thought out) approaches.
>
> I disagree on both counts. Firstly, W3C does not control XML to the
> extent that they can control other syntaxes, because XML is SGML.
I disagree that XML is SGML ( I'l try to explain that XML is
actualy UNIX ;-) but I agree that it is not *that* important
who is 'controling' XML, because from my point of view
XML is 'controling itself' more than any vendor or group
could do.
> Secondly, I think that if Microsoft made a successor to XML, it is quite
> possible it could be better than XML is, learning from experience. If
> Microsoft cares to give me a million dollars, I am prepared to develop
> such a thing!
It may be just a bit better ( more likely it would be
'a bit different'). The unbeatable thing about XML is the
concept. You can not invent the better concept .
( OK, maybe you can - I don't know... )
XML is good because it is not a big invention - it's kind of
reproducing well-known (old) UNIX concepts.
If not going into much detail, we could say that the (whole)
concept of UNIX was:
1. Everything is a text, because it's easier to read.
2. Do not build monsters but use pipes and small
nice bulding blocks instead.
The only difference is that XML says:
1. Yes, everything is a text, but we now also have
not only English ( unicode ) and also text should
be a bit better structured than 'a bunch of lines'
( 'bunch of lines' - is just a special case of a more
general - but still trivial - tree structure ).
It is kind of simplification, because UNIX has changed,
there were more concepts in UNIX, 'everything' means
'mostly everything you need in the real life' e t.c. -
I'm writing those 2 axioms because I think that XML
has a success *only* because it follows those plain
concepts of building scalable and open systems
( just making small change to one of the axioms ).
There was no big choice when designing
something to match the changed axiom.
Sure, we'l get ps -ax and ps -ef. I don't think it is
( would be ) a significant problem or significat
improvemet, like most of the problems discussed
in maling lists.
XML is good not because it is well-designed.
Note 1. Maybe it is - it's hard to understand yet, some parts
are much better than others ( many people are using
XT in the real life, but some standarts are not used
at all ).
Note 2. Maybe it is - but the design process is unspecified.
I can only guess what happens in the XSL FO WG, how
do people make analyzis, do they talk to end-users
e t.c.
XML is a good old concept that works for years (UNIX).
Nobody owns the concept. The concept owns itself.
Remember when MS tried to introduce UNIX-killer?
( Now known as 'better UNIX than UNIX" ).
Actualy, MS is now sponsoring perl ( which
is UNIX 'all in one' ) for Windows. Good concept is
hard to 'own' or 'kill' , more likely the concept
will 'own' you ;-)
Rgds.Paul.
PS. My apologies for a bit 'abstract' posting -
I now promise to stop flooding this malining list for
a while, but I realy think that XML is very special
case. With XML it is not *very* important who
'owns' the trademark or who is writing standards.
I'm not saying that W3C develops bad standards -
W3C does a great job. Any process could be
improved, of course, but in general - I like the
way it goes and I think that it would be hard to
find more brilliant persons to make a descisions.
Just - please - give a small vendors some way to
vote ( just to provide a reality check ) - and it would be
absolutely perfect ;-).
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
paul@pault.com www.renderx.com www.pault.com
XMLTube * Perl/JavaConnector * PerlApplicationServer
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From rbourret at ito.tu-darmstadt.de Tue Sep 7 11:11:18 1999
From: rbourret at ito.tu-darmstadt.de (Ronald Bourret)
Date: Mon Jun 7 17:14:43 2004
Subject: confidentiality in W3C WGs
Message-ID: <01BEF922.56308FE0@grappa.ito.tu-darmstadt.de>
Lauren Wood wrote:
> The only really good reason I see for the confidentiality agreement
> that exists in several W3C Working Groups has marketing, public
> relations, and politics at its base.
>
> [very good reasons for maintaining confidentiality snippped]
>
> So sometimes you need confidentiality, to build trust and a
> knowledge that what is said in a WG remains within that WG, so
> that people can concentrate on the technical work, and not on the
> politics.
I've only briefly been following this whole discussion, but I don't think
anybody has asked for names of who wanted one namespace and who wanted
three and who wanted the discussion to simply end because they were late
for a flight.
I believe what people requested was an explanation of why the change was
made: at the very least, what arguments convinced the WG that it was
worthwhile, and hopefully a discussion of the alternate positions and why
they were rejected.
This kind of information, whether normative, in a non-normative appendix,
or simply provided in a forum like XML-DEV or an annotated specification,
is very helpful in getting buy-in for a specification and, let's face it, a
spec isn't useful if nobody buys in.
In my experience, most people will listen to an explanation. They might
still disagree, but the reaction is often, "Yipes! I see what you mean,"
and in any case explanations almost always help smooth the acceptance
process. On the other hand, when somebody is told, "That's private," the
feel rejected and throw out the good with the bad, not for technical
reasons, but for purely human ones.
-- Ron Bourret
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From steven.livingstone at scotent.co.uk Tue Sep 7 11:10:21 1999
From: steven.livingstone at scotent.co.uk (Steven Livingstone, ITS, SENM)
Date: Mon Jun 7 17:14:43 2004
Subject: Test - Pls Ignore
Message-ID: <8DCB90532FF7D211B34400805FD48853644CAC@SENMAIL3>
Steven Livingstone - http://www.deltabiz.com
07771 957 280 or +447771957280
Author -
Professional Site Server 3, Wrox Press
http://www.wrox.com/Store/Details.asp?Code=2696
Professional Site Server 3.0 Commerce Edition, Wrox Press
http://www.wrox.com/Store/Details.asp?Code=2505
President, AIP Scotland.
ceo@citix.com
http://www.citix.com
Join Association of Internet Professionals - http://www.citix.com/aip
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From digitome at iol.ie Tue Sep 7 11:16:17 1999
From: digitome at iol.ie (Sean Mc Grath)
Date: Mon Jun 7 17:14:43 2004
Subject: an unfilled need
Message-ID: <3.0.6.32.19990907100342.009fa980@gpo.iol.ie>
>David Megginson wrote:
>
>: Perhaps an example would help. Here's an XML document (without
>: Namespaces for now):
>:
>:
>: 112
>: g
>: xxx
>:
>:
>: Now, imagine that my XML application has just received this piece of
>: XML and knows nothing about the markup language used. What kind of
>: information should it be able to discover automatically, so that it
>: can process this document usefully?
>:
>
[Brendan McKenna]
> I would say that that depended on the application.
Precisely! But the number of potential applications is infinite. The
amount of stuff you would need to "discover" is infinite. That
is why inventing declarative syntaxes for discoverable knowledge
such as rendering semantics, interactive behaviour, computation
etc. will always be a sandwich short of the full picnic.
The most general way to deal with the fact that so much depends
on the application, is to soft-code the application. In browser
land, this is like saying that we should take out
the hardwired rendering engine and download it along with the
data. The PostScript printer approach.
It is as Nicklaus Wirth put it : "Algorithms + Data Structures = Programs".
In the XML world we can re-phrase this as "Java + XML = Programs"
or "Perl + XML = Programs" or (my personal favourite) "Python + XML =
Programs". A truly general solution needs both algorithm and
data. The headling rush into XML seems to have caused people
to loose sight of this and put all their faith in XML syntaxes
doing *everything*.
Whatever way you cut it, to truly general-purposify a "browser"
you need to soft-code its algorithms. In some alternative
universe you can write this:-
Bank.Debit ( (bllrp.zata * rte.w )/ GoldenRatio )
112
g
xxx
This approach will get you there a lot faster than
waiting for the "bank debit golden ratio rte interest group"
to finalize their declarative syntax for what the bllrp
element really means...
regards,
Developers Day Co-Chair, 9th International World Wide Web Conference
16-19, May, 2000, Amsterdam, The Netherlands http://www9.org
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From rbourret at ito.tu-darmstadt.de Tue Sep 7 11:19:21 1999
From: rbourret at ito.tu-darmstadt.de (Ronald Bourret)
Date: Mon Jun 7 17:14:43 2004
Subject: ANN: XML and Databases article
Message-ID: <01BEF923.76896540@grappa.ito.tu-darmstadt.de>
[My apologies if you get this twice -- my email's been on the blink and I'm
not sure if the first message got through.]
I've written an article about XML and databases -- why you might want to
use a database with XML, what are some of the technical issues involved,
how available middleware transfers XML documents to/from databases, and
what software is currently available to do this. You can find the article
at:
http://www.informatik.tu-darmstadt.de/DVS1/staff/bourret/xml/XMLAndDatab
ases.htm
Comments welcome.
-- Ron Bourret
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From matthew at praxis.cz Tue Sep 7 11:34:10 1999
From: matthew at praxis.cz (Matthew Gertner)
Date: Mon Jun 7 17:14:43 2004
Subject: W3C policies (Was Re: an unfilled need)
References: <00f301bef8f2$139be330$5bf96d8c@NT.JELLIFFE.COM.AU>
Message-ID: <37D4DC7D.7C17841E@praxis.cz>
> I disagree on both counts. Firstly, W3C does not control XML to the
> extent that they can control other syntaxes, because XML is SGML.
> Secondly, I think that if Microsoft made a successor to XML, it is quite
> possible it could be better than XML is, learning from experience. If
> Microsoft cares to give me a million dollars, I am prepared to develop
> such a thing!
Just to be clear, the XML recommendation in itself is a fantastic piece
of work, especially considering the tightrope that had to be walked
between SGML traditionalists and HTML mainstreamers. Could it be better?
Sure, and if anyone catches me drinking in a hotel lobby and wants to
hear a long unfocused rant about how much I hate external entity
references, I'm always amenable. The point is that for what XML tries to
do (and I refer here only to the Feb98 recommendation) it is more than
good enough. I can't imagine Microsoft or anyone else wanting to redo
all this work for the sake of some marginal benefit.
On the other hand, XML is incomplete. There seems to be a quite strong
consensus (at least in this list, which I assume is somewhat
representative) that the following aspects still need treatment if XML
is going to live up to all the promises we have been proclaiming from
the rooftops (universal data exchange, data perennity, application
interoperability, world peace, etc.):
1) A better schema language than DTDs
2) Some means of finding out what semantics are associated with a
document
I thought I was going to type a long list, but it seems to me that this
would actually be sufficient. There seems to be agreement that other
important issues (like stylesheets and namespaces) are solved or at
least well on their way. This might not seem clear in the case of
namespaces, but most of the objections to the current spec are really
just hankering for 2) in disguise.
So the question is, would we rather see these things come about through
some open process or by dictate from some corporate titan focused on its
own self-interest? I don't want to sound like a starry-eyed idealist,
but as XML folks we all understand the value of freely available
information. Can't we leverage the network infrastructure provided by
the Internet to reform the standards process into something more
appropriate for this day and age? The argument that "it's a hell of a
lot better than ISO" is not entirely satisfying. (No offense to any
ISOers out there; I have no experience with this myself but I've heard a
lot of people who do make this statement.)
I read Lauren's post on this issue with interest and she makes a very
sensible argument. I hadn't seen things from exactly this perspective,
so I was glad to be exposed to this viewpoint. But I'm still not
convinced. The press no longer controls the public's access to
information. Open access to working group proceedings would presumably
make it harder for journalists to misrepresent what went on between
company A and company B, not easier, since anyone with a Web browser
could surf over to the W3C and check it out themselves. If this forces
companies to be more aware of how their actions could affect the way the
public perceives them, isn't this an unambiguously good thing for
everyone except for a handful of large and/or well-connected companies
(and in the aggregate, probably for them too)?
The word "political" keeps coming up in this context, and yes, we need
to accept political realities. What I was trying to argue in my original
post is that the choice is not between closed (i.e. non-publicly
accessible) development of further XML-related standards (favored by
larger companies and insiders) or open processes (favored by the
unorganized masses). If this were the case, the political reality would
be that the processes are going to stay the way there are. Actually, the
W3C may well fail to provide for XML developer's needs in the areas I
mentioned above and be preempted by some proprietary approach. This
means that even those with a vested interest in maintaining the W3C
confidentiality policy (and the power to make this happen) need to think
twice about whether this is really in their interests.
Taking a "direct democracy" approach where the public at large votes on
major decisions in the XML standards process is politically unrealistic
and not obviously desirable. We now have the technical means to do the
same in the "real" political arena, but it is likely that people will
decide to have knowledgeable representatives continue to do the vast
majority of this work for them. But providing completely open access to
all about what is going on inside the W3C's various groups would bring a
ton of benefits, clearing away a lot of bad feelings and
misunderstandings, opening critical and tricky decisions to widespread
debate, enabling W3C members to participate more freely in outside
discussions instead of brushing off interested and justified enquiries,
etc., etc. Sure, there would be a couple of additional risks to watch
out for, but the conclusion that these are of overriding importance is
far from obvious.
Matthew
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From matthew at praxis.cz Tue Sep 7 12:01:11 1999
From: matthew at praxis.cz (Matthew Gertner)
Date: Mon Jun 7 17:14:43 2004
Subject: Discovering semantics (Was Re: an unfilled need)
References: <3.0.6.32.19990907100342.009fa980@gpo.iol.ie>
Message-ID: <37D4E2CB.1AF7E943@praxis.cz>
> Precisely! But the number of potential applications is infinite. The
> amount of stuff you would need to "discover" is infinite. That
> is why inventing declarative syntaxes for discoverable knowledge
> such as rendering semantics, interactive behaviour, computation
> etc. will always be a sandwich short of the full picnic.
And rightly so, since this will enable software vendors to create their
own sandwich, keeping the idea market open to innovation and progress.
But that still leaves a hell of a lot of beer and potato salad for the
standards makers:
1) Associating element types with rendering semantics
Everyone agrees on this. We want a default way to view a document in a
browser in some readable way. Right now the stylesheet associated with a
document is specified in the document instance, but it could just as
easily be placed in a central repository and retrieve via query. In
addition, there's obviously no reason why several stylesheets couldn't
be associated with a single document, au choix. I'm way out of my depth
here, but presumably if font-makers can come up with universally
understandable, orthogonal attributes like "sans serif", "bold" or
"italic", then the same could be done for stylesheets. I'm no expert on
aesthetics, but this would be cool even if I just got a list of
available stylesheets with a descriptive name for each when I access a
document.
2) Associating element types with other semantics
...maybe in the form of Java classes (beans or otherwise). You won't get
a fully fledged application, of course, you could at least have a
standard method for viewing (which might well be done through some other
means than a stylesheet -- imagine a CML document), printing and
editing. And there's no reason why arbitrary methods could not be
available too. So I receive my "invoice" document and go to the central
semantics repository to see what methods are available for it. If I find
a method for, say, calculating the total price of the invoice, I don't
have to duplicate this code in my application.
Objections to putative methods for discovering semantics tend to be
based on unrealistic expectations about what this could do. There isn't
any need for "Java ontologies" or code that works everywhere in every
situation in order to do useful stuff. If I can publish my schema in a
central place alongside some code, then I am providing far more for
others to reuse than if I publish the data alone. If others can extend
my code (through aggregation or -- please! -- inheritance) then there is
a real chance that a rich set of reusable syntaxes and associated code
will spring up organically for a variety of application domains. So what
if I have to go query the repository by hand and then hard-code the link
to the appropriate application logic into my document instance? And if
someone manages to figure out some basic hierarchical organization for
the repository, a la Yahoo, well it would seem petty to bicker about a
missing sandwich or two...
Matthew
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Tue Sep 7 12:11:59 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:43 2004
Subject: an unfilled need
In-Reply-To: <3.0.32.19990906111905.00abd6c0@pop.intergate.bc.ca>
References: <3.0.32.19990906111905.00abd6c0@pop.intergate.bc.ca>
Message-ID: <14292.58565.93178.895487@localhost.localdomain>
Tim Bray writes:
> At 01:54 PM 9/6/99 -0400, David Megginson wrote:
> Hmm, I observe that the namespace name hardwired to the prefix
> "xml:" is
>
> http://www.w3.org/XML/1998/namespace
>
> (amusing, since the namespace spec is dated January 1999). Anyhow,
> I seem to recall that Dan Connolly cooked this up on behalf of the
> W3C and there was a real good reason which I unfortunately forget
> for having a date in there.
Dan's suggestion is that if someone else ends up owning the w3.org
domain some day, they're less likely to create their own, identical
Namespace URI if the URI contains a stale date.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Tue Sep 7 12:20:35 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:43 2004
Subject: Namespace for HTML (Signup Here)
In-Reply-To: <000401bef8b3$86d9aec0$6645fea9@w21tp>
References: <3.0.32.19990906111905.00abd6c0@pop.intergate.bc.ca>
<000401bef8b3$86d9aec0$6645fea9@w21tp>
Message-ID: <14292.58847.24153.172583@localhost.localdomain>
Don Park writes:
> Both Tim Bray and David Megginson proposed we settle on the Namespace URI
> for HTML. I think this is a clear enough and small enough task for us
> (XML-DEV) to accomplish here and now.
No, I'm not proposing that quite yet -- the HTML WG is still
discussing this issue, and things might still change before the XHTML
REC comes out.
> Here are some candidates:
>
> 1) "http://www.w3.org/Markup/" - by David Megginson
> 2) "http://www.w3.org/HTML/1999/namespace" - by Tim Bray
> 3) "http://www.w3.org/HTML/2000/Namespace" - by Don Park (I like zeros :)
None of these is under our control -- they would have to be proposed
from within the W3C, and I cannot see any being approved without the
HTML WG's consent.
> Lets just settle on one and start using it. If W3C balks, we can go with:
>
> 4) "http://www.xml.org/HTML/2000/Namespace" - (if Jon Bosak agrees)
OASIS controls xml.org now, and they probably would not want to set
themselves up in direct opposition to the W3C like this.
> Signup by replying with one of the numbered options.
Let's just wait a little while. We've made our opinions well-known on
the XML-Dev mailing list, and I know that many people here have sent
their comments into the Director or the HTML WG. I don't think that
I'm breaching member-confidentiality rules by saying that our
discussions here have been noticed.
If anyone is really excited and cannot wait to act, go buy the
namespaces.com domain so that we have somewhere to hang this sort of
thing. purl.org might also be co-operative.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Tue Sep 7 12:34:20 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:43 2004
Subject: MSFT is innocent, this time (was Re: an unfilled need)
In-Reply-To:
References: <37D3F74D.AA84B8FF@praxis.cz>
Message-ID: <14292.59533.863961.498295@localhost.localdomain>
Arjun Ray writes:
> > Lack of complete buyin (not to mention open hostility) from XML
> > developers is certainly not in the W3C's interest, and only opens
> > the way for Microsoft and other major players to step in with
> > their own proprietary (and inevitably less well thought out)
> > approaches.
>
> I would argue that it was in Microsoft et al's *interest* to
> "close" XML to the wider community. How else could they exploit
> standardization as a marketing tool?
I've been a Linux user since 1993 (and a Minix user before that) and
am, in general, a rabid anti-Redmond person, but I'd like to point out
(at the risk of breaching confidentiality) that Microsoft is not
conspiring to keep XML secret, and that when its representatives have
pushed on committees where I've been present, they've generally pushed
for an earlier and simpler public release rather than a later or more
complex one, and have often accepted compromises to help consensus.
Others' mileage may vary, but my experience is that the big companies
need XML to become an open commodity, while the smaller ones sometimes
have more of an interest in obfuscation (because XML is just a side
dish for the big ones but bread-and-butter for the small).
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Daniel.Veillard at w3.org Tue Sep 7 14:20:32 1999
From: Daniel.Veillard at w3.org (Daniel Veillard)
Date: Mon Jun 7 17:14:43 2004
Subject: Namespace for HTML (Signup Here)
In-Reply-To: <14292.58847.24153.172583@localhost.localdomain>
References: <3.0.32.19990906111905.00abd6c0@pop.intergate.bc.ca> <000401bef8b3$86d9aec0$6645fea9@w21tp> <14292.58847.24153.172583@localhost.localdomain>
Message-ID: <19990907082301.G6649@w3.org>
I would just like to raise the point that, if within xml-dev
an HTML naming space is designated, then it will just make things harder,
software writer (and I'm one of those) will have to support 4 namespaces
instead of 3 (or 2 instead of one), since I don't believe the XHTML WG
would find it easy to change a Proposed REC draft (and all the
implementation which are certainly be worked on in various places).
As David said at least some of the WG members got the feedback
especially Ann (and independantly of the technical issue itself
she has done an impressive job at reading and answering people
mails, hat off ...). Let's see what is the final outcome.
> If anyone is really excited and cannot wait to act, go buy the
> namespaces.com domain so that we have somewhere to hang this sort of
> thing. purl.org might also be co-operative.
too late, it's already reserved :-)
Banyan Systems (NAMESPACE-DOM)
115 Flanders Road
Westboro, MA 01581
US
and the .org and .net are bought already too
Daniel
--
Daniel.Veillard@w3.org | W3C, INRIA Rhone-Alpes | Today's Bookmarks :
Tel : +33 476 615 257 | 655, avenue de l'Europe | Linux, WWW, rpmfind,
Fax : +33 476 615 207 | 38330 Montbonnot FRANCE | rpm2html, XML,
http://www.w3.org/People/W3Cpeople.html#Veillard | badminton, and Kaffe.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From simonstl at simonstl.com Tue Sep 7 15:15:07 1999
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun 7 17:14:43 2004
Subject: What XML-dev can do (was Re: an unfilled need)
In-Reply-To: <37D4247B.4FD2@hiwaay.net>
References: <3.0.32.19990906085410.00b6d4e0@pop.intergate.bc.ca>
<37D3F74D.AA84B8FF@praxis.cz>
Message-ID: <199909071317.JAA05607@hesketh.net>
At 03:30 PM 9/6/99 -0500, Len Bullard wrote:
>Now, the rest of you should decide what you want to use namespaces
>for and just do it. You don't need permission and if you wait for it,
>you only fall behind in development.
and Matthew Gertner later wrote:
>So the question is, would we rather see these things come about through
>some open process or by dictate from some corporate titan focused on its
>own self-interest? I don't want to sound like a starry-eyed idealist,
>but as XML folks we all understand the value of freely available
>information. Can't we leverage the network infrastructure provided by
>the Internet to reform the standards process into something more
>appropriate for this day and age?
We already have an excellent forum for discussing all of these issues, for
making proposals and presenting implementations. XML-dev has shown itself
capable in the past of building standards through open process (SAX, DDML)
and even in getting those standards widely adopted (SAX). There are no
membership fees, there is no official process, there are no barriers.
In the past, a lot of projects on XML-dev have faced an uphill fight until
it became very clear that the W3C wasn't going to claim that space as its
own turf and thereby obsolete our work. (In DDML's case, the proposal we
wrote was submitted to the W3C as a Note and became one of several inputs
to their schema project.)
It doesn't seem sensible, however, to be afraid of the W3C. There are a
lot of projects they don't seem interested in working on, and there are
even more cases where people need standards (like the namespaces issues Len
noted above) that go beyond what the W3C is currently providing.
Matt's list included two possibilities:
>1) A better schema language than DTDs
>2) Some means of finding out what semantics are associated with a
>document
I'd add:
3) A way to associate prefixes with namespace URIs in DTDs (I could do that!)
4) Some means of discovering what kind of processing is needed for a document
5) A way to profile or subset XML itself (ties into 4 somewhat)
6) A set of tools for querying XML documents (discussed at W3C, but nothing
yet)
7) A convention for mapping data types into DTDs (XML Authority has one)
8) Whatever else comes to mind - I'm only on my first cup of coffee.
Some of these projects have been talked about at the W3C, some (Matt's #1)
are enjoying the full thrills of W3C WGs. Others, though they might be
useful, even easy, appear to have fallen off the radar.
This doesn't even begin to get into the application development
possibilities for XML, on which the members of this list are constantly
working. The surface has barely been scratched, though.
It's a lot of work, and I know it isn't always fun. Most of us do have
other lives and all of that, and a significant portion of us have other
lives already spent at the W3C. It does seem, however, like a lot of this
could be rewarding. Personally, professionally, even politically.
Simon St.Laurent
XML: A Primer (2nd Ed - September)
Building XML Applications
Inside XML DTDs: Scientific and Technical
Sharing Bandwidth / Cookies
http://www.simonstl.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From dave at userland.com Tue Sep 7 15:41:22 1999
From: dave at userland.com (Dave Winer)
Date: Mon Jun 7 17:14:43 2004
Subject: Why focus on HTML?
References: <3.0.32.19990906085410.00b6d4e0@pop.intergate.bc.ca><37D3F74D.AA84B8FF@praxis.cz> <199909071317.JAA05607@hesketh.net>
Message-ID: <014101bef937$73ad3040$1618ccce@pebbles>
I've been reading the messages lately on the XML-DEV list and wanted to
share an observation.
You have a choice as to where you channel your energies. Rearchitecting HTML
is going to be a problem because it is already so deeply rooted. It may look
like fun, or a place of power, but that's an illusion. No one has the power
to rearchitect it. It's a thing of nature. W3C is powerless, as are the
browser vendors, as is the XML-DEV list. It doesn't matter what politics
happen in W3C. They have no control over the HTML code that's already
deployed, and very limited influence over the HTML that will be deployed in
the future. Look at the low rate of adoption of everything related to web
browsers and don't blame Microsoft for that, it's a market speaking, and big
markets move s-l-o-w-l-y and for good reason.
So, if you believe that, as I do, why not channel your energies into spaces
where there is no installed base? XML opens all kinds of possibilities. Why
not spread out and focus on implementations that use the net in new ways? Or
try out some new ideas. Why channel all the energy into a place where not
much can be done?
My opinion only..
Dave
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From dave at userland.com Tue Sep 7 16:44:55 1999
From: dave at userland.com (Dave Winer)
Date: Mon Jun 7 17:14:43 2004
Subject: Why focus on HTML?
References: <3.0.32.19990906085410.00b6d4e0@pop.intergate.bc.ca><37D3F74D.AA84B8FF@praxis.cz><199909071317.JAA05607@hesketh.net> <3.0.6.32.19990907074023.00984e90@mail.dt1.sdca.home.com>
Message-ID: <014701bef940$64943410$1618ccce@pebbles>
> More energy into actual XML apps and things like XML-RPC.
Well, now that you mention it, there has been a lot of movement in
XML-RPC-land in the last week.
A summary is here:
http://discuss.userland.com/msgReader$10661
The party is just beginning! Still lots of wide open space and room for
creative contributions.
Dave
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Mark.Volkmann at AGEDWARDS.com Tue Sep 7 16:58:22 1999
From: Mark.Volkmann at AGEDWARDS.com (Volkmann, Mark)
Date: Mon Jun 7 17:14:43 2004
Subject: need XML DOM parser for COBOL
Message-ID: <5D176C6118FFD211B3A700902761F0F0FC4327@hqexchn8.agedwards.com>
I'm interested in using XML for data transport between a PC or Solaris box
and a mainframe in an organization where COBOL is the preferred mainframe
language. I've searched for a COBOL-based DOM parser to no avail. Is
anyone aware of one?
--
__ __
/ \/ \ Object Computing, Inc.
\ / ark (314)589-1617 pager
\ / (314)955-8087 A.G.Edwards
\ / olkmann volkmann@inlink.com
\/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19990907/128d9e41/attachment.htm
From srn at techno.com Tue Sep 7 17:00:28 1999
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun 7 17:14:43 2004
Subject: ANN: XML and Databases article
In-Reply-To: <01BEF923.76896540@grappa.ito.tu-darmstadt.de> (message from
Ronald Bourret on Tue, 7 Sep 1999 11:23:47 +0200)
References: <01BEF923.76896540@grappa.ito.tu-darmstadt.de>
Message-ID: <199909071453.JAA01354@bruno.techno.com>
[Ron Bourret:]
> I've written an article about XML and databases -- why you might want to
> use a database with XML, what are some of the technical issues involved,
> how available middleware transfers XML documents to/from databases, and
> what software is currently available to do this. You can find the article
> at:
>
> http://www.informatik.tu-darmstadt.de/DVS1/staff/bourret/xml/XMLAndDatabases.htm
>
> Comments welcome.
I learned some interesting things from this article, for which I'm
grateful. Recommended reading!
I was disappointed that the grove paradigm
(http://www.prescod.net/groves/shorttut/) was never mentioned, nor was
GroveMinder (http://www.techno.com). The grove paradigm offers an
ISO-defined meta-DOM to all notations, including XML.
-Steve
--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com http://www.techno.com ftp.techno.com
voice: +1 972 231 4098
fax +1 972 994 0087
pager (150 characters max): srn-page@techno.com
3615 Tanner Lane
Richardson, Texas 75082-2618 USA
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From mike.champion at sagus.com Tue Sep 7 17:04:10 1999
From: mike.champion at sagus.com (Michael Champion)
Date: Mon Jun 7 17:14:43 2004
Subject: XML Database/Repository with content indexing ?
References: <199909061659.SAA05197@chimay.loria.fr>
Message-ID: <011601bef942$e91e7e00$5bbeb3c7@mccdell>
----- Original Message -----
From: Patrice Bonhomme
To:
Sent: Monday, September 06, 1999 12:59 PM
Subject: XML Database/Repository with content indexing ?
>
>
>
> I was wondering if it exists some XML database system that provide a
content
> (full text) indexing mechanism. This system should be able to answer to
this
> query : give me all documents that contains the expression "big blue"
within
> their "/TEI.2/teiHeader/*/title" element(s).
My apologies if this goes out twice, but check out Software AG's soon to be
released native XML database system at http://www.softwareag.com/tamino/
Also, the "XML and Databases" article pointed to by Ron Bourret's recent
posting contains a nice summary of various XML database products.
Mike Champion
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From tbray at textuality.com Tue Sep 7 17:15:14 1999
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun 7 17:14:43 2004
Subject: an unfilled need
Message-ID: <3.0.32.19990907081729.00c01680@pop.intergate.bc.ca>
At 12:55 AM 9/7/99 -0400, Arjun Ray wrote:
>Actually, the origin of XML as a W3C activity was irregular: it didn't
>become an approved activity until Summer '97, at which point the ERB+WG
>structure (with a "closed but public" mailing list) was reorganized into
>the new WG+SIG (and the mailing list was sucked into the Members Area.)
Historically inaccurate. XML was an official W3C activity starting
in July '96. The re-organization you describe (accurately) was part
of a W3C-wide process regularization. It is correct to say that the
W3C staff & management didn't really take XML seriously until the
late spring of '97. -Tim
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From donpark at docuverse.com Tue Sep 7 17:42:02 1999
From: donpark at docuverse.com (Don Park)
Date: Mon Jun 7 17:14:43 2004
Subject: Namespace for HTML (Signup Here)
Message-ID: <000201bef948$07f4d540$16a1fea9@w21tp>
David and Tim,
Sorry about misrepresenting your 'pleas' (see below) to the HTML WG as
proposals to XML-DEV. Frankly, I am too much of an ego maniac to even
consider begging an unreasonable group to make a reasonable decision. While
I am certain that not all of the HTML WG members are unreasonable, I can not
ignore the decision of the whole nor the hand of 'god' that took part in its
unraveling decision.
I took the initiative on the Namespace for HTML issue because it is less
violent than asking W3C to disband the HTML WG and try again. It was less
impossible than expecting W3C to reorganize itself. Will the HTML WG
suddenly see the light? I prefer not to hold my breath in suspense and wait
helplessly.
Best,
Don Park
Docuverse
If MS and Netscape/Mozilla both buy into the same Namespace URI, then
it won't much matter what the W3C says, but I'd still like to hope
that the XHTML WG will give us a URI -- even just a NOTE with the line
The XML Namespace URI for HTML is "http://www.w3.org/Markup/".
would help prevent the nightmarish interoperability problems that are
coming up. If we had this, I'd be willing to wait a while for XHTML
proper.
"Hmm, I observe that the namespace name hardwired to the prefix "xml:" is
http://www.w3.org/XML/1998/namespace
(amusing, since the namespace spec is dated January 1999). Anyhow, I
seem to recall that Dan Connolly cooked this up on behalf of the W3C and
there was a real good reason which I unfortunately forget for having a
date in there.
So by analogy, a candidate namespace name for HTML would be
http://www.w3.org/HTML/1999/namespace
And yes, a one-line NOTE on www.w3.org/TR would do the job. Hey there,
HTML WG, any chance? If you pick a URI, we all promise not to complain
about what it is. Right, guys?
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Clark.Cooper at bender.com Tue Sep 7 17:44:33 1999
From: Clark.Cooper at bender.com (Cooper, Clark)
Date: Mon Jun 7 17:14:43 2004
Subject: Article on using expat at www.xml.com
Message-ID:
I've written an article for xml.com (http://www.xml.com) on using expat.
Since I've seen requests on this mailing list for information on expat, I
thought I'd let you know that it's currently running there.
Clark Cooper Logic Technologies, Inc. cccooper@ltionline.com
(800)424-4200 650 Franklin St., Suite 304 coopercc@netheaven.com
ext 3984 Schenectady, NY 12305 USA
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From dhunter at Mobility.com Tue Sep 7 18:22:13 1999
From: dhunter at Mobility.com (Hunter, David)
Date: Mon Jun 7 17:14:44 2004
Subject: W3C policies (Was Re: an unfilled need)
Message-ID: <805C62F55FFAD1118D0800805FBB428D02BBFE7D@cc20exch2.mobility.com>
> From: Matthew Gertner [mailto:matthew@praxis.cz]
> Sent: Tuesday, September 07, 1999 5:36 AM
> I read Lauren's post on this issue with interest and she makes a very
> sensible argument. I hadn't seen things from exactly this perspective,
> so I was glad to be exposed to this viewpoint. But I'm still not
> convinced. The press no longer controls the public's access to
> information. Open access to working group proceedings would presumably
> make it harder for journalists to misrepresent what went on between
> company A and company B, not easier, since anyone with a Web browser
> could surf over to the W3C and check it out themselves. If this forces
> companies to be more aware of how their actions could affect
> the way the
> public perceives them, isn't this an unambiguously good thing for
> everyone except for a handful of large and/or well-connected companies
> (and in the aggregate, probably for them too)?
I have to disagree with this one point, and agree with Lauren on this issue,
and it's mostly because of my lack of faith in the press. :-) We all know
that you can make "facts" and "numbers" say just about anything you want,
even when those facts and numbers are publicly available. If an issue comes
up in a W3C working group, and a member from company A disagrees with a
member from company B, the press can very easily write up a bunch of
articles about how "Ol' A and B are at it again. Like cats and dogs, those
two are.". Everyone will either
a) Not bother to go and read the archives to verify it. Why bother?
They're not allowed to LIE when they're writing news stories!
b) Go to the archives and find out that, yes indeed, A and B did disagree
on that point! Boy, those two can never get along!
In either case, the net result Lauren described will probably ensue.
Representatives from A and B can no longer feel free to work together
properly in the W3C.
OTOH, summary information is always good. Don't give out the
minutes of the meetings, or make public all of the emails, because we can
get the results above. But if the working groups could make things like the
following public:
"There was also [insert issue here] involved in making this decision. Some
members of the WG thought [insert opinion on issue here], but others [insert
other opinion on issue here]. In the end, we came to this decision because
of [A REASON]."
then we wouldn't get all of the backlash against the W3C that has come up in
the last week on xml-dev. It's not important who thought what about those
issues, and anyone who feels they need to know is either nosy or a reporter
looking for a story.
Just my 2 cents.
David Hunter
david.hunter@mediaserv.com
MediaServ Information Architects
http://www.MediaServ.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ricko at allette.com.au Tue Sep 7 19:25:36 1999
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun 7 17:14:44 2004
Subject: XML belongs to itself. (Re: an unfilled need)
Message-ID: <003a01bef956$d951e840$09f96d8c@NT.JELLIFFE.COM.AU>
From: Paul Tchistopolskii
>I disagree that XML is SGML
The XML Spec says XML is SGML (see the abstract).
The SGML standard says XML is SGML (see ISO 8879 Annex L).
Rick Jelliffe
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From wunder at infoseek.com Tue Sep 7 19:26:08 1999
From: wunder at infoseek.com (Walter Underwood)
Date: Mon Jun 7 17:14:44 2004
Subject: XML Database/Repository with content indexing ?
In-Reply-To: <199909061659.SAA05197@chimay.loria.fr>
Message-ID: <3.0.5.32.19990907102333.00bae100@corp.infoseek.com>
At 06:59 PM 9/6/99 +0200, Patrice Bonhomme wrote:
>
>I was wondering if it exists some XML database system that provide a content
>(full text) indexing mechanism. This system should be able to answer to this
>query : give me all documents that contains the expression "big blue" within
>their "/TEI.2/teiHeader/*/title" element(s).
Ultraseek Server is a full-text search engine that handles XML.
If most of your searches are by experts, with complex boolean
queries and arbitrary element context, Ultraseek Server is not
the answer. But if you want to find "big blue" in title elements,
search for phrases, handle plurals in English, French, and other
languages (stemming), then Ultraseek Server is a good fit.
See: http://software.infoseek.com/
wunder
--
Walter R. Underwood
wunder@infoseek.com
wunder@best.com (home)
http://software.infoseek.com/cce/ (my product)
http://www.best.com/~wunder/
1-408-543-6946
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From lauren at sqwest.bc.ca Tue Sep 7 20:58:22 1999
From: lauren at sqwest.bc.ca (Lauren Wood)
Date: Mon Jun 7 17:14:44 2004
Subject: confidentiality in W3C WGs
In-Reply-To: <01BEF922.56308FE0@grappa.ito.tu-darmstadt.de>
Message-ID: <199909071900.PAA25558@gw.sqwest.bc.ca>
On 7 Sep 99, at 11:15, Ronald Bourret wrote:
> I've only briefly been following this whole discussion, but I don't think
> anybody has asked for names of who wanted one namespace and who wanted
> three and who wanted the discussion to simply end because they were late
> for a flight.
I agree that technical reasons for why the spec is the way it is, and
reasons for change, should be made public. Particularly when the
issue is controversial, as this one is.
Lauren
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cowan at locke.ccil.org Tue Sep 7 23:07:03 1999
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun 7 17:14:44 2004
Subject: First draft of proposed XML TC for Unicode 3.0 (unofficial)
Message-ID: <199909072144.RAA08372@locke.ccil.org>
This is version 0.1 of a proposed technical corrigendum to XML 1.0
to incorporate the new characters of Unicode 3.0 into the allowable
sets used in XML Names. It presumes that XML should not
remain limited to an obsolete version of the Unicode and ISO 10646
standards.
The new scripts handled are:
Cherokee, Ethiopic, Khmer, Mongolian, Myanmar, Ogham, Runic, Syriac,
Thaana, Unified Canadian Aboriginal Syllabics, Yi.
These lists of new characters were constructed by using the current Unicode 3.0
data file from the Unicode Consortium and applying the rules given
in Appendix B to it. This version of the proposal does not
yet incorporate information from the Unicode 3.0 properties list.
(Unicode 3.0 is technically still in beta, but the character list has
been frozen for months now.)
New BaseChars (BNF rule 85):
[#x01F6-#x01F9] /* new Latin letters */
| [#x0218-#x021F]
| [#x0222-#x0233]
| [#x02A9-#x02AD] /* new IPA Latin letters */
| #x03D7 /* new Greek letters */
| #x03DB
| #x03DD
| #x03DF
| #x03E1
| #x0400 /* new Cyrillic letters */
| #x040D
| #x0450
| #x045D
| [#x048C-#x048F]
| [#x04EC-#x04ED]
| [#x06B8-#x06B9] /* new Arabic letters */
| #x06BF
| #x06CF
| [#x06FA-#x06FC]
| #x0710 /* new Syriac script */
| [#x0712-#x072C]
| [#x0780-#x07A5] /* new Thaana script */
| #x0950 /* OM letters */
| #x0AD0
| [#x0D85-#x0D96] /* new Sinhala script */
| [#x0D9A-#x0DB1]
| [#x0DB3-#x0DBB]
| #x0DBD
| [#x0DC0-#x0DC6]
| #x0E2F / * new Thai characters */
| #x0EAF
| #x0F00 /* Tibetan OM */
| #x0F6A /* new Tibetan letters */
| [#x1000-#x1021] /* new Myanmar script */
| [#x1023-#x1027]
| [#x1029-#x102A]
| [#x1050-#x1055]
| #x1101 /* Hangul jamo that are no longer compatibility characters */
| #x1104
| #x1108
| #x110A
| #x110D
| [#x1113-#x113B]
| #x113D
| #x113F
| [#x1141-#x114B]
| #x114D
| #x114F
| [#x1151-#x1153]
| [#x1156-#x1158]
| #x1162
| #x1164
| #x1166
| #x1168
| [#x116A-#x116C]
| [#x116F-#x1171]
| #x1174
| [#x1176-#x119D]
| [#x119F-#x11A2]
| [#x11A9-#x11AA]
| [#x11AC-#x11AD]
| [#x11B0-#x11B6]
| #x11B9
| #x11BB
| [#x11C3-#x11EA]
| [#x11EC-#x11EF]
| [#x11F1-#x11F8]
| [#x1200-#x1206] /* new Ethiopic script */
| [#x1208-#x1246]
| #x1248
| [#x124A-#x124D]
| [#x1250-#x1256]
| #x1258
| [#x125A-#x125D]
| [#x1260-#x1286]
| #x1288
| [#x128A-#x128D]
| [#x1290-#x12AE]
| #x12B0
| [#x12B2-#x12B5]
| [#x12B8-#x12BE]
| #x12C0
| [#x12C2-#x12C5]
| [#x12C8-#x12CE]
| [#x12D0-#x12D6]
| [#x12D8-#x12EE]
| [#x12F0-#x130E]
| #x1310
| [#x1312-#x1315]
| [#x1318-#x131E]
| [#x1320-#x1346]
| [#x1348-#x135A]
| [#x13A0-#x13F4] /* new Cherokee script */
| [#x1401-#x166C] /* new Canadian Syllabics script */
| [#x166F-#x1676]
| [#x1681-#x169A] /* new Ogham script */
| [#x16A0-#x16EA] /* new Runic script */
| [#x1780-#x17B3] /* new Khmer script */
| [#x1820-#x1842] /* new Mongolian script */
| [#x1844-#x1877]
| [#x1880-#x18A8]
| #x3006 /* Ideographic closing mark */
| [#x31A0-#x31B7] /* new Bopomofo letters */
| [#xA000-#xA48C] /* new Yi script */
IMHO none of these are controversial except perhaps the Hangul jamo.
Formerly, some Hangul jamo had compatibility decompositions into
sequences of other Hangul jamo. These decompositions have been
removed from the Unicode Standard (actually in 2.1), so the jamo
should now be allowed in XML names in accordance with the rules in Appendix B.
New Ideographics (BNF rule 86):
[#x3400-#x4DB5] /* CJK Ideograph Extension A */
New CombiningChars (BNF rule 87):
[#x0346-#x034E] /* new IPA combining characters */
| #x0362
| [#x0488-#x0489] /* new Cyrillic combining characters */
| [#x0653-#x0655] /* new Arabic combining characters */
| #x0711 /* combining characters for new Syriac script */
| [#x0730-#x074A]
| [#x07A6-#x07B0] /* combining characters for new Thaana script */
| [#x0D82-#x0D83] /* combining characters for new Sinhala script */
| #x0DCA
| [#x0DCF-#x0DD4]
| #x0DD6
| [#x0DD8-#x0DDF]
| [#x0DF2-#x0DF3]
| #x0F96 /* new Tibetan subjoined letters */
| [#x0FAE-#x0FB0]
| #x0FB8
| [#x0FBA-#x0FBC]
| #x0FC6 /* new Tibetan combining character */
| [#x102C-#x1032] /* combining characters for new Myanmar script */
| [#x1036-#x1039]
| [#x1056-#x1059]
| [#x17B4-#x17D3] /* combining characters for new Khmer script */
| #x18A9 /* combining character for new Mongolian script */
| [#x20E2-#x20E3] /* new general combining characters */
IMHO none of these are controversial except perhaps the #x20E2 and #x20E3,
which are primarily intended for use with symbol characters, and therefore
should perhaps be excluded as #x20DD-#x20E0 are.
New Digits (BNF rule 88):
[#x1040-#x1049] /* digits for new Myanmar script */
| [#x1369-#x1371] /* digits for new Ethiopic script */
| [#x17E0-#x17E9] /* digits for new Khmer script */
| [#x1810-#x1819] /* digits for new Mongolian script */
IMHO none of these will be controversial.
New Extenders (BNF rule 89):
#x02EE /* Modifier letter double apostrophe */
| #x1843 /* Modifier letter for new Mongolian script */
IMHO none of these will be controversial.
In addition, the following characters no longer pass the tests given
in Appendix B for valid name or name-start characters, but should
remain legal in XML names for backward compatibility, and therefore
should be explicitly enumerated in the corrigendum:
03D0;GREEK BETA SYMBOL
03D1;GREEK THETA SYMBOL
03D2;GREEK UPSILON WITH HOOK SYMBOL
03D5;GREEK PHI SYMBOL
03D6;GREEK PI SYMBOL
03F0;GREEK KAPPA SYMBOL
03F1;GREEK RHO SYMBOL
03F2;GREEK LUNATE SIGMA SYMBOL
0675;ARABIC LETTER HIGH HAMZA ALEF
0676;ARABIC LETTER HIGH HAMZA WAW
0677;ARABIC LETTER U WITH HAMZA ABOVE
0678;ARABIC LETTER HIGH HAMZA YEH
0E33;THAI CHARACTER SARA AM
0EB3;LAO VOWEL SIGN AM
0F77;TIBETAN VOWEL SIGN VOCALIC RR
0F79;TIBETAN VOWEL SIGN VOCALIC LL
1E9A;LATIN SMALL LETTER A WITH RIGHT HALF RING
212E;ESTIMATED SYMBOL
###
--
John Cowan cowan@ccil.org
I am a member of a civilization. --David Brin
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From roddey at us.ibm.com Tue Sep 7 23:35:55 1999
From: roddey at us.ibm.com (roddey@us.ibm.com)
Date: Mon Jun 7 17:14:44 2004
Subject: NAMESPACES: expressing commonality or distinction
Message-ID: <872567E5.0076DF35.00@d53mta03h.boulder.ibm.com>
>and allow applications to match
>
>http://www.w3.org/HTML
>
>or
>
>http://www.w3.org/HTML/Strict
>
>or
>
>http://www.w3.org/HTML/Strict/1.0
>
>depending on what they care about
>
>PRO: uses existing namespace mechanism
>CON: would require modification to XPath, etc.
>
It would definitely have performance implications. It would no longer be
possible to put elements/attrs into pools and identify them by a pool id that
can be very efficiently checked and validated. I would be very concerned about
it from that perspective. If an element is no longer uniquely identified by
URI:Name, then it cannot be internally uniquely identified either.
And I wonder if saying "its an application thing" is going to float either. Many
apps will rely on tools to do a lot of work for them (otherwise XML will have
missed a big part of the boat), so all those tools would have to understand this
scheme or the apps that need this kind of functionality wouldn't be able to use
them very effectively perhaps. But how would you efficiently implement this kind
of thing in general purpose tools? For example, how would a DOM tree diff tool
handle such a thing so that only meaningful diffs to the application at hand
would show up?
I'm sure it could be done, but the question is it worth going down this road or
just forcing all apps that need to do this to have effectively their own
validation mechanisms, attribute uniqueness checks, and so on?
----------------------------------------
Dean Roddey
Software Weenie
IBM Center for Java Technology - Silicon Valley
roddey@us.ibm.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From roddey at us.ibm.com Tue Sep 7 23:47:21 1999
From: roddey at us.ibm.com (roddey@us.ibm.com)
Date: Mon Jun 7 17:14:44 2004
Subject: ATTN: Please comment on XHTML (before it's too late)
Message-ID: <872567E5.0077E475.00@d53mta03h.boulder.ibm.com>
>Adding three lines of code introduces the opportunity for three times
>the bugs in a real application? I don't buy it. Looking for "HTML 4.0
>strict" OR "HTML 4.0 frameset" is no harder than looking for "UL" OR
>"OL", if the software is set up right.
>
>And it is *just as necessary in the general case*. There will be tons of
>code that needs to work the same across "similar" element types across
>namespaces.
>
I said this elsewhere, so this is a little redundant but...
I think its more complex than that. Its more than adding 3 lines of code. What
if you want to validate the XHTML? That requires that the validation mechanism
in all parsers now does partial URL matching? On which URLs does it do this? How
will parsers, which use decl pools in which uniquely identified elements are
given unique ids, deal with this?
Otherwise, you push off validation, attribute uniqueness checks, attribute
normalization, etc... off out of the parser which is what the parser was kind of
designed for in the first place, right?
If you think that those apps that need this functionality are happy to do it all
themselves, then I have no problem with that. But I think that it deserves
consideration that such documents could not meaningfully be dealt with by
parsers that exist now, other than as non-validated, non-namespace aware
documents pretty much. Is that really useful? And would we want to add all that
complexity to parsers in order to deal with this issue, if its not sufficient to
have the apps that need it handle all the issues?
----------------------------------------
Dean Roddey
Software Weenie
IBM Center for Java Technology - Silicon Valley
roddey@us.ibm.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From simonstl at simonstl.com Wed Sep 8 00:20:59 1999
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun 7 17:14:44 2004
Subject: fixing (just) namespaces and validation
Message-ID: <199909072223.SAA30088@hesketh.net>
I brought this up earlier on a list of things that could be done to improve
XML, and it keeps ringing in my ears.
Namespaces and validation are incompatible in cases where the namespace
prefix changes for whatever reason, because XML 1.0 will validate based
only on the prefix. The Namespaces in XML Recommendation doesn't even
provide a mechanism for telling DTDs what URI might be associated with a
given prefix.
James Clark provides an explanation of this at
http://www.jclark.com/xml/xmlns.htm:
>It would of course be very useful to have namespace-aware validation:
>to be able to associate each URI used in a universal name with some
>sort of schema (similar to a DTD) and be able to validate a document
>using multiple such URIs with respect to the schemas for all of the URIs.
>The XML Namespaces Recommendation does not provide this. The reason is
>is that DTDs have many other problems and missing features in addition
>to lack of namespace-awareness. So the plan is to come up with a new
>schema mechanism that fixes the problems with DTDs and, as part of this,
>provides namespace-awareness. This work is being done by the XML Schema WG.
At the same time, Namespaces REC says: (end of Section 6, conformance)
>Thus, unless the use of a validating processor has been specified,
>there can be no assurance that the contents of attribute values
>have been checked for conformance to this specification.
Most people seem to have disregarded this notice, and use non-validating
processors when they use namespaces, if indeed they do use namespaces.
Those who validate simply pray that in these early days of XML processing
no one will have gotten around to changing a prefix for whatever reason.
It seems like DTDs will be here for a while, and it doesn't seem like
fixing this problem is difficult. The PI from the 5/18 draft, which was
meant for documents, would serve just as well in DTDs. It could provide a
simple mapping for prefixes as used in the DTD to the namespace URI,
allowing processors to validate against both the namespace URI and the
local part, rather than simply the element or attribute name.
For example: (Yes, I changed the colon to a dash.)
When it appeared in the DTD, this would tell a validating processor that
the prefix xpdl maps to http://www.simonstl.com/ns/1999/xpdl. If the
processor then encountered elements with a namespace URI of
http://www.simonstl.com/ns/1999/xpdl, it could validate them on the basis
of that URI and the local part even if the xpdl prefix had disappeared.
It seems very simple, even potentially worthwhile. I don't think that it
would give parser builders enormous headaches, and it would take away a lot
of the technical ammunition of the anti-namespaces crowd. Then we could
discuss the more painful issues of 'what does it mean?' without extra noise
about how weakly it's been implemented, and we wouldn't have to wait as
impatiently for the Schema WG to finish their labors.
Simon St.Laurent
XML: A Primer (2nd Ed - September)
Building XML Applications
Inside XML DTDs: Scientific and Technical
Sharing Bandwidth / Cookies
http://www.simonstl.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From roddey at us.ibm.com Wed Sep 8 00:40:22 1999
From: roddey at us.ibm.com (roddey@us.ibm.com)
Date: Mon Jun 7 17:14:44 2004
Subject: why distinctions within XHTML?
Message-ID: <872567E5.007CCC3D.00@d53mta03h.boulder.ibm.com>
>Somebody did the basic math in a comment: three variants of XHTML will
>very quickly add an order of magnitude to the complexity of the systems
>built with it. That's a deterrent to the use of XHTML, and discards the
>simplification that's long been at the core of the XML movement.
>
Does not the 'X' in XHTML pretty much mean that technically there should be no
'non-strict' version? I mean if its HTML, its HTML. But, if its XML, then it
needs to be well formed XML. I think that its going a little too far in the
direction of backwards compatability to do anything else. All parsers out there
now would reject non-strict HTML as not well formed anyway, right? I'm assuming
that non-strict (or traditional, or classic or whatever it is :-) means you
don't need a
for every and so on, right?
So just making it really have to be well formed XML (which would avoid changing
all the parsers in the world just to deal with this) would get rid of one of the
DTDs right off the back, wouldn't it?
Maybe this was a totally ignorant statement (I've been off writing real code and
haven't been following this debate) or maybe its been said 20 times already and
sorry if so, but it makes perfect sense to me that there should be no XHTML
which is not really XML.
That still leaves the issues of how to progress standards, but it would remove
one big hair ball from our respective gullets :-)
----------------------------------------
Dean Roddey
Software Weenie
IBM Center for Java Technology - Silicon Valley
roddey@us.ibm.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From roddey at us.ibm.com Wed Sep 8 00:53:06 1999
From: roddey at us.ibm.com (roddey@us.ibm.com)
Date: Mon Jun 7 17:14:44 2004
Subject: How much extra code for multiple Namespaces?
Message-ID: <872567E5.007DF412.00@d53mta03h.boulder.ibm.com>
>From:
>If you have such a mechanism then it should be fairly trivial to map all
>three XHTML namespaces into one, if as you say 99% of applications will
>treat them all the same anyway. Then it is nowhere near the extra amount
>of code that you claim? If you could, pls give an example!
>
The problem is that this is just one instance of this situation. If you allow it
here, will you make this the one and only special dispensation for it? If so,
then maybe all apps that deal with HTML can deal with this one case specially
and perhaps its no big deal (to me anyway, since I just write parsers :-)
But, if its not, how do you generalize it? Do you fill your XML processor with
lists of equivalent URLs, and tell it when and where they are really equivalent?
Otherwise, there is no way for a parser to validate such documents or even
confirm that it meets XML 1.0 rules (such as no reuse of the same attribute in
an element.)
And, even if you do this, you've put a big new burden on all processors to
support this kind of functionality. It will mean a performance hit because there
is no longer a unique string (uri-name, probably put into a pool and turned into
a single unique number) which represents an element or attribute. It would be a
significant change required in processors to handle this kind of thing.
----------------------------------------
Dean Roddey
Software Weenie
IBM Center for Java Technology - Silicon Valley
roddey@us.ibm.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From roddey at us.ibm.com Wed Sep 8 01:03:15 1999
From: roddey at us.ibm.com (roddey@us.ibm.com)
Date: Mon Jun 7 17:14:44 2004
Subject: why distinctions within XHTML?
Message-ID: <872567E5.007EE110.00@d53mta03h.boulder.ibm.com>
>I agree with Dave. The more I look, the more I am convinced that
>the W3C is not the right standards body to deal with XML, which is being
>employed in many arenas that have absolutely nothing to do with the web. In
>fact, if/when the true potential of XML is realized, the web will be a minor
>player in that. SGML is/was not the native language of the web. HTML was
>derived for that purpose. XML is/has the potential to be used for much more
>then internet publishing of information. The web has benefited from many
>technological contributions, many of which predate the web by a couple of
>decades, and most of the technology that goes into it is not at all web
>specific (connection oriented stream TCP communications, MIME,
>request/response, etc.). To think that XML is a web-only or even a
>web-mostly language is to miss the boat so far as to not even notice the
>ripples.
>
An obvious question to ask though is that, once XML has been stretched to deal
with all these other issues, will it be a reasonable mechanism to use on the
web, which will still need such a mechanism. Part of the problem with XML, IMHO,
is that too many people want it to do too many things and the simplicity that
made it attractive is going fast. If it were explicitly turned into a 'its not
just for breakfast anymore' kind of standard, I think that its usefulness in the
web and its simplicity would just get smothered that much faster.
I think its fine for anyone to use it who needs to. But, there has to be some
focus as to what its intended applications are and things shouldn't be done to
it that make it difficult to use in those intended applications.
Just my opinion of course...
----------------------------------------
Dean Roddey
Software Weenie
IBM Center for Java Technology - Silicon Valley
roddey@us.ibm.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ann at webgeek.com Wed Sep 8 01:02:19 1999
From: ann at webgeek.com (Ann Navarro)
Date: Mon Jun 7 17:14:44 2004
Subject: why distinctions within XHTML?
In-Reply-To: <872567E5.007CCC3D.00@d53mta03h.boulder.ibm.com>
Message-ID: <199909072304.TAA08656@earth.isni.net>
At 04:43 PM 9/7/99 -0600, roddey@us.ibm.com wrote:
>
>
>
>>Somebody did the basic math in a comment: three variants of XHTML will
>>very quickly add an order of magnitude to the complexity of the systems
>>built with it. That's a deterrent to the use of XHTML, and discards the
>>simplification that's long been at the core of the XML movement.
>>
>
>Does not the 'X' in XHTML pretty much mean that technically there should
be no
>'non-strict' version? I mean if its HTML, its HTML. But, if its XML, then it
>needs to be well formed XML.
Nothing in the transitional and frameset flavors of XHTML 1.0 equate to not
being well formed.
If I correctly understand that to be your concern, it appears to not be an
issue.
Ann
---
Author of Effective Web Design: Master the Essentials
Coming in September --- Mastering XML
Founder, WebGeek Communications http://www.webgeek.com
Vice President-Finance, HTML Writers Guild http://www.hwg.org
Director, HWG Online Education http://www.hwg.org/services/classes
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cowan at locke.ccil.org Wed Sep 8 06:47:39 1999
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun 7 17:14:44 2004
Subject: First draft of proposed XML TC for Unicode 3.0 (unofficial)
In-Reply-To: <37D59BFC.4B37BFFC@pacbell.net> from "David Brownell" at Sep 7, 99 04:13:00 pm
Message-ID: <199909080525.BAA26575@locke.ccil.org>
David Brownell scripsit:
> Are you saying the proposal is that this should be an erratum
> to the "XML 1.0" specification? Or instead that this should
> be "XML 1.1" ?
This is a process question that I am not equipped to answer, though
someday I may be so equipped. :-)
The overall result is that certain documents which were not WF
before become WF; no WF documents become not WF. It is an
upward compatible extension of XML 1.0:1998.
I emphasize that this document has, at present, nothing to do
with the W3C, though I would be pleased if it did someday.
--
John Cowan cowan@ccil.org
I am a member of a civilization. --David Brin
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cowan at locke.ccil.org Wed Sep 8 06:44:49 1999
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun 7 17:14:45 2004
Subject: why distinctions within XHTML?
In-Reply-To: <872567E5.007CCC3D.00@d53mta03h.boulder.ibm.com> from "roddey@us.ibm.com" at Sep 7, 99 04:43:05 pm
Message-ID: <199909080522.BAA26525@locke.ccil.org>
roddey@us.ibm.com scripsit:
> Does not the 'X' in XHTML pretty much mean that technically there should be no
> 'non-strict' version? I mean if its HTML, its HTML. But, if its XML, then it
> needs to be well formed XML. I think that its going a little too far in the
> direction of backwards compatability to do anything else. All parsers out there
> now would reject non-strict HTML as not well formed anyway, right? I'm assuming
> that non-strict (or traditional, or classic or whatever it is :-) means you
> don't need a
for every and so on, right?
No. "Strict" means "guaranteed not to have deprecated element types and
attributes" in the HTML context; it has nothing to do with SGML vs. XML.
--
John Cowan cowan@ccil.org
I am a member of a civilization. --David Brin
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Patrice.Bonhomme at loria.fr Wed Sep 8 10:46:47 1999
From: Patrice.Bonhomme at loria.fr (Patrice Bonhomme)
Date: Mon Jun 7 17:14:45 2004
Subject: [XT] XSL output question
Message-ID: <199909080849.KAA08564@chimay.loria.fr>
I have an XML file that is encoding both with ISO-8859-1 and ISO-8859-2
characters using external entities declaration (ISOlat1.pen and ISOlat2.pen).
1/ What is the value of the 'encoding' attribute within the XML declaration ?
ISO-8859-1 ? ISO-8859-2 ? None ?
2/ How could we generate an HTML file encoding in UTF-8 using an XSL
stylesheet from this XML file ?
We are currently using XT 19990822.
Here is the declaration of the XML file :
%ISOlat1;
%ISOlat2;
]>
A sample of our data :
Slavnost athénských
Thráků
The result with a very simple XSL stylesheet :
Slavnost athénských Thrák?
I am also looking for an XSL/XSLT tutorial compatible with the last XSLT WD.
Thanks for any help,
Pat.
--
==============================================================
bonhomme@loria.fr Tel : 03 83 59 30 52 / 06 11 34 03 85
http://www.loria.fr/~bonhomme Office : B.228
--------------------------------------------------------------
* Serveur Silfide : http://www.loria.fr/projets/Silfide
==============================================================
--
==============================================================
bonhomme@loria.fr Tel : 03 83 59 30 52 / 06 11 34 03 85
http://www.loria.fr/~bonhomme Office : B.228
--------------------------------------------------------------
* Serveur Silfide : http://www.loria.fr/projets/Silfide
==============================================================
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From James.Anderson at mecomnet.de Wed Sep 8 11:42:40 1999
From: James.Anderson at mecomnet.de (james anderson)
Date: Mon Jun 7 17:14:45 2004
Subject: fixing (just) namespaces and validation
References: <199909072223.SAA30088@hesketh.net>
Message-ID: <37D6301C.17055AB7@mecomnet.de>
Simon St.Laurent wrote:
>
> Namespaces and validation are incompatible in cases where the namespace
> prefix changes for whatever reason,
this claim is not universaly true. while there are dtd's for which it holds
(those with ambiguous prefixes are those with declarations with identical
prefixed names and no directly specified namespace binding.), this does not
hold for all dtd's
...
>
> For example: (Yes, I changed the colon to a dash.)
>
>
> When it appeared in the DTD, this would tell a validating processor that
> the prefix xpdl maps to http://www.simonstl.com/ns/1999/xpdl. If the
> processor then encountered elements with a namespace URI of
> http://www.simonstl.com/ns/1999/xpdl, it could validate them on the basis
> of that URI and the local part even if the xpdl prefix had disappeared.
i suspect that this instruction is not necessary. it does work (to wit the
parser which is available with the cl-http server supports an instruction of
this form), but it is not necessary
in those cases where it is possible to include it in order to resolve
otherwise ambiguous names, it would also be possible to modify the prefixes so
that they are not ambiguous.
in cases where the bindings are not ambiguous (see above), it is possible to
infer the necessary bindings and resolve apparent ambiguities. the above
mentioned parser does this for directly specified namespace bindings. it would
also be possible to infer indirectly specified bindings through the respective
content models ...
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From davidc at nag.co.uk Wed Sep 8 12:37:58 1999
From: davidc at nag.co.uk (David Carlisle)
Date: Mon Jun 7 17:14:45 2004
Subject: fixing (just) namespaces and validation
Message-ID: <199909081037.LAA21158@nag.co.uk>
James anderson wrote
> Simon St.Laurent wrote:
> >
> > Namespaces and validation are incompatible in cases where the namespace
> > prefix changes for whatever reason,
>
> this claim is not universaly true. while there are dtd's for which it holds
> (those with ambiguous prefixes are those with declarations with identical
> prefixed names and no directly specified namespace binding.), this does not
> hold for all dtd's
Surely _all_ DTDs do have this problem with the namespace REC.
The documents
and
are fully equivalent according to XML Namespace REC, but it is not
possible to write a DTD such that you can add
and
Well, you could write a DTD that worked for any finite set of prefixes,
just by duplicating all the declarations, but you can not write one that
works in general.
So, to validate with namespaces, you either have to pre-process the
document instance to normalise the prefixes to the prefix used in the
DTD, or you have to invent a new declaration in the DTD that says
`this DTD uses prefix foo: but a documenent instance may use any prefix,
so long as is bound to the namespace "http://here"'
This is (I assume) the intention of the PI in Simon's posting.
David
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Wed Sep 8 13:04:31 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:45 2004
Subject: fixing (just) namespaces and validation
In-Reply-To: <199909081037.LAA21158@nag.co.uk>
References: <199909081037.LAA21158@nag.co.uk>
Message-ID: <14294.16724.994022.557375@localhost.localdomain>
David Carlisle writes:
> Surely _all_ DTDs do have this problem with the namespace REC.
>
> The documents
>
>
>
>
>
> and
>
>
>
>
> are fully equivalent according to XML Namespace REC.
That's because they're different layers.
Think of RDF; assuming appropriate Namespace declarations, these are
both equivalent in the RDF layer, but not in the XML 1.0 layer:
The Foo thing
and
DTD validation is applied to a very low-level layer of XML processing
(essentially, the DOM/SAX layer); Namespaces is concerned with a
higher layer, and RDF, with a higher layer still.
There is usually a many-to-one relationship as you go up to more
abstract layers: many different sequences of characters can be
interpreted as the same XML document, many different XML documents can
provide the same Namespaces (or Architectural Forms) view, many
different Namespaces views can provide the same RDF view, etc.
The point is that a single sequence of characters cannot represent two
different XML documents, nor can a single XML document represent two
different Namespace views, nor can a single Namepsace view represent
two different RDF views.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From davidc at nag.co.uk Wed Sep 8 14:08:56 1999
From: davidc at nag.co.uk (David Carlisle)
Date: Mon Jun 7 17:14:45 2004
Subject: fixing (just) namespaces and validation
In-Reply-To: message from David Carlisle on 8 Sep 1999 11:37:34 +0100
Message-ID: <199909081208.NAA17410@nag.co.uk>
Me> Surely _all_ DTDs do have this problem with the namespace REC.
David Megginson> That's because they're different layers.
Yes, agreed, it wasn't really a criticism. The fact remains that at the
current time the `problem' is that there is no standard way of getting
from one layer to the other.
That is, if I have a namespace aware application that really doesn't
mind what prefix is used in a document instance, there is no convenient
standard way of supplying a DTD against which a set of documents to be
used with that application may be validated.
Of course it's easy enough to _do_ the validation, you just need to
specify (for example) an XSL transform that normalises the prefix to
match the DTD or just build this knowledge in to an extended XML parser
but there is nothing like the following that works in a cross platform
way
...
David
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Wed Sep 8 14:29:57 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:45 2004
Subject: Layers, again (was Re: fixing (just) namespaces and validation)
In-Reply-To: <199909081208.NAA17410@nag.co.uk>
References: <199909081208.NAA17410@nag.co.uk>
Message-ID: <14294.21817.614899.758420@localhost.localdomain>
David Carlisle writes:
> Yes, agreed, it wasn't really a criticism. The fact remains that at
> the current time the `problem' is that there is no standard way of
> getting from one layer to the other.
Sure there is -- at least, the Namespaces REC defines pretty clearly
what Namespace declarations and prefixes do.
> That is, if I have a namespace aware application that really
> doesn't mind what prefix is used in a document instance, there is
> no convenient standard way of supplying a DTD against which a set
> of documents to be used with that application may be validated.
But that's not a problem of getting from one layer to another; it's
simply a problem of applying an operation to a layer. Here's one
layered view:
Layer 1: octets
Validate with: (custom code)
Layer 2: Unicode characters
Validate with: (regular expression)
Layer 3: XML
Validate with: DTD
Layer 4: Namespaces
Validate with: (XML Schemas, eventually)
Layer 5: RDF
Validate with: RDF schema
Layer 6: Application
Validate with: (local business rules)
Here's another layered view:
Layer 1: octets
Validate with: (custom code)
Layer 2: Unicode characters
Validate with: (regular expression)
Layer 3: XML
Validate with: DTD
Layer 4: Namespaces
Validate with: (XML Schemas, eventually)
Layer 5: XHTML
Validate with: (built-in XHTML processing rules)
Layer 6: Application
Validate with: (local business rules)
My applications have no problem at all getting from layer 3 to layer 4
in either example, because the path is fairly well defined; it just
happens that there is also a convenient schema formats for applying
structural validation or guided authoring to layer 3, but that's a
separate operation applied to the layer, not part of the layer
itself. Many layers do not have a standard validation technique yet.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From smohr at voicenet.com Wed Sep 8 15:00:11 1999
From: smohr at voicenet.com (Stephen T. Mohr)
Date: Mon Jun 7 17:14:45 2004
Subject: Layers, again (was Re: fixing (just) namespaces and validation)
References: <199909081208.NAA17410@nag.co.uk> <14294.21817.614899.758420@localhost.localdomain>
Message-ID: <009c01bef9fa$7ad0d840$e9d9f2cc@omicron.com>
A key sticking point is layer 4. Until XML Schemas are a recommendation, we
have no standard way to validate XML containing namespace references.
Several proprietary or early implementation ways, yes, but none standard.
It is a shame the namespaces rec went out without going back and revising
the core XML Rec, or going forward and adding Schemas.
----- Original Message -----
From: David Megginson
To:
Sent: Wednesday, 8 September 1999 08:31 AM
Subject: Layers, again (was Re: fixing (just) namespaces and validation)
> David Carlisle writes:
>
> > Yes, agreed, it wasn't really a criticism. The fact remains that at
> > the current time the `problem' is that there is no standard way of
> > getting from one layer to the other.
>
> Sure there is -- at least, the Namespaces REC defines pretty clearly
> what Namespace declarations and prefixes do.
>
> > That is, if I have a namespace aware application that really
> > doesn't mind what prefix is used in a document instance, there is
> > no convenient standard way of supplying a DTD against which a set
> > of documents to be used with that application may be validated.
>
> But that's not a problem of getting from one layer to another; it's
> simply a problem of applying an operation to a layer. Here's one
> layered view:
>
>
> Layer 1: octets
> Validate with: (custom code)
>
> Layer 2: Unicode characters
> Validate with: (regular expression)
>
> Layer 3: XML
> Validate with: DTD
>
> Layer 4: Namespaces
> Validate with: (XML Schemas, eventually)
>
> Layer 5: RDF
> Validate with: RDF schema
>
> Layer 6: Application
> Validate with: (local business rules)
>
>
> Here's another layered view:
>
>
> Layer 1: octets
> Validate with: (custom code)
>
> Layer 2: Unicode characters
> Validate with: (regular expression)
>
> Layer 3: XML
> Validate with: DTD
>
> Layer 4: Namespaces
> Validate with: (XML Schemas, eventually)
>
> Layer 5: XHTML
> Validate with: (built-in XHTML processing rules)
>
> Layer 6: Application
> Validate with: (local business rules)
>
>
> My applications have no problem at all getting from layer 3 to layer 4
> in either example, because the path is fairly well defined; it just
> happens that there is also a convenient schema formats for applying
> structural validation or guided authoring to layer 3, but that's a
> separate operation applied to the layer, not part of the layer
> itself. Many layers do not have a standard validation technique yet.
>
>
> All the best,
>
>
> David
>
> --
> David Megginson david@megginson.com
> http://www.megginson.com/
>
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on
CD-ROM/ISBN 981-02-3594-1
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Wed Sep 8 15:42:47 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:45 2004
Subject: Layers, again (was Re: fixing (just) namespaces and validation)
In-Reply-To: <009c01bef9fa$7ad0d840$e9d9f2cc@omicron.com>
References: <199909081208.NAA17410@nag.co.uk>
<14294.21817.614899.758420@localhost.localdomain>
<009c01bef9fa$7ad0d840$e9d9f2cc@omicron.com>
Message-ID: <14294.26532.335111.817603@localhost.localdomain>
Stephen T. Mohr writes:
> A key sticking point is layer 4. Until XML Schemas are a recommendation, we
> have no standard way to validate XML containing namespace references.
> Several proprietary or early implementation ways, yes, but none standard.
> It is a shame the namespaces rec went out without going back and revising
> the core XML Rec, or going forward and adding Schemas.
That is true -- several of the layers are missing standard validation
mechanisms. That has nothing to do with movement between the layers,
and won't inhibit most kinds of processing, but it is still a useful
thing to have. Note that several of layers in my original chart are
missing standard validation mechanisms.
If you're exchanging mainly serialized objects, you might want to
consider RDF schemas, which are admirably simple.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From simonstl at simonstl.com Wed Sep 8 17:04:20 1999
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun 7 17:14:45 2004
Subject: Layers, again (was Re: fixing (just) namespaces and
validation)
In-Reply-To: <14294.21817.614899.758420@localhost.localdomain>
References: <199909081208.NAA17410@nag.co.uk>
<199909081208.NAA17410@nag.co.uk>
Message-ID: <4.0.1.19990908091744.01115340@216.27.10.33>
Amazing what proposing to fix namespaces on this list can bring out.
David's presented an astounding set of assumptions that I've never heard
voiced so explicitly before. In many ways, however, I think these
assumptions underlie many of the _problems_ in XML, not the promise of XML.
This set of assumptions also contradicts the quote I began my proposal with
from James Clark's XML Namespaces document
(http://www.jclark.com/xml/xmlns.htm), which complains about missing parts
in DTDs rather than any layering problems. It also doesn't match the Note
Web Architecture: Extensible Languages
(http://www.w3.org/TR/NOTE-webarch-extlang), and only sort of fits the
picture (not so much the text) in the Note RDF Architecture
(http://www.w3.org/TR/NOTE-rdfarch).
At 08:31 AM 9/8/99 -0400, David Megginson wrote:
>David Carlisle writes:
>
> > Yes, agreed, it wasn't really a criticism. The fact remains that at
> > the current time the `problem' is that there is no standard way of
> > getting from one layer to the other.
>
>Sure there is -- at least, the Namespaces REC defines pretty clearly
>what Namespace declarations and prefixes do.
I think you've read considerably more into the Namespaces REC than I as far
as _when_ that 'namespace doing' takes place. While it does discuss the
appearance of qualified names in DTDs and makes certain comments regarding
the non-reliability of attribute defaulting in non-validating parsers, it
doesn't go further. It doesn't specify explicitly that Namespace
processing is performed as a layer between the application and the parser,
or that all parser operation must be completed before namespace processing
begins.
More on this below, as we get into the details of this 'layering'.
> > That is, if I have a namespace aware application that really
> > doesn't mind what prefix is used in a document instance, there is
> > no convenient standard way of supplying a DTD against which a set
> > of documents to be used with that application may be validated.
>
>But that's not a problem of getting from one layer to another; it's
>simply a problem of applying an operation to a layer. Here's one
>layered view:
>
>
>Layer 1: octets
>Validate with: (custom code)
>
>Layer 2: Unicode characters
>Validate with: (regular expression)
>
>Layer 3: XML
>Validate with: DTD
>
>Layer 4: Namespaces
>Validate with: (XML Schemas, eventually)
>
>Layer 5: RDF
>Validate with: RDF schema
>
>Layer 6: Application
>Validate with: (local business rules)
>
>
>Here's another layered view:
>
>
>Layer 1: octets
>Validate with: (custom code)
>
>Layer 2: Unicode characters
>Validate with: (regular expression)
>
>Layer 3: XML
>Validate with: DTD
>
>Layer 4: Namespaces
>Validate with: (XML Schemas, eventually)
>
>Layer 5: XHTML
>Validate with: (built-in XHTML processing rules)
>
>Layer 6: Application
>Validate with: (local business rules)
>
>
>My applications have no problem at all getting from layer 3 to layer 4
>in either example, because the path is fairly well defined; it just
>happens that there is also a convenient schema formats for applying
>structural validation or guided authoring to layer 3, but that's a
>separate operation applied to the layer, not part of the layer
>itself. Many layers do not have a standard validation technique yet.
The problem in both of these examples is that you treat XML itself as
monolithic, and DTD validation as a tool that can only be used at the time
of parsing. As a result, we have multiple levels of checking that have to
be redundant if they're done at all. Check against schemas, DTDs, _and_
RDF? And then throw application rules on top of that? Forget it. These
'layers' are pretty much a guarantee that developers either need to make an
investment in large quantities of aspirin - or pick one tool and stick to it.
If I thought that schemas would be here soon, or that RDF really was the
answer to all of these, I wouldn't be pushing on DTD validation. DTDs do
seem to be a good answer - in the short term for many projects, in the long
term for a subset of projects - to the need for structural checking. It
doesn't seem that ridiculous to want to 'validate' the results of a
transformation (generated via XSL or the DOM) or to want to 'validate' a
document against a DTD structure while taking into account namespaces.
Because XML 1.0 was written so that everything from character checking to
entity replacement to attribute defaulting to structural inspections (DTD
and otherwise) are all performed by one monolithic 'parser', we haven't
been able to describe XML processors with any level of granularity. When I
talk about layers (for instance, in
http://www.simonstl.com/articles/layering/layered.htm), it's layering for
the sake of breaking things into the smallest usable components, not for
the sake of piling on more and more mostly redundant processing. Your
layer 3 is way too thick.
If treat validation as a process with its own life, outside of the Rube
Goldberg machine known as an XML processor, we might be able to solve a lot
of problems that currently look very difficult much more simply.
Namespaces included.
Simon St.Laurent
XML: A Primer (2nd Ed - September)
Building XML Applications
Inside XML DTDs: Scientific and Technical
Sharing Bandwidth / Cookies
http://www.simonstl.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From matthew at praxis.cz Wed Sep 8 17:02:55 1999
From: matthew at praxis.cz (Matthew Gertner)
Date: Mon Jun 7 17:14:45 2004
Subject: What XML-dev can do (was Re: an unfilled need)
References: <3.0.32.19990906085410.00b6d4e0@pop.intergate.bc.ca>
<37D3F74D.AA84B8FF@praxis.cz> <199909071317.JAA05607@hesketh.net>
Message-ID: <37D67B08.2B269E86@praxis.cz>
"Simon St.Laurent" wrote:
> It doesn't seem sensible, however, to be afraid of the W3C. There are a
> lot of projects they don't seem interested in working on, and there are
> even more cases where people need standards (like the namespaces issues Len
> noted above) that go beyond what the W3C is currently providing.
I really like the idea of addressing some of these issues in this forum
in a way complementary to the work done by the W3C, which of course is
extremely valuable. In this context I have a question: how many Java
developers are reading this mail and would be potentially interested in
participating in an open-source project centered around Java/XML?
Basically we are considering releasing some of our internal code as OSS
and it would be great to get a feeling for how many qualified people
would at least consider investing time in such a project. As someone
pointed out when this list started discussing XSchema, the SAX project
was successful because there were commitments from the very start from
application developers to implement the interface when completed. A
concrete implementation of a extensible architecture for XML processing
(Im being intentionally vague) would presumably be a great help in
testing new ideas and even creating usable code, be if for alternative
schema approaches, namespace-aware validation or whatever. But we only
want to do this if there is sufficient potential interest in the
developer community.
If anyone wants more details, please send me a private mail and I will
send you a short summary of our current system and our future plans for
it.
Thanks in advance,
Matthew
PS: My new Czech keyboard does not appear to have an apostrophe. Sorry.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cowan at locke.ccil.org Wed Sep 8 17:19:40 1999
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun 7 17:14:45 2004
Subject: First draft of proposed XML TC for Unicode 3.0 (unofficial)
In-Reply-To: <37D5ECB2.9794EBD5@jclark.com> from "James Clark" at Sep 8, 99 11:57:23 am
Message-ID: <199909081557.LAA02460@locke.ccil.org>
James Clark scripsit:
> > 0E33;THAI CHARACTER SARA AM
>
> That's very strange. SARA AM certainly needs to be allowed in names.
It's a funny story. In earlier versions of Unicode, SARA AM was treated
as canonically equivalent to NIKHAHIT followed by SARA AA; that is,
Unicode-conformant processes were not supposed to distinguish between
them. In the latest version, this equivalence has been downgraded to
a mere compatibility equivalence. As a result, SARA AM has become
a "compatibility character" and as such disallowed by the Appendix B rules.
> > 0EB3;LAO VOWEL SIGN AM
The same story applies here: the VOWEL SIGN AM is now a compatibility
equivalent of NIGGAHITA followed by VOWEL SIGN AA.
In any event, whatever XML worked before has to work now, so I am merely
proposing a statement that THAI CHARACTER SARA AM and LAO VOWEL SIGN
AM *are* legal in XML names, despite their status as Unicode compatibility
characters. In any case their legality in XML *text* is of course not affected.
If anyone thinks something is desperately broken here, please contact
unicode@unicode.org right away.
--
John Cowan cowan@ccil.org
I am a member of a civilization. --David Brin
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From rbourret at ito.tu-darmstadt.de Wed Sep 8 17:32:07 1999
From: rbourret at ito.tu-darmstadt.de (Ronald Bourret)
Date: Mon Jun 7 17:14:45 2004
Subject: ANN: XML and Databases article
Message-ID: <01BEFA20.9D5E8F30@grappa.ito.tu-darmstadt.de>
Steven R. Newcomb wrote:
> I learned some interesting things from this article, for which I'm
> grateful. Recommended reading!
Thanks!
> I was disappointed that the grove paradigm
> (http://www.prescod.net/groves/shorttut/) was never mentioned, nor was
> GroveMinder (http://www.techno.com). The grove paradigm offers an
> ISO-defined meta-DOM to all notations, including XML.
I've read Paul's tutorial and the GroveMinder summary on the Web, so let's
see if I've got this straight. A grove is basically a property set, broken
down into classes, each of which has properties. There are probably
relationships between those classes. For example, a grove for XML could
have classes for elements, attributes, entities, and so on, where the
element class points to the attribute class. A grove for a relational
database would have classes for tables, columns, etc., where the table
class points to the column class.
(In this sense, the XML information set has much in common with groves, as
it is a property set. Similarly, the DOM could be viewed as an API for a
grove. The XML information set is not a grove because [why? The only
reason I can think of is that it is not been expressed in grove notation].
The DOM is not an API for a grove because it's a bit wishy-washy in places
-- for example, four characters of PCDATA could be one node or four, so
it's not built on a rigid enough data model.)
The nice thing about groves is that all groves, regardless of what they are
built on, have certain commonalities, such as addressability, so you can
perform certain common functions with them.
GroveMinder is generic grove middleware. It has plug-ins, called Minders (I
think of them as drivers), that can build groves over different property
sets. For example, there is one Minder for SGML/XML documents and a
different Minder for relational databases. (There can actually be different
property sets for a "type" of data. For example, one property set for XML
might include entities and another might not, specifying that each entity
is replaced by its value. A different Minder is needed for each property
set.)
One thing GroveMinder can do is store a grove in its own database. (Note
that this is separate from the database addressed by the relational
database Minder -- it has a structure designed to store groves.) Thus,
GroveMinder can store an XML document in a database as a grove and is what
I, in my article, called a content management systems. That is, it can
store and retrieve an XML document as a document.
Some questions:
1) Is it possible to combine groves of different types? For example, can I
take a grove representing a table in a relational database and stuff it
into a grove for an XML document, for example as the content of an element?
If so, does the table grove retain its table-ness, or is it converted to
one or more XML elements? Both cases seem reasonable, although the latter
would presumably require a special converter. If the latter case is true,
then GroveMinder might also fit what I call data transfer middleware,
depending on how the conversion is done.
2) Are groves themselves relevant at a high level in a discussion of XML
and databases? It strikes me that, like SAX and the DOM, they are a useful
tool in implementing software that stores/retrieves XML documents (or data
from those documents) in a database but are not directly relevant to the
discussion itself. Instead, they are most relevant to the user in that they
are likely to weigh heavily in the feature set exposed by a content
management system or (possibly) data transfer system.
3) This isn't directly related to XML/databases, but what other common
functionality do all groves have? For example, can I write an application
that navigates groves, regardless of their source (I think the answer is
yes)? Can I combine groves of different types or convert painlessly -- that
is, without writing any additional code -- from one type to another (I
think the answer is no -- additional code is needed)? Can I hyperlink from
one grove to another (I think the answer is yes)? And so on.
-- Ron
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Wed Sep 8 19:01:01 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:45 2004
Subject: Layers, again (was Re: fixing (just) namespaces and
validation)
In-Reply-To: <4.0.1.19990908091744.01115340@216.27.10.33>
References: <199909081208.NAA17410@nag.co.uk>
<14294.21817.614899.758420@localhost.localdomain>
<4.0.1.19990908091744.01115340@216.27.10.33>
Message-ID: <14294.36560.725917.191918@localhost.localdomain>
Simon St.Laurent writes:
> At 08:31 AM 9/8/99 -0400, David Megginson wrote:
> >David Carlisle writes:
> >
> > > Yes, agreed, it wasn't really a criticism. The fact remains
> > > that at the current time the `problem' is that there is no
> > > standard way of getting from one layer to the other.
> >
> >Sure there is -- at least, the Namespaces REC defines pretty clearly
> >what Namespace declarations and prefixes do.
>
> I think you've read considerably more into the Namespaces REC than
> I as far as _when_ that 'namespace doing' takes place.
Oops! Note that I didn't say "what Namespaces do" -- that's
application-specific. I referred only to the parts of XML 1.0 syntax
that act as links to the Namespace layer (Namespace declarations and
prefixes). So, for
"what Namespace declarations and prefixes do"
try reading
"how expanded names are constructed from Namespace declarations and
prefixes".
In other words, given any XML 1.0 document, the XML Namespaces REC
precisely and unambiguously specifies how to determine the expanded
name of every element and attribute in the document. Thats *all* that
you need to move from the XML 1.0 layer to the Namespaces layer during
processing.
> While it does discuss the appearance of qualified names in DTDs and
> makes certain comments regarding the non-reliability of attribute
> defaulting in non-validating parsers, it doesn't go further. It
> doesn't specify explicitly that Namespace processing is performed
> as a layer between the application and the parser, or that all
> parser operation must be completed before namespace processing
> begins.
But there's no reason that the parser has to finish processing the XML
1.0 layer before it starts evaluating the Namespace layer -- that's a
matter of physical implementation, and the layers belong to a logical
model (jumping layers is always wrong in logical models, but it often
makes sense in implementations: note how many routers know about HTTP,
even though they're technically dealing with the IP layer). In the
SAX Namespaces filter for DATAX, for example, the Namespace processing
is done on the fly.
More seriously, neither XML nor Namespaces currently has a
standardized processing model of any sort, and I'm still trying to
decide whether I think they should. The data models and APIs that we
have specify only what should be available *after* processing; they
don't say how we get there.
> The problem in both of these examples is that you treat XML itself
> as monolithic, and DTD validation as a tool that can only be used
> at the time of parsing. As a result, we have multiple levels of
> checking that have to be redundant if they're done at all. Check
> against schemas, DTDs, _and_ RDF? And then throw application rules
> on top of that? Forget it.
No, you have the choice of applying validation only at the levels that
are important to you: if you're producing XHTML for display in legacy
browsers, then you might need to validate the character layer with
regular expressions to ensure that empty-element tags always have a
space before the closing delimiter (
, not
).
You can apply validation to *any* layer of processing, from the raw
bytes to the final application. Choosing where to validate is an
architectural decision, not a standards one. It just happens that
some of the layers do have shared specs that make validation easier
(regular expressions, DTDs, and RDF schemas), while others do not,
yet.
> These 'layers' are pretty much a guarantee that developers either
> need to make an investment in large quantities of aspirin - or pick
> one tool and stick to it.
No, that's wrong -- layered approaches like these have proven
themselves over and over (the Internet is only the most famous
example). Do you consider it redundant that both TCP and HTTP perform
different kinds of validity checks?
> If I thought that schemas would be here soon, or that RDF really
> was the answer to all of these, I wouldn't be pushing on DTD
> validation.
But you need all of these and more. Any higher-level layer will have
its own constraints that cannot (and should not) be expressed in a
generic XML structural schema language: read the TEI spec (for
example) to see how complex these constraints can be.
DTDs do two things very well -- they let you validate the surface
structure of an XML 1.0 document, and they provide production rules to
help with the creation of XML 1.0 documents. They could be extended
to do lots of other kinds of things, but I hardly see the point (ISO
8859-1 could have been extended to include markup, for example, but it
would have been a bad idea).
> DTDs do seem to be a good answer - in the short term for many
> projects, in the long term for a subset of projects - to the need
> for structural checking. It doesn't seem that ridiculous to want
> to 'validate' the results of a transformation (generated via XSL or
> the DOM) or to want to 'validate' a document against a DTD
> structure while taking into account namespaces.
Not at all, but you need something other than DTDs to do that, at
least for now.
> Because XML 1.0 was written so that everything from character
> checking to entity replacement to attribute defaulting to
> structural inspections (DTD and otherwise) are all performed by one
> monolithic 'parser', we haven't been able to describe XML
> processors with any level of granularity. When I talk about layers
> (for instance, in
> http://www.simonstl.com/articles/layering/layered.htm), it's
> layering for the sake of breaking things into the smallest usable
> components, not for the sake of piling on more and more mostly
> redundant processing. Your layer 3 is way too thick.
All of the layers are way too thick -- that's the joy of a high-level
logical model. I can break any one of them down into dozens of
smaller layers; in fact, the application layer will often be much more
complicated than the parser layer, simply because useful applications
generally do complicated things.
> If treat validation as a process with its own life, outside of the
> Rube Goldberg machine known as an XML processor, we might be able
> to solve a lot of problems that currently look very difficult much
> more simply. Namespaces included.
It's wonderful to see that we end up agreeing. Validation is a much
bigger problem than DTDS -- it's best to think of DTDs as a small
bonus (you can perform some types of structural validation on the XML
layer right now basically for free) rather than a liability, and to
think of the greater validation problem as still unsolved in the
general case. That's not because the XML Schema committee members are
stupid or obstructionist, but simply because validation in the general
case is a *very* hard problem.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Dapeng.Wang at Dresdner-Bank.com Wed Sep 8 19:01:50 1999
From: Dapeng.Wang at Dresdner-Bank.com (Wang, Dapeng)
Date: Mon Jun 7 17:14:45 2004
Subject: Is it possible to overwrite global variables?
Message-ID:
Hi,
Sometimes I feel that I need to overwrite the value of a globally declared
variable. Is that possible? I think it is very pratical to save some
tipping. I just want to give the same variable different values depending on
some condition, so that I can evaluate it later.
e.g.
It doesn't work with xt. Nor do I find anything in the spec.
Any idea?
Thanx.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From James.Anderson at mecomnet.de Wed Sep 8 19:04:14 1999
From: James.Anderson at mecomnet.de (james anderson)
Date: Mon Jun 7 17:14:45 2004
Subject: fixing (just) namespaces and validation
References: <199909081037.LAA21158@nag.co.uk>
Message-ID: <37D69791.652DF82A@mecomnet.de>
David Carlisle wrote:
>
> I wrote
> > Simon St.Laurent wrote:
> > >
> > > Namespaces and validation are incompatible in cases where the namespace
> > > prefix changes for whatever reason,
> >
> > this claim is not universaly true. while there are dtd's for which it holds
> > (those with ambiguous prefixes are those with declarations with identical
> > prefixed names and no directly specified namespace binding.), this does not
> > hold for all dtd's
>
> Surely _all_ DTDs do have this problem with the namespace REC.
We disagree. It is not the dtd?s themselves which "have the problem". A dtd
can actually encode sufficient information to identify intended universal
names. It is the parsers which cause the problem by ignoring the information provided.
>
> The [illustrated equivalent documents]
>
can be validated against a dtd equivalent to
iff the parser provides adequate support for name management. for example,
where it maps the names "foo:x", "bar:x" and "zap:x" all to the same symbol
before the validation process/application ever has access to them.
The parser which i noted in my previous message recognizes PI's of the form
as instructions to bind, within a given scope, the respective prefixes to
packages which are in addition bound statically to the identifier provided as
pseudo-attribute value. this works. no, it is not necessary. The bindings
implicit in attribute declarations are also be sufficient to enable the same
interning process - given the appropriate scoping rules. the name stem is then
interned into the respective package. the application works with the interned symbols.
> Well, you could write a DTD that worked for any finite set of prefixes,
> just by duplicating all the declarations, but you can not write one that
> works in general.
Please see above.
>
> So, to validate with namespaces, you either have to pre-process the
> document instance to normalise the prefixes to the prefix used in the
> DTD,
No, the processing happens in the course of parsing. There is no need for an
additional pass. No, it is better to map them all to a symbol associated with
the universal name. The prefixes are superfluous.
The application (and the validation process) works with these symbols only. It
never cares what the prefixes were. The "rewriting" happens on the fly as the
document is parsed. It is similar to the approach outlined by Mr Bray endless
months ago.
> or you have to invent a new declaration in the DTD that says
> `this DTD uses prefix foo: but a documenent instance may use any prefix,
> so long as is bound to the namespace "http://here"'
> This is (I assume) the intention of the PI in Simon's posting.
>
As i noted in the prior message, the pi is sufficient to support the process.
it is, however, not always necessary. For some dtds, the attribute
declarations suffice. I suspect that, with additional rules to specify their
scope, the attribute declarations would be as expressive as the proposed pi.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Wed Sep 8 19:13:23 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:45 2004
Subject: ANN: XML and Databases article
In-Reply-To: <01BEFA20.9D5E8F30@grappa.ito.tu-darmstadt.de>
References: <01BEFA20.9D5E8F30@grappa.ito.tu-darmstadt.de>
Message-ID: <14294.39104.190422.759370@localhost.localdomain>
Ronald Bourret writes:
> (In this sense, the XML information set has much in common with groves, as
> it is a property set. Similarly, the DOM could be viewed as an API for a
> grove. The XML information set is not a grove because [why? The only
> reason I can think of is that it is not been expressed in grove
> notation].
I'm not aware of anything that would prevent the Infoset from being
described in Grove notation. There aren't many people alive who
actually know Groves (we couldn't all fit in a Cessna, but we probably
could squeeze into a Dash-8 with a few empty seats), so it had no real
familiarity advantage.
By the way, for a database-like XML thingy, see
http://www.megginson.com/DATAX/
The DATAX interfaces can be used on top of an RDBM as easily as they
can on top of the default in-memory data structures.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Wed Sep 8 19:21:24 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:45 2004
Subject: Layers, again (was Re: fixing (just) namespaces and
validation)
In-Reply-To: <14294.36560.725917.191918@localhost.localdomain>
References: <199909081208.NAA17410@nag.co.uk>
<14294.21817.614899.758420@localhost.localdomain>
<4.0.1.19990908091744.01115340@216.27.10.33>
<14294.36560.725917.191918@localhost.localdomain>
Message-ID: <14294.39695.624934.80200@localhost.localdomain>
David Megginson writes:
> DTDs do two things very well -- they let you validate the surface
> structure of an XML 1.0 document, and they provide production rules to
> help with the creation of XML 1.0 documents. They could be extended
> to do lots of other kinds of things, but I hardly see the point (ISO
> 8859-1 could have been extended to include markup, for example, but it
> would have been a bad idea).
Note that they do two additional things rather more poorly: they
specify the physical makeup of the document, and they provide default
values for attributes.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From simonstl at simonstl.com Wed Sep 8 20:08:38 1999
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun 7 17:14:45 2004
Subject: Layers, again (was Re: fixing (just) namespaces and
validation)
In-Reply-To: <14294.36560.725917.191918@localhost.localdomain>
References: <4.0.1.19990908091744.01115340@216.27.10.33>
<199909081208.NAA17410@nag.co.uk>
<14294.21817.614899.758420@localhost.localdomain>
<4.0.1.19990908091744.01115340@216.27.10.33>
Message-ID: <199909081810.OAA05438@hesketh.net>
This continues to be interesting, but I'm afraid you're convincing me more
and more that your layered approach is grotesque, inflexible, and liable to
tilt, rather than a good way to spare developers aspirin.
At 01:02 PM 9/8/99 -0400, David Megginson wrote:
>Oops! Note that I didn't say "what Namespaces do" -- that's
>application-specific. I referred only to the parts of XML 1.0 syntax
>that act as links to the Namespace layer (Namespace declarations and
>prefixes). So, for
>
> "what Namespace declarations and prefixes do"
>
>try reading
>
> "how expanded names are constructed from Namespace declarations and
> prefixes".
>
>In other words, given any XML 1.0 document, the XML Namespaces REC
>precisely and unambiguously specifies how to determine the expanded
>name of every element and attribute in the document. Thats *all* that
>you need to move from the XML 1.0 layer to the Namespaces layer during
>processing.
Again, you're assuming that everyone wants to treat Namespaces as a layer
that happens _after_ XML parsing (incl validation) is complete. I don't
want to move from XML 1.0 to Namespaces - I want to address Namespaces
within the context of XML 1.0 validation. There is nothing in either spec
that requires that they be treated as separate layers, by my reading. On
the other hand, breaking XML 1.0 into intelligible layers and integrating
Namespaces as a layer within that structure seems like a worthy project.
>But there's no reason that the parser has to finish processing the XML
>1.0 layer before it starts evaluating the Namespace layer -- that's a
>matter of physical implementation, and the layers belong to a logical
>model (jumping layers is always wrong in logical models, but it often
>makes sense in implementations: note how many routers know about HTTP,
>even though they're technically dealing with the IP layer). In the
>SAX Namespaces filter for DATAX, for example, the Namespace processing
>is done on the fly.
See above. The question is not whether I have to finish processing the
entire document with the parser before I begin namespace processing - it's
whether I can integrate namespace processing with XML 1.0 processing. Where
does the layer belong? I don't find Namespaces compelling as a 'logical
layer' of their own - rather, I see their existence in a separate document
as a historical accident.
>More seriously, neither XML nor Namespaces currently has a
>standardized processing model of any sort, and I'm still trying to
>decide whether I think they should. The data models and APIs that we
>have specify only what should be available *after* processing; they
>don't say how we get there.
Precisely. And they don't say what layer goes where, either. You read the
current situation as opening one set of possibilities, and I see it opening
a very different set of possibilities.
> > The problem in both of these examples is that you treat XML itself
> > as monolithic, and DTD validation as a tool that can only be used
> > at the time of parsing. As a result, we have multiple levels of
> > checking that have to be redundant if they're done at all. Check
> > against schemas, DTDs, _and_ RDF? And then throw application rules
> > on top of that? Forget it.
>
>No, you have the choice of applying validation only at the levels that
>are important to you: if you're producing XHTML for display in legacy
>browsers, then you might need to validate the character layer with
>regular expressions to ensure that empty-element tags always have a
>space before the closing delimiter (
, not
).
No, you don't have the choice of applying _validation_ at whatever level
you like - currently, you have the choice of applying it or not applying it
during the parser. You're applying the word validation in a much more
general sense here, ignoring the fact that DTD-based validation, which is
capable of addressing a significant range of problems _today_, is locked in
a box with the rest of XML 1.0 processing.
>You can apply validation to *any* layer of processing, from the raw
>bytes to the final application. Choosing where to validate is an
>architectural decision, not a standards one. It just happens that
>some of the layers do have shared specs that make validation easier
>(regular expressions, DTDs, and RDF schemas), while others do not,
>yet.
Again, you're using validation to mean anything you want here. Choosing
where and when to validate is as much a decision about which tools are
available as it is about the logical model you like and which I find so
illogical.
> > These 'layers' are pretty much a guarantee that developers either
> > need to make an investment in large quantities of aspirin - or pick
> > one tool and stick to it.
>
>No, that's wrong -- layered approaches like these have proven
>themselves over and over (the Internet is only the most famous
>example). Do you consider it redundant that both TCP and HTTP perform
>different kinds of validity checks?
TCP and HTTP do well with their different checks. (And UDP is available
for those who like to cut down redundancy.) But DTDs perform a subset of
the validity checks of schemas, while RDF schemas provide an overlapping
set. That sort of redundancy I find merely redundant. Your layers aren't
performing tasks which are different enough to justify calling them
separate 'logical' tasks.
> > If I thought that schemas would be here soon, or that RDF really
> > was the answer to all of these, I wouldn't be pushing on DTD
> > validation.
>
>But you need all of these and more. Any higher-level layer will have
>its own constraints that cannot (and should not) be expressed in a
>generic XML structural schema language: read the TEI spec (for
>example) to see how complex these constraints can be.
I may need all of these, someday, for certain types of projects. I do not
believe that I will need to use DTDs, schemas, and RDF all on the same
processing run of a document. I may need to be able to convert among them
for different processors, but stacking all of them (DTDs and schemas in
particular) seems foolish at best.
James Clark's note on the problems of DTDs and their effective uselessness
is much more believable to me than a claim that I'll need to use both forms
in some kind of layered processing. (Given that the whole thing will fold
at validation if I change a prefix anyway, I can't imagine why I'd bother.)
>DTDs do two things very well -- they let you validate the surface
>structure of an XML 1.0 document, and they provide production rules to
>help with the creation of XML 1.0 documents. They could be extended
>to do lots of other kinds of things, but I hardly see the point (ISO
>8859-1 could have been extended to include markup, for example, but it
>would have been a bad idea).
I don't think we're talking about an enormous extension ala schemas - we're
talking about integrating validation with qualified names. It's not an
enormous leap.
> > DTDs do seem to be a good answer - in the short term for many
> > projects, in the long term for a subset of projects - to the need
> > for structural checking. It doesn't seem that ridiculous to want
> > to 'validate' the results of a transformation (generated via XSL or
> > the DOM) or to want to 'validate' a document against a DTD
> > structure while taking into account namespaces.
>
>Not at all, but you need something other than DTDs to do that, at
>least for now.
Why? Because XML 1.0 was written as a monolithic spec and the XML
Namespaces rec didn't feel it was worth the time? This is not a difficult
problem to address, solving real needs now. We don't have _anything_ to do
that with now.
> > Because XML 1.0 was written so that everything from character
> > checking to entity replacement to attribute defaulting to
> > structural inspections (DTD and otherwise) are all performed by one
> > monolithic 'parser', we haven't been able to describe XML
> > processors with any level of granularity. When I talk about layers
> > (for instance, in
> > http://www.simonstl.com/articles/layering/layered.htm), it's
> > layering for the sake of breaking things into the smallest usable
> > components, not for the sake of piling on more and more mostly
> > redundant processing. Your layer 3 is way too thick.
>
>All of the layers are way too thick -- that's the joy of a high-level
>logical model. I can break any one of them down into dozens of
>smaller layers; in fact, the application layer will often be much more
>complicated than the parser layer, simply because useful applications
>generally do complicated things.
Then maybe we'd better take a closer look at the layers you propose -
piling many thick layers on top of each other doesn't sound like a very
good recipe.
> > If treat validation as a process with its own life, outside of the
> > Rube Goldberg machine known as an XML processor, we might be able
> > to solve a lot of problems that currently look very difficult much
> > more simply. Namespaces included.
>
>It's wonderful to see that we end up agreeing. Validation is a much
>bigger problem than DTDS -- it's best to think of DTDs as a small
>bonus (you can perform some types of structural validation on the XML
>layer right now basically for free) rather than a liability, and to
>think of the greater validation problem as still unsolved in the
>general case. That's not because the XML Schema committee members are
>stupid or obstructionist, but simply because validation in the general
>case is a *very* hard problem.
Validation in general is a very large problem, and I'm not trying to solve
it all at once. I'm proposing that we fix some tools we have today so that
they work better with other tools we have today.
Simon St.Laurent
XML: A Primer (2nd Ed - September)
Building XML Applications
Inside XML DTDs: Scientific and Technical
Sharing Bandwidth / Cookies
http://www.simonstl.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david-b at pacbell.net Wed Sep 8 21:01:27 1999
From: david-b at pacbell.net (David Brownell)
Date: Mon Jun 7 17:14:45 2004
Subject: Layers, again (was Re: fixing (just) namespaces and validation)
References: <4.0.1.19990908091744.01115340@216.27.10.33>
<199909081208.NAA17410@nag.co.uk>
<14294.21817.614899.758420@localhost.localdomain>
<4.0.1.19990908091744.01115340@216.27.10.33> <199909081810.OAA05438@hesketh.net>
Message-ID: <37D6B308.AB465539@pacbell.net>
"Simon St.Laurent" wrote:
>
> This continues to be interesting, but I'm afraid you're convincing me more
> and more that your layered approach is grotesque, inflexible, and liable to
> tilt, rather than a good way to spare developers aspirin.
I find that puzzling. Layering is well known to be an essential
tool for managing complex systems. Heck, the proof is right in
front of me on my desk ... ;-)
> Again, you're assuming that everyone wants to treat Namespaces as a layer
> that happens _after_ XML parsing (incl validation) is complete. I don't
> want to move from XML 1.0 to Namespaces - I want to address Namespaces
> within the context of XML 1.0 validation.
But XML 1.0 doesn't include namespaces, so this can't be done. It's
a definitional thing. A revision of XML could do it, not that I'd
encourage such a thing right now. But it wouldn't be "XML 1.0" at all.
Some folk may be using validation in some other sense than the XML 1.0
specification defines it. If so, it'd help things to find some other
words or phrases to describe those processs -- at least apply adjectives
to disambiguate the various distinct notions.
> I don't find Namespaces compelling as a 'logical layer"
> lof their own - rather, I see their existence in a separate document
> as a historical accident.
Be that as it me, the _fact_ is that it's a separate and decoupled
document. If you hard-wire namespace conformance, you're no longer
conforming to the XML 1.0 spec, and vice versa. They're separate.
The only way to have them work in the same system is to use layers.
> > > Because XML 1.0 was written so that everything from character
> > > checking to entity replacement to attribute defaulting to
> > > structural inspections (DTD and otherwise) are all performed by one
> > > monolithic 'parser', we haven't been able to describe XML
> > > processors with any level of granularity. When I talk about layers
> > > (for instance, in
> > > http://www.simonstl.com/articles/layering/layered.htm), it's
> > > layering for the sake of breaking things into the smallest usable
> > > components, not for the sake of piling on more and more mostly
> > > redundant processing. Your layer 3 is way too thick.
Simon, what purpose is served by breaking things down that finely?
The flip side of recognizing layering as a useful tool is to know
that successful layers need to have a type of consistency that's
hard to create without careful integration. Even if it's not the
way I'd have integrated those components, XML 1.0 has that sort of
integration. Dis-integrating it shows no clear benefit to me; it
seems like reinventing the wheel or fire or something.
> >All of the layers are way too thick -- that's the joy of a high-level
> >logical model. I can break any one of them down into dozens of
> >smaller layers; in fact, the application layer will often be much more
> >complicated than the parser layer, simply because useful applications
> >generally do complicated things.
>
> Then maybe we'd better take a closer look at the layers you propose -
> piling many thick layers on top of each other doesn't sound like a very
> good recipe.
Hmm, that wasn't quite how I was taking it. While I'd admit that
many of the technology layers W3C is talking about seem rather thick
to me (== barriers to entry), what I heard David suggest was a way to
view those layers such that they didn't clash. (A role W3C should be
taking, but such architectural communication doesn't appear to be a
strong point over there -- sadly.)
- Dave
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cowan at locke.ccil.org Wed Sep 8 21:01:58 1999
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun 7 17:14:46 2004
Subject: Test, please ignore
Message-ID: <199909081939.PAA10907@locke.ccil.org>
Test, please ignore.
--
John Cowan cowan@ccil.org
I am a member of a civilization. --David Brin
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From simonstl at simonstl.com Wed Sep 8 21:47:29 1999
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun 7 17:14:46 2004
Subject: Layers, again (was Re: fixing (just) namespaces and
validation)
In-Reply-To: <37D6B308.AB465539@pacbell.net>
References: <4.0.1.19990908091744.01115340@216.27.10.33>
<199909081208.NAA17410@nag.co.uk>
<14294.21817.614899.758420@localhost.localdomain>
<4.0.1.19990908091744.01115340@216.27.10.33>
<199909081810.OAA05438@hesketh.net>
Message-ID: <199909081949.PAA09724@hesketh.net>
At 12:03 PM 9/8/99 -0700, David Brownell wrote:
>I find that puzzling. Layering is well known to be an essential
>tool for managing complex systems. Heck, the proof is right in
>front of me on my desk ... ;-)
The problem isn't the use of layering itself - it's the choices about what
kinds of layers are used and what 'logically' qualifies as a layer.
>> Again, you're assuming that everyone wants to treat Namespaces as a layer
>> that happens _after_ XML parsing (incl validation) is complete. I don't
>> want to move from XML 1.0 to Namespaces - I want to address Namespaces
>> within the context of XML 1.0 validation.
>
>But XML 1.0 doesn't include namespaces, so this can't be done. It's
>a definitional thing. A revision of XML could do it, not that I'd
>encourage such a thing right now. But it wouldn't be "XML 1.0" at all.
I'm not especially concerned about it being "XML 1.0" at all. Yes, I'll
happily acknowledge that integrating namespaces and XML 1.0 in a strong
sense would require rewriting, probably resulting in XML 1.1 - which no one
seems interested in doing.
Proposing it in a weaker sense might encourage consistency among those who
need and use it, however. It doesn't seem like decoupling DTD processing
from XML 1.0 is especially difficult, and that integrating that with
namespaces in a better way (in fact, any way) than exists now would be useful.
>Some folk may be using validation in some other sense than the XML 1.0
>specification defines it. If so, it'd help things to find some other
>words or phrases to describe those processs -- at least apply adjectives
>to disambiguate the various distinct notions.
Agreed. We need better vocabulary. After a few years in the language
lawyering world of XML 1.0, 'validation' to me means something that happens
in XML 1.0 with DTDs. If I need to describe schema or RDF applications, I
tend to say 'schema validation' or 'validation against a schema', for
instance.
>> I don't find Namespaces compelling as a 'logical layer"
>> lof their own - rather, I see their existence in a separate document
>> as a historical accident.
>
>Be that as it me, the _fact_ is that it's a separate and decoupled
>document. If you hard-wire namespace conformance, you're no longer
>conforming to the XML 1.0 spec, and vice versa. They're separate.
>The only way to have them work in the same system is to use layers.
In other words, we're stuck, should wait for schemas, and should
discontinue use of DTDs in systems where namespaces may be important? (And
I mean namespaces, not namespace prefixes.)
I suspect that there's a sizable number of developers who might like more
options than that.
>> > > Because XML 1.0 was written so that everything from character
>> > > checking to entity replacement to attribute defaulting to
>> > > structural inspections (DTD and otherwise) are all performed by one
>> > > monolithic 'parser', we haven't been able to describe XML
>> > > processors with any level of granularity. When I talk about layers
>> > > (for instance, in
>> > > http://www.simonstl.com/articles/layering/layered.htm), it's
>> > > layering for the sake of breaking things into the smallest usable
>> > > components, not for the sake of piling on more and more mostly
>> > > redundant processing. Your layer 3 is way too thick.
>
>Simon, what purpose is served by breaking things down that finely?
Lots of purposes. You can reuse modules easily, integrate new technologies
(like namespaces) into existing structures, and take modules that were
originally built for use in parsing and apply them to results from other
XML operations (like XSL transforms, DOM work, etc.).
>The flip side of recognizing layering as a useful tool is to know
>that successful layers need to have a type of consistency that's
>hard to create without careful integration. Even if it's not the
>way I'd have integrated those components, XML 1.0 has that sort of
>integration. Dis-integrating it shows no clear benefit to me; it
>seems like reinventing the wheel or fire or something.
If reinventing the wheel would let me reuse the wheels on other devices,
I'd be happy to do it. Right now, XML 1.0 provides you with axles, wheels,
and tires, all glued together so that you can't take the wheels off and put
them on another car without yanking the axle off.
Integration is critically important. A key part of integration, though, is
clarifying the parts and how they fit together - and come apart.
>> Then maybe we'd better take a closer look at the layers you propose -
>> piling many thick layers on top of each other doesn't sound like a very
>> good recipe.
>
>Hmm, that wasn't quite how I was taking it. While I'd admit that
>many of the technology layers W3C is talking about seem rather thick
>to me (== barriers to entry), what I heard David suggest was a way to
>view those layers such that they didn't clash.
I agree with David (Megginson) that the layers shouldn't clash. However, I
don't think his model avoids clashes successfully (notably DTD validation
and namespaces), and I consider the thickness issue important. If you're
willing to break down some of those layers, I think you'll see lots of room
for improvement.
>(A role W3C should be
>>taking, but such architectural communication doesn't appear to be a
>>strong point over there -- sadly.)
Agreed. There are a few documents out there that might be guideposts (like
the Notes I mentioned earlier today), but they don't provide a clear
picture of where any of this is going. There seem to a large number of
very different perspectives. Hopefully those different perspectives will
play off each other and give us some useful tools.
Simon St.Laurent
XML: A Primer (2nd Ed - September)
Building XML Applications
Inside XML DTDs: Scientific and Technical
Sharing Bandwidth / Cookies
http://www.simonstl.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From BMurri at wavephore.net Wed Sep 8 21:46:58 1999
From: BMurri at wavephore.net (Blair Murri)
Date: Mon Jun 7 17:14:46 2004
Subject: Namespace for HTML (Signup Here)
Message-ID: <31AA4BE99284D211B47A006008172E20016EA6@MAILMAN>
I vote 2 (or 4 if needed)
Blair L. Murri
Sr. Programmer/etc.
WavePhore, Inc.
> -----Original Message-----
> From: Don Park [SMTP:donpark@docuverse.com]
> Sent: Monday, September 06, 1999 4:02 PM
> To: xml-dev@ic.ac.uk
> Subject: Namespace for HTML (Signup Here)
>
> Both Tim Bray and David Megginson proposed we settle on the Namespace URI
> for HTML. I think this is a clear enough and small enough task for us
> (XML-DEV) to accomplish here and now.
>
> Here are some candidates:
>
> 1) "http://www.w3.org/Markup/" - by David Megginson
> 2) "http://www.w3.org/HTML/1999/namespace" - by Tim Bray
> 3) "http://www.w3.org/HTML/2000/Namespace" - by Don Park (I like zeros :)
>
> Lets just settle on one and start using it. If W3C balks, we can go with:
>
> 4) "http://www.xml.org/HTML/2000/Namespace" - (if Jon Bosak agrees)
>
> Signup by replying with one of the numbered options.
>
> If you are against this proposal, select this:
>
> 0) "hell://no.stinking.namespace/4/HTML"
>
> Best,
>
> Don Park - mailto:donpark@docuverse.com
> Docuverse - http://www.docuverse.com
>
>
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on
> CD-ROM/ISBN 981-02-3594-1
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
> message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19990908/ba417c2b/attachment.htm
From david at megginson.com Wed Sep 8 21:54:31 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:46 2004
Subject: Layers, again (was Re: fixing (just) namespaces and
validation)
In-Reply-To: <199909081810.OAA05438@hesketh.net>
References: <4.0.1.19990908091744.01115340@216.27.10.33>
<199909081208.NAA17410@nag.co.uk>
<14294.21817.614899.758420@localhost.localdomain>
<14294.36560.725917.191918@localhost.localdomain>
<199909081810.OAA05438@hesketh.net>
Message-ID: <14294.47066.957120.565534@localhost.localdomain>
Simon St.Laurent writes:
> Again, you're assuming that everyone wants to treat Namespaces as a
> layer that happens _after_ XML parsing (incl validation) is
> complete. I don't want to move from XML 1.0 to Namespaces - I want
> to address Namespaces within the context of XML 1.0 validation.
In other words, you want to deal exclusively with the Namespaces
layer: I think that's a good idea for most applications, and it's
certainly necessary for XSL and RDF; it just happens that there's not
a standard schema for validating that layer yet (because it's so
new). Personally, I would not mind deprecating the XML 1.0 layer so
that we can remove it in a few years.
Some applications, however, *do* care about the XML 1.0 layer
(i.e. they care about what specific prefix you use) -- again, XHTML
for legacy browsers is a very good example -- and DTDs are well suited
for that work. See further, below.
My point is simply that it's not fair to complain that DTDs don't work
with the Namespaces layer, any more than it's fair to complain that
your screwdriver handle gets dented when you hit nails with it.
> See above. The question is not whether I have to finish processing
> the entire document with the parser before I begin namespace
> processing - it's whether I can integrate namespace processing with
> XML 1.0 processing. Where does the layer belong? I don't find
> Namespaces compelling as a 'logical layer' of their own - rather, I
> see their existence in a separate document as a historical
> accident.
It is an accident, partly, but that's the way that technology
develops. We have to deal with the fact that there is much XML
software that doesn't know about Namespaces, and it will likely be
deployed for a long time. That means that some applications will care
about the XML 1.0 level for the foreseeable future. As a result,
there are de facto two separate layers.
Ideally, the number of applications that care will diminish over
time.
[snip]
> No, you don't have the choice of applying _validation_ at whatever
> level you like - currently, you have the choice of applying it or
> not applying it during the parser. You're applying the word
> validation in a much more general sense here, ignoring the fact
> that DTD-based validation, which is capable of addressing a
> significant range of problems _today_, is locked in a box with the
> rest of XML 1.0 processing.
Validation is a big problem in software design, and even in the SGML
world DTD validation covered at best a tiny subset of it. It might be
useful to distinguish the following terms to avoid confusion:
validation: the act of ensuring that data conform to a set of known
rules.
XML validation: validation where the data are all or part of an XML
document.
DTD validation: XML validation using a DTD, as defined in the XML 1.0
specification.
The first term applies to any layer in any system that exchanges
information; most programmers who deal with validation problems have
never even heard of XML. An RDF schema (for example) provides
validation of the RDF data model, not of the XML markup (as a proof,
it can be applied after all of the original markup distinctions have
been removed). A Java interface, to give another example, is a schema
that the parser uses to validate a Java implementation.
The second, more specialised term applies to any kind of validation
performed on an XML document *as XML* -- it is broad enough to embrace
both the XML 1.0 layer and the Namespaces layer. The XML schema
effort is aimed at providing a general mechanism for XML validation,
but not at providing DTD validation.
The third term applies to the specific set of rules and constraints in
the XML 1.0 recommendation, where the target is an XML document and
the rules are expressed in a DTD. DTD validation applies only to the
XML 1.0 layer.
Most validation in software systems is done by custom code; in some
cases, there are higher level constructs (like DTDs or BNF) that can
help.
> I may need all of these, someday, for certain types of projects. I do not
> believe that I will need to use DTDs, schemas, and RDF all on the same
> processing run of a document.
I hope that you won't, but it's not hard to think of situations where
you would.
> Why? Because XML 1.0 was written as a monolithic spec and the XML
> Namespaces rec didn't feel it was worth the time? This is not a difficult
> problem to address, solving real needs now. We don't have _anything_ to do
> that with now.
It's not a question of being worth the time (God knows how many
person-hours we put into debating and designing Namespaces REC, but it
must be at least 10 person hours per word) -- it's a question of not
breaking XML 1.0. We deliberately haven't fiddled with XML during the
last year and a half -- it's been stable so that the companies
investing hundreds of millions of R&D dollars into XML don't see their
software become obsolete two weeks after (or before) release.
This approach wins us a lot of confidence in the corporate world, and
it's one of the biggest secrets of XML's success: every document
containing Namespaces can still be processed by an XML 1.0 processor
that knows nothing about Namespaces; processors that do know about
Namespaces can do additional kinds of value-added processing.
[snip]
> >All of the layers are way too thick -- that's the joy of a
> >high-level logical model. I can break any one of them down into
> >dozens of smaller layers; in fact, the application layer will
> >often be much more complicated than the parser layer, simply
> >because useful applications generally do complicated things.
>
> Then maybe we'd better take a closer look at the layers you propose -
> piling many thick layers on top of each other doesn't sound like a very
> good recipe.
I think that you've misunderstood. Each thick layer is an abstraction
of many thinner layers -- that's the way that high-level models work
(just as each folder in a file system may contain many other folders,
etc.). Remember that a model is an abstracted explanation of a system
design, not the system itself.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Wed Sep 8 22:06:45 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:46 2004
Subject: Layers, again (was Re: fixing (just) namespaces and
validation)
In-Reply-To: <199909081949.PAA09724@hesketh.net>
References: <4.0.1.19990908091744.01115340@216.27.10.33>
<199909081208.NAA17410@nag.co.uk>
<14294.21817.614899.758420@localhost.localdomain>
<199909081810.OAA05438@hesketh.net>
<37D6B308.AB465539@pacbell.net>
<199909081949.PAA09724@hesketh.net>
Message-ID: <14294.49584.847567.906696@localhost.localdomain>
Simon St.Laurent writes:
> In other words, we're stuck, should wait for schemas, and should
> discontinue use of DTDs in systems where namespaces may be
> important? (And I mean namespaces, not namespace prefixes.)
No -- it shouldn't matter whether Namespaces are important or not.
You should discontinue using DTDs where you don't care about the
specific prefixes used, since you're not buying yourself anything by
using them when you don't care about the XML 1.0 layer.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From simonstl at simonstl.com Wed Sep 8 22:24:06 1999
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun 7 17:14:46 2004
Subject: Layers, again (was Re: fixing (just) namespaces and
validation)
In-Reply-To: <14294.47066.957120.565534@localhost.localdomain>
References: <199909081810.OAA05438@hesketh.net>
<4.0.1.19990908091744.01115340@216.27.10.33>
<199909081208.NAA17410@nag.co.uk>
<14294.21817.614899.758420@localhost.localdomain>
<14294.36560.725917.191918@localhost.localdomain>
<199909081810.OAA05438@hesketh.net>
Message-ID: <199909082026.QAA11768@hesketh.net>
In many ways, I agree with David's posting, and he goes well into providing
the more precise vocabulary that David Brownell and I both seem to agree
that any discussion of 'validation' needs. I'm addressing only the points
here where we disagree - few, but critical - in the hopes of keeping this
post briefer than its predecessors have been.
At 03:55 PM 9/8/99 -0400, David Megginson wrote:
>Simon St.Laurent writes:
> > I don't want to move from XML 1.0 to Namespaces - I want
> > to address Namespaces within the context of XML 1.0 validation.
>
>In other words, you want to deal exclusively with the Namespaces
>layer:
No. I want to take a very well understood technology that's in the XML 1.0
layer and apply it to the results of the Namespaces layer. Adding
qualified names processing to XML DTD validation is not very difficult (as
james anderson has pointed out several times on another thread). This
requires taking a layer out of XML 1.0 - though it was not described as a
separate layer in that document - and applying it to the namespace layer.
> I think that's a good idea for most applications, and it's
>certainly necessary for XSL and RDF; it just happens that there's not
>a standard schema for validating that layer yet (because it's so
>new). Personally, I would not mind deprecating the XML 1.0 layer so
>that we can remove it in a few years.
Personally, I would not mind using the technology we have today so that we
can validate those results - heck, any output that purports to be XML - today.
>My point is simply that it's not fair to complain that DTDs don't work
>with the Namespaces layer, any more than it's fair to complain that
>your screwdriver handle gets dented when you hit nails with it.
If there _were_ tools of any kind for processing the Namespaces layer, I
would be more inclined to agree with this analogy. As it stands, I'd
prefer a screwdriver with a blunt piece of metal on the end of the handle
to nothing, thanks.
> > I don't find
> > Namespaces compelling as a 'logical layer' of their own - rather, I
> > see their existence in a separate document as a historical
> > accident.
>
>It is an accident, partly, but that's the way that technology
>develops. We have to deal with the fact that there is much XML
>software that doesn't know about Namespaces, and it will likely be
>deployed for a long time. That means that some applications will care
>about the XML 1.0 level for the foreseeable future. As a result,
>there are de facto two separate layers.
There were warnings in the XML 1.0 spec about the use of colons and their
reserved nature. I don't think this situation qualifies for two 'separate
layers' or that the lack of standardized processing for namespace
validation of some sort is justified by the existence of XML 1.0. At
worst, this is a situation that should be left up to parser and application
developers - it certainly causes no more harm than optional retrieval of
external resources.
>I think that you've misunderstood. Each thick layer is an abstraction
>of many thinner layers -- that's the way that high-level models work
>(just as each folder in a file system may contain many other folders,
>etc.). Remember that a model is an abstracted explanation of a system
>design, not the system itself.
I think you've forgotten the cost of encapsulating smaller parts into
larger layers - the small parts are no longer mobile, and can't be
redeployed to other parts of the model. And because models affect system
design...
And a few minutes later, David writes:
>Simon St.Laurent writes:
>
>> In other words, we're stuck, should wait for schemas, and should
>> discontinue use of DTDs in systems where namespaces may be
>> important? (And I mean namespaces, not namespace prefixes.)
>
>No -- it shouldn't matter whether Namespaces are important or not.
>You should discontinue using DTDs where you don't care about the
>specific prefixes used, since you're not buying yourself anything by
>using them when you don't care about the XML 1.0 layer.
Except of course, that you might well buy yourself a screwdriver with a
blunt piece of metal of the end that you could use to validate documents
with namespaces _today_.
I don't buy the lines you draw between the layers - I don't find them
logical. If you don't like DTDs and don't want to see their use extended,
fine. I find your refusal to consider them a useful tool outside of their
original context extraordinarily puzzling.
Simon St.Laurent
XML: A Primer (2nd Ed - September)
Building XML Applications
Inside XML DTDs: Scientific and Technical
Sharing Bandwidth / Cookies
http://www.simonstl.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david-b at pacbell.net Wed Sep 8 22:28:05 1999
From: david-b at pacbell.net (David Brownell)
Date: Mon Jun 7 17:14:46 2004
Subject: list administration (was: Re: Layers, again (was Re: fixing (just)
namespaces andvalidation)
References: <852567E6.006E8C38.00@Venus.GlobalServeCorp.com>
Message-ID: <37D6C733.A089AEDB@pacbell.net>
Mike_D_Hall/Cincinnati/GlobalServe@GlobalServeCorp.com wrote:
>
> Can someone please help me get removed from this mailing list.
Every message you receive has a footer that says how to get
off this particular mailing list. Follow those directions.
There is not a snowball's chance in a hot place that ** I ** can
do anything about this. The same is true for all but one or two
members of this list. So please don't send such requests either
individuals not identified in the message footer, or to the list
as a whole. (You did both.)
> Someone must have applied my email for them, cause I've tried to remove myself
> from this email and my others and it's not working.
> I'm sorry to bother, but I'd really appreciate any help (My boss is mad that I
> get 100 emails daily!).
Hmm, I suggest that you use filtering tools such as the one
built into Netscape, or "procmail". They're essential tools
on the internet; they have been for most of the decade. Only
one hundered emails a day? What a pleasure that must be!
- Dave
> Thanks,
> mikey
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Wed Sep 8 22:44:58 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:46 2004
Subject: Someone's or something is filtering my message!
In-Reply-To: <14294.47066.957120.565534@localhost.localdomain>
References: <4.0.1.19990908091744.01115340@216.27.10.33>
<199909081208.NAA17410@nag.co.uk>
<14294.21817.614899.758420@localhost.localdomain>
<14294.36560.725917.191918@localhost.localdomain>
<199909081810.OAA05438@hesketh.net>
<14294.47066.957120.565534@localhost.localdomain>
Message-ID: <14294.51650.225619.594268@localhost.localdomain>
RGF2aWQgTWVnZ2luc29uIHdyaXRlczoNCg0KID4gSXQncyBub3QgYSBxdWVzdGlvbiBvZiBi
ZWluZyB3b3J0aCB0aGUgdGltZSAoR29kIGtub3dzIGhvdyBtYW55DQogPiBwZXJzb24taG91
cnMgd2UgcHV0IGludG8gZGViYXRpbmcgYW5kIGRlAAAAAGluZyBOYW1lc3BhY2VzIFJFQywg
YnV0IGl0DQogPiBtdXN0IGJlIGF0IGxlYXN0IDEwIHBlcnNvbiBob3VycyBwZXIgd29yZCkg
LS0gDQoNCkRlcGVuZGluZyBvbiB5b3VyIG1haWwgcmVhZGVyLCB5b3UgbWF5IG9yIG1heSBu
b3Qgc2VlIGZvdXIgbnVscyAoXkApDQppbiB0aGUgYWJvdmUgZXhjZXJwdCBpbiB0aGUgd29y
ZCBwcmVjZWRpbmcgIk5hbWVzcGFjZXMgUkVDIi4gIEhlcmUncw0Kd2hhdCBvcmlnaW5hbGx5
IHdlbnQgb3V0Og0KDQogIEl0J3Mgbm90IGEgcXVlc3Rpb24gb2YgYmVpbmcgd29ydGggdGhl
IHRpbWUgKEdvZCBrbm93cyBob3cgbWFueQ0KICBwZXJzb24taG91cnMgd2UgcHV0IGludG8g
ZGViYXRpbmcgYW5kIGRlc2lnbmluZyBOYW1lc3BhY2VzIFJFQywgYnV0IGl0DQogIG11c3Qg
YmUgYXQgbGVhc3QgMTAgcGVyc29uIGhvdXJzIHBlciB3b3JkKSAtLQ0KDQpIZXJlJ3MgaG93
IGl0IGxvb2tzIG9uIG15IHNjcmVlbiBub3c6DQoNCiAgSXQncyBub3QgYSBxdWVzdGlvbiBv
ZiBiZWluZyB3b3J0aCB0aGUgdGltZSAoR29kIGtub3dzIGhvdyBtYW55DQogIHBlcnNvbi1o
b3VycyB3ZSBwdXQgaW50byBkZWJhdGluZyBhbmQgZGVeQF5AXkBeQGluZyBOYW1lc3BhY2Vz
IFJFQywgYnV0IGl0DQogIG11c3QgYmUgYXQgbGVhc3QgMTAgcGVyc29uIGhvdXJzIHBlciB3
b3JkKSAtLQ0KDQpJIGxlZnQgb3V0IHRoZSAndGhlJyBteXNlbGYsIGJ1dCBzb21laG93IGJl
dHdlZW4gd2hlbiB0aGUgbWVzc2FnZSBsZWZ0IA0KbXkgY29tcHV0ZXIgYW5kIGl0IGFycml2
ZWQgYmFjayBmcm9tIHRoZSBzZXJ2ZXIsIHRoZSBzdHJpbmcgJ3NpZ24nIHdhcyANCnJlcGxh
Y2VkIHdpdGggZm91ciBudWxzIGluICJkZXNpZ25pbmciLiAgSSBoYWQgdGhlIHNhbWUgcHJv
YmxlbSB3aXRoIGEgDQpkaWZmZXJlbnQgc3RyaW5nIGluIGFub3RoZXIgcG9zdGluZywgYW5k
IEkgbm90aWNlZCB0aGUgcHJvYmxlbSBpbiBhDQpwb3N0aW5nIGZyb20gU3RldmUgTmV3Y29t
YiBhcyB3ZWxsLg0KDQpJdCB3b3VsZCBiZSB1c2VmdWwgdG8ga25vdyBpZiBhbnlvbmUgZWxz
ZSBpcyBzZWVpbmcgdGhpcywgc28gdGhhdCBJDQpjYW4gc3RhcnQgZmlndXJpbmcgb3V0IHdo
ZXJlIHRoaXMgaXMgaGFwcGVuaW5nIChpcyBpdCBoYXBwZW5pbmcgb24gdGhlDQp3YXkgb3V0
IG9yIHRoZSB3YXkgYmFjaz8pLiAgSSBkb24ndCBrbm93IGhvdyBhbnkgZmFtaWx5IGZpbHRl
ciBjb3VsZA0KbWlzdGFrZSAnc2lnbicgZm9yIGEgbmF1Z2h0eSB3b3JkLCBidXQgdGhleSBh
cmUgcHJldHR5IHN0dXBpZCBwaWVjZXMNCm9mIHNvZnR3YXJlLg0KDQoNCkFsbCB0aGUgYmVz
dCwNCg0KDQpEYXZpZA0KDQotLSANCkRhdmlkIE1lZ2dpbnNvbiAgICAgICAgICAgICAgICAg
ZGF2aWRAbWVnZ2luc29uLmNvbQ0KICAgICAgICAgICBodHRwOi8vd3d3Lm1lZ2dpbnNvbi5j
b20v
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Wed Sep 8 22:54:00 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:46 2004
Subject: Layers, again (was Re: fixing (just) namespaces and
validation)
In-Reply-To: <199909082026.QAA11768@hesketh.net>
References: <199909081810.OAA05438@hesketh.net>
<4.0.1.19990908091744.01115340@216.27.10.33>
<199909081208.NAA17410@nag.co.uk>
<14294.21817.614899.758420@localhost.localdomain>
<14294.36560.725917.191918@localhost.localdomain>
<14294.47066.957120.565534@localhost.localdomain>
<199909082026.QAA11768@hesketh.net>
Message-ID: <14294.52037.477683.565382@localhost.localdomain>
Simon St.Laurent writes:
> No. I want to take a very well understood technology that's in the
> XML 1.0 layer and apply it to the results of the Namespaces layer.
> Adding qualified names processing to XML DTD validation is not very
> difficult (as james anderson has pointed out several times on
> another thread).
It's fairly simple to add it to the spec, but very hard to retrofit
deployed software to use it. It is not a good idea (or good
publicity) to have two different XML dialects floating around the Web
at the same time (see further, below).
> If there _were_ tools of any kind for processing the Namespaces
> layer, I would be more inclined to agree with this analogy. As it
> stands, I'd prefer a screwdriver with a blunt piece of metal on the
> end of the handle to nothing, thanks.
Does DDML handle Namespaces? If so, then there's your spec. There's
no law that says you have to wait for the W3C to bless something, and
DDML has the (good) characteristic of not breaking existing XML tools,
a characteristic missing from any modification of DTD syntax.
> There were warnings in the XML 1.0 spec about the use of colons and
> their reserved nature.
[snip]
There was no warning, however, that DTD syntax might change.
> I don't buy the lines you draw between the layers - I don't find them
> logical. If you don't like DTDs and don't want to see their use extended,
> fine. I find your refusal to consider them a useful tool outside of their
> original context extraordinarily puzzling.
See above. We talked about pre-release pragmatism during the XHTML
discussion -- keep everything small, simple, and modular -- but we
also have to consider post-release pragmatism -- don't screw around
with released specs until you have a very good,
world-is-about-to-blow-up reason. This is more of a
my-car-radio-is-stuck-on-AM reason.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From simonstl at simonstl.com Wed Sep 8 23:13:54 1999
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun 7 17:14:46 2004
Subject: Layers, again (was Re: fixing (just) namespaces and
validation)
In-Reply-To: <14294.52037.477683.565382@localhost.localdomain>
References: <199909082026.QAA11768@hesketh.net>
<199909081810.OAA05438@hesketh.net>
<4.0.1.19990908091744.01115340@216.27.10.33>
<199909081208.NAA17410@nag.co.uk>
<14294.21817.614899.758420@localhost.localdomain>
<14294.36560.725917.191918@localhost.localdomain>
<14294.47066.957120.565534@localhost.localdomain>
<199909082026.QAA11768@hesketh.net>
Message-ID: <199909082116.RAA14362@hesketh.net>
At 04:55 PM 9/8/99 -0400, David Megginson wrote:
>Does DDML handle Namespaces? If so, then there's your spec. There's
>no law that says you have to wait for the W3C to bless something, and
>DDML has the (good) characteristic of not breaking existing XML tools,
>a characteristic missing from any modification of DTD syntax.
Sure. But does DDML have any implementations? No. Perhaps that's my own
fault, but it seems a lot easier to add a PI to DTDs, which doesn't break
anything, than to write a DDML implementation from scratch.
>There was no warning, however, that DTD syntax might change.
I think you're overestimating the syntax changes being described. A PI
would be ignored by parsers that didn't understand it, and james anderson's
suggestion of interpreting attribute declarations makes even less impact.
(I don't think that one will necessarily work, however.)
>See above. We talked about pre-release pragmatism during the XHTML
>discussion -- keep everything small, simple, and modular -- but we
>also have to consider post-release pragmatism -- don't screw around
>with released specs until you have a very good,
>world-is-about-to-blow-up reason. This is more of a
>my-car-radio-is-stuck-on-AM reason.
Have you listened to AM radio in Central New York? Never mind that question.
I don't think what I'm proposing is inconsistent with the values you are
espousing. If I felt it was going to take a drastic reconfiguration of DTD
syntax, with implications for all those corporations in search of
stability, I wouldn't be bothering. Instead, it seems like I'm applying
the small, simple, and modular approach to a problem that has irritated a
lot of folks for a long time. It's taking something useful from one
situation and applying it to another - reuse in a different context, with a
little added information.
And, as always, nothing I post has any official standing anyway, right?
Simon St.Laurent
XML: A Primer (2nd Ed - September)
Building XML Applications
Inside XML DTDs: Scientific and Technical
Sharing Bandwidth / Cookies
http://www.simonstl.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Wed Sep 8 23:28:24 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:46 2004
Subject: Layers, again (was Re: fixing (just) namespaces and
validation)
In-Reply-To: <199909082116.RAA14362@hesketh.net>
References: <199909082026.QAA11768@hesketh.net>
<199909081810.OAA05438@hesketh.net>
<4.0.1.19990908091744.01115340@216.27.10.33>
<199909081208.NAA17410@nag.co.uk>
<14294.21817.614899.758420@localhost.localdomain>
<14294.36560.725917.191918@localhost.localdomain>
<14294.47066.957120.565534@localhost.localdomain>
<14294.52037.477683.565382@localhost.localdomain>
<199909082116.RAA14362@hesketh.net>
Message-ID: <14294.54422.707751.653558@localhost.localdomain>
Simon St.Laurent writes:
> >There was no warning, however, that DTD syntax might change.
>
> I think you're overestimating the syntax changes being described.
> A PI would be ignored by parsers that didn't understand it, and
> james anderson's suggestion of interpreting attribute declarations
> makes even less impact. (I don't think that one will necessarily
> work, however.)
My fault: I didn't realise that your proposed changes used only PIs.
I'm not sure that I see how the documents could still be valid from
the XML 1.0 perspective, though, and if not, the new documents would
still be backwards incompatible with some (many?) existing XML 1.0
implementations.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From srn at techno.com Thu Sep 9 01:40:12 1999
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun 7 17:14:46 2004
Subject: ANN: XML and Databases article
In-Reply-To: <01BEFA20.9D5E8F30@grappa.ito.tu-darmstadt.de> (message from
Ronald Bourret on Wed, 8 Sep 1999 17:35:58 +0200)
References: <01BEFA20.9D5E8F30@grappa.ito.tu-darmstadt.de>
Message-ID: <199909082341.SAA02016@bruno.techno.com>
[Ron Bourret:]
> I've read Paul's tutorial and the GroveMinder summary on the Web, so
> let's see if I've got this straight. A grove is basically a
> property set, broken down into classes, each of which has
> properties. There are probably relationships between those
> classes. For example, a grove for XML could have classes for
> elements, attributes, entities, and so on, where the element class
> points to the attribute class. A grove for a relational database
> would have classes for tables, columns, etc., where the table class
> points to the column class.
Pretty close, but not quite on the money. First of all, a
terminological problem: A grove is the set of objects that results
from understanding (parsing and processing) some particular logical
resource. No grove is made from more than one logical resource (I say
"logical" resource because some single resources are distributed in
multiple physical containers). However, more than one grove can be
made from a single resource. This is because resources have multiple
layers. For example, in the case of XML documents, there is always
the XML syntax layer of "understanding". The property set (schema)
for this layer is probably strongly reminiscent of the DOM. However,
there are one or more vocabularies used in every XML document (there's
always at least one because the element types have names, even if
there's no DTD). The semantics of these vocabularies may imply
"emergent properties" of the information contained in the resource,
and there can be a property set for each vocabulary's emergent
properties. So preparing a single resource for application-internal
exploitation may involve creating groves for each vocabulary. By
giving names to the emergent properties of vocabularies, such property
sets can be, in effect, APIs to the semantics of each vocabulary, thus
opening the way for vocabulary-specific software engines, and for far
more reliable cross-application information interchange than the Web
has ever seen.
So, instead of saying,
> A grove for XML could have classes for elements, attributes,
> entities, and so on, where the element class points to the attribute
> class.
... you might better have said any one of the following (this has to
be said with extreme precision, so look closely):
| A property set for the XML language could have classes for elements,
| attributes, entity references, and so on, where the element class
| has, as one of its nodal properties an "attribute specification
| list" property, whose value is a list of "attribute value
| specification" nodes.
or:
| The primary grove form of an XML resource could have nodes
| conforming to the "element", "attribute specification", and "entity"
| classes, and so on, where the "element" class has, as one of its
| properties, an "attribute specification list" property, whose value
| consists of a list of nodes that all must be of the class "attribute
| value specification".
or, in view of the fact that the DTD of an XML resource is part of
its grove (when it appears or is referenced by the DOCTYPE
declaration in an XML resource):
| The primary grove from of an XML resource could have element type
| definitions, attribute list definitions, entity declarations, and so
| on, where the element type definition class has, as one of its nodal
| properties, an "attribute definition list" property, whose value
| consists of a list of nodes that all must be of the class "attribute
| definition".
The second problem with your summary statement is that "points to" is
actually an implementation detail. The standard only says that nodes
(objects) in groves have properties, and the some properties can be
"nodal" -- that is, the values of such properties can be other nodes
(in the same grove and/or in other groves). The manner in which a
node is represented to be a property value in any given implementation
is almost certainly going to be via pointing (at least in a von
Neumann architecture machine), but it's important to realize that that
is an implementation decision, and it's inaccurate to say that
"pointing" has anything to do with the grove paradigm. A property set
can only say that the value of a property is nodal, and
implementations of the grove paradigm must make it appear that the
value of such a property is indeed one or more nodes, but how that is
made to happen is not part of the standard (nor should it be).
So, instead of saying:
> "where the table class points to the column class"
... it would be much more accurate to say:
| where the "table" class has a property named "columns" whose value
| is a list of "column"-class nodes.
> In this sense, the XML information set has much in common with
> groves, as it is a property set.
Yes, except that it's not yet clear that the XML info set will be
expressed using the ISO Property Set DTD -- but this is merely a
syntax issue. I agree with David Megginson: I expect it to be readily
convertible.
> Similarly, the DOM could be viewed as an API for a grove.
Yes, to a single kind of grove, specifically an XML syntactic grove.
(A grove governed by the properties of XML's syntax.)
(Aside: I hope we're not facing a future in which the semantics of
certain chosen vocabularies will be directly supported by future
versions of the DOM. Such support should "plug into" (and be
unpluggable from) the DOM. No vocabulary-specific support should
become a required feature of all DOM implementations. For example,
making XLink a vocabulary is fine; making the DOM able to support
XLink but no other linking vocabularies would be the start of a long
nightmare with a bad ending. To do that would significantly reduce
the freedom of industries to design their own information
architectures, and to evolve them according to their own perceived
needs. It would also destroy the DOM, which must stay simple in order
to survive. No API can do everything for everybody, and once you
start putting support for DTD-specific (or namespace-specific)
semantics into the DOM, where do you stop? I've watched a couple of
systems bloat uncontrollably and meet their demise in similar ways,
and the stage is perfectly set for the same thing to happen to the
DOM.)
> The XML information set is not a grove because ... it is not
> ... expressed in grove notation.
If you replace the word "grove" with "property set" (twice) in the
above sentence, you are exactly correct. (There is no such thing as
"grove notation". "Grove" is an abstract concept that, when sensibly
implemented, makes a grove exactly as human readable as a hex dump of
RAM in which there are C structs in no particular order.)
> The DOM is not an API for a grove because it's a bit wishy-washy in
> places -- for example, four characters of PCDATA could be one node
> or four, so it's not built on a rigid enough data model.)
Close enough. I would put the same thought differently: The DOM
doesn't have a formalized underlying data model, so the DOM doesn't
answer the need for a solid basis on which to express the addresses of
the components of XML resources. I'm hoping and believing that after
the XML infoset is done we'll have a basis for implementing a powerful
version of XPath (or XPointers or whatever the idea of generalized
addressing of components of XML resources is being called at that
time).
> The nice thing about groves is that all groves, regardless of what
> they are built on, have certain commonalities, such as
> addressability, so you can perform certain common functions with
> them.
Right. All nodes in groves have the same "object model" (I'm using
this term in a more formal, scientific sense than the term is used in
the phrase "Document Object Model (DOM)".) The grove object model is:
Groves have nodes, nodes conform to classes, and classes have named
properties with value constraints. Nodes have named properties, and
values for those properties. That's about it; the rest is detail.
(It's pretty interesting detail.)
> GroveMinder is generic grove middleware. It has plug-ins, called
> Minders (I think of them as drivers),
Hooray, thank you! I have sometimes called them "notation drivers"
only to get the blankest stares imaginable. (I then have asked
something lame, like, "Do you know what a device driver is, and why we
have them?") But you obviously get the point of Minders: Minders
represent plug and play support for individual notations, in a system
that makes all content look alike (i.e., conform to the grove object
model).
> that can build groves over different property sets. For example,
> there is one Minder for SGML/XML documents and a different Minder
> for relational databases.
Well, actually, there's probably a one-to-one correspondence between
property sets and database schemas. In order to address information
in terms of its structure, you have to know the structure. In
grove-land, the structure is defined by a property set. Different
databases have different structures, normally expressed as database
schemas. Making a database look like a grove is very straightforward.
The bulk of the work is translating the schema into a property set
(which is, after all, a kind of schema). There's a bit of coding
involved, too, but the GroveMinder developer kit has tools that make
this amazingly easy. (At least the Lockheed-Martin people were
amazed, and they said so publicly at XML '98.)
The grove paradigm breaks down the distinction between documents
(resources) and databases. Everything, in its addressable form, is a
grove, and a grove is a database. But a grove is convertible into an
interchangeable resource (that is, if the property set is a
comprehensive expression of the syntactic features of the notation of
an interchangeable resource). Obviously, a resource is also
convertible into a grove, given a property set for its notation.
Property sets are the bridge between the world of information
interchange, and the world in which interchanged information is
immediately useful (i.e., the world in which information exists after
parsing and common semantic processing of interchangeable resources
has been done). If the resource is *already* a database, there's
probably no parsing or processing involved. All that needs to be done
is to put a translating layer over it that makes the database look
like a grove. Then, the database and all its contents are fully able
to participate in the wider world of interchangeable information
resources: they can be linked, re-used by reference, have any kind of
metadata associated with them, etc. etc.
> (There can actually be different property sets for a "type" of
> data. For example, one property set for XML might include entities
> and another might not, specifying that each entity is replaced by
> its value. A different Minder is needed for each property set.)
Strictly speaking, you're correct: people can disagree about the
properties of, say, PostScript as a notation, or they might agree
about the properties but not about what the names of the properties
should be. Nothing prevents people from writing their own property
sets. In fact, however, the situation is not as chaotic as your
example might lead one to believe, because of "grove plans". A "grove
plan" is a way of selectively deleting properties from classes, and of
deleting classes altogether, as a way of avoiding the overhead of
storing and/or processing those properties and classes. For example,
the property set for SGML is comprehensive, but an application may not
need, for example, to store nonsignificant white spaces found in the
start tags of SGML elements. The application may therefore use a
"grove plan" to delete the properties whose values would be those
white space characters.
The addresses of nodes in groves are always expressed with respect to
a property set and a grove plan. If it were not so, you wouldn't know
whether to count a certain node type or not, when counting nodes to
get to a particular node. And it's true that, for example, some
people want to count the text that was inserted via an entity
reference as a distinct node, while other people don't; this kind of
flexibility is needed in order to keep peace in the family, and allow
people to do addressing in the way they want to do it.
Property sets are modularizable, so that it's relatively easy to
express commonplace grove plans, to establish conformance levels for
processing systems, and to understand the rules for interpreting
address expressions.
A Minder that implements a property set comprehensively can optionally
view groves less comprehensively, so as to be able to resolve
addresses that were expressed according to lesser grove plans. There
doesn't have to be a different Minder for each different grove plan.
(And that's where your example might be misleading.)
> One thing GroveMinder can do is store a grove in its own
> database. (Note that this is separate from the database addressed by
> the relational database Minder -- it has a structure designed to
> store groves.) Thus, GroveMinder can store an XML document in a
> database as a grove and is what I, in my article, called a content
> management systems. That is, it can store and retrieve an XML
> document as a document.
Sounds right to me. ("...its own database" sounds a bit odd because
GroveMinder can use any ODBMS for grove storage.)
> Some questions:
> 1) Is it possible to combine groves of different types? For example,
> can I take a grove representing a table in a relational database and
> stuff it into a grove for an XML document, for example as the
> content of an element?
I'm afraid I don't grasp the intent of this question. When such an
XML document is exported from its grove as an XML document, what
should the document look like?
There's no need (and no way) to stuff something into something else.
It is only necessary that the "content" property of the element have,
as its value, the node in the database grove that represents the
table. The ISO standard SGML Property Set does not allow this; only
certain classes of nodes within the same grove are allowed as the
value of the "content" property of "element" nodes. However, if you
want to change your operative SGML Property Set so that this will be
permitted, nothing (other than good sense) prevents you from doing it;
the grove paradigm will readily support you in your madness.
I don't know why it would be sensible to regard an RDBMS table as the
content of an SGML or XML element. The normal meaning of "content" is
elements, character data, and/or other SGML constructs, right there,
inside the element. There is no way to write a general purpose
grove-to-SGML converter unless the classes of the nodes that can
appear in element content are limited and known. (We certainly don't
want to dump arbitrary data into the content of an element; this would
invite a situation in which the document that is ultimately exported
is unparsable.)
> If so, does the table grove retain its table-ness, or is it
> converted to one or more XML elements? Both cases seem reasonable,
> although the latter would presumably require a special converter. If
> the latter case is true, then GroveMinder might also fit what I call
> data transfer middleware, depending on how the conversion is done.
I would suggest that an efficient way to handle this would be to
convert the table into node classes that *are* permitted to appear in
element content, and then make *those* nodes the value of the content
property. If you do it this way, you're necessarily making the
decisions that must be made about how the XML document, when exported,
will reflect the table data.
You're right that one application of GroveMinder is data transfer
middleware. The conversion program is comparatively easy to write,
since everything already conforms to the same object model.
> 2) Are groves themselves relevant at a high level in a discussion of
> XML and databases? It strikes me that, like SAX and the DOM, they
> are a useful tool in implementing software that stores/retrieves XML
> documents (or data from those documents) in a database but are not
> directly relevant to the discussion itself. Instead, they are most
> relevant to the user in that they are likely to weigh heavily in the
> feature set exposed by a content management system or (possibly)
> data transfer system.
Good question. I guess that's for the person who's doing the
discussing to decide. Since groves can be persistent (e.g., stored in
databases), and since XML resources can become groves, it seems to me
that groves are relevant. You're right, the real reason they're
interesting is their impact on feature sets. But aren't feature sets
(and especially tradeoffs between feature sets) what technical
discussions are all about?
> 3) This isn't directly related to XML/databases, but what other
> common functionality do all groves have? For example, can I write an
> application that navigates groves, regardless of their source (I
> think the answer is yes)?
Yes. We have a demonstration of that.
> Can I combine groves of different types or convert painlessly --
> that is, without writing any additional code -- from one type to
> another (I think the answer is no -- additional code is needed)?
Probably no, but it really depends on what you mean by "code." You
have to decide how instances of nodes of particular classes and in
particular contexts will be mapped onto instances of nodes of
particular classes in the new context, and you have to express your
decisions in a formal, machine processable fashion. Right now, using
GroveMinder, you can do that with a Python script, which seems about
as quick, intuitive, and flexible a way to do it as any. I don't know
of any transformation specification language with which a similar feat
(transforming one kind of grove into another kind of grove) can be
done, except possibly DSSSL (which relies on (and was written in terms
of) the grove paradigm, by the way). We haven't implemented DSSSL,
but it shouldn't be too hard to do that on top of GroveMinder. Would
you call a DSSSL transformation specification "code"? (I guess I
would.)
> Can I hyperlink from one grove to another (I think the answer is
> yes)?
Yes. The interesting thing here is that traversal can be initiated
from any node in any grove, on account of a link in any grove, and
traversal can be made to any node in any grove. Neither the traversal
initiation point, nor the traversal target, has to be a linking
construct. Neither has to "know" anything about the fact that they
are actually anchors.
> And so on.
I'll provide you with a copy of the GroveMinder demo, if you like.
There are lots of playful possibilities. Some people have even
written their own HyTime documents to use with the demo software.
It's a challenge for puzzle lovers, because the demo does not report
errors in documents.
-Steve
--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com http://www.techno.com ftp.techno.com
voice: +1 972 231 4098
fax +1 972 994 0087
pager (150 characters max): srn-page@techno.com
3615 Tanner Lane
Richardson, Texas 75082-2618 USA
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From srn at techno.com Thu Sep 9 01:39:25 1999
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun 7 17:14:46 2004
Subject: ANN: XML and Databases article
In-Reply-To: <14294.39104.190422.759370@localhost.localdomain> (message from
David Megginson on Wed, 8 Sep 1999 13:14:35 -0400 (EDT))
References: <01BEFA20.9D5E8F30@grappa.ito.tu-darmstadt.de> <14294.39104.190422.759370@localhost.localdomain>
Message-ID: <199909082020.PAA02004@bruno.techno.com>
[David Megginson:]
> There aren't many people alive who actually know Groves (we couldn't
> all fit in a Cessna, but we probably could squeeze into a Dash-8
> with a few empty seats), so it had no real familiarity advantage.
Groves are going to turn out to be like Linux, which began with a very
few people who had a vision that turned out to work. As was the case
with Linux in those early days, there is nobody doing big media
advertising about it, and even the trade press, whose income is
derived from such advertising, hasn't heard of groves very much. That
will change. Linux has risen on the strength of the idea that people
can and should be in direct control of their operating system, and
that the result of such control will be increased human productivity.
Similarly, groves will rise on the strength of the idea that people
should be in direct control of their information. The
product-differentiation barriers that vendors have set up around their
customers' data must come down. There is no information that
civilization can afford to leave out of the mainstream of information
processing. XML is a step toward this goal, but it requires that the
data be converted into XML; it will never happen that all data will be
stored (or even interchanged) as XML. The grove paradigm brings the
barriers down without necessitating data conversion. The grove
paradigm lets the markup be elsewhere than inside the data.
Even though groves are the technical foundation of the SGML, DSSSL,
and HyTime international standards (respectively the proud,
heavier-duty forerunners of XML, XSL, and XLink, among other W3C
Recommendations), there is no money for groves precisely because
system vendors have *less than no reason* to popularize this dangerous
idea. As with Linux, however, that is the very reason why the grove
paradigm will become commonplace: it will wring massive inefficiencies
out of the software systems marketplace, and out of software systems.
As everybody who attended Metastructures in Montreal last month knows,
people who are into solving tough real-world information management
problems, like DataChannel / ISOGEN, are selling and developing the
grove paradigm as a core strategy, because they know that there is
nothing else out there that compares to the power it brings to solving
tough business problems, both technically and politically. Other
system vendors cannot ignore this situation forever. It won't be too
long before groves are a mass-market phenomenon (even if they're not
called "groves" by then). The opportunities are almost unbelievably
large.
But don't worry, David: if you don't provide a property set for XML as
part of the work of the XML infoset group, we'll take what you do
produce and turn it into a property set. That way, it'll be machine
processable as just another notation, by engine software that is just
another plug-in to the wider world that includes all other notations,
and all other database schemas. That will be a good thing, and we
can't let a little matter of syntax stand in the way of progress.
-Steve
--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com http://www.techno.com ftp.techno.com
voice: +1 972 231 4098
fax +1 972 994 0087
pager (150 characters max): srn-page@techno.com
3615 Tanner Lane
Richardson, Texas 75082-2618 USA
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From simonstl at simonstl.com Thu Sep 9 01:59:45 1999
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun 7 17:14:46 2004
Subject: Layers, again (was Re: fixing (just) namespaces and
validation)
In-Reply-To: <14294.54422.707751.653558@localhost.localdomain>
References: <199909082116.RAA14362@hesketh.net>
<199909082026.QAA11768@hesketh.net>
<199909081810.OAA05438@hesketh.net>
<4.0.1.19990908091744.01115340@216.27.10.33>
<199909081208.NAA17410@nag.co.uk>
<14294.21817.614899.758420@localhost.localdomain>
<14294.36560.725917.191918@localhost.localdomain>
<14294.47066.957120.565534@localhost.localdomain>
<14294.52037.477683.565382@localhost.localdomain>
<199909082116.RAA14362@hesketh.net>
Message-ID: <199909090002.UAA20519@hesketh.net>
At 05:30 PM 9/8/99 -0400, David Megginson wrote:
>> >There was no warning, however, that DTD syntax might change.
>>
>> I think you're overestimating the syntax changes being described.
>>[...PIs, not syntax changes]
>
>My fault: I didn't realise that your proposed changes used only PIs.
>I'm not sure that I see how the documents could still be valid from
>the XML 1.0 perspective, though, and if not, the new documents would
>still be backwards incompatible with some (many?) existing XML 1.0
>implementations.
I'm not sure I'm quite off the hook yet. While the PI approach is nice
because it doesn't have many side effects (it's easily ignored), I'm still
trying to figure out if the processing shift might produce odd results in
some situations. I think that most of the time, it'll be okay, but I'm
still hunting for lurking weirdness.
My _hope_ is that it'll be 100% backwards compatible with XML 1.0 DTD
validation if the prefixes don't change, and that it'll let through the
cases where the prefix changed but nothing else did. In that sense, it'd
be incompatible - documents with good namespace matches would still spit
out from validating XML 1.0 processors. That doesn't seem any worse than
the current situation, but it's worth more thought.
Simon St.Laurent
XML: A Primer (2nd Ed - September)
Building XML Applications
Inside XML DTDs: Scientific and Technical
Sharing Bandwidth / Cookies
http://www.simonstl.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cbullard at hiwaay.net Thu Sep 9 02:21:47 1999
From: cbullard at hiwaay.net (Len Bullard)
Date: Mon Jun 7 17:14:47 2004
Subject: Why focus on HTML?
References: <3.0.32.19990906085410.00b6d4e0@pop.intergate.bc.ca><37D3F74D.AA84B8FF@praxis.cz> <199909071317.JAA05607@hesketh.net> <014101bef937$73ad3040$1618ccce@pebbles>
Message-ID: <37D6FD32.59EB@hiwaay.net>
Dave Winer wrote:
>
> So, if you believe that, as I do, why not channel your energies into spaces
> where there is no installed base? XML opens all kinds of possibilities. Why
> not spread out and focus on implementations that use the net in new ways? Or
> try out some new ideas. Why channel all the energy into a place where not
> much can be done?
>
> My opinion only..
A good one. Consider the results if XML applications don't do what some
tried
with SGML applications: one size fits all consumers. Well, you get the
reemergence
of helper applications (or.. applets) as dominant species.
Works for real audio. May be the best way to do 3D. Altogether, page
integration
is only meaningful to a given kind of page. On the other hand, using
operating
system services is easy to decouple when the protocols are commodities.
For my
money, the best idea of the Web was and is HTTP. The other good idea is
MIME.
Register types and interoperate through commodity interfaces. COM is
the best
CS invention of the 90s. HTML is GenCoding with Windows. Cool, because
it
works, but not much of a conceptual leap.
We already know what happens if HTML is endlessly extended:
MIL-D-28001.
len
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cbullard at hiwaay.net Thu Sep 9 02:21:44 1999
From: cbullard at hiwaay.net (Len Bullard)
Date: Mon Jun 7 17:14:47 2004
Subject: an unfilled need [ tie XML to Java ]
References: <3.0.32.19990906085410.00b6d4e0@pop.intergate.bc.ca> <37D3F74D.AA84B8FF@praxis.cz> <37D4247B.4FD2@hiwaay.net> <00c001bef8ab$d1ab19a0$0a0a0a0a@deltabiz>
Message-ID: <37D6F89C.5100@hiwaay.net>
Steven Livingstone wrote:
>
> >Yes, tie XML to Java
>
> [ I have come into this discussion a bit late - where does this comment
> originate? ]
>
> This statement seems to defeat the one of the purposes of XML. I am not (not
> would want to) learn and use Java - for Web Applications there is no need
> for having to use Java. I have done plenty of exciting XML-based work using
> COM and scripting.
Tie XML to Java in an application. That means one knows what is XML
and what is application. If that works for the designer, it provides
both portable data and an interoperable application within the virtual
machine. If you have used COM to do that, you have tied your data
to a COM-aware application. Works for me. Now, after one does
that, what is still interoperable?
len
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From cbullard at hiwaay.net Thu Sep 9 02:58:59 1999
From: cbullard at hiwaay.net (Len Bullard)
Date: Mon Jun 7 17:14:47 2004
Subject: confidentiality in W3C WGs
References: <199909071900.PAA25558@gw.sqwest.bc.ca>
Message-ID: <37D70003.E62@hiwaay.net>
Lauren Wood wrote:
> I agree that technical reasons for why the spec is the way it is, and
> reasons for change, should be made public. Particularly when the
> issue is controversial, as this one is.
Right. Do it the OldeFashionedWay: Document by paragraph number the
text, the suggested edit to the text, the rationale for the technical
change, the submittor. the reason for accepting or rejecting the
change. There are some sage standards editors on this
list (Dr. Goldfarb? Dr. Newcomb?) who can provide a precise format
for this kind of document.
It really is that easy. We can't fix the W3C. We can't fix the
press. We can go our own way.... or we can state in earnest to
whatever authority can provide support that the need for clearly
documented public rationale for public utilities outweighs the
need for confidentiality.
Editing such documents before publishing them is good practice.
Publishing blow by blow summaries of meeting debates is bad practice.
There is plenty of experience in this community to explain those
practices.
len
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From martind at netfolder.com Thu Sep 9 04:07:04 1999
From: martind at netfolder.com (Didier PH Martin)
Date: Mon Jun 7 17:14:47 2004
Subject: ANN: XML and Databases article
In-Reply-To: <199909082341.SAA02016@bruno.techno.com>
Message-ID:
Hi Steven,
Steven said:
--------------------------------------------------
Probably no, but it really depends on what you mean by "code." You
have to decide how instances of nodes of particular classes and in
particular contexts will be mapped onto instances of nodes of
particular classes in the new context, and you have to express your
decisions in a formal, machine processable fashion. Right now, using
GroveMinder, you can do that with a Python script, which seems about
as quick, intuitive, and flexible a way to do it as any. I don't know
of any transformation specification language with which a similar feat
(transforming one kind of grove into another kind of grove) can be
done, except possibly DSSSL (which relies on (and was written in terms
of) the grove paradigm, by the way). We haven't implemented DSSSL,
but it shouldn't be too hard to do that on top of GroveMinder. Would
you call a DSSSL transformation specification "code"? (I guess I
would.)
Didier says:
--------------------------------------------------
You're absolutely right Steven, yes DSSSL could be made inter-operable with
grove engines quite easily. In fact, we are working on an interface for
grove engines in the OpenJade project. Actually, OpenJade includes a SGML
property set based grove and this grove "in memory" only (i.e. resident on
the heap). This limitation could be removed by allowing other grove engines
to be processed by DSSSL.
I would also call DSSSL a transformation specification code either from a
grove to a modified grove of from a grove into a flow object tree.
How can we bridge the vision to the reality simply by sitting around the
table and define the API between gove engines and transformation engines.
The DOM only reflects a particular interface to a particular property set
(If I can express myself that way). Obviously a grove is more than that
(anyway you know that). So, why not work on a grove API, publish it, and
then submit it to our collegues like those present in this list.
Call to action:
--------------
If anyone is interested by the task to define an API between grove engines
and transformation engines (XSL or DSSSL for instance), please send me an
email and we'll set a discussion group with the OpenJade team and you so
that we all together define the Linux of markup technologies ;-). Then we
will submit the document to other collegues for further discussion and
feedback.
regards
Didier PH Martin
mailto:martind@netfolder.com
http://www.netfolder.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From davidc at nag.co.uk Thu Sep 9 10:15:14 1999
From: davidc at nag.co.uk (David Carlisle)
Date: Mon Jun 7 17:14:47 2004
Subject: Is it possible to overwrite global variables?
Message-ID: <199909090815.JAA21092@nag.co.uk>
You'd probably be better asking xsl questions on xsl-list rather than
xml-dev:
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
but anyway...
The answer to your question is no, but don't despair:-)
You don't want a condition to control whether or not to set a variable,
you want to set a variable to an expression who's value depends on a
condition:
David
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Daniel.Veillard at w3.org Thu Sep 9 10:31:35 1999
From: Daniel.Veillard at w3.org (Daniel Veillard)
Date: Mon Jun 7 17:14:47 2004
Subject: ANN: XML and Databases article
In-Reply-To: <199909082020.PAA02004@bruno.techno.com>
References: <01BEFA20.9D5E8F30@grappa.ito.tu-darmstadt.de> <14294.39104.190422.759370@localhost.localdomain> <199909082020.PAA02004@bruno.techno.com>
Message-ID: <19990909043413.A9400@w3.org>
> Groves are going to turn out to be like Linux, which began with a very
> few people who had a vision that turned out to work. As was the case
> with Linux in those early days, there is nobody doing big media
> advertising about it, and even the trade press, whose income is
> derived from such advertising, hasn't heard of groves very much. That
Hum, don't mix an match things at a different level. Linux got
success because of two things:
- it was pretty well defined, i.e. a reimplementation of the
UNIX without any attemps to get into "research" or new
semantic fields. So basically it was a implementation challenge
not a research one.
- The code source is available, moreover with a 'contaminating'
licence at the kernel level, allowing it to grow it's community
When faced with "groves" I still have a serious problem with both
points:
- Show me a definition so that I can understand the term and
underlying concept clearly enough that an implementation
time is spend not collecting and reading papers but implemening
something well defined.
Even reading http://www.prescod.net/groves/shorttut/ I still can't
get a clear definition of "what is a grove precisely".
Not at the concept level, but a implementable definition say
on top of the XML infoset (for XML documents).
- Show me the code. Not that there is none, I just don't know.
Is there a program available in source code, that I can run
on say a laptop in front of a novice (but programmer kind)
audience (say a Gnome developper's group) allowing me in 3 mn
to show a "grove" in action and what it does for them.
Get both, and if you're lucky you will experience the same success
as linux. In the meantime me and others are still wondering what's
really behind that 5 letters word !
Daniel
P.S.: Don't get me wrong, I not negative, mostly interrogative, and
honnestly a bit puzzled by the comparison. Give me both (or even
a set of clearly understandable graphics for the second point) and
I can try to propagate that notion once i have understood it.
--
Daniel.Veillard@w3.org | W3C, INRIA Rhone-Alpes | Today's Bookmarks :
Tel : +33 476 615 257 | 655, avenue de l'Europe | Linux, WWW, rpmfind,
Fax : +33 476 615 207 | 38330 Montbonnot FRANCE | rpm2html, XML,
http://www.w3.org/People/W3Cpeople.html#Veillard | badminton, and Kaffe.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From heiko.grussbach at crpht.lu Thu Sep 9 11:09:40 1999
From: heiko.grussbach at crpht.lu (heiko.grussbach@crpht.lu)
Date: Mon Jun 7 17:14:47 2004
Subject: XML-Editors with pluggable validation
Message-ID:
Hi,
XML-Editors usually let you edit an XML file, so that its valid against its
corresponding DTD. How about application specific validation. Just an
example, I want to make sure that an element "DATE" in the XML-Document
contains a valid date, but I don't want to express what a "valid date" is
in the DTD. So, whenever a user finishes editing a DATE element, my
application code would have to be executed to see, if the text entered by
the user is valid. Has anyone seen such an editor?
Regards
Heiko
Heiko.Grussbach@crpht.lu
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From rbourret at ito.tu-darmstadt.de Thu Sep 9 11:36:06 1999
From: rbourret at ito.tu-darmstadt.de (Ronald Bourret)
Date: Mon Jun 7 17:14:47 2004
Subject: ANN: XML and Databases article
Message-ID: <01BEFAB8.224F6E70@grappa.ito.tu-darmstadt.de>
Steven R. Newcomb wrote:
> > The nice thing about groves is that all groves, regardless of what
> > they are built on, have certain commonalities, such as
> > addressability, so you can perform certain common functions with
> > them.
>
> Right. All nodes in groves have the same "object model" (I'm using
> this term in a more formal, scientific sense than the term is used in
> the phrase "Document Object Model (DOM)".) The grove object model is:
> Groves have nodes, nodes conform to classes, and classes have named
> properties with value constraints. Nodes have named properties, and
> values for those properties. That's about it; the rest is detail.
> (It's pretty interesting detail.)
What other common functions can you perform besides addressing /
hyperlinking?
> > GroveMinder is generic grove middleware. It has plug-ins, called
> > Minders (I think of them as drivers),
>
> Hooray, thank you! I have sometimes called them "notation drivers"
> only to get the blankest stares imaginable. (I then have asked
> something lame, like, "Do you know what a device driver is, and why we
> have them?") But you obviously get the point of Minders: Minders
> represent plug and play support for individual notations, in a system
> that makes all content look alike (i.e., conform to the grove object
> model).
Actually, if you had said "notation driver" to me, I would have given you
the same blank stare. The problem for me is not "driver", but "notation"
-- it might mean something to SGML gurus, but is unlikely to mean much to
the average programmer, who at least has a chance with "driver". I can't
think of a simple generic term right off, but the following would do pretty
handily -- "Just like you need a different ODBC driver for each brand of
database, you need a different Minder for each type of data -- SGML
document, database, email, Word documents, and so on." It's probably not
quite technically correct, but it will get the point across.
>
> > that can build groves over different property sets. For example,
> > there is one Minder for SGML/XML documents and a different Minder
> > for relational databases.
>
> Well, actually, there's probably a one-to-one correspondence between
> property sets and database schemas. In order to address information
> in terms of its structure, you have to know the structure. In
> grove-land, the structure is defined by a property set. Different
> databases have different structures, normally expressed as database
> schemas. Making a database look like a grove is very straightforward.
> The bulk of the work is translating the schema into a property set
> (which is, after all, a kind of schema). There's a bit of coding
> involved, too, but the GroveMinder developer kit has tools that make
> this amazingly easy. (At least the Lockheed-Martin people were
> amazed, and they said so publicly at XML '98.)
What do you mean by "different databases" here? If you mean relational
databases v. hierarchical databases v. object-oriented databases, then
there's no problem -- I would expect each to have a different property set.
On the other hand, if you mean DB2 v. Oracle v. Informix v. SQL Server,
then it seems there is something broken -- I would very much expect all
relational databases to have the same property set.
> The grove paradigm breaks down the distinction between documents
> (resources) and databases. Everything, in its addressable form, is a
> grove, and a grove is a database.
Saying that a grove is a database strikes me as a bit misleading, at least
in the database world. A grove is a database, in the sense that it
contains data and you can extract that data, but in this sense, a
spreadsheet is a database, a Word document is a database, a file system is
a database, and so on. I think it would be more accurate to say that a
database (in the traditional sense of the term) can be used to persist a
grove, especially as you go on to say:
> ... If the resource is *already* a database, there's
> probably no parsing or processing involved. All that needs to be done
> is to put a translating layer over it that makes the database look
> like a grove. Then, the database and all its contents are fully able
> to participate in the wider world of interchangeable information
> resources: they can be linked, re-used by reference, have any kind of
> metadata associated with them, etc. etc.
>
> > One thing GroveMinder can do is store a grove in its own
> > database. (Note that this is separate from the database addressed by
> > the relational database Minder -- it has a structure designed to
> > store groves.) Thus, GroveMinder can store an XML document in a
> > database as a grove and is what I, in my article, called a content
> > management systems. That is, it can store and retrieve an XML
> > document as a document.
>
> Sounds right to me. ("...its own database" sounds a bit odd because
> GroveMinder can use any ODBMS for grove storage.)
What I meant to do here was distinguish between the database that
GroveMinder uses to store groves and other databases that might be external
resources, such as that Informix database run by Billy Bob over in
Engineering. (Granted, GroveMinder's database can undoubtedly be treated in
a fashion similar to Billy Bob's database, but that's getting a bit
self-referential and misses the point that GroveMinder runs just fine
without Billy Bob's database but can't run without its own.) I'm also
assuming that the database used by GroveMinder is configured for grove
storage, although this might not be such an earth-shaking amount of work --
I'll take a guess that the class definitions are fed pretty much directly
into the database as schema.
>
> > Some questions:
>
> > 1) Is it possible to combine groves of different types? For example,
> > can I take a grove representing a table in a relational database and
> > stuff it into a grove for an XML document, for example as the
> > content of an element?
>
> I'm afraid I don't grasp the intent of this question. When such an
> XML document is exported from its grove as an XML document, what
> should the document look like?
>
> There's no need (and no way) to stuff something into something else.
> It is only necessary that the "content" property of the element have,
> as its value, the node in the database grove that represents the
> table. The ISO standard SGML Property Set does not allow this; only
> certain classes of nodes within the same grove are allowed as the
> value of the "content" property of "element" nodes. However, if you
> want to change your operative SGML Property Set so that this will be
> permitted, nothing (other than good sense) prevents you from doing it;
> the grove paradigm will readily support you in your madness.
>
> I don't know why it would be sensible to regard an RDBMS table as the
> content of an SGML or XML element. The normal meaning of "content" is
> elements, character data, and/or other SGML constructs, right there,
> inside the element. There is no way to write a general purpose
> grove-to-SGML converter unless the classes of the nodes that can
> appear in element content are limited and known. (We certainly don't
> want to dump arbitrary data into the content of an element; this would
> invite a situation in which the document that is ultimately exported
> is unparsable.)
What I meant here was whether you could perform an operation similar to
embedding an Excel spreadsheet in a Word document -- that is, can I
(easily, generically, and without modification of property sets) combine
information from different groves into a single grove. This would be
extremely useful because it would be a step on the way towards being able
to query the entire enterprise with requests such as, "Get me the names,
addresses, and company prospectuses of all customers that I've sent email
to in the last three days". The names and addresses come from the corporate
database, the email information comes from my email, and the prospectuses
come from a document database somewhere or perhaps the Web. The result
could be navigable by a generic grove navigation tool.
My guess is that groves right now give me this functionality by
hyperlinking one grove to the next. This is fine in some cases (a
grove-based query tool), not in others (wanting to expose tabular database
data as XML). Actually, I would have been absolutely amazed if groves could
do this without any sort of conversion software.
(Note that when the Word document is persisted, the result isn't really a
"Word" document, it's (if I've got my terminology right), an OLE Compound
Document. When you get to the point where the spreadsheet is, Word no
longer has a clue how to process it. Instead, there's a flag that says,
"Yo! Go start Excel" and processing is handed over to Excel. A similar
situation would be reasonable in a "compound" grove. It could easily be
persisted to a grove database, which understands groves, but couldn't
readily be persisted as XML or a database without conversion software.
This is the fundamental difference between what I classify as data transfer
middleware and content management systems. Data transfer middleware views
the XML document as something it understand -- data -- and "converts" it to
a database format; this is similar to Word storing a spreadsheet as a
table. Content management systems view XML documents as documents and
simply store them in a database rather than a text file.)
> > If so, does the table grove retain its table-ness, or is it
> > converted to one or more XML elements? Both cases seem reasonable,
> > although the latter would presumably require a special converter. If
> > the latter case is true, then GroveMinder might also fit what I call
> > data transfer middleware, depending on how the conversion is done.
>
> I would suggest that an efficient way to handle this would be to
> convert the table into node classes that *are* permitted to appear in
> element content, and then make *those* nodes the value of the content
> property. If you do it this way, you're necessarily making the
> decisions that must be made about how the XML document, when exported,
> will reflect the table data.
Exactly.
> You're right that one application of GroveMinder is data transfer
> middleware. The conversion program is comparatively easy to write,
> since everything already conforms to the same object model.
I'm not quite sure I understand this. Above, you've just said you need to
make decisions about how database node classes are converted to XML node
classes, which makes sense to me. What do you mean by "everything already
conforms to the same object model"? As far as I can tell, groves allow
everything to be expressed as objects (which have a few common properties),
but these objects are no more the "same object model" than a model for a
person is the same as a model for a book. Groves may have gotten me from
some other format (notation) into an object model, but whether converting
one object to another is easy depends on the objects themselves.
> > 2) Are groves themselves relevant at a high level in a discussion of
> > XML and databases? It strikes me that, like SAX and the DOM, they
> > are a useful tool in implementing software that stores/retrieves XML
> > documents (or data from those documents) in a database but are not
> > directly relevant to the discussion itself. Instead, they are most
> > relevant to the user in that they are likely to weigh heavily in the
> > feature set exposed by a content management system or (possibly)
> > data transfer system.
>
> Good question. I guess that's for the person who's doing the
> discussing to decide. Since groves can be persistent (e.g., stored in
> databases), and since XML resources can become groves, it seems to me
> that groves are relevant. You're right, the real reason they're
> interesting is their impact on feature sets. But aren't feature sets
> (and especially tradeoffs between feature sets) what technical
> discussions are all about?
In this case (since I'm the discusser ;) I'd say they're worth discussing
in individual product descriptions, but not in the meat of the article, any
more than a discussion of the DOM is relevant to the discussion.
-- Ron
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From abcoates at TheOffice.net Thu Sep 9 11:49:36 1999
From: abcoates at TheOffice.net (Anthony B. Coates)
Date: Mon Jun 7 17:14:47 2004
Subject: RFP: Namespace URI for HTML
Message-ID: <199909090953.TAA20982@mail.internetezy.com.au>
** Reply to message from "Don Park" on Mon, 6 Sep 1999
22:50:56 -0700
Late proposal (sorry, not all of us get to read xml-dev in real time ...)
> Current candidates:
>
> 0) "hell://no.stinking.namespace/4/HTML"
> 1) "http://www.w3.org/Markup/" - by David Megginson
> 2) "http://www.w3.org/HTML/1999/namespace" - by Tim Bray
> 3) "http://www.w3.org/HTML/2000/Namespace" - by Don Park (I like zeros :)
> 4) "http://www.xml.org/HTML/2000/Namespace" - (if Jon Bosak agrees)
> 5) "http://www.w3.org/Namespace/HTML/" - by Don Park (2nd try)
6) Like 2/3/4, but change 1999 or 2000 to 19991225 (or as appropriate). How
very hopeful indeed, to assume that there will never be the need for more than
one version in a year. As it is, I can't be sure that the time of day shouldn't
be included as well - who knows what "Web time" will mean to developers in 5 or
10 years time ...
Actually, I really prefer the 3-namespaces idea: Strict, Transitional, etc.
Cheers,
Tony.
** Anthony B. Coates
** Software Engineer (Java). This is a 100% Pure Java e-mail.
**
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From davidc at nag.co.uk Thu Sep 9 12:12:05 1999
From: davidc at nag.co.uk (David Carlisle)
Date: Mon Jun 7 17:14:47 2004
Subject: fixing (just) namespaces and validation
Message-ID: <199909091012.LAA18444@nag.co.uk>
> We disagree.
Not by much.
> It is the parsers which cause the problem by ignoring the information
> provided.
>
Ah interesting. But then the `problem' is that there is no REC or NOTE
or hint or suggestion from anyone in authority that namespace aware
validating parsers are _supposed_ to interpret xmlns: attribute
declarations in this way.
So when I said that `all dtds have the problem' You argue (and I would
probably agree) that the problem is in the specification (or lack of)
in how DTDs are used in the presence of namespaces.
David Megginson argues that we shouldn't be using XML DTD if we are
interested in namespaces, which is an argument, but I think I would
still be in favour of an interpretation such as you suggest which
allows a notion of validation with DTDs that does not break existing
systems (they just say that the document doesn't validate) but would
allow a class of documents to be DTD-validated by namespace aware
systems without fixing prefixes.
David
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From marko.zerdin at ixtlan-team.si Thu Sep 9 12:28:33 1999
From: marko.zerdin at ixtlan-team.si (Marko Zerdin)
Date: Mon Jun 7 17:14:47 2004
Subject: MSXML for Java Questions
Message-ID:
Hi everybody.
1) Does anybody know if Microsoft is going to support public (free) MSXML
packages for Java? The signals from Microsoft have been far from unanimous.
2) I also have an interesting problem. I'm using Visual J++ for development
environment, and com.ms.xml.om package for parsing XML
(Document.load(InputStream) method, didn't try the other versions because I
need just this one). The problem is with the following code (I've made it
the shortest XML that produces this problem):
value
value
value
value
value
value
value
value
value
value
When I call the method Document.load(InputStream), I get the following
error:
Line 19, Column 44: Close tag BankAccountPart1 does not match start tag
BankAccountPart1.
I've also found out, that I don't have this error if I remove one letter
from the tag (for example, if I replace BankAccountPart in the tags by
BankAccountPar).
If anybody knows the problem and is able to explain to me, what's the rule
for this error to occur, I would really appreciate it.
Regards,
Marko.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From martind at netfolder.com Thu Sep 9 12:43:16 1999
From: martind at netfolder.com (Didier PH Martin)
Date: Mon Jun 7 17:14:47 2004
Subject: ANN: XML and Databases article
In-Reply-To: <19990909043413.A9400@w3.org>
Message-ID:
Hi Daniel,
Daniel said:
------------------------------------------------
When faced with "groves" I still have a serious problem with both
points:
- Show me a definition so that I can understand the term and
underlying concept clearly enough that an implementation
time is spend not collecting and reading papers but implemening
something well defined.
Even reading http://www.prescod.net/groves/shorttut/ I still can't
get a clear definition of "what is a grove precisely".
Not at the concept level, but a implementable definition say
on top of the XML infoset (for XML documents).
Didier says:
------------------------------------------------
I agree with you on that point. Actually groves are defined as abstract
entities and there is no common API to groves (each implementer ca define
its own interface).
Daniel says:
-----------------------------------------------
- Show me the code. Not that there is none, I just don't know.
Is there a program available in source code, that I can run
on say a laptop in front of a novice (but programmer kind)
audience (say a Gnome developper's group) allowing me in 3 mn
to show a "grove" in action and what it does for them.
Didier says:
-----------------------------------------------
Easy, just use the "Linux of the markup technologies" aka OpenJade. The code
is freely available, several developers around the globe are already
improving it each day. The source code is stored on a CVS server and like I
said, is freely available. OpenJade includes the SGML grove plan but the
grove is transient (i.e. resident on the heap). This is a stand alone module
that could be reused in other code. The grove is available as a DCOM object
or a C++ object. You may make your own implementation if you want or change
the interface. For more information:
http://www.netfolder.com/DSSSL
So, the first requirement is not yet fulfilled because actual documentation
about groves (I mean public one) is not targeted toward concrete
implementations and interface issues. Within the OpenJade project we are
correcting this situation by documentation a concrete freely available
component. But this is work in progress like it is for all open source
projects. The second requirement is already fulfilled, you just have to get
the code and play with it. Its free, the source is available and maintained
by an international team. So, the best way to know what's behind these 5
letters is to simply take a look at the OpenJade project. I suggest also
that you take a look at the PERL open source project because they too have
included a grove manager in the PERL package. Did you noticed that PERL
already implemented a Grove module?
Regards
Didier PH Martin
mailto:martind@netfolder.com
http://www.netfolder.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From Daniel.Veillard at w3.org Thu Sep 9 14:28:09 1999
From: Daniel.Veillard at w3.org (Daniel Veillard)
Date: Mon Jun 7 17:14:47 2004
Subject: ANN: XML and Databases article
In-Reply-To:
References: <19990909043413.A9400@w3.org>
Message-ID: <19990909083042.E9400@w3.org>
> Daniel said:
> ------------------------------------------------
> When faced with "groves" I still have a serious problem with both
> points:
> - Show me a definition so that I can understand the term and
> underlying concept clearly enough that an implementation
> time is spend not collecting and reading papers but implemening
> something well defined.
> Even reading http://www.prescod.net/groves/shorttut/ I still can't
> get a clear definition of "what is a grove precisely".
> Not at the concept level, but a implementable definition say
> on top of the XML infoset (for XML documents).
>
> Didier says:
> ------------------------------------------------
> I agree with you on that point. Actually groves are defined as abstract
> entities and there is no common API to groves (each implementer ca define
> its own interface).
Ok, having followed the http://www.netfolder.com/DSSSL/ link and
looking at http://www.oasis-open.org/cover/topics.html#groves
The following seems at least a clear and concise definition:
In HyTime ISO/IEC 10744:1997 "3. Definitions (3.35)": graph
representation of property values is 'An abstract data structure
consisting of a directed graph of nodes in which each node may be
connected to other nodes by labeled arcs.
Ahhh ... and
GROVE - "Graph Representation of Property Values."
Ok, this becomes clearer.
And the picture clarify it even more !
http://www.cogsci.ed.ac.uk/~ht/grove.html
> Daniel says:
> -----------------------------------------------
> - Show me the code. Not that there is none, I just don't know.
> Is there a program available in source code, that I can run
> on say a laptop in front of a novice (but programmer kind)
> audience (say a Gnome developper's group) allowing me in 3 mn
> to show a "grove" in action and what it does for them.
>
> Didier says:
> -----------------------------------------------
> Easy, just use the "Linux of the markup technologies" aka OpenJade. The code
> is freely available, several developers around the globe are already
> improving it each day. The source code is stored on a CVS server and like I
> said, is freely available. OpenJade includes the SGML grove plan but the
> grove is transient (i.e. resident on the heap). This is a stand alone module
> that could be reused in other code. The grove is available as a DCOM object
> or a C++ object. You may make your own implementation if you want or change
> the interface. For more information:
> http://www.netfolder.com/DSSSL
Ok, coincidentally i'm doing something similar it seems on top of my XML
parser except that I based my node structure on a DOM point of view:
http://rpmfind.net/veillard/XML/
Is there a clear distinction between the model exposed by a DOM API
and a grove, I can't even find a single DOM reference in
http://www.oasis-open.org/cover/topics.html#groves
Looking at Henry graphic, it's nearly 100% equivalent to what I build
so I'm wondering, but I remember people pointing out that those are
different.
thanks for the pointers,
Daniel
--
Daniel.Veillard@w3.org | W3C, INRIA Rhone-Alpes | Today's Bookmarks :
Tel : +33 476 615 257 | 655, avenue de l'Europe | Linux, WWW, rpmfind,
Fax : +33 476 615 207 | 38330 Montbonnot FRANCE | rpm2html, XML,
http://www.w3.org/People/W3Cpeople.html#Veillard | badminton, and Kaffe.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From philipnye at freenet.co.uk Thu Sep 9 15:07:30 1999
From: philipnye at freenet.co.uk (Philip Nye)
Date: Mon Jun 7 17:14:47 2004
Subject: fixing (just) namespaces and validation
References: <199909081037.LAA21158@nag.co.uk> <37D69791.652DF82A@mecomnet.de>
Message-ID: <37D7B21B.5CF634E@freenet.co.uk>
james anderson wrote:
...
> > So, to validate with namespaces, you either have to pre-process the
> > document instance to normalise the prefixes to the prefix used in the
> > DTD,
>
> No, the processing happens in the course of parsing. There is no need for an
> additional pass. No, it is better to map them all to a symbol associated with
> the universal name. The prefixes are superfluous.
Conceptually this is still pre-processing, even if the two are
integrated in a
single pass.
If you are going to use the DTD at all, there is pre-processing to be
done - transforming namespace references into a normalised prefix - and
then there is post-processing - using the namespace to apply semantics
of some sort to
the document.
This means that in David Megginson's layered model, the document bounces
up and down between layers in an inelegant and to my mind very wasteful
way, during the course of its processing. This is a result of the
inflexibility of the DTD or perhaps of the incompatibility of
namespaces.
Either way Simon St.Laurent's original point will not go away. There is
an incompatibility between DTDs and namespaces. To my mind these are
orthogonal concepts and should not interfere with each other.
Either DTDs change, or namespaces change, or you don't use them both or
you put up with this sort of inelegant processing.
Philip Nye
Engineering Arts
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From mnutter at fore.com Thu Sep 9 15:32:26 1999
From: mnutter at fore.com (Mark Nutter)
Date: Mon Jun 7 17:14:47 2004
Subject: Groves, the next big thing (Re: ANN: XML and Databases article)
In-Reply-To: <199909082020.PAA02004@bruno.techno.com>
References: <14294.39104.190422.759370@localhost.localdomain>
<01BEFA20.9D5E8F30@grappa.ito.tu-darmstadt.de>
<14294.39104.190422.759370@localhost.localdomain>
Message-ID: <4.2.0.58.19990909093108.00b88c00@pophost1.fore.com>
At 03:20 PM 09/08/99 -0500, you wrote:
>Groves are going to turn out to be like Linux, which began with a very
>few people who had a vision that turned out to work.
Ok, you've got me all excited about groves. Where do I go to learn more
about them? I have kind of a vague idea that a grove is basically a
collection of trees. Is there a web site somewhere that will give me a
broader understanding of the "grove paradigm"?
-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-
Mark Nutter,
Internet Applications Developer
FORE Systems
Q: How many surrealists does it take to change a light bulb?
A: A rubber pickle, because fish do *not* wear bow-ties.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david at megginson.com Thu Sep 9 15:33:20 1999
From: david at megginson.com (David Megginson)
Date: Mon Jun 7 17:14:47 2004
Subject: fixing (just) namespaces and validation
In-Reply-To: <37D7B21B.5CF634E@freenet.co.uk>
References: <199909081037.LAA21158@nag.co.uk>
<37D69791.652DF82A@mecomnet.de>
<37D7B21B.5CF634E@freenet.co.uk>
Message-ID: <14295.46046.647795.571535@localhost.localdomain>
Philip Nye writes:
> This means that in David Megginson's layered model, the document
> bounces up and down between layers in an inelegant and to my mind
> very wasteful way, during the course of its processing. This is a
> result of the inflexibility of the DTD or perhaps of the
> incompatibility of namespaces.
The only reason for bouncing back to the XML layer is to take
advantage of the fact that it *does* have a widely-implemented schema
language: that's not a design problem with DTDs or Namespaces; it's
just a (very kludgy) workaround because of the fact that the W3C's
schema standards are still under development for the Namespaces layer.
Besides, even if the W3C XML Schema WG released a schema REC tomorrow
(which won't happen), and we all agreed that it was suitable, we
wouldn't necessarily be any better off. After all, XML-Dev developed
the DDML spec (which would also solve this problem) but no-one has
decided to implement it yet, even though the list's membership
consists almost entirely of developers. Having a spec is no guarantee
of having implementations.
All the best,
David
--
David Megginson david@megginson.com
http://www.megginson.com/
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From paul at prescod.net Thu Sep 9 15:50:04 1999
From: paul at prescod.net (Paul Prescod)
Date: Mon Jun 7 17:14:47 2004
Subject: an unfilled need
References: <3.0.6.32.19990907100342.009fa980@gpo.iol.ie>
Message-ID: <37D5725A.48908DC9@prescod.net>
Sean Mc Grath wrote:
>
>
>
> Bank.Debit ( (bllrp.zata * rte.w )/ GoldenRatio )
>
>
>
> 112
> g
> xxx
>
>
>
>
> This approach will get you there a lot faster than
> waiting for the "bank debit golden ratio rte interest group"
> to finalize their declarative syntax for what the bllrp
> element really means...
All you've done is shifted the problem from document type design to API
design. Now you need the "bank debit golden ration rte interest group"
to standardize their Bank objects and Debit methods. And anyhow, why
bother with XML then? You might as well have said:
data = bllrp( zata=-1, poiuk=112, pppzzz="g", ttt="xxx" )
Bank.Debit ( (bllrp.zata * rte.w )/ GoldenRatio )
Which is exactly what Javascript programmers do today...
Paul Prescod
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From macherius at darmstadt.gmd.de Thu Sep 9 15:51:40 1999
From: macherius at darmstadt.gmd.de (Ingo Macherius)
Date: Mon Jun 7 17:14:47 2004
Subject: fixing (just) namespaces and validation
In-Reply-To: <14295.46046.647795.571535@localhost.localdomain>
References: <37D7B21B.5CF634E@freenet.co.uk>
Message-ID: <199909091354.PAA06806@sonne.darmstadt.gmd.de>
David Megginson wrote at 9 Sep 99, 9:34:
> After all, XML-Dev developed
> the DDML spec (which would also solve this problem) but no-one has
> decided to implement it yet, even though the list's membership
> consists almost entirely of developers.
Someone has implemented DDML, see www.extensibility.com. However,
it's not open source.
++im
--
Ingo Macherius//Dolivostrasse 15//D-64293 Darmstadt//+49-6151-869-882
GMD-IPSI German National Research Center for Information Technology
mailto:macherius@gmd.de http://www.darmstadt.gmd.de/~inim/
Information!=Knowledge!=Wisdom!=Truth!=Beauty!=Love!=Music==BEST (Zappa)
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From robin at isogen.com Thu Sep 9 16:01:00 1999
From: robin at isogen.com (Robin Cover)
Date: Mon Jun 7 17:14:47 2004
Subject: Groves, the next big thing (Re: ANN: XML and Databases article)
In-Reply-To: <4.2.0.58.19990909093108.00b88c00@pophost1.fore.com>
Message-ID:
Re:
> Is there a web site somewhere that will give me a
> broader understanding of the "grove paradigm"?
I try to maintain a collection of references on groves in
the "Topics" section of the SGML/XML Web Page, viz.,
http://www.oasis-open.org/cover/topics.html#groves
- Robin Cover
------------------------------------------------------
On Thu, 9 Sep 1999, Mark Nutter wrote:
> At 03:20 PM 09/08/99 -0500, you wrote:
> >Groves are going to turn out to be like Linux, which began with a very
> >few people who had a vision that turned out to work.
>
> Ok, you've got me all excited about groves. Where do I go to learn more
> about them? I have kind of a vague idea that a grove is basically a
> collection of trees. Is there a web site somewhere that will give me a
> broader understanding of the "grove paradigm"?
>
> -_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-_-
>
> Mark Nutter,
> Internet Applications Developer
> FORE Systems
>
> Q: How many surrealists does it take to change a light bulb?
> A: A rubber pickle, because fish do *not* wear bow-ties.
>
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From simonstl at simonstl.com Thu Sep 9 16:02:49 1999
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun 7 17:14:48 2004
Subject: DDML (was Re: fixing (just) namespaces and validation)
In-Reply-To: <199909091354.PAA06806@sonne.darmstadt.gmd.de>
References: <14295.46046.647795.571535@localhost.localdomain>
<37D7B21B.5CF634E@freenet.co.uk>
Message-ID: <199909091405.KAA12334@hesketh.net>
At 04:07 PM 9/9/99 +0200, Ingo Macherius wrote:
>Someone has implemented DDML, see www.extensibility.com. However,
>it's not open source.
Extensibility's XML Authority - a schema editor for those who haven't seen
it - will let you save DTDs to DDML format. It doesn't provide a DDML
validation tool for use in processing. It's nice to see, especially for
me, but it doesn't let you do anything with DDML in an application context.
And no, it isn't open source, but it is at least cheap.
Simon St.Laurent
XML: A Primer (2nd Ed - September)
Building XML Applications
Inside XML DTDs: Scientific and Technical
Sharing Bandwidth / Cookies
http://www.simonstl.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From digitome at iol.ie Thu Sep 9 16:24:56 1999
From: digitome at iol.ie (Sean Mc Grath)
Date: Mon Jun 7 17:14:48 2004
Subject: an unfilled need
In-Reply-To: <37D5725A.48908DC9@prescod.net>
References: <3.0.6.32.19990907100342.009fa980@gpo.iol.ie>
Message-ID: <3.0.6.32.19990909151750.009b6690@gpo.iol.ie>
>Sean Mc Grath wrote:
>>
>>
>>
>> Bank.Debit ( (bllrp.zata * rte.w )/ GoldenRatio )
>>
>>
>>
>> 112
>> g
>> xxx
>>
>>
>>
>>
>> This approach will get you there a lot faster than
>> waiting for the "bank debit golden ratio rte interest group"
>> to finalize their declarative syntax for what the bllrp
>> element really means...
>
[Paul Prescod]
>All you've done is shifted the problem from document type design to API
>design. Now you need the "bank debit golden ration rte interest group"
>to standardize their Bank objects and Debit methods.
But this only becomes an issue whenever there is
a need to interface between disparate systems. Of course you
need standards at that point -- be they APIs or Data Notations.
Here is the point I was trying to make. If the algorithm to
manipulate the data is downloaded *with* the data,
then the browser meeds no apriori knowledge about
the semantics of the data other than "Oh, here comes a package
of stuff that has an algorithm part and a data structure part.
Execute the algorithm part and hand it the data part to work
with".
Moreover, as an application developer I can do what I like
with the package of data. I can attach any semantics I like
to it. JavaScript is a case in point. Leaving aside any
dislike for the language or the data model exposed to the
language, it illustrates the "arbitrary semantics" point
well.
Case in point: There is no industry standard markup language
for creating a collapsible table of contents in a Web page.
There is a very nice one running on the Isogen site
(http://www.isogen.com/papers.htm) written in JavaScript.
Isogen did not have to wait for the W3C or Microsoft
on Netscape or some committee to quabble endlessly over
the markup language necessary to encode the semantics
of a collapsible toc.
>And anyhow, why bother with XML then?
XML gives me a beautifully simply, open systems, way
of expressing a hierarchical data structure. Why would
I throw that away?
Developers Day Co-Chair, 9th International World Wide Web Conference
16-19, May, 2000, Amsterdam, The Netherlands http://www9.org
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ken at bitsko.slc.ut.us Thu Sep 9 17:02:45 1999
From: ken at bitsko.slc.ut.us (Ken MacLeod)
Date: Mon Jun 7 17:14:48 2004
Subject: ANN: XML and Databases article
In-Reply-To: "Didier PH Martin"'s message of "Thu, 9 Sep 1999 06:45:55 -0400"
References:
Message-ID:
"Didier PH Martin" writes:
> Hi Daniel,
>
> Daniel said:
> ------------------------------------------------
> When faced with "groves" I still have a serious problem with both
> points:
> - Show me a definition so that I can understand the term and
> underlying concept clearly enough that an implementation
> time is spend not collecting and reading papers but implemening
> something well defined.
> Even reading http://www.prescod.net/groves/shorttut/ I still can't
> get a clear definition of "what is a grove precisely".
> Not at the concept level, but a implementable definition say
> on top of the XML infoset (for XML documents).
>
> Didier says:
> ------------------------------------------------
> I agree with you on that point. Actually groves are defined as abstract
> entities and there is no common API to groves (each implementer ca define
> its own interface).
> I suggest also that you take a look at the PERL open source project
> because they too have included a grove manager in the PERL
> package. Did you noticed that PERL already implemented a Grove
> module?
That would be the Data::Grove (in libxml-perl) module and it's only
current set of node classes, the XML::Grove module. If you'd like to
look at a very simple implementation of groves, those would be the
ones.
In Data::Grove, ``nodes'' are Perl objects (named, or ``blessed'',
hashes), node-lists are Perl arrays, and named-node-lists are
un-named, or ``anonymous'', hashes. Even though nodes are ``objects''
they are still accessed directly using Perl hash syntax (i.e. there
are no accessor methods). (The same could be done as easily in any
language, of course.)
In the current implementation of Data::Grove the grove is ``passive''
-- the view, or ``grove plan'', of the grove is fixed at the time the
grove is built, there's no support for enforcing constraints, and
character data does not appear normalized.
This differs from other grove implementations which I would call
``active'' -- the view of the grove can be changed dynamically,
constraints are checked and/or enforced, and groves always appear
normalized. Grove plans were described in another message, but an
example is where you could choose to see entity reference nodes in the
grove or just the characters that replace the entity reference.
Groves are often compared to DOM. In grove terms, DOM Level 1
presents two very specific grove plans, Core and HTML. DOM defines
these grove plans in terms of accessor methods for specific properties
and a fixed set of constraints. In contrast, groves act more like
generic containers. The API for accessing the generic containers is
left to the implementation, property and value constraints are defined
seperately, and the user is allowed to choose which set of property
and value constraints to use at any particular time.
Simply put, the idea behind groves is to seperate the definition of
properties from the API used to access those properties.
--
Ken MacLeod
ken@bitsko.slc.ut.us
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From schnitz at overflow.de Thu Sep 9 17:06:30 1999
From: schnitz at overflow.de (Sebastian Schnitzenbaumer)
Date: Mon Jun 7 17:14:48 2004
Subject: RFP: Namespace URI for HTML
Message-ID: <199909091609.SAA02326@server.overflow.de>
Hi,
> > 0) "hell://no.stinking.namespace/4/HTML"
> > 1) "http://www.w3.org/Markup/" - by David Megginson
> > 2) "http://www.w3.org/HTML/1999/namespace" - by Tim Bray
> > 3) "http://www.w3.org/HTML/2000/Namespace" - by Don Park (I like zeros :)
> > 4) "http://www.xml.org/HTML/2000/Namespace" - (if Jon Bosak agrees)
> > 5) "http://www.w3.org/Namespace/HTML/" - by Don Park (2nd try)
>
> 6) Like 2/3/4, but change 1999 or 2000 to 19991225 (or as appropriate). How
> very hopeful indeed, to assume that there will never be the need for more than
> one version in a year. As it is, I can't be sure that the time of day shouldn't
> be included as well - who knows what "Web time" will mean to developers in 5 or
> 10 years time ...
>
> Actually, I really prefer the 3-namespaces idea: Strict, Transitional, etc.
I'm very surprised that the list does not include the current
approach taken by the HTML WG. This poll seems to suggest that
one namespace for every flavor of XHTML is the only right choice. I
agree with Tony and others who consider 3 namespaces as a
possible solution.
XHTML 1.0 strict: http://www.w3.org/TR/xhtml1/strict
XHTML 1.0 transitional: http://www.w3.org/TR/xhtml1/transitional
XHTML 1.0 frameset: http://www.w3.org/TR/xhtml1/frameset
Regards,
Sebastian Schnitzenbaumer
---
Stack Overflow AG
Phone: +49-89-767363-70
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From mike.champion at sagus.com Thu Sep 9 17:19:30 1999
From: mike.champion at sagus.com (Michael Champion)
Date: Mon Jun 7 17:14:48 2004
Subject: Groves, the next big thing (Re: ANN: XML and Databases article)
References: <14294.39104.190422.759370@localhost.localdomain><01BEFA20.9D5E8F30@grappa.ito.tu-darmstadt.de><14294.39104.190422.759370@localhost.localdomain> <4.2.0.58.19990909093108.00b88c00@pophost1.fore.com>
Message-ID: <016401befad7$19d49f30$091d12d1@mccdell>
>Groves are going to turn out to be like Linux, which began with a very
>few people who had a vision that turned out to work.
I'll start by making it clear that I've been on the W3C DOM working group
for more than two years, so you know where my biases lie ... Nevertheless, I
speak as someone who has spent a fair amount of time wrestling with defining
abstract data models and APIs for XML documents and considering what the
grove paradigm offers.
There are three basic reasons why the DOM is not more groves-like. First, as
someone pointed on earlier in the "XML and Databases" thread, not enough
people outside the hard-core SGML/XML community understand the groves
paradigm, so there was no general familiarity that we felt we should
leverage. Second, the available documentation for groves (at least a
couple of years ago) was mainly the DSSSL spec, which is very difficult for
non-specialists to make sense out of. Third, there was a widespread
perception that the groves model implies, in DOM terms, that "every
character is a node", and people concerned about implementing the DOM API
felt strongly that this would lead to unacceptable footprint and run-time
overhead.
Groves may or may not become the next Linux; if this is going to happen, two
obstacles must be overcome.
Most importantly, someone is going to have to write a *clear* statement of
the paradigm, its power, why it's "the next big thing, etc. Sortof "Groves
for Dummies", or the "Grove Manifesto". I can't stress enough the
importance of writing this for a fairly general audience. I recall
conversations a couple of years ago with very smart technical marketing
people at large companies who are now big players in the XML world; the
level of fustration they expressed at trying to make sense out of things
like the DSSSL spec was quite memorable! I have not read the recent
attempts by groves adovocates to offer tutorials, etc., so forgive me if
this has already been done. I frankly doubt it, because if a clear and
compelling case for the groves paradigm has been made, it hasn't come to the
world's attention.
Also, even if the "grove paradigm" is a fundamentally more powerful way of
looking at XML and other types of data than what is in wide use today, it's
unlikely to be adopted unless there is a clean migration path from familiar
APIs like ODBC/SQL, the W3C DOM, the (forthcoming?) JCP XML data binding
spec, etc. One of the most eye-opening aspects of my experience on the DOM
WG has been to understand that most users of Web scripting languages, Visual
Basic, etc. know very little about computer science. I began my DOM
"career" assuming that everyone who would be using such APIs understood tree
and graph data structures and would understand how nicely they represented
the types of things we were talking about. I was quickly set straight by my
colleagues from companies with larger customer bases: ARRAYS are about as
sophisticated a data structure as the typical Web scripter or VB programmer
can handle. [I *know* that someone reading this wants flame me, but rest
assured that I don't like this notion any more than you do, and just about
every conceivable counter-argument has been raised, and very reluctantly
dismissed, by the DOM WG already.] So, I would *love* to see someone define
a grove API that extends the DOM, and/or to see the grove paradigm cleanly
incorporated into the Java Community Process XML data binding, and/or to see
a repository-friendly API that builds from ADO or JDO and incorporates
groves concepts. But don't expect the typical consumer of XML APIs to be
impressed by a groves API that offers a "new paradigm" that assumes that the
reader understands graph theory and data structures and builds up from there
with little reference to existing efforts.
Mike Champion
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From paul at prescod.net Thu Sep 9 17:35:37 1999
From: paul at prescod.net (Paul Prescod)
Date: Mon Jun 7 17:14:48 2004
Subject: ATTN: Please comment on XHTML (before it's too late)
References: <872567E5.0077E475.00@d53mta03h.boulder.ibm.com>
Message-ID: <37FF5306.ADB5D501@prescod.net>
roddey@us.ibm.com wrote:
>
> >Adding three lines of code introduces the opportunity for three times
> >the bugs in a real application? I don't buy it. Looking for "HTML 4.0
> >strict" OR "HTML 4.0 frameset" is no harder than looking for "UL" OR
> >"OL", if the software is set up right.
> I think its more complex than that. Its more than adding 3 lines of code. What
> if you want to validate the XHTML?
I can validate all three variants of XHTML using parsers that are three
years old already!!!! Download the DTD, download nsgmls or IE5 and try
it yourself!
Paul Prescod
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ejfreed at infocanvas.com Thu Sep 9 17:46:25 1999
From: ejfreed at infocanvas.com (Erik James Freed)
Date: Mon Jun 7 17:14:48 2004
Subject: MSXML for Java Questions
In-Reply-To:
Message-ID:
I am not sure I understand question #1, but if it helps, I know is that
microsoft has in some way backed the company datachannel who
supply a java version of its IE5 xml/xsl com components. This product came
out earlier this year. (see www.datachannel.com) . Though
their documentation does not at all suggest this, datachannel is *not*
providing commercial support for this product unless you pay $5K per year.
(!)
The other problem is that there are those who believe that this product is
far from production worthy, and it is definitely far behind other
XML/XSL products in keeping up with current releases of standards. To make
matters even worse, both companies refuse to make a
public statement in regards to any of these issues and their intentions.
Thus one pretty much has to either cough up $5K on the off chance
that this makes any difference (I have my doubts), or do what we ended up
doing, porting off of their product to IBM (or whatever)
Interestingly, i think that this is going to really hurt XML/XSL because
most serious app developers would like to use a decent java
package for server development, but the most prevalent client environment is
IE5 which is turning out to be significantly different.
This means that server XML/XSL and client XML/XSL de facto standards are now
divergent. It remains to be seen what MS and or
datachannel have up their sleeve, but right now it is kind of bleak for
those like our company that would benefit a lot from identical
server and client standards for XML and XSL (T)
I personally am waiting to see how commercial licenses work with the major
vendors. The best seems to be IBM who give away source
and pretty open licenses for no charge (I am no lawyer, so YMMV)
good luck,
erik
-----Original Message-----
From: owner-xml-dev@ic.ac.uk [mailto:owner-xml-dev@ic.ac.uk]On Behalf Of
Marko Zerdin
Sent: Thursday, September 09, 1999 3:32 AM
To: 'xml-dev@ic.ac.uk'
Subject: MSXML for Java Questions
Hi everybody.
1) Does anybody know if Microsoft is going to support public (free) MSXML
packages for Java? The signals from Microsoft have been far from unanimous.
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From paul at prescod.net Thu Sep 9 17:50:26 1999
From: paul at prescod.net (Paul Prescod)
Date: Mon Jun 7 17:14:48 2004
Subject: DOM and Grove
References: <19990909043413.A9400@w3.org> <19990909083042.E9400@w3.org>
Message-ID: <37FF5D1E.77AEECB6@prescod.net>
Imagine that you had been trained for several years on the ODBC API. Now
you go to university and a professor starts talking about the
"relational database model." You might ask: "how is that different than
ODBC." The answer is that ODBC is a particular representation of the
relational model for programming languages. The DOM can be considered a
particular representation of the grove model for programming languages.
But the ODBC versus relational model is not quite sufficient to capture
the difference. ODBC was built on top of the relational model. The DOM
people used the grove model "for ideas" but the DOM is not based on any
well-defined abstract model at all -- not the grove, not the information
set, not the relational model, not anything. This is reflected in its
odd mix of methods, properties and names "getElementsByTagName"
"TagName", "NodeValue" etc. It's notion of iteration and state is famous
for being rather odd. The DOM is more like a pre-relational API in this
regard.
The major difference between the DOM and an API that could be built on
top of groves (such as that created by James Clark or by GroveMinder) is
that the DOM model is to add support for different data types one by one
through a central committee.
But the important thing to recognize is that the grove is not an API. It
can get lost in Steve's understandable focus on GroveMinder but the use
of the grove as an API is an implementation optimization (in the sense
of optimizing programmer time). The real reason groves were invented was
to answer the question:
* what is the result of hyperlinking into an arbitrary media type?
What are the properties of the abstract object returned? The grove
answers that question: the object has properties such as "parent",
(possibly null), "children" (possibly null), "containing entity" and so
forth.
You cannot build a sophisticated hypertext system without answering that
question. This will become apparent after XLink, XPointer and RDF are
implemented. We'll start to see many divergences of behavior when links
are made into (e.g.) PDF, MPEGs, JPEGs and so forth. Over time we will
need to develop a framework for describing the correct results of links
in a generic way. Then we will reinvent groves.
Or not... we could keep doing things in an ad hoc manner for ever I
suppose. It would be expensive and inefficient but it is possible.
Paul Prescod
Daniel Veillard wrote:
>
> > Daniel said:
> > ------------------------------------------------
> > When faced with "groves" I still have a serious problem with both
> > points:
> > - Show me a definition so that I can understand the term and
> > underlying concept clearly enough that an implementation
> > time is spend not collecting and reading papers but implemening
> > something well defined.
> > Even reading http://www.prescod.net/groves/shorttut/ I still can't
> > get a clear definition of "what is a grove precisely".
> > Not at the concept level, but a implementable definition say
> > on top of the XML infoset (for XML documents).
> >
> > Didier says:
> > ------------------------------------------------
> > I agree with you on that point. Actually groves are defined as abstract
> > entities and there is no common API to groves (each implementer ca define
> > its own interface).
>
> Ok, having followed the http://www.netfolder.com/DSSSL/ link and
> looking at http://www.oasis-open.org/cover/topics.html#groves
> The following seems at least a clear and concise definition:
> In HyTime ISO/IEC 10744:1997 "3. Definitions (3.35)": graph
> representation of property values is 'An abstract data structure
> consisting of a directed graph of nodes in which each node may be
> connected to other nodes by labeled arcs.
>
> Ahhh ... and
>
> GROVE - "Graph Representation of Property Values."
>
> Ok, this becomes clearer.
> And the picture clarify it even more !
>
> http://www.cogsci.ed.ac.uk/~ht/grove.html
>
> > Daniel says:
> > -----------------------------------------------
> > - Show me the code. Not that there is none, I just don't know.
> > Is there a program available in source code, that I can run
> > on say a laptop in front of a novice (but programmer kind)
> > audience (say a Gnome developper's group) allowing me in 3 mn
> > to show a "grove" in action and what it does for them.
> >
> > Didier says:
> > -----------------------------------------------
> > Easy, just use the "Linux of the markup technologies" aka OpenJade. The code
> > is freely available, several developers around the globe are already
> > improving it each day. The source code is stored on a CVS server and like I
> > said, is freely available. OpenJade includes the SGML grove plan but the
> > grove is transient (i.e. resident on the heap). This is a stand alone module
> > that could be reused in other code. The grove is available as a DCOM object
> > or a C++ object. You may make your own implementation if you want or change
> > the interface. For more information:
> > http://www.netfolder.com/DSSSL
>
> Ok, coincidentally i'm doing something similar it seems on top of my XML
> parser except that I based my node structure on a DOM point of view:
>
> http://rpmfind.net/veillard/XML/
>
> Is there a clear distinction between the model exposed by a DOM API
> and a grove, I can't even find a single DOM reference in
> http://www.oasis-open.org/cover/topics.html#groves
>
> Looking at Henry graphic, it's nearly 100% equivalent to what I build
> so I'm wondering, but I remember people pointing out that those are
> different.
>
> thanks for the pointers,
>
> Daniel
>
> --
> Daniel.Veillard@w3.org | W3C, INRIA Rhone-Alpes | Today's Bookmarks :
> Tel : +33 476 615 257 | 655, avenue de l'Europe | Linux, WWW, rpmfind,
> Fax : +33 476 615 207 | 38330 Montbonnot FRANCE | rpm2html, XML,
> http://www.w3.org/People/W3Cpeople.html#Veillard | badminton, and Kaffe.
>
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
> FFrom zope-admin@zope.org Thu Sep 9 07:46:39 1999
> Received: from lambe.pair.com (lambe.pair.com [209.68.1.135])
> by amati.techno.com (8.8.8/8.8.8) with ESMTP id HAA02271
> for ; Thu, 9 Sep 1999 07:46:33 -0500
> Received: from zope.codeit.com (www.zope.org [209.67.167.110]) by lambe.pair.com (8.9.1/8.6.12) with ESMTP id IAA18429 for ; Thu, 9 Sep 1999 08:46:32 -0400 (EDT)
> X-Envelope-To:
> Received: from www2 (localhost [127.0.0.1])
> by zope.codeit.com (8.9.2/8.8.7) with ESMTP id FAA05917;
> Thu, 9 Sep 1999 05:37:21 -0700 (PDT)
> Received: from erwin.complete.org (erwin.complete.org [209.197.212.34])
> by zope.codeit.com (8.9.2/8.8.7) with ESMTP id FAA05895
> for ; Thu, 9 Sep 1999 05:37:15 -0700 (PDT)
> Received: (from jgoerzen@localhost)
> by erwin.complete.org (8.9.3/8.9.3/Debian/GNU) id HAA01185;
> Thu, 9 Sep 1999 07:36:55 -0500
> Sender: jgoerzen@erwin.complete.org
> To: "Phillip J. Eby"
> Cc: zope@zope.org
> Subject: Re: [Zope] and (was Re: Proper way to set variables)
> References: <199909060408.XAA26630@erwin.complete.org> <3.0.5.32.19990908171639.0090f6e0@telecommunity.com>
> Mime-Version: 1.0 (generated by tm-edit 7.108)
> Content-Type: text/plain; charset=US-ASCII
> From: John Goerzen
> Date: 09 Sep 1999 07:36:55 -0500
> In-Reply-To: "Phillip J. Eby"'s message of "Wed, 08 Sep 1999 17:16:39 -0500"
> Message-ID: <87btbc5kig.fsf@erwin.complete.org>
> Lines: 56
> X-Mailer: Gnus v5.6.45/XEmacs 20.4 - "Emerald"
> Sender: zope-admin@zope.org
> Errors-To: zope-admin@zope.org
> X-Mailman-Version: 1.0b8
> Precedence: bulk
> List-Id: Users of the Z Object Publishing Environment
> X-BeenThere: zope@zope.org
>
> Excellent, this sounds like exactly what I need to fix. Can you send
> the patch along to me?
>
> "Phillip J. Eby" writes:
>
> > At 02:02 PM 9/8/99 -0500, John Goerzen wrote:
> > >Heh, feels strange to reply to myself, but as I got no replies before,
> > >maybe I need to elaborate a bit. The original question is below.
> > >
> > >My main confusion is just how do I set a variable that is dynamically
> > >scoped without having to use dtml-with or dtml-let, both of which
> > >require matching closing tags?
> > >
> >
> > FYI, I have just written a patch for DT_Let.py that adds two new tags,
> > 'local' and 'set'. They are used like this:
> >
> >
> >
> >
> >
> > ...
> >
> >
> >
> >
> >
> > Both blocks and tags are dynamically scoped and
> > can be used anywhere as long as there is an enclosing
> > block defining the variable you want to set. You can only
> > set variables which are defined by the innermost block, but
> > you can reset their value any number of times, even within the same
> > tag.
> >
> > I am currently testing this code in an alpha app, but will be submitting it
> > some time in the near future to Digital Creations for consideration of
> > inclusion in future Zope versions.
> >
> >
> > _______________________________________________
> > Zope maillist - Zope@zope.org
> > http://www.zope.org/mailman/listinfo/zope
> >
> > (To receive general Zope announcements, see:
> > http://www.zope.org/mailman/listinfo/zope-announce
> >
> > For developer-specific issues, zope-dev@zope.org -
> > http://www.zope.org/mailman/listinfo/zope-dev )
> >
>
> --
> John Goerzen Linux, Unix consulting & programming jgoerzen@complete.org |
> Developer, Debian GNU/Linux (Free powerful OS upgrade) www.debian.org |
> ----------------------------------------------------------------------------+
> The 176,239,285th prime number is 3,697,131,523.
>
> _______________________________________________
> Zope maillist - Zope@zope.org
> http://www.zope.org/mailman/listinfo/zope
>
> (To receive general Zope announcements, see:
> http://www.zope.org/mailman/listinfo/zope-announce
>
> For developer-specific issues, zope-dev@zope.org -
> http://www.zope.org/mailman/listinfo/zope-dev )
>
> rom python-list-request@cwi.nl Thu Sep 9 07:46:40 1999
> Received: from smv03.iname.net (lmtp06.iname.net [165.251.8.61])
> by amati.techno.com (8.8.8/8.8.8) with SMTP id HAA02274
> for ; Thu, 9 Sep 1999 07:46:33 -0500
> Received: from hera.cwi.nl (hera.cwi.nl [192.16.191.1])
> by smv03.iname.net (8.9.3/8.9.1SMV2) with ESMTP id IAA05852
> for sent by ; Thu, 9 Sep 1999 08:46:32 -0400 (EDT)
> Received: from localhost by hera.cwi.nl with ESMTP
> id OAA04566; Thu, 9 Sep 1999 14:31:17 +0200 (MET DST)
> From: Guido van Rossum
> Newsgroups: comp.lang.python
> Subject: Re: SocketServer on NT slow for large transactions
> Date: 09 Sep 1999 08:11:28 -0400
> X-Organization: Corporation for National Research Initiatives
> Message-ID: <5logfc474f.fsf@eric.cnri.reston.va.us>
> References: <7r7ipf$lqj$1@news.duke.edu>
> X-Trace: ffx2nh5.news.uu.net 936879095 16700 132.151.1.38 (9 Sep 1999 12:11:35 GMT)
> X-Complaints-To: news@ffx2nh5.news.uu.net
> X-Newsreader: Gnus v5.6.45/XEmacs 21.1 - "Big Bend"
> To: python-list@cwi.nl
> Sender: python-list-request@cwi.nl
> Errors-To: python-list-request@cwi.nl
>
> "eric jones" writes:
>
> > I wrote a very simple program using based on the
> > SocketServer.StreamRequestHandler, and found that it crawled along at a
> > snails pace. I've traced the problem to the socket._fileobject.
>
> Yes. This has been fixed in the CVS version though -- see
> python.org/download/cvs.html.
>
> --Guido van Rossum (home page: http://www.python.org/~guido/)
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From simonstl at simonstl.com Thu Sep 9 17:55:54 1999
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun 7 17:14:48 2004
Subject: ATTN: Please comment on XHTML (before it's too late)
In-Reply-To: <37FF5306.ADB5D501@prescod.net>
References: <872567E5.0077E475.00@d53mta03h.boulder.ibm.com>
Message-ID: <199909091558.LAA17320@hesketh.net>
At 07:36 AM 10/9/99 -0700, Paul Prescod wrote:
>roddey@us.ibm.com wrote:
>>
>> >Adding three lines of code introduces the opportunity for three times
>> >the bugs in a real application? I don't buy it. Looking for "HTML 4.0
>> >strict" OR "HTML 4.0 frameset" is no harder than looking for "UL" OR
>> >"OL", if the software is set up right.
>
>> I think its more complex than that. Its more than adding 3 lines of
code. What
>> if you want to validate the XHTML?
>
>I can validate all three variants of XHTML using parsers that are three
>years old already!!!! Download the DTD, download nsgmls or IE5 and try
>it yourself!
If all you care about is the prefixes, of course you can. If you actually
want the URIs, it's a lot more complicated, as we've been discussing for
the last few days.
(Unless, of course, nsgmls was built with the discussions we've had in the
last week regarding namespaces and validation in mind, which would involve
a remarkable amount of foresight.)
Now returning to my usual program of Central New York AM radio,
Simon St.Laurent
XML: A Primer (2nd Ed - September)
Building XML Applications
Inside XML DTDs: Scientific and Technical
Sharing Bandwidth / Cookies
http://www.simonstl.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ejfreed at infocanvas.com Thu Sep 9 18:04:28 1999
From: ejfreed at infocanvas.com (Erik James Freed)
Date: Mon Jun 7 17:14:48 2004
Subject: MSXML for Java Questions
In-Reply-To:
Message-ID:
oops! I should mention that the term 'commercial support' means that they
actually
fix simple trivially demonstrable showstopping bugs... Of course there were
no
guarantees expressed about when.
erik
-----Original Message-----
From: owner-xml-dev@ic.ac.uk [mailto:owner-xml-dev@ic.ac.uk]On Behalf Of
Erik James Freed
Sent: Thursday, September 09, 1999 8:50 AM
To: Marko Zerdin; xml-dev@ic.ac.uk
Subject: RE: MSXML for Java Questions
I am not sure I understand question #1, but if it helps, I know is that
microsoft has in some way backed the company datachannel who
supply a java version of its IE5 xml/xsl com components. This product came
out earlier this year. (see www.datachannel.com) . Though
their documentation does not at all suggest this, datachannel is *not*
providing commercial support for this product unless you pay $5K per year.
(!)
The other problem is that there are those who believe that this product is
far from production worthy, and it is definitely far behind other
XML/XSL products in keeping up with current releases of standards. To make
matters even worse, both companies refuse to make a
public statement in regards to any of these issues and their intentions.
Thus one pretty much has to either cough up $5K on the off chance
that this makes any difference (I have my doubts), or do what we ended up
doing, porting off of their product to IBM (or whatever)
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ken_north at compuserve.com Thu Sep 9 18:05:15 1999
From: ken_north at compuserve.com (Ken North)
Date: Mon Jun 7 17:14:48 2004
Subject: ANN: XML and Databases article
References: <01BEFAB8.224F6E70@grappa.ito.tu-darmstadt.de>
Message-ID: <002301befadd$917f2d60$0b00a8c0@grissom>
Subject: RE: ANN: XML and Databases article
> Ronald Bourret wrote
> Steven R. Newcomb wrote:
> What do you mean by "different databases" here? If you mean relational
> databases v. hierarchical databases v. object-oriented databases, then
> there's no problem -- I would expect each to have a different property
set.
> On the other hand, if you mean DB2 v. Oracle v. Informix v. SQL Server,
> then it seems there is something broken -- I would very much expect all
> relational databases to have the same property set.
There are a few points that haven't been made in this discussion.
It's a mistake to classify DB2, Oracle, Informix, and SQL Server as
relational DBMSs having the same logical data model.
DB2, Oracle, Informix, and Microsoft SQL Server are all SQL DBMSs, but the
first three are object-relational products. They have extensible
architectures so you can add user-defined types and user-defined methods to
customize a database. You can add custom access methods and indexes by
writing server extensions. You can install Java classes in a database to add
types and methods
(http://www.devx.com/upload/free/features/javapro/1999/03mar99/kn0399/kn0399
.asp)
There is a distinction between an active database and a passive data store.
You can embed logic in active databases, so they can act as event alerters
and rule enforcers (e.g., reject data that isn't within a certain range of
values).
Finally, the days when SQL DBMSs stored only columns of number and
characters are gone. Some still do, but that is not a defining
characteristic. Most of the major SQL vendors moved to a universal server
model that supports rich types as well as traditional tabular data. For
storing an XML document, you have the option of decomposing it or storing it
as a whole.
query tool), not in others (wanting to expose tabular database
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From david-b at pacbell.net Thu Sep 9 18:27:38 1999
From: david-b at pacbell.net (David Brownell)
Date: Mon Jun 7 17:14:48 2004
Subject: MSXML for Java Questions
References:
Message-ID: <37D7E05B.D4BD233F@pacbell.net>
Erik James Freed wrote:
>
> The other problem is that there are those who believe that this product is
> far from production worthy, and it is definitely far behind other
> XML/XSL products in keeping up with current releases of standards. To make
> matters even worse, both companies refuse to make a
> public statement in regards to any of these issues and their intentions.
Depends what you mean by "public statement". I count leaving
major bugs as unfixed to be significant statements. Microsoft
bundling the rather ancient MSXML into their latest version of
a Java VM _without bugfixes_ is yet another sort of statement.
(About standards and perhaps DataChannel.)
> This means that server XML/XSL and client XML/XSL de facto standards are now
> divergent. It remains to be seen what MS and or
> datachannel have up their sleeve, but right now it is kind of bleak for
> those like our company that would benefit a lot from identical
> server and client standards for XML and XSL (T)
I'd put it differently: the XML/XSL standards are pretty clear,
but what's lacking is browser support, and there's no announced
path whereby Microsoft's browsers will conform. There may yet
be such a path -- but folk have been asking for conformance for
some time, and not seeing it.
- Dave
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From macherius at darmstadt.gmd.de Thu Sep 9 18:40:07 1999
From: macherius at darmstadt.gmd.de (Ingo Macherius)
Date: Mon Jun 7 17:14:48 2004
Subject: ANN: XML and Databases article
In-Reply-To: <002301befadd$917f2d60$0b00a8c0@grissom>
Message-ID: <199909091642.SAA16413@sonne.darmstadt.gmd.de>
Ken North wrote at 9 Sep 99, 9:08:
> Finally, the days when SQL DBMSs stored only columns of number and
> characters are gone. Some still do, but that is not a defining
> characteristic. Most of the major SQL vendors moved to a universal server
> model that supports rich types as well as traditional tabular data. For
> storing an XML document, you have the option of decomposing it or storing it
> as a whole.
Right, but the price is implementing stored procedures. I dunno how
the good Oracle 8i interfaces are, but with Informix it's no fun. We
did it, and abadoned it. The other problem is you can store XML, but
have to explicitely model relations (parent, sibling, child, ... and
all other axis you need) by foreign keys. The number of "metatables"
with structural information quickly outnumbers the useful data. Again
we tried, benched and canceled.
Is there a paper, page or software that can store XML along with it's
structural information in a RDBMS, be it classic or OO-relational,
you are thinking of ?
Currently we are investigating our PDOM, which just maps between a
W3C-DOM and persistent streams of serialized Java objects (paper will
be at OOPSLA99). Instead of modeling structure we try to scale by
means of intelligent, specialized cache strategies. Results are
promising.
Guido Moerkotte and C-C Kanne from the Univerity of Mannheim are
working on intelligent clustering strategies, which try to keep trees
in clusters suited for typical XML access patterns. They also have
means to cluster better when given semantic user constraints on
granularity.
While we currently work with the DOM, the results could be easily
applied to Groves. However, giving up the DOM would mean to lose lots
of handy middleware. Our XQL processor has to work on a well defined
data structure, the DOM is sufficient for now. Paul Prescod in his
Grove tutorial precisely points DOMs weaknesses, but there is nothing
to replace it right now.
We discussed to use the Infoset as a data model, but its hard to
write an optimizing XQL processor against a model where half the
features are either optional or not present (schema stuff).
Especially the fact you can't tell how many children a node will have
(e.g. when some are optional, or only present when validating) is not
what I call a well defined model for queries.
++im
--
Ingo Macherius//Dolivostrasse 15//D-64293 Darmstadt//+49-6151-869-882
GMD-IPSI German National Research Center for Information Technology
mailto:macherius@gmd.de http://www.darmstadt.gmd.de/~inim/
Information!=Knowledge!=Wisdom!=Truth!=Beauty!=Love!=Music==BEST (Zappa)
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
From ejfreed at infocanvas.com Thu Sep 9 18:58:40 1999
From: ejfreed at infocanvas.com (Erik James Freed)
Date: Mon Jun 7 17:14:48 2004
Subject: MSXML for Java Questions
In-Reply-To: <37D7E05B.D4BD233F@pacbell.net>
Message-ID: