From David.Brownell at Eng.Sun.COM Fri May 1 00:23:17 1998 From: David.Brownell at Eng.Sun.COM (David Brownell) Date: Mon Jun 7 17:00:59 2004 Subject: SAX 1.0beta: Three bugs so far Message-ID: <199804302221.PAA24043@argon.eng.sun.com> > > BUG #2: Parser.setLocale takes only one String argument > > > > As will quickly become apparent, I am not an expert in > > localisation. I have discovered that localisation requires both > > a language code _and_ a country code, so I have changed the > > interface prototype to > > > > public abstract void setLocale (String language, String country) > > throws SAXException; > > > > Does this look correct? Would people prefer that I use the > > java.util.Locale class? Locale ... > I think a single string, or unspecified parts, would be better. XML > allows RFC 1766 language identifiers, which can include i-cherokee and > x-klingon. The language-country form is only one class of valid > language identifier. But this is instructions to the parser, which will be using the standard Java localisation facilities which center around Locale objects in order to format any messages. It's not related to the text of the document. - Dave xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Fri May 1 00:28:30 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:00:59 2004 Subject: Separation of formatting... References: <199804301555.QAA00020@GPO.iol.ie> Message-ID: <3548B8B6.10DFFC2E@technologist.com> Sean Mc Grath wrote: > ... > Here is a pieces of data marked up three ways:- > > Version 1 : Purely Formatting Mindset (RTF) > "{\i Customer} Joe Bloggs \par" > > Version 2 : SGML - Generic Markup >

CustomerJoe Bloggs

> > Version 3 : SGML - Data Modelling > Joe Bloggs > > Somewhere along the line, people started thinking > as in version 3 above. I have no idea when this > started to happen. Anyone out there know? >... These live on a continuum. I don't think that there is a shift from one to another. SGML allows you to decide what is formatting and what is abstraction. It is decided on an element by element basis and most DTDs will have different parts at different points along the continuum. For instance bibliographies typically tend towards data modelling, predictable body text towards what the middle, and unpredictable body text (e.g. web pages) towards the formatting side. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization." - Stewart Brand, founder of the Whole Earth Catalog http://www.wired.com/news/news/culture/story/10124.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From srn at techno.com Fri May 1 01:14:46 1998 From: srn at techno.com (Steven R. Newcomb) Date: Mon Jun 7 17:00:59 2004 Subject: Separation of formatting... In-Reply-To: <199804301555.QAA00020@GPO.iol.ie> (message from Sean Mc Grath on Thu, 30 Apr 1998 16:55:17 +0100) References: <199804301555.QAA00020@GPO.iol.ie> Message-ID: <199804302313.SAA03170@bruno.techno.com> [Sean Mc Grath :] > Version 2 : SGML - Generic Markup >

CustomerJoe Bloggs

Maybe the

is generic markup, but the is procedural markup, not generic markup. > Version 3 : SGML - Data Modelling > Joe Bloggs The above is generic markup, for sure. "Generic" means "according to kind". "Generic markup" means markup that indicates *what kind of thing* is being delimited, rather than *what to do with the thing* that is being delimited. The latter is usually called "procedural markup". > I think for some poople, SGML is all about > Version 2 above. Entire books have been written that > use SGML to abstract the concepts of "paragraph", > "artwork" etc. from the typographic codes required to > achieve the result. Ain't it the truth. Well, evidently it's a useful attitude. > Somewhere along the line, people started thinking > as in version 3 above. I have no idea when this > started to happen. Anyone out there know? I think it was long before there was SGML. The radical implications of the modeling power of DTDs still escape most people, even though they were surprisingly well understood right at the outset. I think SGML's roots are firmly in generic markup, way back to the mid-1960s and the precursorial beginnings of the GenCode committee. I don't know the details, but the history is long, has many details, and is only slightly contentious. Norm Scharpf, President of the GCA, is one authority on the early history. -Steve -- Steven R. Newcomb, President, TechnoTeacher, Inc. srn@techno.com http://www.techno.com ftp.techno.com voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137) fax +1 972 994 0087 (at ISOGEN: +1 214 953 3152) 3615 Tanner Lane Richardson, Texas 75082-2618 USA xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From mrc at allette.com.au Fri May 1 01:53:22 1998 From: mrc at allette.com.au (Marcus Carr) Date: Mon Jun 7 17:00:59 2004 Subject: Off track References: Message-ID: <35490EB4.A58F6437@allette.com.au> Simon St.Laurent wrote: > Apart from the number of people working on XML who are from assorted empires > and settler states, I'm really not sure what on earth you're talking about. I > know there are several issues with Unicode (65000 characters isn't enough), > but what in XML is so 'colonial'? Nothing in the sense that you are thinking of - what I meant was that a relatively small albeit it open, etc. group of people are making decisions that are going to have a profound impact on many millions of people for some time to come. I believe we are preparing to unleash a new culture on everyone; I wasn't referring to any particular cultural or ethnic group or issues therewith. -- Regards Marcus Carr email: mrc@allette.com.au _______________________________________________________________ Allette Systems (Australia) email: info@allette.com.au Level 10, 91 York Street www: http://www.allette.com.au Sydney 2000 NSW Australia phone: +61 2 9262 4777 fax: +61 2 9262 4774 _______________________________________________________________ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From eliot at isogen.com Fri May 1 02:24:48 1998 From: eliot at isogen.com (W. Eliot Kimber) Date: Mon Jun 7 17:01:00 2004 Subject: Off track Message-ID: <3.0.32.19980430192254.006ae490@postoffice.swbell.net> At 09:52 AM 5/1/98 +1000, Marcus Carr wrote: >Simon St.Laurent wrote: > >> Apart from the number of people working on XML who are from assorted empires >> and settler states, I'm really not sure what on earth you're talking about. I >> know there are several issues with Unicode (65000 characters isn't enough), >> but what in XML is so 'colonial'? > >Nothing in the sense that you are thinking of - what I meant was that a relatively >small albeit it open, etc. group of people are making >decisions that are going to have a profound impact on many millions of people for >some time to come. I believe we are preparing to unleash a new culture on >everyone; I wasn't referring to any particular cultural or ethnic group or issues >therewith. But isn't this almost always how these things happen? There's maybe 20 people who have direct and continuous input into SGML, DSSSL, etc. It must be the same for things like HTML, internet standards, products, laws, etc. Certainly a large number of people will influence the decisions of these small groups, but ultimately it comes down to a few people making critical decisions. No matter how open a process is, it always comes down to what the people who do the work do. The US constitution was framed by no more than about 50 people, of whom maybe 10 were involved in the bulk of the decisions made, with the core ideas coming from just two or three people (Jefferson, Madison, etc.). The decisions of that small group of people working largely in secret in 1789 have profoundly affected world history. I would like to think that, as a body and as individuals, that the members of the XML WG take their resposibility very seriously--my experience is that we do--it's one of the reasons that I'm willing and proud to participate in the process. It doesn't mean we'll always make the best technical decision--we're only human, but we certainly don't make any decisions lightly. And of course, we have to play the hand we're dealt, whether we like it or not. Cheers, E. --

W. Eliot Kimber, Senior Consulting SGML Engineer ISOGEN International Corp. 2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004 www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From greyno at mcs.com Fri May 1 03:12:13 1998 From: greyno at mcs.com (Gregg Reynolds) Date: Mon Jun 7 17:01:00 2004 Subject: Separation of formatting... References: <199804301555.QAA00020@GPO.iol.ie> <199804302313.SAA03170@bruno.techno.com> Message-ID: <354913CC.28B1@mcs.com> Steven R. Newcomb wrote: > > [Sean Mc Grath :] > > > Version 2 : SGML - Generic Markup > >

CustomerJoe Bloggs

> > Maybe the

is generic markup, but the > is procedural markup, not generic markup. > > > Version 3 : SGML - Data Modelling > > Joe Bloggs > > The above is generic markup, for sure. > > "Generic" means "according to kind". "Generic markup" means markup > that indicates *what kind of thing* is being delimited, rather than > *what to do with the thing* that is being delimited. The latter is > usually called "procedural markup". In light of the ongoing and eternal debate over just what markup is/means/does, the following passages may interest you. I think the line between, on the one hand, what you call generic and what Paul (if I understand him properly) calls abstract markup, and procedural or formatting instructions is pretty thin when you look closely. From "Ad Herennium", usually attributed to Cicero: "Colon or Clause is the name given to a sentence member, brief and complete, which does not express the entire thought, but is in turn supplemented by another colon, as follows: 'On the one hand you were helping your enemy.' That is one so-called colon; it ought then to be supplemented by a second: 'And on the other you were hurting your friend." This figure can consist of two cola, but it is neatest and most complete when composed of three, as follows: 'You were helping your enemy, you were hurting your friend, and you were not consulting your own best interests.' (Was the modern editor just being a smarty-pants by omitting colons from the quoted examples?) Tully's not talking about written punctuation marks here (markup), but about rhetorical techniques of the public orator - phrasing, rhythm, delivery. In other words, processing instructions. The text goes on to discuss "Comma or Phrase" and "Period" among other things, all in terms of the orator's art. Viewed in this light, the common punctuation marks (including paragraph and other layout conventions) which most of us probably think of as abstract or generic demarcations of thought-units (or something like that), may be viewed as notes on how delivery: distincly how-to. Still works that way, as anybody who's ever heard a recitation by a bad reader can attest. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Fri May 1 11:16:36 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:00 2004 Subject: Off track In-Reply-To: <35490EB4.A58F6437@allette.com.au> References: Message-ID: <3.0.1.16.19980501090437.62bf2ec4@pop3.demon.co.uk> At 09:52 01/05/98 +1000, Marcus Carr wrote: [... in response to others ...] [...] >Nothing in the sense that you are thinking of - what I meant was that a relatively >small albeit it open, etc. group of people are making >decisions that are going to have a profound impact on many millions of people for >some time to come. I believe we are preparing to unleash a new culture on >everyone; I wasn't referring to any particular cultural or ethnic group or issues >therewith. We are undoubtedly part of a very radical process in human history. When Faraday was asked about the use of one of his discoveries (I think it was electromagnetic induction) he is reputed to have said: "Madam, what is the use of a new born baby?". [BTW without electromagnetic induction there would be no XML...] I am very grateful to be part of a virtual community which is international and includes many disciplines, especially those related to linguistics. For example the curators of the TEI hold much of the world's heritage in their care; the quotation from Cicero is an example of the thread that binds us over space and time. [BTW I favour return to Roman monumental inscriptions - no case-sensitivity and no whitespace, and the *practice* has remained future-proof for 2 millennia.] Although it is confidential, the discussion about i18n and especially case-sensitivity on XML-SIG relied heavily on the members' knowledge of non-ISOLatin languages and cultures, and the SIG/WG possibly suffers through that not being widely known. I am very conscious of the separation of the world - I spent two years in W.Africa and am very grateful for the insight I gained into neo-colonialism. The information revolution has the power to bring great benefit to the world, especially in making education more easily available. I am sure that XML forms a major platform for the educational technology of the future - we have a clear responsibility to use it wisely. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tony at ems.uq.edu.au Fri May 1 11:20:16 1998 From: tony at ems.uq.edu.au (Anthony B. Coates) Date: Mon Jun 7 17:01:00 2004 Subject: Announce: xml-litprog-l Message-ID: <35499355.5D10F3B0@ems.uq.edu.au> I have created a mailing list, "xml-litprog-l", for discussions about how best to use XML/XSL/XLink/XPtr for constructing literate programming frameworks and tools. I'm interested in being able to share ideas with (i) people who know more about XML and/or SGML than I do; (ii) people who know more about literate programming than I do; (iii) anyone else who is interested. If you think that you fit into one of these groups, then please consider joining (I'm not expecting it to be a high traffic list, but I hope it will be a profitable list, in non-monetary terms). The Web page with all the details is at Some people have already expressed to me an interest in being on this list. However, for reasons of "netiquette", I won't subscribe anyone automatically. If you are interested, please visit the Web page (or send the message "subscribe xml-litprog-l" to "majordomo@ems.uq.edu.au". Cheers, Tony. -- Educational Multimedia Services = reduced workloads for lecturers, teachers, and tutors = better results for students. -- Another 100% Pure Java e-mail. Is yours? -- Anthony B. Coates. Multimedia Developer (Software Design) Educational Multimedia Services TEDI, The University of Queensland. AJUG-QLD Steering Committee Member (Australian Java Users' Group ). QMUG Committee Member (Queensland Macromedia Users' Group ). E-mail: tony@ems.uq.edu.au WWW: http://www.ems.uq.edu.au/People/Tony/ UIN: 5191015 -- All opinions are mine, and may not represent those of The University of Queensland. -- xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tony at ems.uq.edu.au Fri May 1 11:22:44 1998 From: tony at ems.uq.edu.au (Anthony B. Coates) Date: Mon Jun 7 17:01:00 2004 Subject: Announce: xml-litprog-l Message-ID: <35499355.5D10F3B0@ems.uq.edu.au> I have created a mailing list, "xml-litprog-l", for discussions about how best to use XML/XSL/XLink/XPtr for constructing literate programming frameworks and tools. I'm interested in being able to share ideas with (i) people who know more about XML and/or SGML than I do; (ii) people who know more about literate programming than I do; (iii) anyone else who is interested. If you think that you fit into one of these groups, then please consider joining (I'm not expecting it to be a high traffic list, but I hope it will be a profitable list, in non-monetary terms). The Web page with all the details is at Some people have already expressed to me an interest in being on this list. However, for reasons of "netiquette", I won't subscribe anyone automatically. If you are interested, please visit the Web page (or send the message "subscribe xml-litprog-l" to "majordomo@ems.uq.edu.au". Cheers, Tony. -- Educational Multimedia Services = reduced workloads for lecturers, teachers, and tutors = better results for students. -- Another 100% Pure Java e-mail. Is yours? -- Anthony B. Coates. Multimedia Developer (Software Design) Educational Multimedia Services TEDI, The University of Queensland. AJUG-QLD Steering Committee Member (Australian Java Users' Group ). QMUG Committee Member (Queensland Macromedia Users' Group ). E-mail: tony@ems.uq.edu.au WWW: http://www.ems.uq.edu.au/People/Tony/ UIN: 5191015 -- All opinions are mine, and may not represent those of The University of Queensland. -- xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tfj at apusapus.demon.co.uk Fri May 1 12:42:47 1998 From: tfj at apusapus.demon.co.uk (Trevor Jenkins) Date: Mon Jun 7 17:01:00 2004 Subject: Announce: xml-litprog-l In-Reply-To: <35499355.5D10F3B0@ems.uq.edu.au> Message-ID: <199805010948.tfj.21089@apusapus.demon.co.uk> > I have created a mailing list, "xml-litprog-l", for discussions > about how best to use XML/XSL/XLink/XPtr for constructing literate > programming frameworks and tools. I'm interested in being able to > share ideas with > (i) people who know more about XML and/or SGML than I do; > (ii) people who know more about literate programming than I do; > (iii) anyone else who is interested. What a good idea. Back in 1985/6 I proposed that the ISO Technical Report "Techniques for Using SGML" include an example of Literate Programming! Some of the other committee members prefered a Z example. I gave way to their "pressure" and forgot the literate programming example; no one provided the Z example either. There are some things in SGML that would mean existing web files could have been processed whereas they could not be processed with XML. Regards, Trevor. -- <>< Re: deemed xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Fri May 1 13:32:44 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:00 2004 Subject: SAX 1.0beta: Three bugs so far References: <199804300045.UAA00461@unready.microstar.com> Message-ID: <3549A239.C0199123@jclark.com> David Megginson wrote: > > BUG #2: Parser.setLocale takes only one String argument > > As will quickly become apparent, I am not an expert in > localisation. I have discovered that localisation requires both a > language code _and_ a country code, so I have changed the interface > prototype to > > public abstract void setLocale (String language, String country) > throws SAXException; > > Does this look correct? Would people prefer that I use the > java.util.Locale class? I would prefer java.util.Locale: setLocale(java.util.Locale locale) is the standard Java pattern. Since Java also provides a 3-argument variant of the Locale constructor, I don't think setLocale(String language, String country) is sufficient. James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From M.H.Kay at eng.icl.co.uk Fri May 1 14:38:01 1998 From: M.H.Kay at eng.icl.co.uk (Michael Kay) Date: Mon Jun 7 17:01:00 2004 Subject: SAX 1.0: problem with InputSource Message-ID: <003101bd74fe$120d0400$1e09e391@mhklaptop.bra01.icl.co.uk> I am trying to subclass InputSource to allow a File to be supplied as an alternative to a SystemId, CharacterStream, etc. (The implementation of this goes through the magic incantations to turn a file name into a URL). This is proving clumsy because the InputSource class has no default constructor. Since there is a full range of setX() methods, and given the general usefulness of having a default constructor e.g. for JavaBeans / ActiveX, should this omission be rectified? Mike Kay xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Fri May 1 14:53:45 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:00 2004 Subject: SAX 1.0: problem with InputSource In-Reply-To: <003101bd74fe$120d0400$1e09e391@mhklaptop.bra01.icl.co.uk> References: <003101bd74fe$120d0400$1e09e391@mhklaptop.bra01.icl.co.uk> Message-ID: <199805011252.IAA00387@unready.microstar.com> Michael Kay writes: > Since there is a full range of setX() methods, and given the > general usefulness of having a default constructor e.g. for > JavaBeans / ActiveX, > should this omission be rectified? Yes. Thanks for pointing this out. All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Fri May 1 16:02:15 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:00 2004 Subject: Open standards processes Message-ID: <3549D5D2.CE54C193@technologist.com> Here are some more thoughts about open standards processes: The XML SIG was pretty wide open, which was admirable, but XSL the DOM, etc. seem to have no equivalent. My hypothesis is that the XML SIG became large before anyone "important" realized it was going to be a crucial technology. On the other hand, since the W3C groups do much of their work online, there is more of a "bit-trail" (available only to members, but available nevertheless) that I cannot see in ISO standards processes. I am involved in the ISO process, but do not go to the meetings, and proposals seem to drop down from heaven for us to vote on or dismiss. Surely we could achieve a more global, technology-mediated discussion in this day and age! Clearly no process is perfect, but I think that whining about them on XML-DEV is an important part of determining the imperfections. Standards bodies derive all of their legitimacy from public acquiescence (else, I would start one myself!). If the public were more aware of their goings on, they could be forced open and the vendors would still have no choice but to play along. They also derive their legitimacy from the public. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization." - Stewart Brand, founder of the Whole Earth Catalog http://www.wired.com/news/news/culture/story/10124.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From M.H.Kay at eng.icl.co.uk Fri May 1 17:15:59 1998 From: M.H.Kay at eng.icl.co.uk (Michael Kay) Date: Mon Jun 7 17:01:00 2004 Subject: Open standards processes Message-ID: <005201bd7514$1ed4cc20$1e09e391@mhklaptop.bra01.icl.co.uk> >Here are some more thoughts about open standards processes: > To add my tuppenceworth, I've been involved in the past in both de jure and consortium standards-making processes, though all before the days of the web. To get a successful standard you need a core team who work hard, who are technically highly competent, and who understand the needs of the users as well as (a more common reason for failure) the needs of potential vendors. You need a consensus on the general principles and objectives, an aversion to introducing unproven innovations, and an absence of people with an interest in obstructing the process. You don't need consultation or democracy or legal authority; these can sometimes help to achieve the necessary consensus but can also slow things down or send things off in the wrong direction. But I don't think it serves any purpose to be secretive. I have certainly always believed that the more people knew what was going on, the greater the chance of success. Publishing work in progress will enable the user and vendor community to respond more rapidly when the thing is finally published, and will harness the resources of a wider group of people to spot the errors. I find it a little disappointing, now that there is no cost argument to prevent open dissemination, that W3C should (apparently) have a policy of secrecy which goes beyond anything I ever encountered in ISO or ANSI or X/Open or OMG committees. Perhaps the problem is that they would be deluged by feedback, but I doubt it. Michael Kay xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From paul at arbortext.com Fri May 1 17:32:05 1998 From: paul at arbortext.com (Paul Grosso) Date: Mon Jun 7 17:01:00 2004 Subject: Open standards processes Message-ID: <98May1.112853edt.26884@thicket.arbortext.com> At 10:01 1998 05 01 -0400, Paul Prescod wrote: >Here are some more thoughts about open standards processes: > >The XML SIG was pretty wide open, which was admirable, but XSL the DOM, >etc. seem to have no equivalent. I don't quite understand this statement. The open, public www-dom@w3.org mailing list has been in existence "forever," and the DOM work has been receiving some very useful feedback from people on that list. The open, public xsl-list@mulberrytech.com has been in existence for only two weeks less than the xsl-wg itself, and it has been quite active since its inception. The "problem" with the XML SIG was that everyone on that list had to be an "invited expert" and they were all bound by W3C-member confidentiality regulations. This was a logistic nightmare, and it actually meant that some important discussion was pre-empted from truly public lists like xml-dev. The XSL WG decided instead not to have this "intermediate" (and confusing and problematic) level mailing group, but instead to have just the public xsl-list (kindly hosted by Mulberry Technologies). We felt that was the more open, democratic thing to do. As far as XSL, if you think you're not seeing something that W3C members are seeing, you're wrong. We do not have anything yet to show (other than the requirements document that we are in the process of releasing to the public as we speak). We plan to have a first draft of the XSL 1.0 spec released to the public in July--the same time it is released to W3C members, as I understand it [mega-apologies if I'm wrong about this, but this is my current understanding]. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From bckman at ix.netcom.com Fri May 1 17:47:55 1998 From: bckman at ix.netcom.com (Frank Boumphrey) Date: Mon Jun 7 17:01:01 2004 Subject: Open standards processes Message-ID: <01bd7532$54288c40$efaddccf@uspppBckman> >>But I don't think it serves any purpose to be secretive. I have certainly always believed that the more people knew what was going on, the greater the chance of success. Publishing work in progress will enable the user and vendor community to respond more rapidly when the thing is finally published, and will harness the resources of a wider group of people to spot the errors. I find it a little disappointing, now that there is no cost argument to prevent open dissemination,<< I think thats very well said. Even if there were a deluge of feed back, the compilers of the standard would be free to ignore it. I can think of no legitimate reason for secrecy. Even if the member-developers did want an edge on non-member developers, (assuming that that's a legimitate reason, which is debateable) if they need that kind of edge they have real problems and are not going to last very long!! Frank -----Original Message----- From: Michael Kay To: xml-dev Date: Friday, May 01, 1998 8:21 AM Subject: Re: Open standards processes >>Here are some more thoughts about open standards processes: >> >To add my tuppenceworth, I've been involved in the past in >both de jure and consortium standards-making processes, >though all before the days of the web. > >To get a successful standard you need a core team who work >hard, who are technically highly competent, and who >understand the needs of the users as well as (a more common >reason for failure) the needs of potential vendors. You need >a consensus on the general principles and objectives, an >aversion to introducing unproven innovations, and an absence >of people with an interest in obstructing the process. You >don't need consultation or democracy or legal authority; >these can sometimes help to achieve the necessary consensus >but can also slow things down or send things off in the >wrong direction. > >But I don't think it serves any purpose to be secretive. I >have certainly always believed that the more people knew >what was going on, the greater the chance of success. >Publishing work in progress will enable the user and vendor >community to respond more rapidly when the thing is finally >published, and will harness the resources of a wider group >of people to spot the errors. I find it a little >disappointing, now that there is no cost argument to prevent >open dissemination, that W3C should (apparently) have a >policy of secrecy which goes beyond anything I ever >encountered in ISO or ANSI or X/Open or OMG committees. >Perhaps the problem is that they would be deluged by >feedback, but I doubt it. > >Michael Kay > > >xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk >Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ >To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; >(un)subscribe xml-dev >To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; >subscribe xml-dev-digest >List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) > > xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jcupp at essc.psu.edu Fri May 1 17:54:36 1998 From: jcupp at essc.psu.edu (Jason R. Cupp) Date: Mon Jun 7 17:01:01 2004 Subject: Namespaces & Behavior Message-ID: <3549EFB3.6A24E89E@essc.psu.edu> Hello, If I were writing an XML parser (in PERL) and I wanted to consider adding functionallity to recognize tags in particular namespaces, then would it make sense that the parser could import locally, dynamically link, include, request from the namespace URI (broker fashion?)--whatever you want to call it--behaviors for that tag and its content such as simple tag replacement, special formatting, re-ordering of elements, basically full XSL I think? Jason R. Cupp (jcupp@essc.psu.edu) Deasy GeoGraphics The Pennsylvania State University xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Patrice.Bonhomme at loria.fr Fri May 1 19:32:57 1998 From: Patrice.Bonhomme at loria.fr (Patrice Bonhomme) Date: Mon Jun 7 17:01:01 2004 Subject: XML 1.0 : Message-ID: <199805011731.TAA14818@chimay.loria.fr> I am very surprise to see that the XML spec 1.0 cannot authorize the following element declaration (because of the +): I am wrong ? Pat. -- ============================================================== bonhomme@loria.fr | Office : B.228 http://www.loria.fr/~bonhomme | Phone : 03 83 59 30 52 -------------------------------------------------------------- * Serveur Silfide : http://www.loria.fr/projets/Silfide * Projet Aquarelle : http://aqua.inria.fr ============================================================== xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Fri May 1 19:47:19 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:01 2004 Subject: XML 1.0 : In-Reply-To: <199805011731.TAA14818@chimay.loria.fr> References: <199805011731.TAA14818@chimay.loria.fr> Message-ID: <199805011746.NAA05101@unready.microstar.com> Patrice Bonhomme writes: > I am very surprise to see that the XML spec 1.0 cannot authorize > the following element declaration (because of the +): > > > > I am wrong ? Entirely correct; however, there is no XML content matched by (#PCDATA | s+)* that is not also matched by (#PCDATA | s)* More generally, (a+)* = (a*)+ = (a*)* = (a?)+ = (a?)* = (a)* XML forces you to reduce mixed content models to the simplest representation. All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Patrice.Bonhomme at loria.fr Fri May 1 19:53:19 1998 From: Patrice.Bonhomme at loria.fr (Patrice Bonhomme) Date: Mon Jun 7 17:01:01 2004 Subject: ELEMENT P (#PCDATA | S+)* In-Reply-To: Your message of "Fri, 01 May 1998 12:52:59 CDT." <199805011752.MAA07891@ACADCOMP.SIL.ORG> Message-ID: <199805011752.TAA15485@chimay.loria.fr> robin@ACADCOMP.SIL.ORG said: ] Hmmm... (#PCDATA | (s)+ )* should be OK. Yes it should be, but : /* Mixed-content Declaration */ [51] Mixed ::= '(' S? '#PCDATA' (S? '|' S? Name)* S? ')*' | '(' S? '#PCDATA' S? ')' Pat. -- ============================================================== bonhomme@loria.fr | Office : B.228 http://www.loria.fr/~bonhomme | Phone : 03 83 59 30 52 -------------------------------------------------------------- * Serveur Silfide : http://www.loria.fr/projets/Silfide * Projet Aquarelle : http://aqua.inria.fr ============================================================== xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tbray at textuality.com Fri May 1 19:57:09 1998 From: tbray at textuality.com (Tim Bray) Date: Mon Jun 7 17:01:01 2004 Subject: XML 1.0 : Message-ID: <3.0.32.19980501105343.00b662e0@pop.intergate.bc.ca> At 07:31 PM 5/1/98 +0200, Patrice Bonhomme wrote: >I am very surprise to see that the XML spec 1.0 cannot authorize the following >element declaration (because of the +): > > That's right. -Tim xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Fri May 1 20:31:11 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:01 2004 Subject: Open standards processes References: <98May1.112853edt.26884@thicket.arbortext.com> Message-ID: <354A151D.B718CB2C@technologist.com> Paul Grosso wrote: > > At 10:01 1998 05 01 -0400, Paul Prescod wrote: > >Here are some more thoughts about open standards processes: > > > >The XML SIG was pretty wide open, which was admirable, but XSL the DOM, > >etc. seem to have no equivalent. > > I don't quite understand this statement. > > The open, public www-dom@w3.org mailing list has been in existence "forever," > and the DOM work has been receiving some very useful feedback from people on > that list. www-dom@w3.org and xsl-list (which is not even a W3C mailing list) are NOT comparable to w3c-xml-sig@w3.org. The latter has published minutes, focused and moderated discussions that were carefully watched by essentially ALL members of the working group. The former is essentially a "feedback list" with little influence and hardly enough information to be influential even if it had more. I know, because I have the same role in both. > The "problem" with the XML SIG was that everyone on that list had to be > an "invited expert" and they were all bound by W3C-member confidentiality > regulations. This was a logistic nightmare, and it actually meant that > some important discussion was pre-empted from truly public lists like xml-dev. So the XML effort had three levels: "Voting members", "Influential non-voting experts" and "XML-DEV" and the DOM effort has essentially two levels "voting members" and "non-influential, non-voting experts" and you see that as more open? How so? Before there were some discussions I could only have "inside" (based on inside knowledge) and some I could only have "outside" (based on open knowledge) and now I can only have the "outside" discussions because nobody except the 20 people on the WG have inside knowledge. Wiping out a level of quasi-openness does not make the process more open, it makes it less. It's like abolishing the congress so that they don't "get in the way" of the president speaking directly to the people. It's good to give people more power (in this case information and voice). It doesn't have to be at the expense of someone else's power, however. > As far as XSL, if you think you're not seeing something that W3C members > are seeing, you're wrong. We do not have anything yet to show (other than > the requirements document that we are in the process of releasing to the > public as we speak). That the W3C members are as in the dark about what you are doing as we are is not comforting. I think that anyone should have access to the minutes of meetings as soon as they are released, as well as mailing list archives. In other words, the "audit trail" from the first idea to the last page-break in the rendered version of the spec. should be as transparent as possible. No, we do not achieve that in the XML WG. Yes, we come closer. This is why so many people in this mailing list who participated in that effort have warm, fuzzy feelings about the W3C right now. My point is that this warm and fuzzy cooperative effort was an aberration. As someone wrote in the Atlantic Monthly not too long ago: "Democracy's moment (such as it was) has passed." Paul Prescod - http://itrc.uwaterloo.ca/~papresco "Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization." - Stewart Brand, founder of the Whole Earth Catalog http://www.wired.com/news/news/culture/story/10124.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From digitome at iol.ie Fri May 1 21:29:43 1998 From: digitome at iol.ie (Sean Mc grath) Date: Mon Jun 7 17:01:01 2004 Subject: Open standards processes Message-ID: <1.5.4.32.19980501192208.00922c0c@gpo.iol.ie> On Wed 31 Dec 1997 I asked some basic questions about XSL on the DSSSL List. This was before the formation of the XSL list and after Jon Bosak saying that the DSSSL list was a suitable forum to discuss XSL. I sort of assumed that the XML-SIG style banter between the core experts and dabblers would be the order of the day. Paul Grosso replied to my enquiries with this comment >As a submission, it [XSL] is only a partial starting point, >and detailed discussion of it is of very questionable >utility (and I suspect most of the authors of the submission >will not find it well worth their time to reply). Some weeks later Arbortext announced an XSL product. My point? Any suggestion that the XSL etc. lists are run along the same "open" lines as XML-SIG is/was seems rather dubious to me. Sean Mc Grath http://www.digitome.com "there are two types of people in the world. Those who think the world consists of two types of people, and those who don't" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From bckman at ix.netcom.com Sat May 2 00:26:38 1998 From: bckman at ix.netcom.com (Frank Boumphrey) Date: Mon Jun 7 17:01:01 2004 Subject: Imbed XML in HTML Message-ID: <01bd7569$ca762c40$e8acdccf@uspppBckman> Some of you might be interested in a demonstration of embedding XML in an HTML file. Its the third tip at. http://www.hypermedic.com/style/tips/tipindex.htm When you click on the link a msgbox with the generated HTML code appears followed by the Display of the XML document. Look at the source code to see how its done. This will only work on IE4 because the script uses the IE4 document object model. Frank Boumphrey XML and style sheet info at Http://www.hypermedic.com/style/index.htm xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Sat May 2 01:07:01 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:02 2004 Subject: Open standards processes In-Reply-To: <354A151D.B718CB2C@technologist.com> References: <98May1.112853edt.26884@thicket.arbortext.com> Message-ID: <3.0.1.16.19980501230607.77873dc8@pop3.demon.co.uk> At 14:31 01/05/98 -0400, Paul Prescod wrote: [...] > >www-dom@w3.org and xsl-list (which is not even a W3C mailing list) are NOT >comparable to w3c-xml-sig@w3.org. The latter has published minutes, >focused and moderated discussions that were carefully watched by >essentially ALL members of the working group. The former is essentially a >"feedback list" with little influence and hardly enough information to be >influential even if it had more. I know, because I have the same role in >both. > >> The "problem" with the XML SIG was that everyone on that list had to be >> an "invited expert" and they were all bound by W3C-member confidentiality >> regulations. This was a logistic nightmare, and it actually meant that >> some important discussion was pre-empted from truly public lists like xml-dev. > >So the XML effort had three levels: "Voting members", "Influential >non-voting experts" and "XML-DEV" and the DOM effort has essentially two >levels "voting members" and "non-influential, non-voting experts" and you >see that as more open? How so? Before there were some discussions I could Just to clear up any confusion - XML-DEV is not a W3C mailing list and has no formal part in the XML process - it's offered by the goodwill of Henry and myself in case people find it useful. Also -as far as XML-DEV is concerned - there is no reason why it can't play the same role for DOM as XML if people want (I at least am going to start hacking the DOM at some stage and will no doubt be asking the same sort of hairy questions as I ask here). I am reading the current discussion with interest. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From dan at intervista.com Sat May 2 01:45:49 1998 From: dan at intervista.com (Dan Ancona) Date: Mon Jun 7 17:01:02 2004 Subject: Separation of formatting... Message-ID: <3.0.32.19980501165640.00afa1e0@mail.intervista.com> At 08:14 PM 4/30/98 -0400, Gregg Reynolds wrote: > [ ... ] > about rhetorical techniques of the public > orator - phrasing, rhythm, delivery. In > other words, processing instructions. > [ ... ] IOW, the writing is to some extent output device independent, and oration is just one such device? That is SO cool. Anyway, the gist I'm getting from this thread is that a rigorous separation of formatting from any purely generic or abstract markup in a given XML file format is not strictly necessary. In fact, it could even be beneficial in some circumstances and for some formats. Correct? If I'm right, this is a good thing, since it always seemed to me that (despite how horribly broken in so many other ways) the unholy combination of a partial procedural markup and the complete lack of generic markup built into HTML was part of why it caught on. At least you could easily include the elements that you wanted, even if you couldn't actually lay them out at first (and could only do so by using kloogey stuff like tables and later). Dan __________________________________________________________ Dan Ancona "engage!" Tech Support, Evangelism and Cookery Intervista Software xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From lauren at sqwest.bc.ca Sat May 2 01:54:12 1998 From: lauren at sqwest.bc.ca (Lauren Wood) Date: Mon Jun 7 17:01:02 2004 Subject: Open standards processes In-Reply-To: <3.0.1.16.19980501230607.77873dc8@pop3.demon.co.uk> References: <354A151D.B718CB2C@technologist.com> <98May1.112853edt.26884@thicket.arbortext.com> Message-ID: <199805012354.QAA17523@sqwest.bc.ca> At 14:31 01/05/98 -0400, Paul Prescod wrote: >So the XML effort had three levels: "Voting members", "Influential >non-voting experts" and "XML-DEV" and the DOM effort has essentially two >levels "voting members" and "non-influential, non-voting experts" and you >see that as more open? How so? I don't think equating lack of voting rights and lack of influence is fair in terms of the DOM work. Just as in the XML SIG, where what was said was read and discussed by members of the XML WG, so are all postings to www-dom read and discussed by members of the DOM WG. Not all postings get direct replies, but the influence on the specs has been large. The people who take the time to read the specs and post to the www-dom list are not being ignored. Those who are interested in the current DOM discussions will find the archives for www-dom at http://lists.w3.org/Archives/Public/www-dom/ Lauren xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Sat May 2 02:47:35 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:02 2004 Subject: Announcement: SAX 1.0gamma Message-ID: <199805020046.UAA02818@unready.microstar.com> Implementors have requested more time to test SAX 1.0 before it becomes final. There will be no additional changes other than bug fixes; however, since bugs have required several minor changes already, and since the final, official, righteous SAX 1.0 version might still be a week or more away, in the interest of openness I have uploaded SAX 1.0gamma (I don't like "beta2") to the following (new) URL: http://www.megginson.com/SAX/ (The final SAX 1.0 will be mirrored at http://www.microstar.com/XML/SAX/ as well). The latest JavaDoc is also available online. SAX is changing very little now, and it is probably safe to start basing implementations on it, as long as you're ready for a little tweaking. Here are the (very minor) changes since 1.0beta: Interface Changes ----------------- - bug fix: changed parameter to Parser.setLocale() to java.util.Locale (a single string is insufficient) - bug fix: added default constructor to InputSource, to simplify subclassing - bug fix: removed "extends IOException" from InputSource (it was retained accidentally from an earlier revision) - bug fix: in the ParserFactory helper class, used "org.xml.sax.parser" instead of "sax.parser" for the system property name Documentation Changes --------------------- - doc fix: clarify that attributes in AttributeList are zero-indexed - doc fix: clarify that Parser objects are non-re-entrant but reusable - doc fix: clarify that the parser should never modify the InputSource provided by the application - doc fix: clarify that the parser should not deliver the endDocument event until EOF or a non-recoverable error - moved sample drivers to com.megginson.sax package All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From greyno at mcs.com Sat May 2 04:22:43 1998 From: greyno at mcs.com (Gregg Reynolds) Date: Mon Jun 7 17:01:02 2004 Subject: Separation of formatting... References: <3.0.32.19980501165640.00afa1e0@mail.intervista.com> Message-ID: <354A75D1.4348@mcs.com> Dan Ancona wrote: > > Anyway, the gist I'm getting from this thread is that a rigorous separation > of formatting from any purely generic or abstract markup in a given XML > file format is not strictly necessary. In fact, it could even be > beneficial in some circumstances and for some formats. Correct? > Personally I've come around to a pragmatist position on this, after some time as a radical Free The Text purist. It's just not possible to encode information without also encoding something about what we're supposed to do with it, any more than it's possible to draw a "real" line segment. But we can certainly do some very useful things by trying. An interesting paper on a similar topic is at http://www.sil.org/sgml/ohco1.html, "Refining Our Notion of What Text Really Is: The Problem of Overlapping Hierarchies." -- xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ricko at allette.com.au Sat May 2 05:19:13 1998 From: ricko at allette.com.au (Rick Jelliffe) Date: Mon Jun 7 17:01:02 2004 Subject: Separation of formatting... Message-ID: <004501bd7579$505bfba0$890b4ccb@NT.JELLIFFE.COM.AU> From: Gregg Reynolds >Dan Ancona wrote: >> >> Anyway, the gist I'm getting from this thread is that a rigorous separation >> of formatting from any purely generic or abstract markup in a given XML >> file format is not strictly necessary. In fact, it could even be >> beneficial in some circumstances and for some formats. Correct? > >Personally I've come around to a pragmatist position on this, after some >time as a radical Free The Text purist If we subdivide the logical/presentation split we can come up with the following 6 subdivisions: topical structures (e.g. ) editorial structures (e.g.

) characters glyphs page objects (e.g. a line) layout structures (e.g. a column) If there is a straightforward flow of dependence and synchronization between these different structures, then is possible to start at the topic structure and format all the way down to the layout (i.e. starting from a higher and piggybacking the settings of the lower based on the higher ones.) This is the underlying model of, e.g. CSS & DSSSL to a great extent: in DSSSL there is no feedback from the layout program to the stylesheet language. However, there are many kinds of documents where the flow of dependence is not one-way or straightforward. For example, in a magazine the page layout determines how many characters a heading sould have, and in some designs it might determine what kinds of editorial structures are possible I came across an extreme example in a loose leaf system: the writers of a prose section knew they had a fixed space to work in, but this could not be determined at authoring time. So they wrote knowing that the last paagraph might be deleted by the pagination system if it could not be fitted. This is a case where the layout structure determined the page-objects which determined which editorial structures made it into the final publication. Developers should be aware that if their publication requires more than just a straight-forward flow of dependence between the various structures (which are concurrently present in their publication) they may have to spend a lot more development effort. A simple flow of dependence with no reverse feeding back is easier to implement. Rick Jelliffe "The XML & SGML Cookbook" Prentice Hall, out in May 1998 xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sat May 2 06:39:14 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:02 2004 Subject: Open standards processes References: <354A151D.B718CB2C@technologist.com> <98May1.112853edt.26884@thicket.arbortext.com> <199805012354.QAA17523@sqwest.bc.ca> Message-ID: <354AA396.E5F1608C@technologist.com> Lauren Wood wrote: > > At 14:31 01/05/98 -0400, Paul Prescod wrote: > >So the XML effort had three levels: "Voting members", "Influential > >non-voting experts" and "XML-DEV" and the DOM effort has essentially two > >levels "voting members" and "non-influential, non-voting experts" and you > >see that as more open? How so? > > I don't think equating lack of voting rights and lack of influence is fair > in terms > of the DOM work. I didn't mean to equate lack of voting rights and lack of influence. In the XML effort there were people (like me) that had no voting rights but could still influence things sometimes. It isn't clear to me the extent that the DOM is the same, because I haven't tried to influence the DOM and haven't been watching the conversations clearly. I will say that in several months of watching the list I haven't seen any heated debates similar to those we had in the XML group. My *guess* is that these heated (and important) debates are taking place elsewhere. That impression is further supported by the fact that I often see questions/answers on the DOM list of the form: "Feature X seems strange to me." "Yes, we thought about that quite a bit and argued about it. Do you think we did the right thing?" which strikes me as nearly the opposite of the XML-SIG version: "We've got this hard problem to solve. We've got this group of smart people assembled. WHAT DO YOU THINK? We'll make a decision after hearing all of your opinions." Casting an opinion on an issue that was debated two months ago is *not* the same as participating in the debate. Paul Grosso said that the XML-SIG way was a procedural nightmare. In my experience, all forms of democracy, even limited ones, are procedural nightmares. But as Len Bullard once told me: "Chaos is the engine." I can't prove that the XML spec. is better because I could participate in ways that I cannot with the DOM and XSL (so far). But I know that it is the fact that some of my suggestions were taken seriously and even adopted in times of heated discussions that would never have come about at all under the DOM/XSL processes. Even given read-only access to the main conversational mailing lists (and meeting minutes), outsiders could influence the process through private email in support of or rebuttal to positions. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization." - Stewart Brand, founder of the Whole Earth Catalog http://www.wired.com/news/news/culture/story/10124.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cmatei at agora.ro Sat May 2 14:38:23 1998 From: cmatei at agora.ro (Crystian) Date: Mon Jun 7 17:01:02 2004 Subject: Loops in DTD? Message-ID: <01BD75E0.75CF83E0@jackson.agora.ro> Hi, I make an vadidating XML processor. Until now, the processor can make a complex tree from data from Document Type Definition and validat the declaration from it. I am not sure if this tree can contain loops. I mean something like this: ... ... ]> Thank you, Cristian Matei xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Sat May 2 15:34:50 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:02 2004 Subject: Announcement: SAX 1.0gamma In-Reply-To: <199805020046.UAA02818@unready.microstar.com> Message-ID: <3.0.1.16.19980502133350.60cfa996@pop3.demon.co.uk> Many thanks David, At 20:46 01/05/98 -0400, David Megginson wrote: >Implementors have requested more time to test SAX 1.0 before it >becomes final. FWIW this suits my timetable as well :-) My plan is to assemble a manageable number (2-5) of Java-based parsers under JUMBO2 on the CDROM. The present parsers I have been looking at (with comments and queries) follow. I have listed these in public rather than approach individual authors - I hope that's OK. The selection is pragmatic - I can only deal with a smallish number at this stage (the CLASSPATHs, version differences, etc. can make my operation unstable at this stage). I also plan to use JUMBO to extract DTD information from at least one parser so that people can browse DTD-provided information such as attribute types and content specs. This is obviously a non-standard process at present and will either be through subclassing the parsers to implement a further (JUMBO-hacked) interface or simply if ... else if's. I'd be very grateful for the authors to confirm that I have got things right. If I have made any errors, please forgive me. The latest revision info is taken from RobinC's http://www.sil.org/sgml/xml.html#xmlSoftware. If there are later versions, I'd be very grateful to know what the official position is. Some parsers have their latest update pre-XML-V1.0. Unless I hear to the contrary I have to assume that these are not currently XML1.0-compliant (unless the author had a (paranormal) inside track to the spec). AElfred: Version: V1.1 Date in distribution 980309 XML1.0-compliant Yes SAX1.0pre-beta compliant Yes SAX1.0beta compliant after SAX completion + a little bit ? DTD-related information Yes? Validating No DXP: Version: V1.0beta1c Date in distribution 19980410 XML1.0-compliant Yes SAX1.0pre-beta compliant Yes SAX1.0beta compliant Presumably (timescale?) DTD-related information Yes? Validating Yes Lark: Version: V1.0beta (final) Date in distribution 19980105 XML1.0-compliant No? SAX1.0pre-beta compliant No SAX1.0beta compliant Megginsonian SAX wrapper(?) DTD-related information Yes? Validating No (but Larval, codistributed, is) XP: Version: V0.2 (from Version.java) Date in distribution 19980307 XML1.0-compliant Yes SAX1.0pre-beta compliant Yes SAX1.0beta compliant RSN DTD-related information Yes? Validating No These are the parsers I have worked with. They are all 100% Java and I plan to run them under JUMBO as both applications and applets. JUMBO will distribute a JRE for W95 and Solaris and for applets will assume a JDK1.1-compliant browser such as Netscape 4.05 (I haven't actually tested this - that's this afternoon's fun). ***AUTHORS***. Please correct any info here - I may well have missed later versions. I hope that the authors' timescales and JUMBO converge and that all 4 parsers will be available. I shall certainly wait for AElfred :-). The other *Java* parsers on Robin's list are: MSXML: Version: ? Date in distribution 19971204 (RobinC) XML1.0-compliant No?? SAX1.0pre-beta compliant No SAX1.0beta compliant Megginsonian SAX wrapper(?) DTD-related information ? Validating No? Pure Java ?? [PMR: I tried to install this some time ago and couldn't, but didn't give it lots of time. Clearly lots of people *can* run it. If it's trivial to run under (a) java (b) in an applet and there is a SAX wrapper, I might find the time to try it out.] IBM: Version: ? Date in distribution 19980416 (RobinC) XML1.0-compliant Yes? SAX1.0pre-beta compliant ? SAX1.0beta compliant ? DTD-related information ? Validating Yes Pure Java ? [PMR: I have no experience of this and all the information above comes from RobinC's page. If there are plans to make it SAX-compliant and it's easy to install I'd be interested in having a look.] XML-by-hand (Bert Bos). Non-validating. I haven't looked further than Robin's page. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Sat May 2 15:34:53 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:02 2004 Subject: Loops in DTD? In-Reply-To: <01BD75E0.75CF83E0@jackson.agora.ro> Message-ID: <3.0.1.16.19980502125757.7787e944@pop3.demon.co.uk> At 15:39 02/05/98 +-300, Crystian wrote: >Hi, > >I make an vadidating XML processor. Until now, the processor can make a complex tree from data from Document Type Definition and validat the declaration from it. I am not sure if this tree can contain loops. > >I mean something like this: > > ... > > ... > > ... >]> This is legal XML. Anyone who is in doubt about whether their XML is legal should download one of the free parsers and see if it throws errors. For some sorts of people (e.g. me) it's easy to rely on the computer first of all rather than reading the spec. Apart from a very few minor bugs (and pre-XML V1.0 implementations) the parsers should be the touchstone of your XML. I shall be releasing several parsers on the JUMBO-CDROM - see separate posting today. An XML document instance never contains loops. If the DTD is expanded so that element content is recursively displayed may contain loops. This can make it quite difficult to display. For example Earl Hood's very nice dtd2html tool showed the content of the first occurrence of an element and added an ellipsis for others. An alternative way is not to expand the children recursively but to add hyperlinks. This works fine on screen but not on cellulose. HTH P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jtauber at jtauber.com Sat May 2 15:37:53 1998 From: jtauber at jtauber.com (James K. Tauber) Date: Mon Jun 7 17:01:02 2004 Subject: RFD: comp.text.xml Message-ID: <000001bd75ce$fdf449c0$d86118cb@caleb> REQUEST FOR DISCUSSION (RFD) unmoderated group comp.text.xml This is a formal Request For Discussion (RFD) for the creation of a comp.text.xml newsgroup. This is not a Call for Votes (CFV); you cannot vote at this time. Procedural details are below. Newsgroups line: comp.text.xml The Extensible Markup Language (XML) RATIONALE: comp.text.xml Abstract A newsgroup for the discussion of the Extensible Markup Language (XML); including, but not limited to the specifications and syntax, document creation and editing, interchange, software, processing and database integration. This applies not only to XML itself but also the Extensible Linking Language (XLL), the Extensible Style Language (XSL), Cascading Style Sheets (CSS) as applied to XML documents, and to document types and applications of XML. Justification XML is a new language, and it appears that it will be extremely popular. Over the past few months, traffic on the XML-DEV and XML-L mailing lists has grown rapidly. With the release of the W3C Recommendation for XML 1.0, developer interest is growing rapidly and an increasing amount of software is being released. A newsgroup would make it much easier for a wider audience to participate in and benefit from these discussions. While it may appear at first that comp.text.sgml is appropriate, there will be those interested in XML who do not care about the arcana of SGML, and there will be those interested in SGML who do not care to see many basic questions about XML. CHARTER: comp.text.xml comp.text.xml shall be an unmoderated newsgroup for the general discussion of the Extensible Markup Language (XML) and its application. This includes, but is not limited to the specifications and syntax, document creation and editing, interchange, software, processing and database integration. This applies not only to XML itself but also the Extensible Linking Language (XLL), the Extensible Style Language (XSL), Cascading Style Sheets (CSS) as applied to XML documents, and to document types and applications of XML. Policy on Advertising We encourage discussion of the merits and shortcomings of commercial products. A certain amount of advertising (both objective and advocative) is to be expected and this is to be encouraged as long as the products and services relate directly to XML and the announcements are brief. Company representatives are expected to participate in discussions of their product that they themselves did not initiate. Please refer to the widely available Internet literature on the subject of common net abuses (indiscriminate newsgroup spamming, multiple advertisement posting, and chain letters being among the most frequent). END CHARTER. PROCEDURE: This is a request for discussion, not a call for votes. In this phase of the process, any potential problems with the proposed newsgroups should be raised and resolved. The discussion period will continue for a minimum of 21 days (starting from when the first RFD for this proposal is posted to news.announce.newgroups), after which a Call For Votes (CFV) may be posted by a neutral vote taker if the discussion warrants it. Please do not attempt to vote until this happens. All discussion of this proposal should be posted to news.groups. This RFD attempts to comply fully with the Usenet newsgroup creation guidelines outlined in "How to Create a New Usenet Newsgroup" and "How to Format and Submit a New Group Proposal". Please refer to these documents (available in news.announce.newgroups) if you have any questions about the process. END PROCEDURE. DISTRIBUTION: This RFD shall be posted to the w3c-xml-sig, XML-DEV and XML-L mailing lists and to the following newsgroups: comp.text comp.text.sgml alt.etext alt.hypertext comp.infosystems.www.announce comp.infosystems.www.authoring.html comp.infosystems.www.authoring.misc comp.infosystems.www.authoring.stylesheets Please see the newsgroup news.groups for discussions about this RFD, as mentioned above. Readers of the mailing lists and additional newsgroups listed in this section shall be referred to news.announce.newgroups for the eventual Call For Votes (CFV). END DISTRIBUTION. Proponent: James Tauber (jtauber@jtauber.com) With assistance from Chris Maden, Simon St Laurent, BK DeLong and Susan Lesch xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Sat May 2 17:26:19 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:02 2004 Subject: Java-based XML Parsers In-Reply-To: <3.0.1.16.19980502133350.60cfa996@pop3.demon.co.uk> References: <199805020046.UAA02818@unready.microstar.com> <3.0.1.16.19980502133350.60cfa996@pop3.demon.co.uk> Message-ID: <199805021525.LAA00247@unready.microstar.com> Peter Murray-Rust writes: > The other *Java* parsers on Robin's list are: > > MSXML: > Version: ? > Date in distribution 19971204 (RobinC) > XML1.0-compliant No?? > SAX1.0pre-beta compliant No > SAX1.0beta compliant Megginsonian SAX wrapper(?) > DTD-related information ? > Validating No? > Pure Java ?? > [PMR: I tried to install this some time ago and couldn't, but didn't give > it lots of time. Clearly lots of people *can* run it. If it's trivial to > run under (a) java (b) in an applet and there is a SAX wrapper, I might > find the time to try it out.] I have managed to use 1.8 (I think) out of the box on my notebook, and have taken the SAX driver as far as I can. I don't know if it's "pure Java", but it certainly didn't break under Linux. There are several serious and well-known bugs in the current version of MSXML that make it unreliable for production use (such as bogus validation errors, the failure to report defaulted attribute values and badly broken entity management); however, since future versions of MSXML will probably ship with MSIE, MSXML has a very strong distribution channel and deserves attention. It might be be worthwhile to plan to support it when Microsoft has a chance to go back and finish development. > IBM: > Version: ? > Date in distribution 19980416 (RobinC) > XML1.0-compliant Yes? > SAX1.0pre-beta compliant ? > SAX1.0beta compliant ? > DTD-related information ? > Validating Yes > Pure Java ? > > [PMR: I have no experience of this and all the information above comes from > RobinC's page. If there are plans to make it SAX-compliant and it's easy to > install I'd be interested in having a look.] IBM's XML for Java shipped with a SAX driver for the January draft (thanks, guys), and they included SAX 1.0 compliance on the time line during their presentation at the XML Developers' Day in Seattle. You should probably watch this one closely. XML for Java does provide DTD information and even guided authoring (though I haven't tested those features), and again, it runs under Linux JDK 1.1.5. All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cbullard at hiwaay.net Sat May 2 17:34:11 1998 From: cbullard at hiwaay.net (len bullard) Date: Mon Jun 7 17:01:02 2004 Subject: RFD: comp.text.xml References: <000001bd75ce$fdf449c0$d86118cb@caleb> Message-ID: <354B3C86.6C4@hiwaay.net> James K. Tauber wrote: > > REQUEST FOR DISCUSSION (RFD) > unmoderated group comp.text.xml > > This is a formal Request For Discussion (RFD) for the creation of a > comp.text.xml newsgroup. This is not a Call for Votes (CFV); > you cannot vote at this time. Procedural details are below. > > Newsgroups line: > comp.text.xml The Extensible Markup Language (XML) > > RATIONALE: comp.text.xml > > Abstract > > A newsgroup for the discussion of the Extensible Markup > Language (XML); including, but not limited to the specifications > and syntax, document creation and editing, interchange, software, > processing and database integration. This applies not only to XML > itself but also the Extensible Linking Language (XLL), the Extensible > Style Language (XSL), Cascading Style Sheets (CSS) as applied to XML > documents, and to document types and applications of XML. > > Justification > > XML is a new language, and it appears that it will be extremely > popular. XML is a subset of SGML as specified by ISO. It is a consortium-owned subset without normative reference to the parent standard. > Over the past few months, traffic on the XML-DEV and XML-L mailing lists has > grown rapidly. With the release of the W3C Recommendation for XML 1.0, > developer interest is growing rapidly and an increasing amount of software > is > being released. A newsgroup would make it much easier for a wider audience > to participate in and benefit from these discussions. This is true. XML-DEV supports the public discussions. Invitation-only special interest group maillists support the XML core. The XML applications (eg, XSL, XLL) are supported as well. > While it may appear at first that comp.text.sgml is appropriate, there will > be those interested in XML who do not care about the arcana of SGML, and > there will be those interested in SGML who do not care to see many basic > questions about XML. This is speculative not substantive. The membership of the XML and XML application communities overlaps the SGML community. The extent of overlap should be examined, but it is probably greater than a majority. > CHARTER: comp.text.xml > > comp.text.xml shall be an unmoderated newsgroup for > the general discussion of the Extensible Markup Language (XML) and its > application. At this time, the comp-text-sgml newsgroup can support the discussion of SGML subsets such as the W3C consortium subset, XML and its applications. There is no need for a separate group which will weaken the tenuous and often politically difficult to maintain spirit of cooperation between international standards groups and consortium product working groups. This is not in the best interests of that alliance. Len Bullard xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Sat May 2 18:32:05 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:03 2004 Subject: Java-based XML Parsers In-Reply-To: <199805021525.LAA00247@unready.microstar.com> References: <3.0.1.16.19980502133350.60cfa996@pop3.demon.co.uk> <199805020046.UAA02818@unready.microstar.com> <3.0.1.16.19980502133350.60cfa996@pop3.demon.co.uk> Message-ID: <3.0.1.16.19980502162740.5977dfea@pop3.demon.co.uk> Thanks very much David, At 11:25 02/05/98 -0400, David Megginson wrote: >Peter Murray-Rust writes: [...MSXML...] > >I have managed to use 1.8 (I think) out of the box on my notebook, and >have taken the SAX driver as far as I can. I don't know if it's "pure >Java", but it certainly didn't break under Linux. Thanks - glad to hear this - there was discussion at one stage about MS-specific classes. > >There are several serious and well-known bugs in the current version >of MSXML that make it unreliable for production use (such as bogus >validation errors, the failure to report defaulted attribute values >and badly broken entity management); however, since future versions of >MSXML will probably ship with MSIE, MSXML has a very strong >distribution channel and deserves attention. It might be be >worthwhile to plan to support it when Microsoft has a chance to go >back and finish development. Thanks. If this is the case (and I trust David's accuracy) I would not feel it appropriate to offer it as part of the currently planned JUMBO-CDROM. (I shall try to make sure that all parsers give 'the same results' - at least for my examples). However I would certainly wish to support MSXML in the future (assuming the distribution policy allows me to.) Any further clarification from MS would be welcome - I certainly don't want to undervalue the role that it plays. > > > IBM: > >IBM's XML for Java shipped with a SAX driver for the January draft >(thanks, guys), and they included SAX 1.0 compliance on the time line >during their presentation at the XML Developers' Day in Seattle. You >should probably watch this one closely. I downloaded it just before I got your mail :-) and shall hack this evening. [I haven't looked at the distribution conditions yet.] > >XML for Java does provide DTD information and even guided authoring >(though I haven't tested those features), and again, it runs under >Linux JDK 1.1.5. Excellent. Any info from IBM about timescales for SAX1.0 would be most welcome. > P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Sat May 2 18:34:25 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:03 2004 Subject: RFD: comp.text.xml In-Reply-To: <000001bd75ce$fdf449c0$d86118cb@caleb> Message-ID: <3.0.1.16.19980502161715.69af8892@pop3.demon.co.uk> [reply to xml-dev only - I'm sure JamesT will read this]. At 21:34 02/05/98 +0800, James K. Tauber wrote: > REQUEST FOR DISCUSSION (RFD) > unmoderated group comp.text.xml > >This is a formal Request For Discussion (RFD) for the creation of a >comp.text.xml newsgroup. This is not a Call for Votes (CFV); >you cannot vote at this time. Procedural details are below. I am deliberately neutral about this - this message is simply to state how I see XML-DEV's position w.r.t. the propose comp.text.xml [...] >A newsgroup for the discussion of the Extensible Markup >Language (XML); including, but not limited to the specifications >and syntax, document creation and editing, interchange, software, >processing and database integration. This applies not only to XML >itself but also the Extensible Linking Language (XLL), the Extensible >Style Language (XSL), Cascading Style Sheets (CSS) as applied to XML >documents, and to document types and applications of XML. XML-DEV has had and continues to have robust and IMO valuable discussion in these areas. > >Justification > >XML is a new language, and it appears that it will be extremely >popular. > >Over the past few months, traffic on the XML-DEV and XML-L mailing lists has >grown rapidly. With the release of the W3C Recommendation for XML 1.0, >developer interest is growing rapidly and an increasing amount of software >is >being released. A newsgroup would make it much easier for a wider audience >to participate in and benefit from these discussions. I will leave Henry to answer whether there are any current problems of scale. [I would hate to promise his services without checking :-)] > [...] > > >Policy on Advertising > >We encourage discussion of the merits and shortcomings of >commercial products. A certain amount of advertising (both objective >and advocative) is to be expected and this is to be encouraged as >long as the products and services relate directly to XML and the >announcements are brief. Company representatives are expected to >participate in discussions of their product that they themselves did >not initiate. There has never been the slightest hint of abuse of this on XML-DEV and for this I thank the XML community. We aren't yet at the stage where there are a large number of XML products in common use, so haven't got to the 'I bought XML-FOO last week and when I try to run DUCKBOOK under it it ...'. > So far there have also been very few 'What is a DDT?' 'Where do I find the < and > on my keyboard?'. Peter Flynn and Robin Cover (and many others) must take great credit for providing answers to every conceivable question. Since I have asked these sort of Qs on comp.text.sgml 3 years ago, I expect that there needs to be a similar place for XML. XML-DEV isn't appropriate for large volumes of such Qs. I'm neutral whether it should be on comp.text.xml, comp.text.sgml or XML-L. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Sat May 2 19:20:26 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:03 2004 Subject: RFD: comp.text.xml Message-ID: Len made some comments about the RFD which I heartily disagree with. >> XML is a new language, and it appears that it will be extremely >> popular. > >XML is a subset of SGML as specified by ISO. It is a >consortium-owned subset without normative reference to >the parent standard. XML is more than a subset of SGML. It has its own limitations, its own practices, and its own audience. You can debate whether or not it is a 'new language' just as the XML-DEV list has argued whether the XML spec contains just syntax or syntax and semantics, but I think you're splitting hairs, at best. New acronym, new approach, fine - new language. Personally, I'd like to see XML make a clean parent-child break with SGML. XML should acknowledge SGML as the parent, the teacher, the source of inspiration, and move on to live its own life. I'd like to see XML have its own place to grow. If that means that it acquires features from SGML (like architectural forms, which David Megginson pointed out on XML-L), then fine - but it shouldn't be forever limited to being a 'mere' subset. >> While it may appear at first that comp.text.sgml is appropriate, there will >> be those interested in XML who do not care about the arcana of SGML, and >> there will be those interested in SGML who do not care to see many basic >> questions about XML. > >This is speculative not substantive. The membership of the XML and XML >application communities overlaps the SGML community. The extent of >overlap should be examined, but it is probably greater than a majority. The extent of overlap may be examined, but I doubt it will be a majority for more than few months. XML is arousing interest in many computing sectors (HTML developers, database developers, OOP programmers) who in the past had nothing (or as little as possible) to do with SGML. I can go through my email archive and find at least fifty people who have come to XML but had no previous experience in SGML and show no interest in learning it. There may be bias because my book deliberately targeted a non-SGML audience, but that audience certainly exists and is growing rapidly. >At this time, the comp-text-sgml newsgroup can support the discussion >of SGML subsets such as the W3C consortium subset, XML and its >applications. I've been perusing comp.text.sgml for the last few months. While I'm impressed with the quality of discussion (as I have been on XML-DEV and XML-L), I don't think it's a great place to learn about XML. It's not always easy to determine when SGML-only features are being discussed, and the question overlaps can lead to some fairly significant confusion, especially for beginners. >There is no need for a separate group which will weaken the tenuous >and often politically difficult to maintain spirit of cooperation >between international standards groups and consortium product working groups. >This is not in the best interests of that alliance. There is a lot of need for a separate group which will address the needs of large numbers of people moving into a new technology. 'Maintaining the spirit of cooperation between international standards groups and consortium product working groups' is _not_ the purpose of this newsgroups. Those groups, for better or worse, need to learn to work with each other while the rest of get on with learning the technologies that are appropriate to the tasks _we_ want to accomplish. I see comp.text.xml as a place for people, both new and experts, to communicate about the new things opened up by XML, not as a place for a relatively small club of experts to form and maintain alliances. XML-L is good for this, but newsgroups are easy places for the whole world to find and participate. I strongly hope comp.text.xml arrives soon. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From digitome at iol.ie Sat May 2 20:06:26 1998 From: digitome at iol.ie (Sean Mc grath) Date: Mon Jun 7 17:01:03 2004 Subject: Separation of formatting... Message-ID: <1.5.4.32.19980502175845.0090f500@gpo.iol.ie> [Rick Jelliffe] > >Developers should be aware that if their publication requires more than >just a straight-forward flow of dependence between the various structures >(which are concurrently present in their publication) they may have to >spend a lot more development effort. A simple flow of dependence with >no reverse feeding back is easier to implement. > Even reverse feeding has its limits. I suspect that no amount of algorithm driven formatting will ever produce the same result as a skilled human. IOW, there are limits to how good a pure SGML/XML driven formatting system can be. Thusly we need processing instructions and format oriented markup. I recently grappled with a large piece of legislation on paper which proved easy enough to model but I gave up trying to re-produce the "look" of the paper version. Individual sections exhibited clear signs of aesthetic tweaking. Infuriating because the darn thing looks so much nicer than mine yet I cannot determine why:-( To see this aesthetic effect in its extremes I heartily recommend "Le Ton beau de Marot" by Douglas Hofstadter. For example, according to the introduction, the section layout of chapter 2 was heavily influenced by the avoidence of descender characters in the first word! Sean Mc Grath http://www.digitome.com "there are two types of people in the world. Those who think the world consists of two types of people, and those who don't" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From h.rzepa at ic.ac.uk Sat May 2 20:38:22 1998 From: h.rzepa at ic.ac.uk (Rzepa, Henry) Date: Mon Jun 7 17:01:03 2004 Subject: RFD: comp.text.xml In-Reply-To: <3.0.1.16.19980502161715.69af8892@pop3.demon.co.uk> References: <000001bd75ce$fdf449c0$d86118cb@caleb> Message-ID: >I will leave Henry to answer whether there are any current problems of >scale. [I would hate to promise his services without checking :-)] It scales more or less linearly, with about 10 individual addresses "decaying" each week and about ten new ones needing attention, out of a total of 1000 subscribers. I dont see a problem up to say 2000 subscribers. The spam/junk seems to have stabilised. Henry Rzepa. +44 171 594 5774 (Office) +44 171 594 5804 (Fax) http://www.ch.ic.ac.uk/rzepa/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Sat May 2 21:14:53 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:03 2004 Subject: Announcement: AElfred 1.2, with SAX 1.0gamma support Message-ID: <199805021914.PAA01433@unready.microstar.com> A new version of Microstar's AElfred XML parser is available at the following URL: http://www.microstar.com/XML/ There are several user-visible changes since version 1.1: 1. the XmlParser.parse method for parsing from a URI now has a third argument (for an encoding, if known); 2. The XmlHandler.resolveEntity method is more powerful: you may return a String (for a URI), an InputStream, _or_ a Reader. If you return null, the parser will take the default action. 3. The SAX driver has been updated to SAX 1.0gamma (released 1 May 1998). SAX is available from http://www.megginson.com/SAX/ All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Sun May 3 00:23:08 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:03 2004 Subject: Announcement: AElfred 1.2, with SAX 1.0gamma support In-Reply-To: <199805021914.PAA01433@unready.microstar.com> Message-ID: <3.0.1.16.19980502221849.2f776e8a@pop3.demon.co.uk> At 15:14 02/05/98 -0400, David Megginson wrote: [... marvellous announcements about SAX and AElfred ...] Thanks very much David. I shall assume that the application-side of the SAX API will remain constant and therefore hope to release an alpha version of JUMBO2.0 quite shortly. P. I was just going to bed. [I thought I had a few weeks to relax in :-)] xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jtauber at jtauber.com Sun May 3 08:30:30 1998 From: jtauber at jtauber.com (James K. Tauber) Date: Mon Jun 7 17:01:03 2004 Subject: RFD: comp.text.xml In-Reply-To: <3.0.1.16.19980502161715.69af8892@pop3.demon.co.uk> Message-ID: <000801bd765c$dcc66420$e66118cb@caleb> A couple of points: 1. The proposal of comp.text.xml in no way is to be taken as a negative comment on xml-dev or any other mailing list. I love xml-dev. 2. Discussion on the pros and cons of comp.text.xml should be held on the news.groups newsgroup as stated in the RFD. James -- James Tauber / jtauber@jtauber.com Perth, Western Australia XML Pages: http://www.jtauber.com/xml/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Sun May 3 12:46:28 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:03 2004 Subject: RFD: comp.text.xml In-Reply-To: <000801bd765c$dcc66420$e66118cb@caleb> References: <3.0.1.16.19980502161715.69af8892@pop3.demon.co.uk> Message-ID: <3.0.1.16.19980503075802.3617e848@pop3.demon.co.uk> At 14:28 03/05/98 +0800, James K. Tauber wrote: > >A couple of points: > >1. The proposal of comp.text.xml in no way is to be taken as a negative >comment on xml-dev or any other mailing list. I love xml-dev. Thanks :-). I didn't take it as negative. I posted primarily because one posting could be read to give the wrong impression of the role of XML-DEV. Our intention (H+P) is that XML-DEV continue and if there are topics which are better suited to be on other fora we'll gently point that out. XML-DEV's role is clearly evolving and so our goals are deliberately simple 'a list for XML developers'. > >2. Discussion on the pros and cons of comp.text.xml should be held on the >news.groups newsgroup as stated in the RFD. Fully agreed. Any burgeoning discussion of c.t.x. will be gently pointed towards n.g P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Sun May 3 13:09:46 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:03 2004 Subject: Announcement: SAX 1.0gamma References: <199805020046.UAA02818@unready.microstar.com> Message-ID: <354C4EF0.81ADF185@jclark.com> I did a first pass at implementing a SAX 1.0gamma driver for XP today. Some nits: It should be specified whether a byte order mark at the beginning of a XML byte stream is included as part of the character stream. I don't think it should be since the byte-order mark isn't included the XML document production, and the XML spec says explicitly that the byte order mark "is an encoding signature, not part of either the markup or the character data of the XML document". How are relative system identifiers supposed to be handled in DTDHandler? Suppose I have a DTD with a system id of dir/foo.dtd, which declares an unparsed entity with a system id of foo.eps (which refers to dir/foo.eps). If the systemId argument to DTDHandler.unparsedEntityDecl is foo.eps, then the application is going to have problems. There's a similar issue with EntityResolver.resolveEntity. Parser.parse should be allowed to throw IOException in addition to SAXException. Since InputSource includes a Reader and InputStream, and methods on Reader and InputStream throw IOException, parse needs to throw IOException. It's ridiculous to require the parser to wrap an IOException in a SAXException when you know that the parser needs to throw IOException. There's nothing in the XML spec that says parsers have to make attribute types available. So I think the doc for AttributeList.getType should say that CDATA may be returned not only if the parser has not read the declaration, but also if the parser does not make this information available (alternatively it could return null in this case). Supporting changing of Locale in the middle of a parse would require me to redesign my native interface in a way I consider very undesirable, and I don't see any need for this, so I don't plan to support this. The basic issue is that my counterpart of Parser is reentrant unlike SAX's and I want to keep it that way. I think changing of Locale mid-parse is going to be difficult for anyone with a similar style of interface. It's probably too late for this, but I'm having problems seeing the logic in the exception handling design. The design seems to make things inconvenient for both users and implementors: implementors have to wrap SAXException in order to pass it up through their parsers, and in handler methods users have to wrap their exceptions in SAXException. James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From alain at cabinfo.com Sun May 3 14:55:17 1998 From: alain at cabinfo.com (Alain DESEINE) Date: Mon Jun 7 17:01:03 2004 Subject: announce : innovation Partners and CEI announce IRIS XML DTD GENERATOR beta 1 Message-ID: <354C68CF.2B7A@cabinfo.com> innovation Partners and CEI announce IRIS XML DTD GENERATOR beta 1 The first release (beta 1) of IRIS XML DTD GENERATOR is now available for download. This first beta version can open or create DTD for XML applications. The user can edit the DTD with a text view (like if you edit it in the Windows Notepad), or in a logical view. Both the text and logical view provide tools to insert ELEMENTS, ATTRIBUTES, COMMENTS, OR ENTITIES. In logical view you can also modify ***very easily*** ELEMENTS, ATTRIBUTES, ENTITIES, and COMMENTS. This software is design for user that don't know (and don't want to know) XML standards details. You can download the IRIS XML DTD GENERATOR from the following URL. http://www.cabinfo.com/download.htm All comments, bugs reports, functionnalities wishes are welcome at alain@cabinfo.com Well known in FRANCE for his HTML EDITOR ODYSSEE ( see http://www.mondial-telecom.com/odyssee ), Innovation Partners has start to think about XML many months ago. This thought tell us to develop 3 XML software. The first one, IRIS XML DTD GENERATOR, is a tool for building XML DTDs. The second is IRIS XML EDITOR for editing XML files. And the third one is a XSL stylesheet EDITOR for presenting and processing XML files. These 3 softwares can naturally work together, building a complete sofware suite for XML editing. For limitations, supported functionnalities, and bugs of beta versions, please see the readme file of the downloaded software. Now lets have a look to the DTD functionnalities. IRIS XML DTD GENERATOR - Editing existing DTDs - Creating DTDs - Logical and Text view mode for editing DTDs - Wizzard for creating ELEMENTS both in text an logical view - Wizzard for creating ATTRIBUTES both in text an logical view - Wizzard for creating ENTITIES both in text an logical view - Wizzard for creating COMMENTS both in text an logical view - Support of additionnals informations for IRIS XML EDITOR (saved as comment in the DTD) - Generation of XML DATA schemas - Find an replace both in logical and text view - Wizzard for creating xml standard attributes (like xml:lang for example) - support of XLL linking attributes - Glossaries - Printing a DTD - multi language software (currently French and English only - You can change the language in the file menu) - Automatic save capabilities - backup on save capabilities - MDI environment NOTE : All of these functionnalities are not implemented in the beta 1 release. This list describe what the software can be for the version 1.00 release. Some of these functionnalities will perharps disappear for the final release. IRIS XML EDITOR - Creating XML files according to a DTD generated by IRIS XML DTD GENERATOR, or generated by something else. - Project notion for managing XML file collections (like in the ODYSSEE HTML EDITOR). - glossaries - Dynamic toolbars with generated icons that can insert ELEMENTS, ATTRIBUTES, or ENTITIES you define in your DTD - Dynamic Wizzard generated automaticly for ELEMENTS with attributes, that ask user for informations. - Text view for editing XML files - Hierachical view for for editing the XML file (the tree object flow view) - Visualisation view for seeing the rendering of the XML file : - If an XSL file is specified in your XML document these one is use to generate the visualisation view. - if none is provide, a minimal built in style sheet is use to render your document. - Automatic save capabilities - backup on save capabilities - XML document printing - XML Visualisation view printing - FTP publication of XML files - Specific Wizzard adding capabilities through an Standardize DLL API (like in ODYSSEE HTML EDITOR) - Friend application capabilities - multi language software (currently French and English only - You can change the language in the file menu) - Find an replace both in tree and text view - Insertion of XML Standard Attributes (like xml:lang) NOTE : All of these functionnalities are not implemented in the beta 1 release. This list describe what the software can be for the version 1.00 release. Some of these functionnalities will perharps disappear for the final release. IRIS XSL STYLER - Creating XSL stylesheet - full support of XSL proposal and future ones. - glossaries - Text view for editing XSL files - Hierachical view for for editing the XSL rules - Import a DTD to re-use its elements, attributes, etc. in the style sheet - Visualisation view for seeing the rendering produce with the XSL file : - You can provide your own XML file to see the rendering. - if none is provide, a minimal built in XML file based on the target-elements is used. - Automatic save capabilities - backup on save capabilities - XSL document printing - XSL Visualisation view printing - Specific Wizzard adding capabilities through an Standardize DLL API (like in ODYSSEE HTML EDITOR) - Friend application capabilities - multi language software (currently French and English only - You can change the language in the file menu) - Find an replace both in tree and text view NOTE : All of these functionnalities are not implemented in the beta 1 release. This list describe what the software can be for the version 1.00 release. Some of these functionnalities will perharps disappear for the final release. Currently, only the IRIS XML DTD GENERATOR beta 1 is available. IRIS XML EDITOR beta 1 will be available in May, and IRIS XSL STYLER beta 1 will be available in June. All theses software are commercial, so they are copyright. The beta version of these software are freely usable for evaluation purpose, bugs report, and functionnality discussions. Once the final release (Version 1.00) launch up, you ***MUST*** register the software if you want to use it. For Bugs reports, Functionnalities discussion, technical questions, XML and XSL discussion, you can use the DTD GENERATOR mailling list, for details about the mailling list, please see the readme file include in the beta 1 distribution. You can download the Beta 1 distribution of IRIS XML DTD GENERATOR at : http://www.cabinfo.com/download.htm Paris the 20/04/1998 Alain DESEINE, Technical contact. ******************************************************************** Innovation Partners Commercial contact : M. Emmanuel CHAMBON innovationpartners@hol.fr Technical contact : M. Alain DESEINE alain@cabinfo.com http://www.mondial-telecom.com/odyssee http://www.cabinfo.com Postal Adress : 9, bis rue des Besnards 92260 FONTENAY AUX ROSES FRANCE Tel : 33 01 41 41 00 89 or 33 01 41 41 91 91 or 33 06 09 80 10 16 (GSM) Fax : 33 01 41 41 02 34 xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Sun May 3 15:13:49 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:03 2004 Subject: Announcement: SAX 1.0gamma In-Reply-To: <354C4EF0.81ADF185@jclark.com> References: <199805020046.UAA02818@unready.microstar.com> <354C4EF0.81ADF185@jclark.com> Message-ID: <199805031312.JAA00291@unready.microstar.com> James Clark writes: > I did a first pass at implementing a SAX 1.0gamma driver for XP today. > > Some nits: > > It should be specified whether a byte order mark at the beginning > of a XML byte stream is included as part of the character stream. > I don't think it should be since the byte-order mark isn't included > the XML document production, and the XML spec says explicitly that > the byte order mark "is an encoding signature, not part of either > the markup or the character data of the XML document". My first hunch is the opposite: the XML productions deal with characters, not bytes. When I provide a raw byte stream (java.io.InputStream), I'm requiring the XML parser to take on two logical tasks: 1) convert the bytes to characters 2) apply the XML productions to the characters You have already mentioned that, unlike many XML parsers (including AElfred), XP does not perform these as independent, serial steps; conceptually, however, the tasks are still distinct. The BOM is part of the raw byte stream, but not part of the character stream. I think that it also simplifies Java implementation if the parser can behave the same way with an InputStream from a URLConnection and an InputStream supplied explicitly by an application. > How are relative system identifiers supposed to be handled in > DTDHandler? Suppose I have a DTD with a system id of dir/foo.dtd, > which declares an unparsed entity with a system id of foo.eps > (which refers to dir/foo.eps). If the systemId argument to > DTDHandler.unparsedEntityDecl is foo.eps, then the application is > going to have problems. There's a similar issue with > EntityResolver.resolveEntity. This does seem to be a serious problem. One solution is to require the parser to fully resolve system identifiers before reporting them (as AElfred already does). This approach will work well with URLs, but may break for other URI schemes. Any other solutions? > Parser.parse should be allowed to throw IOException in addition to > SAXException. Since InputSource includes a Reader and InputStream, and > methods on Reader and InputStream throw IOException, parse needs to > throw IOException. It's ridiculous to require the parser to wrap an > IOException in a SAXException when you know that the parser needs to > throw IOException. This suggestion sounds quite reasonable. Any objections? > There's nothing in the XML spec that says parsers have to make > attribute types available. So I think the doc for > AttributeList.getType should say that CDATA may be returned not > only if the parser has not read the declaration, but also if the > parser does not make this information available (alternatively it > could return null in this case). I don't recall anything in the spec that requires parser to report the start and end of elements, or even character data other than whitespace -- this area needs attention from the XML WG. That said, I think that this documentation change would be useful. > Supporting changing of Locale in the middle of a parse would > require me to redesign my native interface in a way I consider very > undesirable, and I don't see any need for this, so I don't plan to > support this. The basic issue is that my counterpart of Parser is > reentrant unlike SAX's and I want to keep it that way. I think > changing of Locale mid-parse is going to be difficult for anyone > with a similar style of interface. I doubt that anyone else is supporting Locale at all right now, so this change should not cause any trouble. I have no objection to making it. > It's probably too late for this, but I'm having problems seeing the > logic in the exception handling design. The design seems to make > things inconvenient for both users and implementors: implementors > have to wrap SAXException in order to pass it up through their > parsers, and in handler methods users have to wrap their exceptions > in SAXException. The former problem exists only when SAX support comes through a separate driver (as, admittedly, is usually be the case). A new parser, written from scratch, could include SAXException in its throw clauses without using a wrapper. The latter problem is very annoying, but there seems to be no obviously correct solution. I received a very, very large number of objections to my use of java.lang.Exception. I don't want to vacillate any more, and have settled on a SAXException wrapper simply as the (slightly) lesser of two evils. All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Gil_Zehavi at ibcomverse.com Sun May 3 15:18:18 1998 From: Gil_Zehavi at ibcomverse.com (Zehavi, Gil) Date: Mon Jun 7 17:01:04 2004 Subject: Using schemas with XML parser Message-ID: I wrote xml-data schema and instances that are based on that schema, but I don't know how to combine them in order that the parser (i'm using the Microsoft parser which is supposed to know how to deal with schemas) will recognize that a schema is being used. Putting both the schema and the instances in one files is not good since the parser recognizes only one root. I don't know how to link the two parts if they are defined in different files (my application is not a web application). I can use DTD but the use of schemas is very useful in my case. I would appreciate any information I can get. Gil Zehavi Comverse Network System email: gil_zehavi@ibcomverse.com xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tbray at textuality.com Sun May 3 15:43:36 1998 From: tbray at textuality.com (Tim Bray) Date: Mon Jun 7 17:01:04 2004 Subject: Announcement: SAX 1.0gamma Message-ID: <3.0.32.19980503064103.00b5bec0@pop.intergate.bc.ca> At 09:12 AM 5/3/98 -0400, David Megginson wrote: >James Clark writes: > > It should be specified whether a byte order mark at the beginning > > of a XML byte stream is included as part of the character stream. > > I don't think it should be > >My first hunch is the opposite: the XML productions deal with >characters, not bytes. I think James has the spec on his side here. The BOM should *definitely* be hidden from SAX customers. -Tim xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From elharo at sunsite.unc.edu Sun May 3 16:25:02 1998 From: elharo at sunsite.unc.edu (Elliotte Rusty Harold) Date: Mon Jun 7 17:01:04 2004 Subject: Cafe con Leche In-Reply-To: <199805021914.PAA01433@unready.microstar.com> Message-ID: I've begun building a new XML news and resources site called Cafe con Leche, modeled after my Java site Cafe au Lait. You'll find it at http://sunsite.unc.edu/xml/ It's pretty nascent at this point. Comments, suggestions, and critiques are apreciated. +-----------------------+------------------------+-------------------+ | Elliotte Rusty Harold | elharo@sunsite.unc.edu | Writer/Programmer | +-----------------------+------------------------+-------------------+ | Java Secrets (IDG Books 1997) | | http://www.amazon.com/exec/obidos/ISBN=0764580078/cafeaulaitA/ | +----------------------------------+---------------------------------+ | Cafe au Lait | | http://sunsite.unc.edu/javafaq/ | +----------------------------------+---------------------------------+ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From boumhome at agents-tech.com Sun May 3 17:05:40 1998 From: boumhome at agents-tech.com (Martin Bouchard Home) Date: Mon Jun 7 17:01:04 2004 Subject: unsubsribe Message-ID: <000d01bd76a5$168f1880$c002aa98@PETER.ATC> -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980503/9b89f881/attachment.htm From cbullard at hiwaay.net Sun May 3 17:57:43 1998 From: cbullard at hiwaay.net (len bullard) Date: Mon Jun 7 17:01:04 2004 Subject: RFD: comp.text.xml References: Message-ID: <354C9391.60FA@hiwaay.net> Simon St.Laurent wrote: > > XML is more than a subset of SGML. It has its own limitations, its own > practices, and its own audience. So far, it has a new name. >You can debate whether or not it is a 'new > language' just as the XML-DEV list has argued whether the XML spec contains > just syntax or syntax and semantics, but I think you're splitting hairs, at > best. New acronym, new approach, fine - new language. As I said, so far, a new name. This is not splitting hairs. It is maintaining a correct and accurate history. > Personally, I'd like to see XML make a clean parent-child break with SGML. > XML should acknowledge SGML as the parent, the teacher, the source of > inspiration, and move on to live its own life. I'd like to see XML have its > own place to grow. If that means that it acquires features from SGML (like > architectural forms, which David Megginson pointed out on XML-L), then fine - > but it shouldn't be forever limited to being a 'mere' subset. It isn't limited. There is no normative reference and it is a product of the W3C Consortium whose rules and processes govern its growth. At any time the Director chooses, he can cast away that parentage and the alliance with ISO. > >This is speculative not substantive. The membership of the XML and XML > >application communities overlaps the SGML community. The extent of > >overlap should be examined, but it is probably greater than a majority. > > The extent of overlap may be examined, but I doubt it will be a majority for > more than few months. XML is arousing interest in many computing sectors > (HTML developers, database developers, OOP programmers) who in the past had > nothing (or as little as possible) to do with SGML. Nice. Take the ideas, abuse the parent in public, claim to have created a new technology. o HTML acknowledges it is an SGML application o XML acknowledges it is an SGML subset. o Both database and OOP programmers have in the past worked with SGML and developed applications of SGML. > I can go through my email archive and find at least fifty people who have come > to XML but had no previous experience in SGML and show no interest in learning > it. There may be bias because my book deliberately targeted a non-SGML > audience, but that audience certainly exists and is growing rapidly. Bias is bias but that is a different issue. Those who learn XML do not need to learn SGML. That has been a clear goal of the development of SGML On The Web. So? > >At this time, the comp-text-sgml newsgroup can support the discussion > >of SGML subsets such as the W3C consortium subset, XML and its > >applications. > > I've been perusing comp.text.sgml for the last few months. While I'm > impressed with the quality of discussion (as I have been on XML-DEV and > XML-L), I don't think it's a great place to learn about XML. It's not always > easy to determine when SGML-only features are being discussed, and the > question overlaps can lead to some fairly significant confusion, especially > for beginners. This is speculative but so far, the one issue that can be taken seriously. > There is a lot of need for a separate group which will address the needs of > large numbers of people moving into a new technology. 'Maintaining the spirit > of cooperation between international standards groups and consortium product > working groups' is _not_ the purpose of this newsgroups. Those groups, for > better or worse, need to learn to work with each other while the rest of get > on with learning the technologies that are appropriate to the tasks _we_ want > to accomplish. It is that spirit of cooperation which has enabled the standard (SGML) and the consortium-owned product (XML) to remain aligned. If you want to use SGML ideas (rename them if you must) and keep this alignment, maintain that spririt. It is in your own best interest. As for the royal "we", that can be dismissed. Technologies and standards are separate entities. > I see comp.text.xml as a place for people, both new and experts, to > communicate about the new things opened up by XML, not as a place for a > relatively small club of experts to form and maintain alliances. That relatively small club created everything that you are now claiming is a new language. You are being somewhat less than ethical in that comment. > XML-L is good for this, but newsgroups are easy places for the whole world to > find and participate. I strongly hope comp.text.xml arrives soon. The creation of comp.text.xml is inevitable. The reasons given should be truthful, legitimate, and compelling, not based on the politics of claiming the work of others. If you plan to be a successful author, do your homework and keep to the facts. len xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Sun May 3 18:47:28 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:04 2004 Subject: Announcement: SAX 1.0gamma References: <199805020046.UAA02818@unready.microstar.com> <354C4EF0.81ADF185@jclark.com> <199805031312.JAA00291@unready.microstar.com> Message-ID: <354C9E40.E13ED372@jclark.com> David Megginson wrote: > > James Clark writes: > > It should be specified whether a byte order mark at the beginning > > of a XML byte stream is included as part of the character stream. > > I don't think it should be since the byte-order mark isn't included > > the XML document production, and the XML spec says explicitly that > > the byte order mark "is an encoding signature, not part of either > > the markup or the character data of the XML document". > > My first hunch is the opposite: the XML productions deal with > characters, not bytes. When I provide a raw byte stream > (java.io.InputStream), I'm requiring the XML parser to take on two > logical tasks: > > 1) convert the bytes to characters > > 2) apply the XML productions to the characters > > You have already mentioned that, unlike many XML parsers (including > AElfred), XP does not perform these as independent, serial steps; > conceptually, however, the tasks are still distinct. The BOM is part > of the raw byte stream, but not part of the character stream. > > I think that it also simplifies Java implementation if the parser can > behave the same way with an InputStream from a URLConnection and an > InputStream supplied explicitly by an application. I'm a bit confused by your reply. You say you're disagreeing with me, but the points you make don't seem to contradict my suggestion. I agree there are conceptually two stages. My point is that the BOM bytes are removed as part of the first stage because they are part of the encoding signature not part of the sequence of characters that matches the document production. Thus the InputStream should include the BOM bytes, but the Reader shouldn't include the 0xFEFF character. > > How are relative system identifiers supposed to be handled in > > DTDHandler? Suppose I have a DTD with a system id of dir/foo.dtd, > > which declares an unparsed entity with a system id of foo.eps > > (which refers to dir/foo.eps). If the systemId argument to > > DTDHandler.unparsedEntityDecl is foo.eps, then the application is > > going to have problems. There's a similar issue with > > EntityResolver.resolveEntity. > > This does seem to be a serious problem. One solution is to require > the parser to fully resolve system identifiers before reporting them > (as AElfred already does). This approach will work well with URLs, > but may break for other URI schemes. > > Any other solutions? In XP, my analog of InputSource has both an InputStream and optionally a URL to use a base URL for system identifiers in that InputStream. In each case where the application is passed a system identifier (whether for parsed or unparsed entity), the parser passes both the specified system identifier and the base URL from the InputSource analog. This gives the application complete control over resolving relative URLs, although at the cost of some complexity. In implementing the SAX driver for XP I try to make an absolute URL from the specified system identifier and the base; if that succeeds I pass the result (after conversion to a String); if it fails (for example because it is parsing from an InputStream with no specified system identifier) I pass the specified system identifier. That is the approach I would suggest for SAX. James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Sun May 3 19:01:23 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:04 2004 Subject: Parents and children (was RFD: comp.text.xml) Message-ID: >> XML is more than a subset of SGML. It has its own limitations, its own >> practices, and its own audience. > >So far, it has a new name. So far it has its own limitations, and 20-odd announced books (3 of which have SGML in the title) coming out, as well as a new name. It also has press attention like SGML hasn't _ever_ seen. It has ringing endorsements from major corporations, and promising new products. None of it's here yet, really, but the standard only arrived two months ago. Not a bad start. Hopefully, it will grow. >> The extent of overlap may be examined, but I doubt it will be a majority for >> more than few months. XML is arousing interest in many computing sectors >> (HTML developers, database developers, OOP programmers) who in the past had >> nothing (or as little as possible) to do with SGML. > >Nice. Take the ideas, abuse the parent in public, claim to have created >a >new technology. Um... where did I invent a new technology? I was discussing additional sectors of users who are taking advantage of XML and who didn't use SGML much previously. As for abusing the parent, I think you may be in denial about SGML's less-than-friendly reputation. There were good reasons that the W3C created XML, I understand, as well as good reasons for naming it XML rather than SGML-Lite. This doesn't mean that SGML is evil incarnate - it just means that it involved a learning curve that many people found less than exciting. I also said "XML should acknowledge SGML as the parent, the teacher, the source of inspiration, and move on to live its own life." This doesn't sound like I'm denying SGML's beneficent influence. >> I see comp.text.xml as a place for people, both new and experts, to >> communicate about the new things opened up by XML, not as a place for a >> relatively small club of experts to form and maintain alliances. > >That relatively small club created everything that you >are now claiming is a new language. You are being somewhat less than >ethical in that comment. I'm very thankful to that small club, as I've said repeatedly. I just don't think that comp.text.xml is meant as a forum for diplomacy. XML-DEV or other forums are probably more appropriate. I don't understand why you seem so dismissive of the needs of people who are coming to XML from non-SGML (and non-standards body) backgrounds. And, to end on a positive note: >> I've been perusing comp.text.sgml for the last few months. While I'm >> impressed with the quality of discussion (as I have been on XML-DEV and >> XML-L), I don't think it's a great place to learn about XML. It's not always >> easy to determine when SGML-only features are being discussed, and the >> question overlaps can lead to some fairly significant confusion, especially >> for beginners. >This is speculative but so far, the one issue that can be taken >seriously. Well, at least we seem to agree on the pedagogical value of a separate comp.text.xml newsgroup. That's a start. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Sun May 3 19:35:27 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:04 2004 Subject: Using schemas with XML parser In-Reply-To: References: Message-ID: <199805031734.NAA00245@unready.microstar.com> Zehavi, Gil writes: > I wrote xml-data schema and instances that are based on that > schema, but I don't know how to combine them in order that the > parser (i'm using the Microsoft parser which is supposed to know > how to deal with schemas) will recognize that a schema is being > used. Putting both the schema and the instances in one files is not > good since the parser recognizes only one root. I don't know how to > link the two parts if they are defined in different files (my > application is not a web application). I can use DTD but the use of > schemas is very useful in my case. I would appreciate any > information I can get. You can have the best of both worlds: define your document type using any high-level schema language (it doesn't have to be XML-Data), then compile the schema into a standard XML 1.0 DTD for parsing or distribution. When you make any changes, make them to the high-level original rather than to the DTD, then recompile. This way, you can enjoy any advantages that the higher-level schema language offers, but you are still interoperable with conformant XML tools. You also have the advantage of being able to choose (or invent) any schema language. I can imagine specialised fields inventing their own abstract (and much simpler) schema languages that deal with only a few high-level, domain-specific units (like tasks or forms or financial tables) rather than low-level elements and attributes. XML-Data will undoubtedly satisfy some needs, there is probably not a one-size-fits-all solution to schemas. Has anyone written a compiler for XML-Data yet? All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Sun May 3 19:38:26 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:04 2004 Subject: Announcement: SAX 1.0gamma In-Reply-To: <3.0.32.19980503064103.00b5bec0@pop.intergate.bc.ca> References: <3.0.32.19980503064103.00b5bec0@pop.intergate.bc.ca> Message-ID: <199805031736.NAA00257@unready.microstar.com> Tim Bray writes: > At 09:12 AM 5/3/98 -0400, David Megginson wrote: > >James Clark writes: > > > It should be specified whether a byte order mark at the beginning > > > of a XML byte stream is included as part of the character stream. > > > I don't think it should be > > > >My first hunch is the opposite: the XML productions deal with > >characters, not bytes. > > I think James has the spec on his side here. The BOM should *definitely* > be hidden from SAX customers. My fault -- I am very tired (for non-SAX reasons), and misread James's comment: I somehow assumed that he was suggesting that the BOM be removed from _byte_ streams. Tim and James are entirely correct: the BOM should not appear in character streams. Sorry for any confusion, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From schampeo at hesketh.com Sun May 3 21:47:54 1998 From: schampeo at hesketh.com (Steven Champeon) Date: Mon Jun 7 17:01:04 2004 Subject: RFD: comp.text.xml In-Reply-To: <354C9391.60FA@hiwaay.net> Message-ID: On Sun, 3 May 1998, len bullard wrote: > Simon St.Laurent wrote: > > > > XML is more than a subset of SGML. It has its own limitations, its own > > practices, and its own audience. > > So far, it has a new name. Len, you're sounding a bit defensive. Do you think C would have gotten as far as it did if every time someone brought it up, someone else popped up to remind everyone that it was inspired by BCPL? Need we say that Java was once named Oak in order to reap its benefits? I started out working with SGML, waited for the CALS table model, hung out wondering when DSSSL would be done, learned Author/Editor's internal styles language as a desperation measure, bemoaned the ridiculous complexity of FOSI, and when the Web hit I never looked back. I'm thrilled to death that the idea of separating presentation from markup has become a reality for those of us without multi-million dollar consulting budgets, and I'm thrilled that XML will be part of the next generation of browsers. I'm afraid that Microsoft has cornered the market on Windows-based XML parsing, by virtue of its being built into the "Internet Explorer OS Upgrade" blob they seem to be pushing as a pre-req for any new application installs, but the fact that there is a standard to appeal to - a simple standard, which doesn't give MS much cover should they decide to slightly alter their implementation. I think it's amazing that there are fully-functional parsers out there today. AFAIK, there isn't a single application out there which fully supports all of the SGML arcana. Len, I've known you since my first days reading comp.text.sgml, and I respect your enlightened approach to subjects both technical and humane. Please don't expect XML to be a bait-and-switch move to bring SGML into everyone's home. And please recognize that XML is a powerful, simple, iteration on some of the ideas espoused by SGML. Where XML will not suffice, perhaps SGML will thrive. But where XML will suffice, its users need not know thing one about SGML. In my mind, this is a good thing. -- Steven Champeon | "While we're all very dependent on http://hesketh.com/schampeo/ | technology, it doesn't always work." http://a.jaundicedeye.com | - Bill Gates xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sun May 3 21:54:20 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:04 2004 Subject: Using schemas with XML parser References: <199805031734.NAA00245@unready.microstar.com> Message-ID: <354CC7F8.9D3B78B5@technologist.com> David Megginson wrote: > > Has anyone written a compiler for XML-Data yet? I haven't tried it, but my *guess* is that if someone tried to implement XML-Data they would find it is more a repository for interesting ideas than a finished, implementable spec. When last I looked at it there were some things tht I considered to be holes. Some things seemed specified by example rather than by specification, for example. Of course none of that is a criticism. It's a NOTE, not a REC. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization." - Stewart Brand, founder of the Whole Earth Catalog http://www.wired.com/news/news/culture/story/10124.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sun May 3 21:54:32 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:05 2004 Subject: Parents and children (was RFD: comp.text.xml) References: Message-ID: <354CC975.ACB97DFC@technologist.com> I think that we could end this flamewar quite simply. XML is an SGML subset. That's a verifable, mathematical fact. As far as I can tell, Simon St. Laurent has no concrete proposal for how to change that fact and I can't think of any reasonable ones off of the top of my head. We could just throw in a feature to break compliance and if ISO adopts it, we could throw in another, and keep doing so until they gave up. NYAH NYAH! So there! We win! Since there is no proposal, there is nothing to argue about and argumentation is wasted breath. Nevertheless, XML practice is often quite different from SGML practice and there are many questions that apply only to one or the other (especially tools questions). That seems justification enough for separate newsgroups. Where concerns overlap, there is cross posting. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization." - Stewart Brand, founder of the Whole Earth Catalog http://www.wired.com/news/news/culture/story/10124.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Sun May 3 23:08:54 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:05 2004 Subject: Parents and children (was RFD: comp.text.xml) Message-ID: >XML is an SGML >subset. That's a verifable, mathematical fact. As far as I can tell, Simon >St. Laurent has no concrete proposal for how to change that fact and I >can't think of any reasonable ones off of the top of my head. Great. I haven't actually suggested that anyone start throwing spare parts into XML for the sake of breaking SGML compliance. Nor have I suggested that the W3C should declare its independence of the ISO, like there's actually a tie there. What I have suggested is that the SGML community stop mixing SGML and XML like they are identical practices. They aren't. Note that XML-L isn't a subset of HyTime, XSL isn't a subset of DSSL, and XPointers don't fit TEI precisely either. This could change, but I think a lot of people would be disappointed. XML practice and the XML spec itself appear to be two different things; the child is in fact diverging from the ways of the parent. If the parent chooses not to acknowledge that, fine. It's early in the child's development, after all. As Paul went on to say, >Nevertheless, XML practice is often quite different from SGML practice and >there are many questions that apply only to one or the other (especially >tools questions). That seems justification enough for separate newsgroups. >Where concerns overlap, there is cross posting. Here we seem to be saying the same thing. That, to me, suggests the end of an argument. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Sun May 3 23:16:37 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:05 2004 Subject: Software specifications (was Re: announce : innovation Partners and CEI announce IRIS XML DTD GENERATOR beta 1) In-Reply-To: <354C68CF.2B7A@cabinfo.com> Message-ID: <3.0.1.16.19980503161019.2fcf7a6e@pop3.demon.co.uk> Thanks for the posting, At 14:53 03/05/98 +0200, Alain DESEINE wrote: >innovation Partners and CEI announce IRIS XML DTD GENERATOR beta 1 [... impressive specification of XML tools ...] I couldn't immediately see a statement of what platforms and/or language this system uses (Please forgive me if I've missed it). I suggest that those posting software announcements on XML-DEV include a very short summary of attributes such as: - platform - or language if it can run, or be compiled on multiple platforms - source availability - any usage or distribution restrictions - free or costs-money (no need to give prices). - any other software required (e.g. uses FOO-XML as a parser, BAR-DSSSL for stylesheet support, etc.) This helps some classes of user decide whether it is unnecessary to visit the site (e.g. can't run on a MAC, etc.). Sometimes (not on XML-DEV :-) with large downloads I have spent an hour or more (costs money) to find that the kit could not run on what I have. It also helps people like Robin Cover and many other who compile lists of software and related resources. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Sun May 3 23:36:04 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:05 2004 Subject: Crossposting to XML-DEV and XML-L (was Re: Parents and children (was RFD: comp.text.xml)) In-Reply-To: <354CC975.ACB97DFC@technologist.com> References: Message-ID: <3.0.1.16.19980503212724.2f1759c0@pop3.demon.co.uk> [crossposted to XML-L intentionally] At 15:45 03/05/98 -0400, Paul Prescod wrote: >I think that we could end this flamewar quite simply. XML is an SGML [...] > >Since there is no proposal, there is nothing to argue about and >argumentation is wasted breath. This discussion (?flame war?) is being carried on in duplicate on XML-L and XML-DEV. I believe it's a good thing to post only to a single list unless the posting is of *very* general value. (Automatic replies without reading the list of recipients is hardly ever a Good Thing.) I also suggest that on XML-DEV we favour discussions that seem to be leading to something concrete - whether software, resources, protocols or merely reshaping the future of the human race. In arguing for a new newsgroup it's worth making sure that the current organs fill different and non-overlapping roles. As I said I'm deliberately neutral. Relax... hack some SAX... P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From t.mueller at terus.envmtl.com Mon May 4 02:54:27 1998 From: t.mueller at terus.envmtl.com (T.Mueller) Date: Mon Jun 7 17:01:05 2004 Subject: comp.text.xml In-Reply-To: <000001bd75ce$fdf449c0$d86118cb@caleb> Message-ID: <005c01bd76f7$367244e0$71012fd1@tribune.terus.envmtl.com> James K. Tauber wrote: > Policy on Advertising > > We encourage discussion of the merits and shortcomings of > commercial products. A certain amount of advertising (both objective > and advocative) is to be expected and this is to be encouraged... While I can certainly buy into James premise that advertising can advocate and I can accept that in something like a legal context advocacy can be objective I am, however, having trouble trying to figuring out just what is "objective advertising" ? A small thought... you humour may vary... Terry xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jtauber at jtauber.com Mon May 4 04:05:33 1998 From: jtauber at jtauber.com (James K. Tauber) Date: Mon Jun 7 17:01:05 2004 Subject: comp.text.xml In-Reply-To: <005c01bd76f7$367244e0$71012fd1@tribune.terus.envmtl.com> Message-ID: <000401bd7701$17434880$ea6118cb@caleb> > James K. Tauber wrote: > > Policy on Advertising > > > > We encourage discussion of the merits and shortcomings of > > commercial products. A certain amount of advertising (both objective > > and advocative) is to be expected and this is to be encouraged... > > While I can certainly buy into James premise that advertising > can advocate and I can accept that in something like a legal context > advocacy can be objective I am, however, having trouble trying to figuring > out just what is "objective advertising" ? Ok, I confess to pinching that sentence from various sample RFDs around the place :-) I can see how one might imagine ads on an objective-advocative continuum but "objective advertising" is an amusing oxymoron. James -- James Tauber / jtauber@jtauber.com Perth, Western Australia XML Pages: http://www.jtauber.com/xml/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Mon May 4 04:44:53 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:05 2004 Subject: comp.text.xml References: <005c01bd76f7$367244e0$71012fd1@tribune.terus.envmtl.com> Message-ID: <354D2BB4.B1E30408@technologist.com> T.Mueller wrote: > > While I can certainly buy into James premise that advertising can advocate > and I can accept that in something like a legal context advocacy can be > objective I am, however, having trouble trying to figuring out just what > is "objective advertising" ? "Here's our product. It isn't the best in its category, but we think it's pretty good value for the money. You may or may not agree. However you feel, please tell us so that we can add your comments to our next round of advertising." Paul Prescod - http://itrc.uwaterloo.ca/~papresco "Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization." - Stewart Brand, founder of the Whole Earth Catalog http://www.wired.com/news/news/culture/story/10124.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From alain at cabinfo.com Mon May 4 10:14:33 1998 From: alain at cabinfo.com (Alain DESEINE) Date: Mon Jun 7 17:01:05 2004 Subject: Software specifications (was Re: announce : innovation Partners and CEI announce IRIS XML Message-ID: <354D7882.1967@cabinfo.com> Peter Murray-Rust wrote: > > Thanks for the posting, > > At 14:53 03/05/98 +0200, Alain DESEINE wrote: > >innovation Partners and CEI announce IRIS XML DTD GENERATOR beta 1 > > [... impressive specification of XML tools ...] > > I couldn't immediately see a statement of what platforms and/or language > this system uses (Please forgive me if I've missed it). I suggest that > those posting software announcements on XML-DEV include a very short > summary of attributes such as: > - platform > - or language if it can run, or be compiled on multiple platforms > - source availability > - any usage or distribution restrictions > - free or costs-money (no need to give prices). > - any other software required (e.g. uses FOO-XML as a parser, BAR-DSSSL > for stylesheet support, etc.) > > This helps some classes of user decide whether it is unnecessary to visit > the site (e.g. can't run on a MAC, etc.). Sometimes (not on XML-DEV :-) > with large downloads I have spent an hour or more (costs money) to find > that the kit could not run on what I have. > > It also helps people like Robin Cover and many other who compile lists of > software and related resources. > > P. > > Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic > net connection > VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary > http://www.venus.co.uk/vhg > We effectivly miss this informations so here they are : - Platform : Windows 3.x, Windows 95, Windows NT (Win 16 bits application) - No sources - Beta versions are free for evaluation purpose, feedback, bug report, and functionnalities wishes. - Future 1.00 version will be Commercial, so they won't be free ... You can download it at http://www.cabinfo.com/download.htm Alain. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Mon May 4 11:06:04 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:05 2004 Subject: comp.text.xml In-Reply-To: <354D2BB4.B1E30408@technologist.com> References: <005c01bd76f7$367244e0$71012fd1@tribune.terus.envmtl.com> Message-ID: <3.0.1.16.19980504075617.297f3254@pop3.demon.co.uk> At 22:45 03/05/98 -0400, Paul Prescod wrote: [...] > >"Here's our product. It isn't the best in its category, but we think it's >pretty good value for the money. You may or may not agree. However you >feel, please tell us so that we can add your comments to our next round of >advertising." About the maximum number of words for a general statement. On xml-dev we have always assumed that some tools will be commercial and that an 'objective' announcement may - if appropriately presented - be of value to the XML development community. In the early stages of development there might only be a single (commercial) tool for an operation, or a conflict between syntactic/semantic interpretation (see following message). It's useful to know about these tools. Factual statements will often benefit the community. As any set of tools matures, then we would expect the value of posting such information to diminish, being replaced by those that address the next set of problems. I should compliment the XML/SGML commercial community in that the balance appears to have been well kept. Some criteria might be: - will my product help the XML development community in general? [This is the most critical.] - is my product almost entirely concerned with XML? - does the use of my product address horizontal problems rather than vertical ones? - does the use of my product raise important issues for software/protocols/syntax/semantics - is a free version of my product available for test/proof_of_concept/academics/non-profits/personal? - am I responding to queries addressed on the XML-DEV list (or possibly elsewhere, but raising important XML-DEV-related issues). - is my product novel? It will *not* be appropriate to mention competitors's products ("FOO-BAR doesn't do this"). Any comparison of products should be for the general development of the XML-DEV community and preferably by someone not involved with a manufacturer. If so, a *brief* statement of the attributes of the product (see previous message about missing platform information) is valuable at the start of the posting. Try to refrain from describing every sub-menu item unless they are novel. There are specialist fora and resources for the announcement of XML software and other products, and it may be more advantageous to post them there. We shall *certainly* reach the stage where, if someone posts "convert your SGML documents to XML for only $5000" (yes, I got a paper flier offering this), we shall suggest it is inappropriate for XML-DEV as there are already many robust ways of doing this. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Mon May 4 11:06:10 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:05 2004 Subject: Attributes with Intent Message-ID: <3.0.1.16.19980504090531.2fa7aec0@pop3.demon.co.uk> The XML Recommendation creates a category of attributes which have intent. (2.10 xml:space, 2.12 xml:lang). I believe that this raises implementation problems for which we may wish to devise a common protocol. The same problem arises in implementing XLink which, although not final, appears likely to contain the same (or closely related constructs). Consider the following macaronic document (about as much as I can manage :-): ]> gut What are the attributes of ? Correct: there aren't any, but the "intent" of xml:lang="de" applies to . I regard this as a subtle form of minimisation which may create considerable problems further downstream. I think this will be particularly problematical for software which relies on attributes to identify or process parts of a document. It's reasonable to assume that generic text-aware XML software might be asked "please find all elements in a document which are expressed in German." It can't look for all elements with xml:lang="de" because they don't have this explicitly. So general mechanisms such as XPointers can't be used, and bespoke software must be written. This software (presumably) finds all elements which *do* have the attribute and continues recursively. At this point an application developer has to ask: "[how] should I include support for xml:lang in my *application*?". The spec makes it clear that the *parser* does not add an attribute xml:lang, i.e. the above document is not equivalent to: ]> gut (which is well-formed but *invalid*). or ]> gut which explicitly shows the author's intent. The three documents will presumably *behave* similarly (I hope identically). The same concern applies to xml:space. XLink is similar but uses a different phrasing (4.3): "If any such [semantic] attribute [such as role] is omitted from a locator element, the value providing on the containing linking element is to be used". Example: The locator elements *behave* *as if* they were represented as: Note: this would be well-formed but *invalid* unless was declared with ---------------------------------- The problem is that *all* applications dealing with XML and XLink now have to consider three separate non-trivial questions. They can ignore the first two and assume that humans can deal with the resulting fuzziness (though the automation of the language aspect is important for machine-based translation and terminology). If XLink is to work, some mechanism *must* be provided and I am struggling with this at present :-) Since the three mechanisms seem to be supportable by a single software module it seem to make sense to discuss whether this is possible. If so perhaps we can devise a communal mechanism/interface. Otherwise we shall have semantic fragmentation where some tools support parts of some of these and other support other parts and some behave inconsistently w.r.t others. I have voiced these concerns before - are there others who see this as needing addressing at an implementation level? If so, we should tackle this quickly before the fuzziness increases [and certainly before other drafters of new specifications adopt this approach, thus increasing the load on the implementers.] P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jtauber at jtauber.com Mon May 4 11:46:57 1998 From: jtauber at jtauber.com (James K. Tauber) Date: Mon Jun 7 17:01:05 2004 Subject: expat question: processStream vs processFile/filemap Message-ID: <000001bd7741$84c62680$ea6118cb@caleb> As I've mentioned before, I'm writing some documentation for expat and I'm currently doing a walkthrough of xmlwf. My lack of knowledge of C file I/O is letting me down. Can somebody who has worked with the expat source (or who is at least willing to have a quick look at xmlwf.c) tell me what the difference is between using processStream and using processFile/filemap? Why is it a command-line option? Why would you pick one over the other? Thanks James -- James Tauber / jtauber@jtauber.com Perth, Western Australia XML Pages: http://www.jtauber.com/xml/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Mon May 4 11:58:44 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:05 2004 Subject: New version of expat Message-ID: <354D9000.64F6C1D2@jclark.com> A new version of expat (my XML parser written in C) is now available. See http://www.jclark.com/xml/expat.html expat is an XML parser written in C. This version just fixes bugs (some of which are quite serious). James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Mon May 4 12:43:43 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:05 2004 Subject: expat question: processStream vs processFile/filemap References: <000001bd7741$84c62680$ea6118cb@caleb> Message-ID: <354D9260.1A7A7944@jclark.com> James K. Tauber wrote: > > As I've mentioned before, I'm writing some documentation for expat and I'm > currently doing a walkthrough of xmlwf. My lack of knowledge of C file I/O > is letting me down. > > Can somebody who has worked with the expat source (or who is at least > willing to have a quick look at xmlwf.c) tell me what the difference is > between using processStream and using processFile/filemap? Why is it a > command-line option? > Why would you pick one over the other? Using file mapping is typically more efficient, but you might need to use processStream if: - your OS doesn't support file mapping (although xmlwf can fake it by reading the entire file into memory) - you don't want to deal with the portability problems that using file mapping brings - you have files larger than can be mapped (eg > 2Gb) - you are reading from a network connection or pipe rather than a file It's a command-line option for testing purposes. James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Mon May 4 12:51:15 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:05 2004 Subject: SAX Exception Handling Message-ID: <354D9C4F.8FF94AA8@jclark.com> David Megginson wrote: > > James Clark writes: > > It's probably too late for this, but I'm having problems seeing the > > logic in the exception handling design. The design seems to make > > things inconvenient for both users and implementors: implementors > > have to wrap SAXException in order to pass it up through their > > parsers, and in handler methods users have to wrap their exceptions > > in SAXException. > > The former problem exists only when SAX support comes through a > separate driver (as, admittedly, is usually be the case). A new > parser, written from scratch, could include SAXException in its throw > clauses without using a wrapper. > > The latter problem is very annoying, but there seems to be no > obviously correct solution. I received a very, very large number of > objections to my use of java.lang.Exception. I don't want to > vacillate any more, and have settled on a SAXException wrapper simply > as the (slightly) lesser of two evils. With my application writer's hat on (I just converted XMLTest to the new SAX), the current solution is a significant step backwards. The typical simple SAX program that processes an XML document and produces some sort of output is going to have to make calls in its DocumentHandler implementation that throw IOExceptions, and each of these methods now have to do: try { ... } catch (IOException e) { throw new SAXException(e); } The original goal of SAX was to be simple and easy to use for application writers. I don't think requiring this sort of mumbo jumbo even for trivial programs is consistent with this goal. I don't see an ideal solution, but I can think of several possibilities in addition to the old solution and the current solution, any of which I think would be an improvement over the current solution: 1. The handler methods are declared to throw both SAXException and IOException (as I proposed for parse). My guess is that throwing IOException is going to be very common, and will avoid the need for the user to wrap exceptions themselves in a large proportion of simple programs. I can't see any disadvantages over the current proposal. 2. A variation on this would be to make SAXException extend IOException and then declare both the handler methods and parse as throwing IOException. 3. Another possibility would be keep the handler methods declared as throwing java.lang.Exception but declare parse as throwing SAXException. In other words it would be the responsiblity of the parser to do something like: try { handler.startDocument(); } catch (RuntimeException e) { throw e; } catch (Exception e) { throw new SAXException(e); } around each call of a SAX handler method. This makes life slightly harder for application writers but makes life much easier for users. I can see the objection to having parse throw java.lang.Exception, but I can't see the objection to having the interface handler methods throwing java.lang.Exception. Implementations of that interface are free to declare that they throw only some subclass of java.lang.Exception, and so no exception type checking is lost. It does make life a little harder for the parser writer, but I think they can cope: you could even declare a convenience class that wrapped around a DocumentHandler and dealt with the exception wrapping: class DocumentHandlerWrapper { DocumentHandler handler; DocumentHandlerWrapper(DocumentHandler handler) { this.handler = handler; } void startDocument() throws SAXException { try { handler.startDocument(); } catch (RuntimeException e) { throw e; } catch (Exception e) { throw new SAXException(e); } } // ... } The parser writer could just wrap a DocumentHandlerWrapper around the users DocumentHandler and get the same convenience they get with the current solution. James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Mon May 4 14:29:20 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:05 2004 Subject: SAX Exception Handling In-Reply-To: <354D9C4F.8FF94AA8@jclark.com> Message-ID: <3.0.1.16.19980504114900.3bb74718@pop3.demon.co.uk> At 17:45 04/05/98 +0700, James Clark wrote: >The original goal of SAX was to be simple and easy to use for >application writers. I hope that it still is - I appreciate that we have had some drift, but much of it has centered on Exceptions. >I don't think requiring this sort of mumbo jumbo >even for trivial programs is consistent with this goal. > >I don't see an ideal solution, This seems to be a problem that is fundamental to Exception handling in general (not SAX-specific). It's compounded by the fact that we need to (a) interoperate with Java Exceptions (b) assume that SAX will be put alongside packages that have yet another set of Exceptions. I spent yesterday afternoon hacking the new SAX/AElfred including the extraction of error messages and transferring them to display of nonwf documents. My blunderings weren't pretty (I failed to realise that one test example threw FileNotFoundException rather than a parsing error, and couldn't understand why I couldn't extract the SAXException message). But I think I can work with the current SAX. (I was one of those who suggested SAX should 'wrap' Exceptions, so pillory me.) >but I can think of several possibilities >in addition to the old solution and the current solution, any of which I >think would be an improvement over the current solution: [... workable solutions snipped ...] I am sure that David will look at these carefully - I think the final decision is essentially his - he has put in *so* much work and got enough public and private mail to make a balanced decision. David must feel like he's on a never-ending marathon - if he feels that SAX1.0gamma is 'it', I think we should accept it. [Every finalisation has bits that someone doesn't like, even XML...] P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Mon May 4 14:35:31 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:06 2004 Subject: Attributes with Intent In-Reply-To: <3.0.1.16.19980504090531.2fa7aec0@pop3.demon.co.uk> References: <3.0.1.16.19980504090531.2fa7aec0@pop3.demon.co.uk> Message-ID: <199805041233.IAA00288@unready.microstar.com> Peter Murray-Rust writes: > It's reasonable to assume that generic text-aware XML software > might be asked "please find all elements in a document which are > expressed in German." It can't look for all elements with > xml:lang="de" because they don't have this explicitly. So general > mechanisms such as XPointers can't be used, and bespoke software > must be written. This software (presumably) finds all elements > which *do* have the attribute and continues recursively. >From a logical perspective, your query is simply Find all elements in a document where the element or the nearest ancestor with a value for the xml:lang attribute specifies "de" as the attribute's value. With SDQL, for example, this test is surprisingly simple: (equal? (inherited-attribute-string "xml:lang") "de") Your point is well-taken, though: fragments of XML documents are often tightly bound to their context (similar situations would involve attributes for effectivity, security level, revision status, etc.). All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Mon May 4 14:54:24 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:06 2004 Subject: SAX: Exception Handling In-Reply-To: <354D9990.5410F401@jclark.com> References: <199805020046.UAA02818@unready.microstar.com> <354C4EF0.81ADF185@jclark.com> <199805031312.JAA00291@unready.microstar.com> <354D9990.5410F401@jclark.com> Message-ID: <199805041253.IAA00394@unready.microstar.com> James Clark writes: > With my application writer's hat on (I just converted XMLTest to the new > SAX), the current solution is a significant step backwards. The typical > simple SAX program that processes an XML document and produces some sort > of output is going to have to make calls in its DocumentHandler > implementation that throw IOExceptions, and each of these methods now > have to do: > > try { > ... > } > catch (IOException e) { > throw new SAXException(e); > } > > The original goal of SAX was to be simple and easy to use for > application writers. I don't think requiring this sort of mumbo jumbo > even for trivial programs is consistent with this goal. I stood by the same position for nearly five months, in the face of very loud opposition from people whose opinions I have great reason to respect (as I have yours). However, upon reflection (or perhaps simply mental exhaustion), I realised that throwing java.lang.Exception was a false economy. When I write public void endElement (String name) throws java.lang.Exception { [..] } The Java compiler will not help me discover which constructors or methods invoked in the body can throw exceptions that I have not yet considered. With public void endElement (String name) throws org.xml.sax.SAXException { [..] } The Java compiler will raise an error the first time I compile. I will discover that java.sql.Connection.createStatement (for example) can throw a SQLException, and I will take steps either to deal with the exception here in the handler (perhaps by reformulating the query) or for packing it up to send it to the top level. Of course, I can always avoid doing so by wrapping the entire contents of each handler in a try...catch statement public void endElement (String name) throws org.xml.sax.SAXException { try { [..] } catch (SAXException e) { throw e; } catch (Exception e) { throw new SAXException(e); } } By doing so, however, I have consciously pulled the pin out of the grenade myself, rather than having the grenade handed to me with the pin already missing. > I don't see an ideal solution, but I can think of several possibilities > in addition to the old solution and the current solution, any of which I > think would be an improvement over the current solution: > > 1. The handler methods are declared to throw both SAXException and > IOException (as I proposed for parse). My guess is that throwing > IOException is going to be very common, and will avoid the need for the > user to wrap exceptions themselves in a large proportion of simple > programs. I can't see any disadvantages over the current proposal. Your guess supposes that one class of XML application dominates. Database applications (and they may come to dominate XML) will more typically need to throw java.sql.SQLException, while GUI applications will often need to throw java.awt.AWTException. > 2. A variation on this would be to make SAXException extend IOException > and then declare both the handler methods and parse as throwing > IOException. Again, this variation benefits only one group of users. If SQLException were a subclass of IOException (for example), then the choice might be clearer. All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Mon May 4 16:02:06 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:06 2004 Subject: SAX: Exception Handling References: <199805020046.UAA02818@unready.microstar.com> <354C4EF0.81ADF185@jclark.com> <199805031312.JAA00291@unready.microstar.com> <354D9990.5410F401@jclark.com> <199805041253.IAA00394@unready.microstar.com> Message-ID: <354DC8FC.B4291CEC@jclark.com> David Megginson wrote: > > James Clark writes: > > > With my application writer's hat on (I just converted XMLTest to the new > > SAX), the current solution is a significant step backwards. The typical > > simple SAX program that processes an XML document and produces some sort > > of output is going to have to make calls in its DocumentHandler > > implementation that throw IOExceptions, and each of these methods now > > have to do: > > > > try { > > ... > > } > > catch (IOException e) { > > throw new SAXException(e); > > } > > > > The original goal of SAX was to be simple and easy to use for > > application writers. I don't think requiring this sort of mumbo jumbo > > even for trivial programs is consistent with this goal. > > I stood by the same position for nearly five months, in the face of > very loud opposition from people whose opinions I have great reason to > respect (as I have yours). > > However, upon reflection (or perhaps simply mental exhaustion), I > realised that throwing java.lang.Exception was a false economy. When > I write > > public void endElement (String name) > throws java.lang.Exception > { > [..] > } > > The Java compiler will not help me discover which constructors or > methods invoked in the body can throw exceptions that I have not yet > considered. Right, so don't do that. Declaring an *interface* as throws java.lang.Exception does not constrain an *implementation* of that interface to be declared as throws java.lang.Exception. It can be declared to throw the appropriate subclass of java.lang.Exception. If you declare the interface public void endElement(String name) throws java.lang.Exception; then I can declare my implementation as public void endElement(String name) throws java.lang.IOException { } or public void endElement(String name) throws java.awt.AWTException { } or whatever is in fact appropriate to my implementation. In other words, it is only be declaring the interface as java.lang.Exception, that I can correctly declare the exceptions that by implementation throws and thereby take advantage of Java's exception checking. I just don't follow your argument at all. > > I don't see an ideal solution, but I can think of several possibilities > > in addition to the old solution and the current solution, any of which I > > think would be an improvement over the current solution: > > > > 1. The handler methods are declared to throw both SAXException and > > IOException (as I proposed for parse). My guess is that throwing > > IOException is going to be very common, and will avoid the need for the > > user to wrap exceptions themselves in a large proportion of simple > > programs. I can't see any disadvantages over the current proposal. > > Your guess supposes that one class of XML application dominates. > Database applications (and they may come to dominate XML) will more > typically need to throw java.sql.SQLException, while GUI applications > will often need to throw java.awt.AWTException. I am not saying it will dominate but I am saying it will be a significant class, and the other classes wouldn't be hurt by making it easier for this class. I don't see much merit in the argument that because we can't make things simpler for *all* applications we therefore shouldn't make things simpler for *some*. James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Mon May 4 16:07:11 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:06 2004 Subject: Attributes with Intent In-Reply-To: <199805041233.IAA00288@unready.microstar.com> References: <3.0.1.16.19980504090531.2fa7aec0@pop3.demon.co.uk> <3.0.1.16.19980504090531.2fa7aec0@pop3.demon.co.uk> Message-ID: <3.0.1.16.19980504140252.3bb717e8@pop3.demon.co.uk> At 08:33 04/05/98 -0400, David Megginson wrote: >Peter Murray-Rust writes: > > > It's reasonable to assume that generic text-aware XML software > > might be asked "please find all elements in a document which are > > expressed in German." It can't look for all elements with > > xml:lang="de" because they don't have this explicitly. So general > > mechanisms such as XPointers can't be used, and bespoke software > > must be written. This software (presumably) finds all elements > > which *do* have the attribute and continues recursively. > >From a logical perspective, your query is simply > > Find all elements in a document where the element or the nearest > ancestor with a value for the xml:lang attribute specifies "de" as > the attribute's value. > >With SDQL, for example, this test is surprisingly simple: > > (equal? (inherited-attribute-string "xml:lang") "de") > >Your point is well-taken, though: fragments of XML documents are often >tightly bound to their context (similar situations would involve >attributes for effectivity, security level, revision status, etc.). > > >All the best, > > >David > >-- >David Megginson ak117@freenet.carleton.ca >Microstar Software Ltd. dmeggins@microstar.com > http://home.sprynet.com/sprynet/dmeggins/ > >xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk >Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ >To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; >(un)subscribe xml-dev >To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; >subscribe xml-dev-digest >List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) > > Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Mon May 4 16:09:49 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:06 2004 Subject: SAX in Applets Message-ID: <3.0.1.16.19980504140216.33370ee6@pop3.demon.co.uk> I am trying to put together the JUMBO package to run as an applet as well as an application. [Jumbo1.0 does this]. Since JUMBO2.0 is written in JDK1.1, with Swing, there are some new aspects for me. I'd be very grateful for any pointers - if you think they are general to XML-DEV, please reply here, else to me. I need three types of operation: 1. java/jre applications 2. applets from a remote site (i.e. from a server) 3. applets on the same file system (a CDROM) as the data and *.html 2 and 3 are close but there are differences in that the local CLASSPATH can interact differently with the browser. For example, in 3 it was important not to have a CLASSPATH set. My current approach is to structure directories: jumbo classes greetings.html (for driving the applet) *.jar (aelfred.jar, sax.jar. swing.jar) jumbo xml Jumbo.class (class files, etc.) greetings.xml example files *.cfg configuration files (e.g. parsers available, etc) To run a jre we have: jre -cp C:\jumbo\classes;C:\jumbo\classes\aelfred.jar;C:\jumbo\classes\sax.jar;C:\ju mbo\classes;C:\jdk1.1.5\classes.zip jumbo.xml.Jumbo greetings.xml This works! To run an applet the applet contains: [I have not put Jumbo in a jar, though presumably I could do so.] I'd be grateful for the following info: - will the jar files sax.jar, swing.jar, etc. work in applets (are they compressed? does it matter?) - can the swing.jar be put in the Netscape *.jar library (I'm using Netscape 4.05 which says it supports JDK1.1.5). - do the top-of-the-classes and *.jar *have* to be in the same directory as the *.html (I believe this used to be a requirement). - how do I manage resource files? In applications they can be picked up either from a -D argument or - ?deprecated - from an environment variable. This isn't possible for applets. - if there are other problems (or solutions) I haven't thought of, I'd be grateful. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From eliot at isogen.com Mon May 4 16:20:10 1998 From: eliot at isogen.com (W. Eliot Kimber) Date: Mon Jun 7 17:01:06 2004 Subject: Attributes with Intent Message-ID: <3.0.32.19980504091654.00762d10@postoffice.swbell.net> At 08:33 AM 5/4/98 -0400, David Megginson wrote: >Your point is well-taken, though: fragments of XML documents are often >tightly bound to their context (similar situations would involve >attributes for effectivity, security level, revision status, etc.). But one of the advantages of a hierarchically-structured data representation is the ability to have scopes to which properties apply. If a processor can't at least maintain a stack of the attributes of the ancestors of the current element, it's pretty darn braindead. I mean come on, we have to presume *some* intelligence in processors. Cheers, E. --

W. Eliot Kimber, Senior Consulting SGML Engineer ISOGEN International Corp. 2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004 www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Mon May 4 16:23:25 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:06 2004 Subject: Attributes with Intent In-Reply-To: <199805041233.IAA00288@unready.microstar.com> References: <3.0.1.16.19980504090531.2fa7aec0@pop3.demon.co.uk> <3.0.1.16.19980504090531.2fa7aec0@pop3.demon.co.uk> Message-ID: <3.0.1.16.19980504134317.2fcfada0@pop3.demon.co.uk> At 08:33 04/05/98 -0400, David Megginson wrote: >Peter Murray-Rust writes: > > >From a logical perspective, your query is simply > > Find all elements in a document where the element or the nearest > ancestor with a value for the xml:lang attribute specifies "de" as > the attribute's value. Absolutely. And with the tools I have written it's trivial (for me). And for those who have SQDL implementations, it's trivial as well. And so on. But for the newcomer to XML this is a major concern that they have to address. > >With SDQL, for example, this test is surprisingly simple: > > (equal? (inherited-attribute-string "xml:lang") "de") > >Your point is well-taken, though: fragments of XML documents are often >tightly bound to their context (similar situations would involve >attributes for effectivity, security level, revision status, etc.). The real point is that X*L/W3C approaches require *almost any* application writer to implement this. If I have to write > (equal? (inherited-attribute-string "xml:lang") "de") or (as I do) String lang = (currentNode.getXPointerNode("ANCESTOR(1,*,xml-lang,*)").getAttributeValue(" xml:lang")); it would be nice if we all had a consistent way of doing it. Whatever the solution it appears that processing links is significantly more effort because of this. Respecting confidentiality, I may or may not have made this point on XML-SIG but the decision has been taken and we have to find the best way of implementing it. The software not only has to search for (say) those elements with (say) role attributes, but those which are 'intended' to have role elements. It gets more fun with attribute remapping: which says that wherever you encounter the attribute "my-role" in a it is *really* a 'role' attribute. So we have: What is the 'role' of the ? [BTW it matters getting this right, since if you get the wrong reactants there might be an unexpected exothermic reaction.] Implementer 1 says: "map all molecule|new-role to behave like molecule|role" "make all locator molecules inherit the value of the parent roles (after mapping" Implementer 2 says: "make all locator molecules inherit the value of the parent roles (without mapping)" I suspect I side with 2, but can I be sure that everyone sees it that way? In any case there will be lots of times that we want an attribute from an element. For certain types of element/attribute we may have to ask: - give me the attributes - does this also have some 'intended attributes' - oh, and does it also have some attributes which are mapped by yet another level of indirection? The only way that I can see myself offering reliable software is if we agree communally on what the semantics of this are *and* implement some software/APIs to help ensure consistency. For the present I suspect I shall say: JUMBO does not support attribute remapping. Sorry. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Mon May 4 16:57:58 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:06 2004 Subject: Attributes with Intent Message-ID: >But one of the advantages of a hierarchically-structured data >representation is the ability to have scopes to which properties apply. If >a processor can't at least maintain a stack of the attributes of the >ancestors of the current element, it's pretty darn braindead. This attribute discussion is interesting, because I'm not sure you want to expect the processor to have the entire document available for stack building. One of the projects that I'm working on (slowly) is an implementation of XPointers that uses the server's processing power to chop just out the section of a document the user wants and avoiding sending out millions of bytes of wasted data. I suspect that we'll need to make the server handle the xml:lang and xml:space stack, and re-enter their values into the topmost element returned, or something like that. xml:lang and xml:space seem like they're worth the extra effort, though. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From David.Brownell at Eng.Sun.COM Mon May 4 17:57:45 1998 From: David.Brownell at Eng.Sun.COM (David Brownell) Date: Mon Jun 7 17:01:06 2004 Subject: SAX: Exception Handling Message-ID: <199805041556.IAA27904@argon.eng.sun.com> > > throws java.lang.Exception > > { > > [..] > > } > > > > The Java compiler will not help me discover which constructors or > > methods invoked in the body can throw exceptions that I have not yet > > considered. > > Right, so don't do that. Declaring an *interface* as throws > java.lang.Exception does not constrain an *implementation* of that > interface to be declared as throws java.lang.Exception. It can be > declared to throw the appropriate subclass of java.lang.Exception. But then since application programmers write to interfaces, not implementations, then application programmers would have all those nasty problems with getting handed random exceptions. A primary intent of exception declarations is to constrain the kinds of errors that a substystem (e.g. parser) reports, so that its clients have a clean "contract" and know exactly the kinds of "expected faults" they have to deal with. Clauses like "throws Exception" are basically counter to the core philosophy of exceptions in Java. - Dave xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From eliot at isogen.com Mon May 4 18:25:31 1998 From: eliot at isogen.com (W. Eliot Kimber) Date: Mon Jun 7 17:01:06 2004 Subject: Attributes with Intent Message-ID: <3.0.32.19980504112217.00da4a20@postoffice.swbell.net> At 02:56 PM 5/4/98 UT, Simon St.Laurent wrote: >>But one of the advantages of a hierarchically-structured data >>representation is the ability to have scopes to which properties apply. If >>a processor can't at least maintain a stack of the attributes of the >>ancestors of the current element, it's pretty darn braindead. > >This attribute discussion is interesting, because I'm not sure you want to >expect the processor to have the entire document available for stack building. You don't need the entire document, only the attributes of the ancestry of the root of the subchunk you're processing. This is not a big deal. Cheers, Eliot --
W. Eliot Kimber, Senior Consulting SGML Engineer ISOGEN International Corp. 2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004 www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Mon May 4 21:16:47 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:06 2004 Subject: AElfred: URL Change Message-ID: <199805041915.PAA01628@unready.microstar.com> Microstar is moving the AElfred XML parser a little. Please change all links and bookmarks to point to http://www.microstar.com/XML/AElfred/ The old URL, "http://www.microstar.com/XML/", still contains a prominent pointer to AElfred for now, so stale links will not be disastrous. Thanks, and all the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From maillist at chris.hubick.com Mon May 4 22:07:55 1998 From: maillist at chris.hubick.com (Chris Hubick) Date: Mon Jun 7 17:01:06 2004 Subject: PEReference Message-ID: I am writing an XML analization tool upon which I am building a parser. At the bottom level it reads in XML and generates start/end/character events based on the productions in the XML spec. For example, when the parser encounters something matching the Name production, say the Name "foo", it would generate events: f o o When it is complete I hope to build an actual parser on top of it. The analizer can currently read most any document, "all" it is lacking support for, and which I am working on, is production [29] markupdecl, and all of it's dependencies. Now this is where I hit a snag. In the XML spec at: http://www.w3.org/TR/REC-xml#NT-markupdecl It states: > The markup declarations may be made up in whole or > in part of the replacement text of parameter entities. > The productions later in this specification for > individual nonterminals (elementdecl, AttlistDecl, > and so on) describe the declarations after all the > parameter entities have been included. I want the productions for an XML document BEFORE the parameter entities have been included. I really think the XML spec should have included productions for before as well as after PEReference inclusion. I want to do PEReference inclusion at the parser level, not at my lower "analizer" level, which I want to generate events that directly reflect what is in the document (before inclusion). So for my purposes, I need to figure out the grammer for an _unprocessed_ XML document. So my first step/idea was to just look at the current grammer, and start adding PEReferences where I thought necessary: [45] elementdecl ::= '' [48] cp ::= (Name | PEReference | choice | seq) ('?' | '*' | '+')? [51] Mixed ::= '(' S? '#PCDATA' (S? '|' S? (Name | PEReference))* S? ')*' | '(' S? '#PCDATA' S? ')' [52] AttlistDecl ::= '' [53] AttDef ::= S (Name | PEReference) S AttType S DefaultDecl [54] AttType ::= StringType | TokenizedType | EnumeratedType | PEReference [58] NotationType ::= 'NOTATION' S '(' S? (Name | PEReference) (S? '|' S? (Name | PEReference))* S? ')' [59] Enumeration ::= '(' S? (Nmtoken | PEReference) (S? '|' S? (Nmtoken | PEReference))* S? ')' Where I get really confused is: [9] EntityValue ::= '"' ([^%&"] | PEReference | Reference)* '"' | "'" ([^%&'] | PEReference | Reference)* "'" If, as the spec states, these are the declarations AFTER PE inclusion, how can there be PEReferences??? Part of the reason I am writing this is to get a better grip on (read learn) XML. Any guidance would be much appreciated, thanks! --- Chris Hubick mailto:chris@hubick.com http://www.hubick.com/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Mon May 4 22:09:58 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:06 2004 Subject: Attributes with Intent In-Reply-To: <3.0.32.19980504091654.00762d10@postoffice.swbell.net> Message-ID: <3.0.1.16.19980504200839.29af6aba@pop3.demon.co.uk> At 09:17 04/05/98 -0500, W. Eliot Kimber wrote: >At 08:33 AM 5/4/98 -0400, David Megginson wrote: > >>Your point is well-taken, though: fragments of XML documents are often >>tightly bound to their context (similar situations would involve >>attributes for effectivity, security level, revision status, etc.). > >But one of the advantages of a hierarchically-structured data >representation is the ability to have scopes to which properties apply. If >a processor can't at least maintain a stack of the attributes of the >ancestors of the current element, it's pretty darn braindead. I suppose most of my code must be pretty darn undead then... The point I was making is that 'scope' is introduced in a total of about 3-4 sentences in total in the XML + XLink specs. It's actually quite easy for an implementer to miss. I'm not complaining - I'm saying that it would be useful to discuss whether we can all implement this easily and if so how. [BTW how much freely available is actually out there which implements any/all of these attributes?] The touchstone of XML-applications - the desperate Perl Hacker - is not necessarily going to build a stack of the attributes of an/every element. And I'd represent - gently - that some people (at least one) are going to have met attribute re-mapping for the first time and possibly get muddled. For example it seems that a Node must have 2-3 sets of methods: getAttributeValue(String attName); getPseudoAttributeValue(String specialAttnameInContext) and getPseudoAttributeValueAfterRemapping(String possiblyRemappedAttributeName) If this is a useful way forward (other than the somewhat contrived names) it would be nice to know. If not, what? P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From donpark at quake.net Mon May 4 23:12:14 1998 From: donpark at quake.net (Don Park) Date: Mon Jun 7 17:01:06 2004 Subject: ANN: SAXDOM Updated Message-ID: <002b01bd77a0$51b597d0$2ee044c6@arcot-main> A preliminary version of SAXDOM supporting the latest DOM spec (04/16/98) is released. There are still some clean ups (i.e. syncronization blocks for concurrent access to DOM objects) and plenty of tests to go. This version of SAXDOM includes NodeIterator.release method which is currently not in the spec. I am hoping it is included in the spec. You will find SaxContainerIterator class particularly interesting because it is implemented as a Node to allow 'between' iterator position and 'live' model requirements. Enough said. Here is the URL: http://www.docuverse.com/personal/saxdom.html Regards, Don Park http://www.docuverse.com/personal/index.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From eliot at isogen.com Mon May 4 23:35:38 1998 From: eliot at isogen.com (W. Eliot Kimber) Date: Mon Jun 7 17:01:06 2004 Subject: Attributes with Intent Message-ID: <3.0.32.19980504163150.006fd2f8@postoffice.swbell.net> At 08:08 PM 5/4/98, Peter Murray-Rust wrote: >For example it seems that a Node must have 2-3 sets of methods: > getAttributeValue(String attName); > getPseudoAttributeValue(String specialAttnameInContext) >and > getPseudoAttributeValueAfterRemapping(String possiblyRemappedAttributeName) Pretty much. I usually have these functions: - AttValue(string LocalAttName) Returns value of attribute given name as specified for the element. - LocalAttName(string ArchAttName) Returns local name mapped to architectural name (e.g., whatever xml:lang has been mapped to for this element). - ArchAttvalue(string ArchAttName) Returns the value of an attribute given its architectural name. Resolves remapping as necessary. - AttEffectiveValue(string LocalAttName) Returns the effective value of an attribute given its local (to the document) name. - ArchAttEffectiveValue(string ArchAttName) Returns the effective value of an attribute given its architectural name. Obviously, the last two are implemented using the first three to recursively examine the ancestry of the current element. There's another case, less common, but to think about, which is the case where the effective value of an ancestor element's attribute is dependent on the value in its content. You see this most often with security classification attributes, where the effective security classification of an element is the highest classification of any of its content, e.g., in this example, the document element's effective security "eyes-only": ]> ... ... Where "eyes-only" is a higher classification than "top-secret". I would not expect to see a generalized method for determining the effective value of attributes in this way. Propogation of attribute values down the hierarchy is also trickier than just looking up the ancestry, because you might have rules about how defaults get set or reset. HyTime's "default value list" (clause A.5.6, Default value list, part of the General Architecture), provides a more complete facility for specifying exactly what the defaults for implied attributes are and how they should be propogated. It's an example of the sorts of options you need to anticipate in the general case. >The touchstone of XML-applications - the >desperate Perl Hacker - is not necessarily going to build a stack of the >attributes of an/every element. How hard can it be?: # NOTE: forgive my pseudo Perl syntax-- you know what I mean. @attstack() ; # stack of attlist structures @attlist{name} = "value" ;# Associative array of atts values by name sub process_element() { push(attstack, attlist); # Push associative array of atts on stack for each att in element.atts ; # Method that returns array of att objects $attlist{att.name} = att.value next } Cheers, Eliot --
W. Eliot Kimber, Senior Consulting SGML Engineer ISOGEN International Corp. 2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004 www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Tue May 5 00:58:02 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:07 2004 Subject: Attributes with Intent In-Reply-To: <3.0.32.19980504163150.006fd2f8@postoffice.swbell.net> Message-ID: <3.0.1.16.19980504225658.302f3ca6@pop3.demon.co.uk> At 16:32 04/05/98 -0500, W. Eliot Kimber wrote: >At 08:08 PM 5/4/98, Peter Murray-Rust wrote: > >>For example it seems that a Node must have 2-3 sets of methods: >> getAttributeValue(String attName); >> getPseudoAttributeValue(String specialAttnameInContext) >>and >> getPseudoAttributeValueAfterRemapping(String possiblyRemappedAttributeName) > >Pretty much. I usually have these functions: > >- AttValue(string LocalAttName) > Returns value of attribute given name as specified for the element. >- LocalAttName(string ArchAttName) > Returns local name mapped to architectural name (e.g., whatever xml:lang > has been mapped to for this element). >- ArchAttvalue(string ArchAttName) > Returns the value of an attribute given its architectural name. Resolves > remapping as necessary. >- AttEffectiveValue(string LocalAttName) > Returns the effective value of an attribute given its local (to the >document) > name. >- ArchAttEffectiveValue(string ArchAttName) > Returns the effective value of an attribute given its architectural > name. > >Obviously, the last two are implemented using the first three to recursively >examine the ancestry of the current element. This is exactly was I was after :-). These may seem nobrainers to you, but they aren't to people like me :-) [...] >>The touchstone of XML-applications - the >>desperate Perl Hacker - is not necessarily going to build a stack of the >>attributes of an/every element. > >How hard can it be?: > ># NOTE: forgive my pseudo Perl syntax-- you know what I mean. >@attstack() ; # stack of attlist structures >@attlist{name} = "value" ;# Associative array of atts values by name > >sub process_element() { > > push(attstack, attlist); # Push associative array of atts on stack > for each att in element.atts ; # Method that returns array of att objects > $attlist{att.name} = att.value > next > The point is not that it is not hard to *do*, it's knowing exactly *what* you have to do when. For you it's second nature :-). That's why I'm asking... P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From eliot at isogen.com Tue May 5 01:11:44 1998 From: eliot at isogen.com (W. Eliot Kimber) Date: Mon Jun 7 17:01:07 2004 Subject: Attributes with Intent Message-ID: <3.0.32.19980504180736.00e25644@postoffice.swbell.net> At 10:56 PM 5/4/98, Peter Murray-Rust wrote: >At 16:32 04/05/98 -0500, W. Eliot Kimber wrote: >> >>Obviously, the last two are implemented using the first three to recursively >>examine the ancestry of the current element. > >This is exactly was I was after :-). These may seem nobrainers to you, but >they aren't to people like me :-) My appologies: it does seem obvious to me, but I've been doing this sort of thing about 15 years now, so I guess it would. I've also had the advantage of being able to crib from lots of good code (such as the old sgmls.pl that you got with James Clark's old sgmls package...)... There's probably an "XML Processing Patterns" book in there somewhere, although I suspect that Bob Ducharme's book is that (I haven't read it yet). >The point is not that it is not hard to *do*, it's knowing exactly *what* >you have to do when. For you it's second nature :-). That's why I'm asking... Again, good point...keep asking. I'll try to be a bit more supportive in my answers. Cheers, E. --
W. Eliot Kimber, Senior Consulting SGML Engineer ISOGEN International Corp. 2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004 www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From dan at intervista.com Tue May 5 02:00:17 1998 From: dan at intervista.com (Dan Ancona) Date: Mon Jun 7 17:01:07 2004 Subject: Separation of formatting... Message-ID: <3.0.32.19980504171122.0097fe20@mail.intervista.com> At 09:24 PM 5/1/98 -0400, Gregg Reynolds wrote: >An interesting paper on a similar topic is at >http://www.sil.org/sgml/ohco1.html, "Refining Our Notion of What Text >Really Is: The Problem of Overlapping Hierarchies." Ooh, that's a *really* good link. Unfortunately, it melted my cerebral cortex. I'll try to read it in more detail later. Re: the later posts in this thread, I hadn't considered the text/presentation feedback loop or how to handle it, but that will definitely have an effect on what I'm working on. Thanks for the thoughtful answers... Dan __________________________________________________________ Dan Ancona "engage!" Tech Support, Evangelism and Cookery Intervista Software xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Tue May 5 08:29:49 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:07 2004 Subject: SAX: Exception Handling References: <199805041556.IAA27904@argon.eng.sun.com> Message-ID: <354EB022.28130075@jclark.com> David Brownell wrote: > James Clark wrote: > > Declaring an *interface* as throws > > java.lang.Exception does not constrain an *implementation* of that > > interface to be declared as throws java.lang.Exception. It can be > > declared to throw the appropriate subclass of java.lang.Exception. > > But then since application programmers write to interfaces, not > implementations, then application programmers would have all those > nasty problems with getting handed random exceptions. > > A primary intent of exception declarations is to constrain the > kinds of errors that a substystem (e.g. parser) reports, so that > its clients have a clean "contract" and know exactly the kinds of > "expected faults" they have to deal with. You're not distinguishing two quite different uses of interface: 1. There are interfaces like Parser. These are interfaces that are implemented by the SAX implementation and called by clients of the application. For interfaces that are used in this way, I completely agree with the points you make. I did not suggest that methods on parser should be declared as throws Exception: they should be declared as throws SAXException. 2. There are interfaces like DocumentHandler. These are not implemented by the SAX implementation, nor are they called by the client. Instead the reverse is true: these are called by the SAX implementation and implemented by the client. Your arguments don't carry through to interfaces that are used in this way. Declaring methods in these interfaces as throws Exception does not cause a client to have to deal with random errors because they are not called by the client. Nor does it require the parser to deal with random errors > Clauses like "throws Exception" are basically counter to the core > philosophy of exceptions in Java. It's not that simple. The real problem ehere is the lack of parameterized types in Java. If Java had a concept of parameterized types, we could get the typing exactly write, eg (using a C++ like syntax): template interface DocumentHandler { void endElement(String name) throws ET; } template interface Parser { void setDocumentHandler(DocumentHandler handler); void parser(InputSource in) throws ET; } class MyDocumentHandler implements DocumentHandler { void endElement(String name) throws IOException { } } class Demo { void foo(Parser parser) { parser.parse(new MyDocumentHandler()); } } The use of Exception that I'm arguing for is analogous to the use of Object in a class like Vector. Because Java doesn't have parameterized types, you can't express the idea that the type you can get out of the Vector constrains the type you can put in (or vice-versa), so you have to settle for using Object. Similarily with SAX you can't express that the idea that the type of the exception that the Parser can throw contrains the type of the exception that the handler methods can throw, so you have to settle for using Exception. Just as with Vectors, you can easily add a type-safe wrapper: interface IODocumentHandler extends DocumentHandler { void endElement(String name) throws IOException; } class IOParser { Parser parser; public void setDocumentHandler(IODocumentHandler handler) { parser.setDocumentHandler(handler); } public void parser(InputSource in) throws IOException { try { parser.parse(in); } catch (SAXException e) { // The cast can't fail since the DocumentHandler is an // IODocumentHandler that can only throw IOExceptions throw (IOException)e.getException(); } } } With this wrapper I now get a version of SAX specialized for handlers that throw IOExceptions. Using this wrapper I get complete exception type-safety. I can similarily do a wrapper for AWTException or SQLException which will give me exception type-safety for applications using AWT or SQL. Note that these wrappers have minimal run-time cost, because the handler methods are called directly. The more I think about this the more certain I am that having the handler methods declared as throws Exception is the right solution. James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Tue May 5 12:59:14 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:07 2004 Subject: AElfred: Missing files added Message-ID: <199805051057.GAA00457@unready.microstar.com> There were a couple of non-critical files missing from the AElfred 1.2 distribution (in particular, README.txt and ReaderDemo). I have updated the distribution with no other changes; you may download it from the following URL: http://www.microstar.com/XML/AElfred/aelfred-1.2.zip Please note that if you have already downloaded AElfred 1.2, you do not need to do so again unless you are interested in the missing files. All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Tue May 5 13:01:20 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:07 2004 Subject: SAX: Exception Handling References: <199805041556.IAA27904@argon.eng.sun.com> <354EB022.28130075@jclark.com> Message-ID: <354EEFF9.BC000335@jclark.com> I've just switched XP to use the exception handling approach I'm advocating for SAX, and I'm very happy with it: - if you import com.jclark.xml.parse.*; import com.jclark.xml.parse.io.*; you get an interface where the handler methods can throw IOException and where the parse method throws IOException (the same as the old XP interface) - if you import com.jclark.xml.parse.*; import com.jclark.xml.parse.awt.*; you get an interface where the handler methods can throw AWTException and where the parse method throws AWTException and IOException; - if your application needs to throw some other kind of exception, you can either create a package with 4 trivial wrapper classes (which could be automatically generated from the com.jclark.xml.parse.awt.* source), or you can use com.jclark.xml.parse.base.* where the handler methods are declared as throwing Exception and the parse method is declared as throwing IOException and ApplicationException (which is a wrapper analagous to SAXException). James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From clovett at microsoft.com Tue May 5 19:04:09 1998 From: clovett at microsoft.com (Chris Lovett) Date: Mon Jun 7 17:01:07 2004 Subject: Is MSXML pure Java? Message-ID: <2F2DC5CE035DD1118C8E00805FFE354C0395C744@red-msg-56.dns.microsoft.com> > I just installed it to check it out and I noticed that it installs > a DLL. It also doesn't come in a standard jar or zip file, > but uses an installer which appends to the Microsoft Java VM > classes.zip file in \winnt\java\classes. In order words, it > seems too close and comfy with the Microsoft VM which scares me > into thinking that perhaps it uses one of MS's horible extensions > to tie it to IE/Windows. > > Is it non-pure? Or is the DLL stuff just some kind of ActiveX/DSO > bridge junk? > [Chris Lovett] The DLL that is an optimization that only works under Windows to improve the speed of Java IO by a large amount (I think it is a factor of 4 or something). This will simply not load on any other platform and become a no-op. To be sure you can remove the XMLStream classes all together. The rest of the parser is very much standard plain vanilla Java, although it does use some JDK 1.1 features. There is also a switch in their that looks for JDK 1.1, so if you have a newer JDK than 1.1 you will need to disable that test and simply set the jdk11 member variable to "true". > The docs aren't JavaDoced either :(, but it does appear to > have its own Object Model which seems suspiciously like > DOM (which is understandable since MS is one of the submitters), > but different enough to be confusing. Any word whether > MSXML will eventually support DOM native? > [Chris Lovett] Yes, the plan is to support DOM. The Java parser was written last year when the DOM was still in its infancy. So far the DOM is still a moving target. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From clovett at microsoft.com Tue May 5 19:16:55 1998 From: clovett at microsoft.com (Chris Lovett) Date: Mon Jun 7 17:01:07 2004 Subject: [xX][mM][lL] is reserved Message-ID: <2F2DC5CE035DD1118C8E00805FFE354C0395C746@red-msg-56.dns.microsoft.com> Spec issue: Section 2.3 paragraph 3: "A Name is a token beginning with a letter or one of a few punctuation characters, and continuing with letters, digits, hyphens, underscores, colons, or full stops, together known as name characters. Names beginning with the string "xml", or any string which wouild match (('x' | 'X')('m' | 'M') ('l' | 'L')) are reserved for standardization in this or future versions of this specification". Currently MSXML disallows "xml" from being used as a "namespace", for example: "xml:foo" since this seems to be the way the XML spec is heading with "xml:space" and "xml:lang". So is the above paragraph just a little too restrictive and should it include the colon ? If not then the following very common XML example is illegal: This is a test. Tim Bray's web site (http://www.xml.com/axml/axml.html) says explicitly that "xmlu" is an illegal name, but James Clark's expat seems to allow this ?? Can the XML spec gurus clarify this paragraph ? Thanks. Chris Lovett. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From crism at ora.com Tue May 5 19:55:44 1998 From: crism at ora.com (Chris Maden) Date: Mon Jun 7 17:01:07 2004 Subject: [xX][mM][lL] is reserved In-Reply-To: <2F2DC5CE035DD1118C8E00805FFE354C0395C746@red-msg-56.dns.microsoft.com> (message from Chris Lovett on Tue, 5 May 1998 10:15:35 -0700) Message-ID: <199805051754.NAA21373@ruby.ora.com> [Chris Lovett] > "A Name is a token beginning with a letter or one of a few > punctuation characters, and continuing with letters, digits, > hyphens, underscores, colons, or full stops, together known as name > characters. Names beginning with the string "xml", or any string > which wouild match (('x' | 'X')('m' | 'M') ('l' | 'L')) are reserved > for standardization in this or future versions of this > specification". [...] > Can the XML spec gurus clarify this paragraph ? Thanks. The spec seems clear to me. Names may not begin with "xml" or any case variation thereof. What's unambiguous? If you're asking if it *should* say what it does, the answer is yes, to me. The XML WG can always choose to free the string for wider use later, but would be unable to capture it again having done so. As with many other decisions, they've chosen to err (if an error it is) on the side of caution. However, one side effect of this is that a conformant XML 1.0 processor may reject a conformant XLink document for using "xml:link" as an attribute name. -Chris -- http://www.oreilly.com/people/staff/crism/ +1.617.499.7487 90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tbray at textuality.com Tue May 5 20:45:29 1998 From: tbray at textuality.com (Tim Bray) Date: Mon Jun 7 17:01:07 2004 Subject: [xX][mM][lL] is reserved Message-ID: <3.0.32.19980505114421.00c608b0@pop.intergate.bc.ca> At 10:15 AM 5/5/98 -0700, Chris Lovett wrote: >Currently MSXML disallows "xml" from being used as a "namespace", for >example: "xml:foo" since this seems to be the way the XML spec is heading >with "xml:space" and "xml:lang". So is the above paragraph just a little >too restrictive and should it include the colon ? Yes, this should be disallowed as a namespace prefix... the latest namespace draft supports this point of view. >If not then the following >very common XML example is illegal: > > This is a test. > Yes. >Tim Bray's web site (http://www.xml.com/axml/axml.html) says explicitly that >"xmlu" is an illegal name, but James Clark's expat seems to allow this ?? Well, end-users certainly shouldn't invent such names in their documents. On the other hand, it's not clear that a processor should reject it, because... >Can the XML spec gurus clarify this paragraph ? Thanks. Well, actually, the spec is a bit mushy on this; for PI targets, it excludes "^xml*" right in the grammar, but for other names, it just says such names are "reserved," whatever that means. My interpretation has been that "reserved" meant, in practical terms, reserved precisely for things like xlink and namespaces and so on; but the spec doesn't do a good job of making this clear. I think, Chris, you ought to get Charles or Jean to raise this formally in the WG. -Tim xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From donpark at quake.net Tue May 5 22:17:05 1998 From: donpark at quake.net (Don Park) Date: Mon Jun 7 17:01:07 2004 Subject: Is MSXML pure Java? Message-ID: <000a01bd785a$5c558460$2ee044c6@arcot-main> >> The docs aren't JavaDoced either :(, but it does appear to >> have its own Object Model which seems suspiciously like >> DOM (which is understandable since MS is one of the submitters), >> but different enough to be confusing. Any word whether >> MSXML will eventually support DOM native? >> >[Chris Lovett] Yes, the plan is to support DOM. The Java parser was >written last year when the DOM was still in its infancy. >So far the DOM is still a moving target. FYI, My FREE-DOM package (formerly called SAXDOM) can be used with MSXML if you combine two drivers: one for bridging FREE-DOM to SAX and another for bridging SAX to MSXML. A direct driver from FREE-DOM to MSXML as well as other popular parsers is planned. FREE-DOM can be obtained at: http://www.docuverse.com/personal/freedom/index.html Regards, Don Park http://www.docuverse.com/personal/index.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From elm at arbortext.com Wed May 6 01:35:26 1998 From: elm at arbortext.com (Eve L. Maler) Date: Mon Jun 7 17:01:07 2004 Subject: [xX][mM][lL] is reserved In-Reply-To: <98May5.140037edt.26883@thicket.arbortext.com> References: <2F2DC5CE035DD1118C8E00805FFE354C0395C746@red-msg-56.dns.microsoft.com> Message-ID: <3.0.5.32.19980505193327.00a2bac0@village.promanage-inc.com> At 01:54 PM 5/5/98 -0400, Chris Maden wrote: >However, one side effect of this is that a conformant XML 1.0 >processor may reject a conformant XLink document for using "xml:link" >as an attribute name. FYI, because of the namespace work, we voted to change the xml:link attribute to xlink:form, though this doesn't appear in the spec yet. You'll see it in the next draft. (No, I'm not sure when that will be. :-) Eve xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cbullard at hiwaay.net Wed May 6 03:19:05 1998 From: cbullard at hiwaay.net (len bullard) Date: Mon Jun 7 17:01:07 2004 Subject: RFD: comp.text.xml References: Message-ID: <354FBA1C.B61@hiwaay.net> Steven Champeon wrote: > > Len, you're sounding a bit defensive. Quite true. Some ideas and principles are worth defending to a point. This post is the last point. I've not time for a flamewar, and anyway, that is what the icon that looks like crossed sabres on the Outlook menu is there to stop. :-) > Do you think C would have gotten as > far as it did if every time someone brought it up, someone else popped up > to remind everyone that it was inspired by BCPL? Need we say that Java was > once named Oak in order to reap its benefits? No. That we remember this is the case is sufficient. OTOH, in my own descriptions of these, I decline to call Java a standard. It is a product of Sun well on its way to becoming a standard technology similar to RealMedia. These distinctions enable me to look at customers and say, "yes, runs best on X machine" and not endure the endless "But Why Won't They Implement the Standard?" questions. When it goes to ISO, as HTML has, I will call it a standard. Until then, XML is a standard technology and the property of the W3C. > I started out working with SGML, waited for the CALS table model, hung > out wondering when DSSSL would be done, learned Author/Editor's internal > styles language as a desperation measure, bemoaned the ridiculous complexity > of FOSI, and when the Web hit I never looked back. With you all the way. I spent time and personal capital pleading for a simplification. When the SGML On The Web project was announced, I stepped up to endorse it. It was an idea whose time had come and in fact, was overdue. I had watched customers, friends, companies, pay large prices. OTOH, I was also there to see certain ones pay off as SGML delivered on the lifecycle properties promised. Markup works. Since XML is the same subset of SGML that most of us had always been using, I feel confident it will deliver the same benefits. No quarrel there. > I'm thrilled to death > that the idea of separating presentation from markup has become a reality > for those of us without multi-million dollar consulting budgets, So am I Steve, but frankly, I am the rogue who didn't buy into the whole arcana and was using an SGML hypertext system with stylesheets by 1992-93. I even arranged to give it away: IADS. It wasn't everything I wanted, but it sure proved the point, and hey, it even let you design and use your own DTDs. That reality has been real for almost a decade. > and I'm > thrilled that XML will be part of the next generation of browsers. I'm > afraid that Microsoft has cornered the market on Windows-based XML parsing, > by virtue of its being built into the "Internet Explorer OS Upgrade" blob > they seem to be pushing as a pre-req for any new application installs, This doesn't bother me a bit. The rules for introducing standard technologies using the W3C processes have been followed apparently to the letter. Practicality prevails. At this point, MS is the defacto standard platform. Having XML on it is the right thing. It will be interesting to see how well it is used and by whom. There are some real issues about integrating Chrome with other standard languages, but that is another problem to be worked. I think XML and MS may soon face a pretty unruly crowd over that one. > but the fact that there is a standard to appeal to - a simple standard, > which doesn't give MS much cover should they decide to slightly alter > their implementation. I think it's amazing that there are fully-functional > parsers out there today. Since the goal was to enable the DPH to write a parser in a week or so, this shouldn't amaze you too much. Meeting the design requirements is usually good practice in engineering efforts. Given that no one had to start from scratch and most of the major decisions were pruning decisions over existing solid work, I would be very surprised if that weren't the case. > AFAIK, there isn't a single application out there > which fully supports all of the SGML arcana. AFAIK, you are right and that hasn't made a warts worth of difference. SGML has been and is applied to large publishing and database projects with much success. Cost of tools varied, but that was mainly in the rendering. The SGML vendor community overcharged and often overemphasized complete compliance. It was suicide and opened the door to the events that followed. The SGML Way was a recipe for commercial failure. XML proves that lesson was learned. Good. > Len, I've known you since my first days reading comp.text.sgml, and I > respect your enlightened approach to subjects both technical and humane. Thank you. > Please don't expect XML to be a bait-and-switch move to bring SGML into > everyone's home. I don't expect it to be. Nor did I expect SGML On The Web to be a bait and switch move to replace SGML with XML, but I was wrong. OTOH, at certain levels, I don't care about that either. For those of us who were fortunate enough to watch Dr. Goldfarb, James Mason, Lynn Price, Martin Bryan, Steve Newcomb, Sharon Adler, Anders Berglund, and the others who created SGML, XML is the crowning achievement, the proof that their ideas and their work have succeeded. If a name change is required to ensure the continuing success, so be it. But I will not sit by and watch them be robbed by the also rans and newtoBes of the industry of the credit for that. Not only does that deny what I know to be true, it is wrong. It also repeats and legitimizes that same precedent set by those who have publicly derided a work to capture their own market, then turn to take that same work, rename it, and call themselves fathers of a new language. History requires a certain ethical and moral payment: respect. > And please recognize that XML is a powerful, simple, > iteration on some of the ideas espoused by SGML. To be sure: SGML On The Web. > Where XML will not suffice, > perhaps SGML will thrive. ISO 8879 is dead. Without a base of implementors, no ideas however good can thrive. But there will be a price for it. ISO rules and processes had a hint of openness and rule of law. A Don Park stood a chance in that. Yet these rules and processes are made by men and in that, justice is the gift of the individual. I remember that very strange fellow with the prison tattoos who made it into the HyTime meeting in Almaden (sorry eliot, this predates you). I thought it awkward that he was there, yet, Dr Goldfarb neither feared him nor took measures to remove him. It was an awesome demonstration of integrity. If this is the case with the Director, then let him change the rules and open the meetings. This technology which he espouses and has done so much to promote enables it and without nearly the problems Dr. Goldfarb overcame. Let him be that much a man, and I will give him that much credit. > But where XML will suffice, its users need not > know thing one about SGML. In my mind, this is a good thing. In my view, they are the same thing and that is a good thing. If we have to accept that our *standards* are the product of The Director's approval, and that only the elect can know the shape of things to come such that a Don Park doesn't even have half a chance to compete, then it is time to go back to playing guitar for happy hour crowds. It is something I understand and can see the good of doing. I appreciate your comments, Steve. I really do. But my time here is past and there are other tasks to be done. There will be comp.text.xml, then there will be whatever follows that. As the twig is bent.... len xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cbullard at hiwaay.net Wed May 6 03:34:53 1998 From: cbullard at hiwaay.net (len bullard) Date: Mon Jun 7 17:01:07 2004 Subject: Parents and children (was RFD: comp.text.xml) References: Message-ID: <354FBD8D.329C@hiwaay.net> Simon St.Laurent wrote: > > So far it has its own limitations, and 20-odd announced books (3 of which have > SGML in the title) coming out, as well as a new name. Congratulations. That is a good start. > It also has press > attention like SGML hasn't _ever_ seen. Umm... press in the web is a taken for granted thing now. As for SGML, check the references in the early part of the decade including a whole issue of Byte. Remember, those were different times before the deflated value of the press today. In those days, even a column in Byte caused quite a stir. > It has ringing endorsements from > major corporations, and promising new products. None of it's here yet, > really, but the standard only arrived two months ago. Since it is a pruned version of SGML, one might say that even Frankenstein could walk on the first day since the legs were already able. "It's Alive!!!" > Not a bad start. Hopefully, it will grow. Hopefully it will stay small and its applications will grow. > As for abusing the parent, I think you may be in denial about SGML's > less-than-friendly reputation. No. I have had to contend with that for some years. > There were good reasons that the W3C created > XML, I understand, as well as good reasons for naming it XML rather than > SGML-Lite. I agree with the first, not the last. We have to let that go. > This doesn't mean that SGML is evil incarnate - it just means that > it involved a learning curve that many people found less than exciting. So does XML. I think once it is out there awhile, you will get to endure that as well. You see, SGML did not start out with that reputation. It grew as some found it convenient. Compared to tagging Digital Standard Runoff by hand, it was a piece of cake. > I just don't > think that comp.text.xml is meant as a forum for diplomacy. XML-DEV or other > forums are probably more appropriate. I don't understand why you seem so > dismissive of the needs of people who are coming to XML from non-SGML (and > non-standards body) backgrounds. You will find that the first time a Jorn Barger drops on the list, diplomacy will be of some use. I don't dismiss their needs. I hope to see them met. I dismiss the arguments that say they become confused by SGML arcana. It is the same people answering both sets of questions. So far, that hasn't been a problem. > I don't think it's a great place to learn about XML. It's not always > >> easy to determine when SGML-only features are being discussed, and the > >> question overlaps can lead to some fairly significant confusion, especially > >> for beginners. > > >This is speculative but so far, the one issue that can be taken > >seriously. > > Well, at least we seem to agree on the pedagogical value of a separate > comp.text.xml newsgroup. That's a start. Yes. Much success to you. BTW, rumors are always suspect. ;-) len xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From donpark at quake.net Wed May 6 03:50:39 1998 From: donpark at quake.net (Don Park) Date: Mon Jun 7 17:01:07 2004 Subject: RFD: comp.text.xml Message-ID: <001a01bd7890$62c2a8b0$2ee044c6@arcot-main> Guys, >ISO 8879 is dead. Without a base of implementors, no ideas however >good can thrive. But there will be a price for it. ISO >rules and processes had a hint of openness and rule of law. >A Don Park stood a chance in that. Yet these rules and What is A Don Park? Are you guys using my name in the same spirit as John Smith or as in Roadkill? Don Park /don bark'/ n 1. Hopeless idealistic person who is destined to be a roadkill in the information age dominated by large corporations. || vt 2. to place and leave (a vehicle) temporarily only to find it flattened under a giant lowercase letter E. At least my name will be above Don Quixote whose entry will be amended to read "See Above". As silly as a Donut, Don Park http://www.docuverse.com/personal/index.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Wed May 6 05:48:42 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:08 2004 Subject: Parents and children (was RFD: comp.text.xml) References: <354FBD8D.329C@hiwaay.net> Message-ID: <354FDDC1.A80CEBB@technologist.com> len bullard wrote: > > Umm... press in the web is a taken for granted thing now. As for > SGML, check the references in the early part of the decade including > a whole issue of Byte. And even another misleading Economist article! And articles in PC Magazine, etc. > > This doesn't mean that SGML is evil incarnate - it just means that > > it involved a learning curve that many people found less than exciting. > > So does XML. I think once it is out there awhile, you will get to > endure that as well. You see, SGML did not start out with that > reputation. To me, this is such a crucial point that it deserves re-emphasis (and re-re-emphasis). XML inherits 95% of everything that made SGML difficult to apply and use. Not 95% of what made it hard to memorize the SGML standard, but that was never relevant, because only one person ever memorized the standard (at which point, presumably, SGML got boring and he moved on to other things). But 95% of the stuff that makes SGML hard to *use* is in XML. The flexibility, the dangerous parameter entities, the responsibility for defining your own tags, your own stylesheets and your own systems. The responsibility to define standards and stick to them. The perceived and real need to bend them. It's all there. Compared to HTML, XML is rocket science, just as SGML was before it. The Byte/Salon/comp....html backlash is a re-occurrence of a very old, very boring phenomenon. Generic markup makes people's lives harder (in the short term) in several ways. People have enough on their plates without digital elites trying to shove down their throats complicated, multi-layered junk that makes their lives with the promise that it will have long term benefits. The best (only?) way to get past that is with instant feedback, immediate gratification software. Using XML for designing data protocols provides pretty immediate gratification, which is why it has taken off in that capacity. On the other hand (and perhaps this is heretical...sorry), the data protocols world got along okay without XML in the first place. I have personally documented some interesting applications of XML to software interoperability problems, but they are essentially incremental changes: a more expressive IDL, HTTP with richer objects, and so forth. XML makes these things easier, but not fundamentally more powerful. In my opinion, XML's real power remains in the ability to make persistent data truly persistent -- across technological sea changes. XML browsers will one day provide more or less instant gratification for Jane Schmoe. She will still have to endure an extra level of abstraction, understanding and responsibility, but at least she can have her web page up and running in an evening. At that point, SGML/XML will have really rounded a corner. Right now, it merely seems to have done so, because it is the media's interest to promote it as such. And of course it will be similarly in their benefit to report and perhaps exaggerate the backlash. Luckily, people are willing (for now) to believe us that we will one day provide them with more interesting, interactive web pages. This is in large part because it is in the media's interest to promote that vision with us. I hope we do not keep the webmasters waiting so long that they start to disbelieve. Considering that the browser vendors cannot even implement CSS right, I sometimes have moments of shaky belief myself. Without the browsers, XML is a programmer's toy and we must begin our long, slow climb to usability again. Still, with Netscape's source available, we should not be subject to their whims anymore, so I am not too worried. In the meantime, if you teach XML, as I do, I would suggest that the closest thing to instant gratification we have today is Jade+Docbook for XML. Creating a series of web pages and a nice printed document from the same source file is pretty damn cool and much more exciting than looking at the output of a parser. > In my view, they are the same thing and that is a good thing. If > we have to accept that our *standards* are the product of > The Director's approval, and that only the elect can know > the shape of things to come such that a Don Park doesn't even > have half a chance to compete, then it is time to go back to > playing guitar for happy hour crowds. It is something I > understand and can see the good of doing. It wasn't so long ago that we were having a similarly pessimistic conversation about web standards (at that time, HTML and CSS) and you admitted that despite all of the setbacks, it was a good time for hypertext. That is more true today than ever. XML may still be due for a backlash, but lights are going on in people's heads all over the place. As long as the pattern is two steps forward, one step back, we are still making progress. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization." - Stewart Brand, founder of the Whole Earth Catalog http://www.wired.com/news/news/culture/story/10124.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From pl at xmatrix.com Wed May 6 05:54:55 1998 From: pl at xmatrix.com (Philippe Lourier) Date: Mon Jun 7 17:01:08 2004 Subject: ANN: Tim Bray on TechNetcast, Transcript Released Message-ID: <3.0.5.32.19980505235220.00798ca0@mail.bettynet.com> The transcript of Tim Bray's appearance on DDJ TechNetcast has been posted on our website, www.technetcast.com Thanks to all who provided comments and suggestions PL Dr. Dobb's TechNetcast xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From mrc at allette.com.au Wed May 6 06:25:41 1998 From: mrc at allette.com.au (Marcus Carr) Date: Mon Jun 7 17:01:08 2004 Subject: Parents and children (was RFD: comp.text.xml) References: <354FBD8D.329C@hiwaay.net> <354FDDC1.A80CEBB@technologist.com> Message-ID: <354FE614.2958E331@allette.com.au> Paul Prescod wrote: > Thank you. And thank you Len Bullard also. Although SGML has its warts, many of us have been using it and providing real solutions for many years - it is disheartening to see it kicked from pillar to post, often by those who have never had a commercial imperative to make it work. (This is a general observation and is not necessarily directed at any of the current participants of this thread.) That said, the parent/child analogy is entirely appropriate - the child has the responsibility for forging its own way in a different world than the parent grew up in, however the genetic bond is indisputable. XML wasn't invented, it was selectively bred. -- Regards Marcus Carr email: mrc@allette.com.au _______________________________________________________________ Allette Systems (Australia) email: info@allette.com.au Level 10, 91 York Street www: http://www.allette.com.au Sydney 2000 NSW Australia phone: +61 2 9262 4777 fax: +61 2 9262 4774 _______________________________________________________________ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ricko at allette.com.au Wed May 6 08:52:19 1998 From: ricko at allette.com.au (Rick Jelliffe) Date: Mon Jun 7 17:01:08 2004 Subject: Parents and children (was RFD: comp.text.xml) Message-ID: <00a901bd78bb$c06d2dc0$880b4ccb@NT.JELLIFFE.COM.AU> From: Paul Prescod > XML inherits 95% of everything that made SGML difficult >to apply and use...The flexibility, the dangerous parameter entities, the >responsibility for defining your own tags, your own stylesheets and your >own systems. The responsibility to define standards and stick to them. The >perceived and real need to bend them. It's all there. I think this is a good point. Markup languages have to cope with text--it is a different world to tagging nice simple fielded records of databases. People who are coming to text new usually have no idea the enormous richness of text--standard generalized markup languages (i.e. XML aka SGML) attempt to provide a very simple mechanism to cope with it--an element hierarchy with attributes and ID/IDREFs to allow graph structures to be represented. But the richness of text, and the sophistication of what people want to do with it, means that a simple tool like a standard generalized markup language makes many problems tractable but still not simple. People will blame the tool (XML), but the enemy is text itself. Anyway, people have trouble with standards (e.g. look at all the W3C applications which give a puported EBNF syntax), with the generalized approach (e.g. that famous web page "The Web is ruined and I ruined it"--sorry no URL), with markup (e.g. those who want documents to be embedded in programming languages) and with languages (e.g. the people who want to use a binary format for everything because of sometimes misplaced notions of efficiency). People already baulk at each of these 4 issues, sometimes appropriately, sometimes inappropriately. These central issues are much more important than the readability or implementability of ISO 8879 versus REC-XML. No matter how simple XML became, IMHO people will still resist buying into XML's standard generalized markup langage approach. In the Byte issue about XML, Gates says something about XML being preferred for the moment. So from their top we can see that Microsoft has not bought into XML as a strategy, but just as a tactic. The RDF people still see XML as a serialization format, which suggests that they may see DOM as more important than XML. So I think we should not fool ourselves that XML's future is assured. People who have not grown used them often have a mental block against standard generalized markup languages and always wish they would go away in favour of semi-proprietary, low-level, procedural, binary formats. Rick Jelliffe xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jmulet at datalab.es Wed May 6 15:04:20 1998 From: jmulet at datalab.es (Jordi Mulet) Date: Mon Jun 7 17:01:08 2004 Subject: Architectural Forms, separation of formatting and loose-leaf management Message-ID: <01bd78ee$cb8ff240$69ac92c0@jordi.praxis.es> Hello, We are involved in a SGML project and we are finalisizing the analysis phase. At present we are thinking about how to implement the management of loose-leaf pages updates because any availabe off-the-shelf comercial implementation can cope with our needs. We think that the concept of architectural forms can help to manage the processing the flow between the intial information structures (SGML semantic tags ) and the presentational/layout structures of the final pages. As Rick Jelliffe said in a past discussion on XML-dev DSSSL ( with the MIF/RTF backend for example) can help to do the transformations to the the layout structures ( down-sizing), but we can't obtain any feedback from the layout program to the stylesheet language. We know that others programs as Omnimark or Balise can help to down-size the SGML instances but we will need a flow of dependence from the pages/layout to the original SGML. A possible implemention of our system would be to build three independent Databases: - Information SGML Database. - Layout Database (Quark, FrameMaker,...). - Page Objects Database ( PDF). and a "meta-Database" to manage all the "processing" an "relationships" between the elements of each database. This "meta-Database" would control and manage the workflow and updates of the different elements. Can architectural forms model this meta-database schema to control the three database from a top structure? Architectural forms can store all the information about processing. How difficult would be to build a similar system ? It will be necessary to define property sets and grove plans for the Layout scheme and Page scheme, doesn't ? Is there any working experience on this topic ? ( PDF, Quark, FrameMaker,...) Any feedack will be of great value for us! Thanks, Jordi Mulet Editorial Praxis S.A -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980506/b2514885/attachment.htm From jmarckel at netco.com Wed May 6 15:07:52 1998 From: jmarckel at netco.com (jeff marckel) Date: Mon Jun 7 17:01:08 2004 Subject: unsubscribe Message-ID: <35505F60.547F50CF@netco.com> unsubscribe -- Jeff Marckel -- Lead Software Engineer, WAM!BASE jmarckel@netco.com --or-- 612.886.5642 xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From mintert at irb.informatik.uni-dortmund.de Wed May 6 15:23:48 1998 From: mintert at irb.informatik.uni-dortmund.de (Stefan Mintert) Date: Mon Jun 7 17:01:08 2004 Subject: SGML declaration for XML Message-ID: <199805061323.PAA15216@brown.informatik.uni-dortmund.de> Hi! which is the correct SGML declaration for XML? I found three: 1a) I am new to SGML/SMIL, and I find myself wondering if an SMIL editor would be a good choice of commercial products to target. Do I need a wake-up call? I am imagining a specialized XML editor that includes temporal attributes and a frame by frame like content view. If the project is a go, then I would have moderate development funds to work with. I am aware there are key individuals on this list who are well established in such technology, and who are already working on their own agendas. Could we cooperate for mutual benefit? I would be interested in knowing more about desired product features, design issues, foreseeable risks, and whether anyone would be interested in participating with development and to what extent. Regards, Rolande Kendal kendal@interlog.com xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Wed May 6 16:22:39 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:08 2004 Subject: ANN: Tim Bray on TechNetcast, Transcript Released References: <3.0.5.32.19980505235220.00798ca0@mail.bettynet.com> Message-ID: <354FF09E.EDF19719@technologist.com> This transcript is really excellent stuff. Tim has hit the nail on the head about many issues. It should be required reading for people in the media. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization." - Stewart Brand, founder of the Whole Earth Catalog http://www.wired.com/news/news/culture/story/10124.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Wed May 6 16:24:13 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:08 2004 Subject: Parents and children (was RFD: comp.text.xml) References: <354FBD8D.329C@hiwaay.net> <354FDDC1.A80CEBB@technologist.com> <354FE614.2958E331@allette.com.au> Message-ID: <354FF0A0.C1BF555F@technologist.com> Marcus Carr wrote: > > That said, the parent/child analogy is entirely appropriate - the child has the > responsibility for forging its own way in a different world than the parent grew > up in, You can call XML a child of SGML, or a new iteration of it. One view establishes a separation and the other eliminates it. Neither has any particular basis in reality: they are both rhetorical views. Consider: C++ (as a random example) must also forge its own way in a different world than previous iterations of it grew up in. Unlike XML, it wasn't renamed as it was adapted for new environments and tasks. Nevertheless, it went from being a Unix-centric front-end to C into a massive programming language used on every platform with compilers that border on artificial intelligence. Is modern C++ a "child" of that older one? Or is it just a new version? Stroustrop once told me that if he were going to update the ARM, he would essentially have to rewrite it from scratch. The differences between C++ in 1986, 1990, 1993 and 1998 are much more severe than those between "old SGML" and XML. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization." - Stewart Brand, founder of the Whole Earth Catalog http://www.wired.com/news/news/culture/story/10124.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ricko at allette.com.au Wed May 6 16:30:00 1998 From: ricko at allette.com.au (Rick Jelliffe) Date: Mon Jun 7 17:01:08 2004 Subject: SGML declaration for XML Message-ID: <002401bd78fb$8c90b4a0$a10b4ccb@NT.JELLIFFE.COM.AU> From: Stefan Mintert >What is the correct SGML declaration for XML? The correct one is: tags, otherwise just use for EMPTY element types. "ISO 8879:1986 (ENR)" is transitional only. Finally tags, otherwise just use for EMPTY element types. >(2): > a Name may start (NAMESTRT) with a ':' All uses of ":" are still experimental. Starting a name with ":" would presumably be done to implement some scoping. In general, XML has avoided anything which makes parsing or understanding a name depend on context--it has a terrible complicating effect. (Note however, that attribute and data values are full of context-dependent connotations.) My understanding is that the namespace proposal is currently not intending to allow ":" to prefix names, but you should ask the WG people for some confirmation: (Tim?) Rick Jelliffe xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ricko at allette.com.au Wed May 6 16:52:47 1998 From: ricko at allette.com.au (Rick Jelliffe) Date: Mon Jun 7 17:01:08 2004 Subject: Architectural Forms, separation of formatting and loose-leaf management Message-ID: <002c01bd78fe$bd3e87a0$a10b4ccb@NT.JELLIFFE.COM.AU> Here are three random things which may be useful to consider. 1) The first is that DSSSL allows you to have external functions. So even though DSSSL itself has no way to query the pagination system, DSSSL does allow you to stick in your own queries or functions. You can do all sorts of tricks with these. I dont know to what extent JADE supports this, though. One trouble with stream-based SGML processors is that they often have an output buffer (or are in a pipe) so unless you can flush the output buffers, your SGML processor may be left stranded if it waits for some feedback from a downstream program. A DSSSL system built on top of a general purpose Scheme would be most likely to cope with feedback from layout engines. Tony Graham of the DSSSL list would be a good contact in this regard. 2) People often put pagination information in processing instructions. Or the information can be kept in an external database with, for example, HyTime locators. If you can decide in advance to only break pages on paragraph boundaries, then you can piggyback the pagination information on top of element markup. 3) If you find you have many of these concurrent structures, you may opt for "point markup", which is rather extreme, and would be an interesting challenge for some stream-based processors. In point markup, your main text is just marked up using Then you have as separate element trees for each kind of structure: these trees probably contain no character data of their own, just IDREFs to the start and end of their range. In this way you can represent concurrent, overlapping hierarchies in SGML. For example: ]> here is some data of no interest. This structure has the advantage of neatness, and provides a lot of modeling power for just one extra level of indirection. If you used HREF rather than REFID, you can use external point markup too. The effect, of course, is to have concurrently here is some data of no interest. and

here is some data of no interest.

Rick Jelliffe Author, "The XML & SGML Cookbook", out in May from Prentice Hall. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980506/36137d71/attachment.htm From ricko at allette.com.au Wed May 6 16:56:33 1998 From: ricko at allette.com.au (Rick Jelliffe) Date: Mon Jun 7 17:01:08 2004 Subject: SGML declaration for XML Message-ID: <003301bd78ff$64eba3c0$a10b4ccb@NT.JELLIFFE.COM.AU> From: Rick Jelliffe >My understanding is that the namespace proposal is currently not >intending to allow ":" to prefix names, but you should ask the WG people >for some confirmation: (Tim?) Oops I mean I believe they are not allowing ":" to *start* names. red:dog = OK :dog = not OK Rick Jelliffe xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ray at guiworks.com Wed May 6 17:34:19 1998 From: ray at guiworks.com (Ray) Date: Mon Jun 7 17:01:08 2004 Subject: Should I develop an SMIL editor? In-Reply-To: <3.0.32.19980506101721.0181b100@interlog.com> from Rolande Kendal at "May 6, 98 10:17:27 am" Message-ID: <199805061532.JAA03075@coldsnap.guiworks.com> > > I am new to SGML/SMIL, and I find myself wondering if an SMIL editor would > be a good choice of commercial products to target. Do I need a wake-up > call? I am imagining a specialized XML editor that includes temporal > attributes and a frame by frame like content view. I think like anything, you need to define yourself a niche and attack it. How are you going to differentiate yourself? Remember, the basic functions of editing and playback are going to in products from Real Networks, Adobe, Macromedia, etc. But who knows, the big guys might trivially support it as an import/export format just to say "I support another feature" On the other hand, I don't think "just an SMIL editor" is a commercial product, because it's too simple. It has to be combined with something, like an existing audio/video authoring suite that is being converted to support the web, etc. If something is not complicated to implement, and easy to author without a tool, it is probably a good candidate for freeware. I was toying with the idea myself of implementing an SMIL player using the Java Media Framework, but I've got too many projects on my lap already. :) Now, I think a MathML editor/displayer has potential. :) For instance, a tool that could convert LaTeX to MathML would be valuable, and perhaps could export from MathML to Mathematica/Maple. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From bckman at ix.netcom.com Wed May 6 18:40:22 1998 From: bckman at ix.netcom.com (Frank Boumphrey) Date: Mon Jun 7 17:01:09 2004 Subject: Tim Bray on TechNetcast, Transcript Released Message-ID: <01bd7926$e8915a40$8d431ecc@uspppBckman> I have just finished reading the transcript. It makes great reading, especially for those who have been following the "parent's and children" saga!! Frank Boumphrey http://www.hypermedic.com/style/index.htm -----Original Message----- From: Philippe Lourier To: xml-dev@ic.ac.uk Date: Tuesday, May 05, 1998 8:56 PM Subject: ANN: Tim Bray on TechNetcast, Transcript Released > >The transcript of Tim Bray's appearance on >DDJ TechNetcast has been posted on our >website, > >www.technetcast.com > >Thanks to all who provided comments and >suggestions > >PL >Dr. Dobb's TechNetcast > > >xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk >Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ >To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; >(un)subscribe xml-dev >To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; >subscribe xml-dev-digest >List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) > > xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Wed May 6 19:38:34 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:09 2004 Subject: Parents and children (was RFD: comp.text.xml) Message-ID: >People will blame the tool (XML), but the enemy is text itself. That says it all, though I'd expand the enemies list to include many more kinds of data. During the excitement this weekend over these issues, I wrote an essay that more completely describes my take on these issues. I'm still tinkering, but the basic framework is there. You can check it out, if you like, at: http://members.aol.com/simonstl/xml/lettinggo.htm Comments and suggestions are welcome, though this list may not be the right place for that. The document has already been strengthened by a number of assaults; just try to keep them friendly. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From crism at ora.com Wed May 6 20:02:50 1998 From: crism at ora.com (Chris Maden) Date: Mon Jun 7 17:01:09 2004 Subject: SGML declaration for XML In-Reply-To: <199805061323.PAA15216@brown.informatik.uni-dortmund.de> (message from Stefan Mintert on Wed, 06 May 1998 15:23:23 +0200) Message-ID: <199805061801.OAA20446@ruby.ora.com> [Stefan Mintert] > which is the correct SGML declaration for XML? > > I found three: > > 1a) "ISO 8879:1986 (ENR)" > > 1b) "ISO 8879:1986 (WWW)" > > > both in http://www.w3.org/TR/NOTE-sgml-xml-971215 > Comparison of SGML and XML > > 2) "ISO 8879:1986 (WWW)" > > in http://www.sgmlsource.com/8879rev/n1955.htm > ISO 8879 TC2 XML is from the W3C. Trust the resources from there. The "SGML Declaration for XML" in the corrigendum for SGML is a sample. It is in no way authoritative with respect to XML. > As far as I know (1b) is correct, since XML 1.0 reads: > > [5] Name ::= (Letter | '_' | ':') (NameChar)* > > which means, a Name may start (NAMESTRT) with a ':' That's correct, in pre-namespace XML. The namespace specification will impose additional constraints which a namespace-aware processor would enforce. -Chris -- http://www.oreilly.com/people/staff/crism/ +1.617.499.7487 90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From eliot at isogen.com Wed May 6 20:13:56 1998 From: eliot at isogen.com (W. Eliot Kimber) Date: Mon Jun 7 17:01:09 2004 Subject: Parents and children (was RFD: comp.text.xml) Message-ID: <3.0.32.19980506131034.00dfbcd4@postoffice.swbell.net> A non-text attachment was scrubbed... Name: not available Type: text/enriched Size: 5284 bytes Desc: not available Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980506/e9ae16c6/attachment.bin From tbray at textuality.com Wed May 6 20:43:21 1998 From: tbray at textuality.com (Tim Bray) Date: Mon Jun 7 17:01:09 2004 Subject: XML & SGML Message-ID: <3.0.32.19980506112553.009aaad0@pop.intergate.bc.ca> At 01:10 PM 5/6/98 -0500, W. Eliot Kimber wrote: >>>> He also makes a clear distinction between use domains that require full strength SGML (and can therefore afford to implement its use) and use domains that do not. This is a key message: XML->full SGML represents a continuum of cost/value that you can choose any point along. <<<< The only problem being that we lack the industrial experience that would tell us at what point XML runs out of steam and you need SGML extras. I must say, that is one of the #1 questions I get at customer sites, and I just don't have a good answer yet. It's not easy; just because they're totally web-oriented doesn't mean they don't need SGML, and just because they're currently using the &-connector doesn't mean they couldn't succeed with just XML. -Tim xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From eliot at isogen.com Wed May 6 21:05:27 1998 From: eliot at isogen.com (W. Eliot Kimber) Date: Mon Jun 7 17:01:09 2004 Subject: XML & SGML Message-ID: <3.0.32.19980506140210.00de98bc@postoffice.swbell.net> At 11:26 AM 5/6/98 -0700, Tim Bray wrote: >At 01:10 PM 5/6/98 -0500, W. Eliot Kimber wrote: >>>>> >He also makes a clear distinction between use domains that require full strength SGML (and can therefore afford to implement its use) and use domains that do not. This is a key message: >XML->full SGML represents a continuum of cost/value that you can choose any point along. ><<<< >The only problem being that we lack the industrial experience >that would tell us at what point XML runs out of steam and you >need SGML extras. I must say, that is one of the #1 questions >I get at customer sites, and I just don't have a good answer >yet. It's not easy; just because they're totally web-oriented >doesn't mean they don't need SGML, and just because they're >currently using the &-connector doesn't mean they couldn't >succeed with just XML. -Tim I say start with XML by itself and see how far you get--that's one of the benefits of XML being SGML--you can slide along the continuum at will at minimal cost. By the time you get a pilot project in place, you'll probably know. The main determiners seem to be: 1. How complex are my DTD development and maintenance requirements? XML DTDs are essentially unmaintainable because of the limitations on the use of parameter entities. This means you either use some non-DTD syntax (e.g., your favorate schema scheme) and generate XML declaration sets as needed or use normal SGML and generate XML documents as needed. The latter is easier to do quickly because there are free tools to do most or all of the work (e.g., SX, etc.). The former might be a better solution in the long term because you can do more clever things. 2. What are my authoring requirements? If you need authoring beyond text editing, your only real choices today are SGML editors with XML support (e.g., ADEPT*Editor's save as XML feature). But even here, most tools should be able to handle XML directly (possibly with a little fiddling). If you need text-based authoring that depends on markup minimization, and such use environments do exist, then you have no choice but to author in SGML and generate XML. Downstream processes (transforms, production) are largely insensitive to the use of XML or SGML because they are not dependent on syntax details but on information semantic (element types rather than omitted start tags). This analysis does not address application issues like the use of XLink, XPointers and XSL. In general, XPointers are not useful for authoring because they are too rigid and lack any form of indirection. XSL is of course not yet defined to the point where you would be safe using it in production. As rule, the XML add-on standards are designed for Web-based delivery only and do not meet the requirements of authoring and archiving. This is because authoring and archiving impose requirements that delivery either does not impose or cannot tolerate (such as indirection or dependence on arbitrary queries for indirection). Fortunately, all of the SGML add-on standards can be applied to XML documents as well as to full-SGML documents, so you can mix and match as you need to. Cheers, Eliot --
W. Eliot Kimber, Senior Consulting SGML Engineer ISOGEN International Corp. 2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004 www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tbray at textuality.com Wed May 6 21:21:28 1998 From: tbray at textuality.com (Tim Bray) Date: Mon Jun 7 17:01:09 2004 Subject: XML & SGML Message-ID: <3.0.32.19980506121625.00915100@pop.intergate.bc.ca> At 02:02 PM 5/6/98 -0500, W. Eliot Kimber wrote: >XML DTDs are essentially unmaintainable because of the limitations on the >use of parameter entities. Hmm, but those limitations really only apply to the internal subset. Am I missing something? (Mind you, there are those who say that the absence of exclusion exceptions makes 'em unmaintainable). As for the rest of Eliot's remarks, right on. In particular, the idea that you should start with XML see how far that gets you. It's risk-free. -Tim xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From eliot at isogen.com Wed May 6 21:29:50 1998 From: eliot at isogen.com (W. Eliot Kimber) Date: Mon Jun 7 17:01:09 2004 Subject: XML & SGML Message-ID: <3.0.32.19980506142627.00de9b6c@postoffice.swbell.net> At 12:20 PM 5/6/98 -0700, Tim Bray wrote: >At 02:02 PM 5/6/98 -0500, W. Eliot Kimber wrote: >>XML DTDs are essentially unmaintainable because of the limitations on the >>use of parameter entities. > >Hmm, but those limitations really only apply to the internal subset. Am >I missing something? (Mind you, there are those who say that the absence >of exclusion exceptions makes 'em unmaintainable). The inability to use parameter entities inside of declarations is the problem--unless I've misunderstood the restriction. Cheers, E. --
W. Eliot Kimber, Senior Consulting SGML Engineer ISOGEN International Corp. 2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004 www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From larsga at step.de Wed May 6 21:42:20 1998 From: larsga at step.de (Lars Marius Garshol) Date: Mon Jun 7 17:01:09 2004 Subject: XML & SGML References: <3.0.32.19980506142627.00de9b6c@postoffice.swbell.net> Message-ID: <3550BBC5.71CF5C5C@step.de> W. Eliot Kimber wrote: > > The inability to use parameter entities inside of declarations is the > problem--unless I've misunderstood the restriction. You have. (Thankfully. Reassuring to see that you can be wrong about something. :) is entirely valid in the external subset, but not in the internal. This is the relevant part of the spec (from section 2.8): "Well-Formedness Constraint: PEs in Internal Subset In the internal DTD subset, parameter-entity references can occur only where markup declarations can occur, not within markup declarations. (This does not apply to references that occur in external parameter entities or to the external subset.)" I still haven't found a sensible way to implement this in the external subsets, though. :-( --Lars M. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From eliot at isogen.com Wed May 6 21:52:32 1998 From: eliot at isogen.com (W. Eliot Kimber) Date: Mon Jun 7 17:01:09 2004 Subject: XML & SGML Message-ID: <3.0.32.19980506144723.006aa4f0@postoffice.swbell.net> At 02:26 PM 5/6/98 -0500, W. Eliot Kimber wrote: >At 12:20 PM 5/6/98 -0700, Tim Bray wrote: >>At 02:02 PM 5/6/98 -0500, W. Eliot Kimber wrote: >>>XML DTDs are essentially unmaintainable because of the limitations on the >>>use of parameter entities. >> >>Hmm, but those limitations really only apply to the internal subset. Am >>I missing something? (Mind you, there are those who say that the absence >>of exclusion exceptions makes 'em unmaintainable). > >The inability to use parameter entities inside of declarations is the >problem--unless I've misunderstood the restriction. Tim pointed out that this restriction is only in the internal subset, not the external subset. I had forgotten this detail. Cheers, E. --
W. Eliot Kimber, Senior Consulting SGML Engineer ISOGEN International Corp. 2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004 www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From eliot at isogen.com Wed May 6 21:52:38 1998 From: eliot at isogen.com (W. Eliot Kimber) Date: Mon Jun 7 17:01:09 2004 Subject: XML & SGML Message-ID: <3.0.32.19980506144910.006a00cc@postoffice.swbell.net> At 09:36 PM 5/6/98 +0200, Lars Marius Garshol wrote: >W. Eliot Kimber wrote: >> >> The inability to use parameter entities inside of declarations is the >> problem--unless I've misunderstood the restriction. > >You have. (Thankfully. Reassuring to see that you can be wrong about >something. :) I'm sure I voted for consistency, and thus against allowing them in the external subset, but such discussions are lost to my memory. I am happy to stand corrected on this issue. Cheers, E. --
W. Eliot Kimber, Senior Consulting SGML Engineer ISOGEN International Corp. 2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004 www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From larsga at step.de Wed May 6 22:07:21 1998 From: larsga at step.de (Lars Marius Garshol) Date: Mon Jun 7 17:01:09 2004 Subject: #cdata? Message-ID: <3550C1AC.35D27450@step.de> (I post this here since it's XML-relevant and I know at least some of the XPointer/XLink WG members read this list.) I'm currently writing a toy XPointer implementation in Python (which will quite possibly become serious later) and have to say I'm impressed with the XPointer specification. Very clear descriptions and a nice syntax. However, I'm rather surprised that the 19980303 WD allows pointers to distinguish nodes of type #cdata, that is, CDATA marked sections, so that root().child(3,#cdata) points to the third CDATA section that's a direct child of the document element. The reason I'm surprised that this is in the WD is that to support this the XPointer implementation will have to ride on top of a parser that reports CDATA sections as a distinct syntactic construct, instead of classing it as text, which is the usual solution. SAX does not do this, nor does the DOM level 1 (WD 19980416) AFAICS. I also have problems seeing the utility of this feature. Can anyone comment on on whether it will be an XPointers feature in future WDs and if so why? --Lars M. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Wed May 6 22:52:14 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:09 2004 Subject: XML & SGML Message-ID: <3550C7DD.AC2641F8@technologist.com> "The similarities between the two languages, in syntax, origin, and possible application, have led several members of the community to deny any separation between the two markup languages. From their perspective, XML is SGML - less SGML perhaps, but still SGML. Consortia, publications, newsgroups, and mailing lists currently mix discussion of XML and SGML, blurring the distinctions further." I think that the inaccuracies of English are more at fault than anything, but I don't think that the two statements above belong together. There is no separation between XML and SGML, because every application of SGML has always, and will always, live on a continuum from simplicity (now explicitly embodied in XML) and sophistication (I'm tempted to say this is embodied in HyTime, but things like RDF also show major signs of sophistication and complexity). But the fact that there is no separation does not imply that there should not be no separate publications, newsgroups and mailing lists for XML. There are magazines/newsgroups/consortia targeted towards beginning webmasters and gurus, OO dabblers and UML masters. As the easy form of SGML, XML deserves magazines, books, classes etc. I reject the parent/child analogy because it would imply, for instance, that RDF belongs in a magazine on XML and not SGML, and "an intro to architectural forms" belongs in a magazine on SGML but not XML. The reverse is true. RDF applies at least as much to SGML as it does to XML (perhaps more, because it is in the complexity class of HyTime, not HTML). Architectural forms will eventually apply to XML as much as to SGML unless someone comes up with something better. You couldn't really come up with something much simpler that would do the same job...but you could perhaps come up with something technically better. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization." - Stewart Brand, founder of the Whole Earth Catalog http://www.wired.com/news/news/culture/story/10124.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cso at vignette.com Wed May 6 23:21:59 1998 From: cso at vignette.com (Conleth O'Connell) Date: Mon Jun 7 17:01:09 2004 Subject: XML & SGML In-Reply-To: <3550BBC5.71CF5C5C@step.de> References: <3.0.32.19980506142627.00de9b6c@postoffice.swbell.net> <3550BBC5.71CF5C5C@step.de> Message-ID: <199805062038.PAA15561@zeus.vignette> Lars Marius Garshol writes: > W. Eliot Kimber wrote: > > > > The inability to use parameter entities inside of declarations is the > > problem--unless I've misunderstood the restriction. > > You have. (Thankfully. Reassuring to see that you can be wrong about > something. :) > > > > is entirely valid in the external subset, but not in the internal. This > is the relevant part of the spec (from section 2.8): > > "Well-Formedness Constraint: PEs in Internal Subset > > In the internal DTD subset, parameter-entity references can > occur only where markup declarations can occur, not within > markup declarations. (This does not apply to references that > occur in external parameter entities or to the external > subset.)" > > I still haven't found a sensible way to implement this in the external > subsets, though. :-( It depends on your intent, but you may have use yet another level of indirection. Let's say I have DTD foo located at http://mycom.com/f.dtd The XML instance might look like: ]> This is valid because the internal subset entity declaration is resolved before the entity declaraition in the external subset. The area I got tripped up was in the Attribute declarations: - only one element can be declared in each ATTLIST - it is more difficult to declare and use common attributes This requires an intervening external subset to declare and use the entities. Too many years on DocBook I guess |-) Cheers, Con xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cbullard at hiwaay.net Thu May 7 00:51:59 1998 From: cbullard at hiwaay.net (len bullard) Date: Mon Jun 7 17:01:09 2004 Subject: XML & SGML References: <3.0.32.19980506112553.009aaad0@pop.intergate.bc.ca> Message-ID: <3550E923.8CD@hiwaay.net> Tim Bray wrote: > The only problem being that we lack the industrial experience > that would tell us at what point XML runs out of steam and you > need SGML extras. I must say, that is one of the #1 questions > I get at customer sites, and I just don't have a good answer > yet. It's not easy; just because they're totally web-oriented > doesn't mean they don't need SGML, and just because they're > currently using the &-connector doesn't mean they couldn't > succeed with just XML. -Tim The point at which you can't tell the difference is the point of success. In practice, tools will know. Now there will be that awkward problem with reserved words and PIs. len xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cbullard at hiwaay.net Thu May 7 01:09:41 1998 From: cbullard at hiwaay.net (len bullard) Date: Mon Jun 7 17:01:09 2004 Subject: Parents and children (was RFD: comp.text.xml) References: <354FBD8D.329C@hiwaay.net> <354FDDC1.A80CEBB@technologist.com> Message-ID: <3550ED44.4894@hiwaay.net> Paul Prescod wrote: > > It wasn't so long ago that we were having a similarly pessimistic > conversation about web standards (at that time, HTML and CSS) and you > admitted that despite all of the setbacks, it was a good time for > hypertext. That is more true today than ever. XML may still be due for a > backlash, but lights are going on in people's heads all over the place. As > long as the pattern is two steps forward, one step back, we are still > making progress. > Yes. We have more toys, better toys and some pretty neat content. I am looking forward to playing with SMIL a la RealMedia and VRML. So, will I use XML? You bet. After using HTML so much, I hope i can still remember how to use another DTD. ;-) Just to be schizophrenic, on the VRML list I have argued for XML and the use of RealMedia streaming technology. It isn't inconsistent. RealMedia offers us working streaming now when we need it. RA provides a broadcast format; the archival format is still wav, etc., so my lifecycle/standards needs are met and my performance/runningCode needs are met. With the content hat on, I am very pleased. Two weeks ago, the world's first virtual theatre production was broadcast. That is something I've been waiting for for a LONNNNG time. Cool. Standards are nice to have. Running code is nice to have. We just have to keep both priorities up front and keep talking openly. That way, we measure and meet all of our needs. The game is a long one. Play to that. Simon: Beers together sometime. Even if the arguments look fierce, at the end of the day, beers together. After awhile one finds it is like being in bands and that once one has done it long enough, other musicians are the best company after 2AM even if before 2AM you were cutting heads. len xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Thu May 7 01:32:46 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:10 2004 Subject: XML & SGML References: <3.0.32.19980506142627.00de9b6c@postoffice.swbell.net> <3550BBC5.71CF5C5C@step.de> <199805062038.PAA15561@zeus.vignette> Message-ID: <3550F2F9.CEDC8489@technologist.com> Conleth O'Connell wrote: > > The XML instance might look like: > > > ]> > > Note that this sort of trickery is poorly supported by SGML editors. They like to think of a DTD as a single immutable thing that spans documents (as users do). But technically, DTDs are really just properties of documents, and documents can fiddle with them as you demonstrate above. I think that I would advise against depending on this sort of fiddling until we see whether XML editors are smarter (or at least more compliant) in this regard than their old-SGML counterparts. This one feature probably makes implementation of an efficient, friendly XML editor harder than most of the features that were eliminated from SGML (omittag, shorttag and rank put together are comparatively trivial...). The other hard feature is entity references in attribute values (yipes! try to make a simple, friendly MFC dialog box that handles *that* properly). Paul Prescod - http://itrc.uwaterloo.ca/~papresco "Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization." - Stewart Brand, founder of the Whole Earth Catalog http://www.wired.com/news/news/culture/story/10124.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Thu May 7 02:23:42 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:10 2004 Subject: XML & SGML References: <3550C7DD.AC2641F8@technologist.com> Message-ID: <3550F9FA.7E5CD761@technologist.com> Paul Prescod wrote: >... > But the fact that there is no separation does not imply that there > should not be no separate publications, newsgroups and mailing lists > for XML. Egad! I meant: "Even though there need be no separation between XML and other uses of SGML (they all live on a continuous complexity continuum), there could still be separate publications, newsgroups and mailing lists for XML." Re: Consortia: I'm not sure that XML consortia make sense. XML usage will soon be so diverse that I'm not sure that various "XML" companies would have much in common to discuss. It strikes me as somewhat like an "HTTP" consortia. SGML was already showing signs of that problem despite the community's emphasis on document processing. XML will have succeeded when people choose not to get together and talk about XML, but rather about XML-based OO protocols, XML-document processing, XML-knowledge representation and so forth. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization." - Stewart Brand, founder of the Whole Earth Catalog http://www.wired.com/news/news/culture/story/10124.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Thu May 7 04:10:45 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:10 2004 Subject: XML & SGML Message-ID: Paul Prescod wrote: >XML will have >succeeded when people choose not to get together and talk about XML, >but rather about XML-based OO protocols, XML-document processing, >XML-knowledge representation and so forth. Tim Bray compares it to ASCII. Does anyone discuss ASCII anymore? Once in a while, maybe, in the context of Unicode or other character encodings that are supplanting it. You're completely right on this one. XML should become invisible and well-understood as it becomes ubiquitous; the problems it solves should be the focus of the discussion. Len Bullard wrote: >Beers together sometime. Even if the arguments look >fierce, at the end of the day, beers together. I think we could all use more beers together. There will always be arguments, but there will always be (hopefully) beer. Someday I'll actually make it to one of these conferences, and we'll have beer. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From trafford at interlog.com Thu May 7 04:22:57 1998 From: trafford at interlog.com (Ben Trafford) Date: Mon Jun 7 17:01:10 2004 Subject: SP and Expat Message-ID: <35511CED.9A8E678@interlog.com> I've got a question: how does Expat's parsing differ from the XML parsing available in SP 1.3? --->Ben Trafford xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Thu May 7 05:14:51 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:10 2004 Subject: SP and Expat References: <35511CED.9A8E678@interlog.com> Message-ID: <355126D6.DB98086E@technologist.com> Ben Trafford wrote: > > I've got a question: how does Expat's parsing differ from the XML > parsing available in SP 1.3? Smaller. Faster. No validation. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization." - Stewart Brand, founder of the Whole Earth Catalog http://www.wired.com/news/news/culture/story/10124.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Thu May 7 10:36:22 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:10 2004 Subject: XML & SGML In-Reply-To: <3550F9FA.7E5CD761@technologist.com> References: <3550C7DD.AC2641F8@technologist.com> Message-ID: <3.0.1.16.19980507083126.08c78824@pop3.demon.co.uk> [many people] wrote: [... interesting material snipped ...] Though deliberately neutral about the ethics and validity of XMLogenesis, it's very important that the contributions of the *people* involved is highlighted and valued. I think I can take an appropriately dispassionate view of SGML, not being a practitioner. I have found the commitment of the WG members to the XML process and their technical ability to be outstanding. I have been privileged on XML-SIG to see the (often weekly) reports from the WG and the frequent list of questions that were addressed to the SIG. What may not be generally realised is that the WG *worked very hard indeed at the nitty-gritty as well as the strategy*. When the XML process ran into problems someone/subgroup on the WG would be asked to take the problem away and come back with a proposal that addressed the queries raised by the SIG. I remember several summaries that, after a really intractable discussion, were a delight to read. During the whole process there was never a hint of vested interests. Occasionally there is the sort of statement - " I don't think my colleagues would find it easy to use this", "We do this, and it works for us", etc. But not "FooCorp would see X as giving away market advantage". And, although I have no knowledge of the companies involved, I suspect their has been enlightened managerial oversight in allowing so much effort to be put into this process. I have no doubt it will amply repay this investment, since XML has to be a corporate philosophy as well as a technology. There is no doubt that we owe a great deal to the XML-WG (and related groups) and I have found the process to be wholly admirable. I am trying to champion it in my own vertical area (chemistry) as an efficient, creative, democratic and open process. It's probably impossible to combine all of these, but most current efforts in chemical informatics could learn a great deal from it. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From yosr.hmaied at inria.fr Thu May 7 16:04:16 1998 From: yosr.hmaied at inria.fr (Yosr HMAIED) Date: Mon Jun 7 17:01:10 2004 Subject: parser for xml-data? Message-ID: <3551BF43.2F3@inria.fr> Do you know parser for xml-data? -- Yosr HMAIED mailto:yosr.hmaied@inria.fr tel :01 39 63 51 71 Projet Rodin INRIA Rocquencourt xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From rbourret at dvs1.informatik.tu-darmstadt.de Thu May 7 16:24:31 1998 From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret) Date: Mon Jun 7 17:01:10 2004 Subject: ANY Message-ID: <199805071337.PAA27616@berlin.dvs1.tu-darmstadt.de> The definition of ANY in the spec seems to be limited to the following sentence (section 3): "The declaration matches ANY, and the types of any child elements have been declared." From "child elements," I assume the following: 1) An element of type ANY can have zero or more children. 2) An element of type ANY cannot contain PCDATA. That is, it cannot have a mixed content model. Is this correct? Thanks. -- Ron Bourret xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From larsga at step.de Thu May 7 16:35:26 1998 From: larsga at step.de (Lars Marius Garshol) Date: Mon Jun 7 17:01:10 2004 Subject: ANY References: <199805071337.PAA27616@berlin.dvs1.tu-darmstadt.de> Message-ID: <3551C561.47C36B2C@step.de> Ron Bourret wrote: > > From "child elements," I assume the following: > > 1) An element of type ANY can have zero or more children. Correct. > 2) An element of type ANY cannot contain PCDATA. That is, it cannot have a > mixed content model. Wrong. lots of text and here is valid XML. --Lars M. PS: This seems to be a candidate for a FAQ, a specification clarification, as well as a good argument for comp.text.xml xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tbray at textuality.com Thu May 7 16:50:50 1998 From: tbray at textuality.com (Tim Bray) Date: Mon Jun 7 17:01:10 2004 Subject: ANY Message-ID: <3.0.32.19980507074659.00b7ce50@pop.intergate.bc.ca> At 03:37 PM 5/7/98 +0200, Ron Bourret wrote: >From "child elements," I assume the following: > >1) An element of type ANY can have zero or more children. >2) An element of type ANY cannot contain PCDATA. That is, it cannot have a >mixed content model. > >Is this correct? Thanks. No - the spec is less than explicit, but since it doesn't rule out text content, it is allowed. The annotated spec at http://www.xml.com has some further elucidation on this subject. -Tim xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From elm at arbortext.com Thu May 7 18:55:39 1998 From: elm at arbortext.com (Eve L. Maler) Date: Mon Jun 7 17:01:10 2004 Subject: #cdata? In-Reply-To: <98May6.160827edt.26886@thicket.arbortext.com> Message-ID: <3.0.5.32.19980507124933.00a6f100@village.promanage-inc.com> At 04:01 PM 5/6/98 -0400, Lars Marius Garshol wrote: >(I post this here since it's XML-relevant and I know at least some of >the XPointer/XLink WG members read this list.) > >I'm currently writing a toy XPointer implementation in Python (which >will quite possibly become serious later) and have to say I'm impressed >with the XPointer specification. Very clear descriptions and a nice >syntax. Thank you. :-) I hope you'll share future news of your implementation as you go along. >However, I'm rather surprised that the 19980303 WD allows pointers to >distinguish nodes of type #cdata, that is, CDATA marked sections, so >that > >root().child(3,#cdata) > >points to the third CDATA section that's a direct child of the document >element. > >The reason I'm surprised that this is in the WD is that to support this >the XPointer implementation will have to ride on top of a parser that >reports CDATA sections as a distinct syntactic construct, instead of >classing it as text, which is the usual solution. SAX does not do this, >nor does the DOM level 1 (WD 19980416) AFAICS. > >I also have problems seeing the utility of this feature. > >Can anyone comment on on whether it will be an XPointers feature in >future WDs and if so why? It was decided to add this back in December. However, several WG members have since had a change of heart. So I have a feeling that it may not survive. My own belief is that having the "string" location term is sufficient, and sufficiently robust, to do any addressing you'd want to do into a CDATA section. If we do want to allow addressing into some construct that the DOM doesn't support, it's incumbent on the XLink side to request new functionality from the DOM folks. In other words, we're not artificially going to restrict ourselves to what the DOM has today. This approach was actually first suggested by Lauren Wood (the DOM chair) to ensure that the dependencies flow in the right direction. Eve xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From sudar at pspl.co.in Fri May 8 07:05:32 1998 From: sudar at pspl.co.in (Sudarshan Purohit) Date: Mon Jun 7 17:01:10 2004 Subject: parser for xml-data? References: <3551BF43.2F3@inria.fr> Message-ID: <355292B9.A2E581A1@pspl.co.in> Yosr HMAIED wrote: > Do you know parser for xml-data? I think this is a good place to talk of the project I'm doing right now. Basically, it's a translator between an XML-Data document and any ODBC interface. This involves : 1. Taking an input XML-Data file and outputting a series of SQL statements that result in the same information being stored in an RDBMS. 2. Using an ODBC Recordset interface to get an equivalent XML-Data file. ( I'm still working out this one) The first part involves parsing the XML file, so I might be able to help out somewhere, Yosr. While we're on the topic, I'd like to ask a few things. 1. XML-Data stores Data as "entities". This implies that we've got a more abstract representaion of data than what would have been possible through relational databases. For example, we've got this "ONEORMORE,ZEROORMORE" occurence attribute for child elements. Implementing this in an RDBMS would mean creating another linked table for that data item. The same goes for complex data types. Am I on the right track? 2. If we're creating XML specifically for this purpose, it'll have to be in the format .... .... ..... This restrictive structure, of course, means that you end up not using this kind of document for anything else. Am I right in thinking that this has to be done? 3. The XML-Data specification says that it's open-ended and new types of entities may be added. I'm assuming this means I'm able to add lines like or into my element delcarations, which will be ignored by other parsers than mine. Am I right ? I came rather late into the discussion, so it's possible that this stuff may have been figured out already. If so, tell me. Thanks, Sudarshan 1030hrs. -- ------------------------------------------------------ The only difference between a rut and a grave is the depth. ------------------------------------------------------ Sudarshan "Connection Machine" Purohit Persistent Systems Private Limited, Pune ------------------------------------------------------ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From maillist at chris.hubick.com Fri May 8 07:41:02 1998 From: maillist at chris.hubick.com (Chris Hubick) Date: Mon Jun 7 17:01:11 2004 Subject: Character Encoding Detection Message-ID: I am new to Character Encodings, and am trying to implement them for my XML parser. As I understand it, UCS has two flavors, UCS-2 and UCS-4, either of which can optionally have a UCS transformation applied to them. It is my understanding that you could author an XML document in either of these, without applying a transformation. The UTF-16 spec at: http://www.stonehand.com/unicode/standard/wg2n1035.html states: "In UTF-16, any UCS character from the BMP shall be represented by its UCS-2 coded representation." Now in UCS-2: '<' is 00 3C '?' is 00 3f So the start of a UCS-2 or UTF-16 encoded XML document would be 00 3C 00 3F In the section on autodetection of character encodings the XML spec states "00 3C 00 3F: UTF-16, big-endian, no Byte Order Mark (and thus, strictly speaking, in error)" My question is, why is this an error rather than a perfectly acceptable untransformed UCS-2 document? --- Chris Hubick mailto:chris@hubick.com http://www.hubick.com/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From larsga at step.de Fri May 8 09:28:46 1998 From: larsga at step.de (Lars Marius Garshol) Date: Mon Jun 7 17:01:11 2004 Subject: #cdata? References: <3.0.5.32.19980507124933.00a6f100@village.promanage-inc.com> Message-ID: <3552B2DD.ADB174D7@step.de> Eve L. Maler wrote: > > I hope you'll share future news of your implementation as > you go along. With pleasure. :) I'll post again with more information and feedback, when I've got enough of it covered to make a release. > It was decided to add this back in December. However, several WG members > have since had a change of heart. So I have a feeling that it may not > survive. That would make at least one XML hacker slightly happier. > My own belief is that having the "string" location term is sufficient, and > sufficiently robust, to do any addressing you'd want to do into a CDATA > section. That's a belief I share. > If we do want to allow addressing into some construct that the DOM doesn't > support, it's incumbent on the XLink side to request new functionality from > the DOM folks. Then I see something that seems highly important: a method in the Document interface for returning the element with a given ID. Something like this: Element getElementWithID(in wstring id); You've very likely already thought of and discussed this, so it would be nice to hear what is likely to happen in this area. (I could subclass the Document object and the DOM builder in the Python DOM implementation to get this information, but it seems pointless if the DOM is going to include it.) --Lars M. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Fri May 8 10:52:08 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:11 2004 Subject: [xX][mM][lL] is reserved References: <2F2DC5CE035DD1118C8E00805FFE354C0395C746@red-msg-56.dns.microsoft.com> Message-ID: <3552B03E.616451C5@jclark.com> Chris Lovett wrote: > > Spec issue: > > Section 2.3 paragraph 3: > > "A Name is a token beginning with a letter or one of a few > punctuation characters, and continuing with letters, digits, hyphens, > underscores, colons, or full stops, together known as name characters. > Names beginning with the string "xml", or any string which wouild match > (('x' | 'X')('m' | 'M') ('l' | 'L')) are reserved for standardization in > this or future versions of this specification". > > Currently MSXML disallows "xml" from being used as a "namespace", for > example: "xml:foo" since this seems to be the way the XML spec is heading > with "xml:space" and "xml:lang". So is the above paragraph just a little > too restrictive and should it include the colon ? If not then the following > very common XML example is illegal: > > > This is a test. > > > Tim Bray's web site (http://www.xml.com/axml/axml.html) says explicitly that > "xmlu" is an illegal name, but James Clark's expat seems to allow this ?? > > Can the XML spec gurus clarify this paragraph ? Thanks. I would say this is an "error" as defined in 1.2 of the spec but not a fatal error nor a violation of a validity constraint. Your processor is therefore free to ignore it or to detect and report it. I think it's probably best to report this only at user option. If tools blow up when they see such names, it's going to cause problems if future versions of XML make use of these names. For example, I don't want my XML processor to complain because I'm using an xml:stylesheet PI, although it would be nice to have a "pedantic" option to tell me that this is strictly an error (until such time as this gets licensed by the XML spec). James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From rbourret at dvs1.informatik.tu-darmstadt.de Fri May 8 11:47:57 1998 From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret) Date: Mon Jun 7 17:01:11 2004 Subject: parser for xml-data? Message-ID: <199805080934.LAA00769@berlin.dvs1.tu-darmstadt.de> Sudarshan Purohit wrote: > I think this is a good place to talk of the project I'm doing right now. > Basically, it's a translator between an XML-Data document and any > ODBC interface. This involves : > 1. Taking an input XML-Data file and outputting a series of SQL > statements that result in the same information being stored in an RDBMS. I'm working on a similar project (moving data from XML to relational databases and vice versa). Do you really mean you are moving data from an XML-Data document? XML-Data is an XML language for describing metadata, including DTDs. As such, the data in an XML-Data file is metadata, not data. This is suitable if you want to create tables in the RDBMS that match the XML-Data schema. However, an XML-Data file won't give you any data to put in those tables. By the way, there was a question the other day about how one might actually deploy XML-Data as a substitute for a DTD. Was this ever answered? > 2. Using an ODBC Recordset interface to get an equivalent XML-Data > file. ( I'm still working out this one) > The first part involves parsing the XML file, so I might be able to help out > somewhere, Yosr. > > While we're on the topic, I'd like to ask a few things. > 1. XML-Data stores Data as "entities". This implies that we've got > a more abstract representaion of data than what would have been > possible through relational databases. For example, we've got this > "ONEORMORE,ZEROORMORE" occurence attribute for child elements. > Implementing this in an RDBMS would mean creating another linked > table for that data item. The same goes for complex data types. > Am I on the right track? You are correct. XML describes an object model, so the problem is roughly the same as moving object data to and from an RDBMS -- you might want to look at some papers on the subject. There is an excellent white paper introducing the subject at: http://www.ontos.com/mapcon.htm Another good paper is: Shekar Ramanathan, Julia E. Hodges: Extraction of Object-Oriented Structures from Existing Relational Databases, SIGMOD Record, Vol. 26, Number 1, March 1997, pp. 59-64. Postscript: http://bunny.cs.uiuc.edu/sigmod/sigmod_record/9703/rama.ps The only major difference I have found so far for XML is that the elements in XML documents are ordered, while data members in OO programming languages are not. > > 2. If we're creating XML specifically for this purpose, it'll have to > be in the format > > > .... > > > .... > > > > ..... > > > > > > > This restrictive structure, of course, means that you end up > not using this kind of document for anything else. Am I right > in thinking that this has to be done? Again, this makes sense if you are storing schema information, but I'm not sure that that is what you want. The problem is one of creating a mapping from the DTD (expressed as a DTD, XML-Data, or another competing schema language) to tables/columns and then using that mapping to move data from an XML file (which uses that DTD) to the RDBMS. You can certainly do this for a large number of DTDs -- I'm not yet convinced it is possible for all. > > 3. The XML-Data specification says that it's open-ended > and new types of entities may be added. I'm assuming this means > I'm able to add lines like > > > or > into my element delcarations, which will be ignored by other > parsers than mine. Am I right ? > > I came rather late into the discussion, so it's possible that > this stuff may have been figured out already. If so, tell me. > > Thanks, > > Sudarshan 1030hrs. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From sudar at pspl.co.in Fri May 8 13:27:06 1998 From: sudar at pspl.co.in (Sudarshan Purohit) Date: Mon Jun 7 17:01:11 2004 Subject: parser for xml-data? References: <199805080934.LAA00769@berlin.dvs1.tu-darmstadt.de> Message-ID: <3552EC2D.7BDC0297@pspl.co.in> Ron Bourret wrote: > Do you really mean you are moving data from an XML-Data document? XML-Data is > an XML language for describing metadata, including DTDs. As such, the data in > an XML-Data file is metadata, not data. This is suitable if you want to create > tables in the RDBMS that match the XML-Data schema. However, an XML-Data file > won't give you any data to put in those tables. Yes, you're right. My mistake. I'll be using the schema to create the tables, then adding rows to this from other (XML) files. > By the way, there was a question the other day about how one might actually > deploy XML-Data as a substitute for a DTD. Was this ever answered? I don't think so. What I'm doing at the moment is ignoring any other schema that appears in the data file itself. So in a sense, I'm actually using the XML-Data file as a DTD. I'm also not yet clear on whether and how the data file refers to the schema. In my program the user will have to specify them both. > Again, this makes sense if you are storing schema information, but I'm not sure > that that is what you want. The problem is one of creating a mapping from the > DTD (expressed as a DTD, XML-Data, or another competing schema language) to > tables/columns and then using that mapping to move data from an XML file (which > uses that DTD) to the RDBMS. You can certainly do this for a large number of > DTDs -- I'm not yet convinced it is possible for all. Um, yes, actually the end result of my project should be an XML web page that is updated whenever a change is made in a company's database. I think I'll be storing schema info at the same location so that the data is more generally accessible. Thanks , Sudarshan 1700hrs. PSPL,Pune 8th may 98 ------------------------------------------------------ The only difference between a rut and a grave is the depth. ------------------------------------------------------ Sudarshan "Connection Machine" Purohit Persistent Systems Private Limited, Pune ------------------------------------------------------ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From rbourret at dvs1.informatik.tu-darmstadt.de Fri May 8 14:27:54 1998 From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret) Date: Mon Jun 7 17:01:11 2004 Subject: Attribute type NMTOKEN Message-ID: <199805081213.OAA01196@berlin.dvs1.tu-darmstadt.de> Why does the attribute type NMTOKEN exist and how is it intended to be used? My best guess is that it is the same as an enumerated type, except that the enumeration is not actually specified in the DTD, perhaps because it is meant to be open-ended or is too big to list, such as all words in the English language. Is this true? -- Ron Bourret xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Fri May 8 14:50:23 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:11 2004 Subject: XML & SGML References: <3.0.32.19980506121625.00915100@pop.intergate.bc.ca> Message-ID: <3552FF7D.33CA5886@technologist.com> Tim Bray wrote: > > As for the rest of Eliot's remarks, right on. In particular, the idea that > you should start with XML see how far that gets you. It's risk-free. I think that this was always prudent. You, too, Tim. IIRC, the OED didn't have a DTD, for example. It was essentially XML. Varying levels of tool support for different features created a defacto simple SGML/full SGML split, and it was very expensive to start walking up the complexity ladder because tool support started to drop away. "Sorry, I've got this really neat DTD for you, but you'll have to buy all new software...and it isn't written yet." Some features in XML are already a little ways up that ladder. I think that Eliot, on the other hand, felt it was his personal duty to use every SGML feature possible so that he would have an excuse to tell vendors that their products were broken. :) That is good for the rest of us, because he kept raising the bar, but it must have been very painful out there on the bleeding edge. I think I'll continue to duck in behind him. Paul Prescod - http://itrc.uwaterloo.ca/~papresco Can we afford to feed that army, while so many children are naked and hungry? Can we afford to remain passive, while that soldier-army is growing so massive? - "Gabby" Barbadian Calpysonian in "Boots" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From rbourret at dvs1.informatik.tu-darmstadt.de Fri May 8 14:51:52 1998 From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret) Date: Mon Jun 7 17:01:11 2004 Subject: parser for xml-data? Message-ID: <199805081244.OAA01325@berlin.dvs1.tu-darmstadt.de> Sudarshan Purohit wrote: > > By the way, there was a question the other day about how one might actually > > deploy XML-Data as a substitute for a DTD. Was this ever answered? > > I don't think so. What I'm doing at the moment is ignoring any other schema > that appears in the data file itself. So in a sense, I'm actually using the > XML-Data file as a DTD. I'm also not yet clear on whether and how the data file > refers to the schema. In my program the user will have to specify them both. One possibility is that something in your XML document, such as an attribute at the root, would refer to the XML document containing the XML-Data definition of your grammar: ... Another (uglier) possibility is that you use namespaces: the XML-Data namespace and the namespace your XML-Data data defines. I haven't looked enough at either the namespaces or XML-Data specs to be sure how this would work, but it seems the object structure might be something like: Root object XML-Data root XML-data data... Your data root Your data... In either case, a validating parser would need to know that it used the XML-Data schema to validate the XML-Data object and the schema defined by the XML-Data data to validate your object, which is essentially what you're doing. Any hints from the XML-Data folks? -- Ron Bourret xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Fri May 8 15:17:43 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:11 2004 Subject: SDD bogus Message-ID: <3553061D.BF3A3461@technologist.com> Is the standalone document declaration bogus and perhaps dangerous? The whole feature strikes me as over-complicated and over-specific for a language like XML, but I'm aware of the historical processes that gave rise to it. My understanding of a typical usage scenario goes like this: a sender creates a document. It creates it specifically so that it will be standalone. It validates that this is the case (while it validates everything else) and then it sends it to the receiver who hopes to consume it without validating it. Things already strike me as a little bizarre, because if your protocol is designed such that the consumer trusts the receiver, then couldn't the SDD be implied in your out-of-band agreement? Further, what do you do if the SDD is other than you expect? Halt the parse and start again with a validating processor? But that's not what I'm concerned about. I'm concerned because I believe this to be a valid XML document: ]> In my opinion, section 5.1 will require the non-validating parser to skip the attribute list declaration, even if memo.dtd is an empty file. The receiver has no way of knowing that this case has occured if it uses a "standard parser" (since XML's semantics are, for the moment at least, imprecisely specified, I only know what that means intuitively ... SAX, Lark, Expat, etc. would not give you enough information to detect this case). This to me suggest that applications cannot trust the SDD and it must therefore be presumed to be meaningless. But I'm glad to be proven wrong. Despite its reputation to the contrary, XML is intricate and deep and I may have missed something important. Paul Prescod - http://itrc.uwaterloo.ca/~papresco Can we afford to feed that army, while so many children are naked and hungry? Can we afford to remain passive, while that soldier-army is growing so massive? - "Gabby" Barbadian Calpysonian in "Boots" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Fri May 8 15:27:42 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:11 2004 Subject: SDD again Message-ID: <3553086A.3494D7F9@technologist.com> Let me risk another step into the language courtroom. Validating parsers must always read the whole DTD. So the SDD is only for non-validating parsers. Non-validating parsers do not read element type declarations. So what is the point of this line: "The standalone document declaration must have the value "no" if any external markup declarations contain declarations of:" ... "element types with element content, if white space occurs directly within any instance of those types." First, why does a non-validating parser care about element/mixed content? It has no responsibility to do any marking of insignificant whitespace anyhow. Second, if there is no class of processor that can reliably reproduce the intended parse tree without reading the whole DTD, then doesn't that significantly weaken the utility (okay, "purity") of the SDD? Even if I am wrong on the last point, it seems that it does not do what it is supposed to do properly. Paul Prescod - http://itrc.uwaterloo.ca/~papresco Can we afford to feed that army, while so many children are naked and hungry? Can we afford to remain passive, while that soldier-army is growing so massive? - "Gabby" Barbadian Calpysonian in "Boots" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From simpson at polaris.net Fri May 8 15:50:42 1998 From: simpson at polaris.net (John E. Simpson) Date: Mon Jun 7 17:01:11 2004 Subject: Attribute type NMTOKEN In-Reply-To: <199805081213.OAA01196@berlin.dvs1.tu-darmstadt.de> Message-ID: <3.0.3.32.19980508095011.008ce420@nexus.polaris.net> At 02:13 PM 5/8/98 +0200, Ron Bourret wrote: >Why does the attribute type NMTOKEN exist and how is it intended to be used? My >best guess is that it is the same as an enumerated type, except that the >enumeration is not actually specified in the DTD, perhaps because it is meant to >be open-ended or is too big to list, such as all words in the English language. >Is this true? As I understand it (would appreciate being corrected if wrong), NMTOKEN essentially applies a constraint on the form of the attribute's value (i.e. value must be a valid name token -- start with either a letter or an underscore) without constraining the *specific* value. So yes, your "open-ended or... too big to list" assumption is correct, with the qualification that a type of NMTOKEN would not allow a value such as "1ABC," "//ENGLISH," or " WHITESPACE." xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tbray at textuality.com Fri May 8 15:53:14 1998 From: tbray at textuality.com (Tim Bray) Date: Mon Jun 7 17:01:11 2004 Subject: SDD bogus Message-ID: <3.0.32.19980508065011.00b3b520@pop.intergate.bc.ca> At 09:18 AM 5/8/98 -0400, Paul Prescod wrote: >But that's not what I'm concerned about. I'm concerned because I believe >this to be a valid XML document: > > > > >]> > > >In my opinion, section 5.1 will require the non-validating parser to skip >the attribute list declaration, even if memo.dtd is an empty file. Welll, it can't be valid if memo.dtd is an empty file, because you don't have anywhere. But yes, 5.1 suggests the attribute default shouldn't be used. >The receiver has no way of knowing that this case has occured if it uses a >"standard parser" If the sender is stupid enough to send something like this to a non-validating parser, he gets what he deserves. If it's a validating parser, then of course the emptiness of memo.dtd will be detected. > (since XML's semantics are, for the moment at least, >imprecisely specified, I only know what that means intuitively ... SAX, >Lark, Expat, etc. would not give you enough information to detect this >case). Huh? >This to me suggest that applications cannot trust the SDD and it must >therefore be presumed to be meaningless. You do raise a good question; it would seem that standalone='true' *ought* to mean that the rule of 5.1 about the effect of external PE refs could be ignored. Hmmmm -Tim xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Fri May 8 15:57:54 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:11 2004 Subject: SDD bogus In-Reply-To: <3553061D.BF3A3461@technologist.com> References: <3553061D.BF3A3461@technologist.com> Message-ID: <199805081351.JAA00618@unready.microstar.com> Paul Prescod writes: > Is the standalone document declaration bogus and perhaps dangerous? Yes and yes. The problem, I think, came from the mistaken idea that people (i.e. desperate Perl hackers) would write custom parsers for each XML application (like RDF), and that these people would not want to deal with seemingly difficult problems like external entity resolution. In the end, as one might have predicted, there is an impressive range of free XML processors available in several different programming languages: someone writing an RDF tool does not need to worry about the character and entity level of XML at all, and can work with XML easily through a more abstract interface such as the DOM or SAX. So, we should let the authors decide -- if an author creates a document referencing external entities (including an external DTD subset), then the XML parser should handle them; if the author does not want to use external entities, then she can simply avoid referencing any. As many XML parser writers have shown, resolving external entities is one of the easiest parts of XML (especially in higher-level languages like Java or Perl, and, I presume, Python). Allowing parsers to skip external entities -- rather than simplifying XML -- ended up making it much more complicated, and as you point it, the standalone declaration really doesn't help things. All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From eliot at isogen.com Fri May 8 16:30:49 1998 From: eliot at isogen.com (W. Eliot Kimber) Date: Mon Jun 7 17:01:11 2004 Subject: XML & SGML Message-ID: <3.0.32.19980508092316.007091a4@postoffice.swbell.net> At 08:50 AM 5/8/98 -0400, Paul Prescod wrote: >I think that Eliot, on the other hand, felt it was his personal duty to >use every SGML feature possible so that he would have an excuse to tell >vendors that their products were broken. :) That is good for the rest of >us, because he kept raising the bar, but it must have been very painful >out there on the bleeding edge. I think I'll continue to duck in behind >him. I'm still trying to staunch the bleeding. But seriously, I use those features of SGML that offer value to my and my clients' applications. Most of the features of SGML were invented for a good reason to solve real problems that people have. And, like any sufficiently general system, facilities designed for one purpose turn out to be useful for other, unanticipated purposes. For example, the simple LINK feature, which is really hopeless at its intended task, is ideal for adding architectural mappings to documents without disturbing the original DTD declarations. I have never asked for facilities that aren't useful. For example, I have never berated a tool supplier for not supporting CONCUR, RANK, DATATAG, or explicit LINK. Cheers, E. --
W. Eliot Kimber, Senior Consulting SGML Engineer ISOGEN International Corp. 2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004 www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Fri May 8 17:07:54 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:12 2004 Subject: SDD again In-Reply-To: <3553086A.3494D7F9@technologist.com> References: <3553086A.3494D7F9@technologist.com> Message-ID: <199805081500.LAA00857@unready.microstar.com> Paul Prescod writes: > Let me risk another step into the language courtroom. Validating > parsers must always read the whole DTD. So the SDD is only for > non-validating parsers. Non-validating parsers do not read element > type declarations. So what is the point of this line: Your first premise is correct, but your second one is not. The spec states that a validating parser must use the whole DTD; it does not state that a non-validating parser may not use the DTD. AElfred, for example, reads the DTD well enough that it can even flag ignorable whitespace base on an element type's content model, but it is non-validating. That said, I still agree that the standalone declaration is wrong. Perhaps some day, if there's an XML 1.1, we can think about fixing it. All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Fri May 8 17:11:17 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:12 2004 Subject: SAX: Final Requests? Message-ID: <199805081508.LAA01003@unready.microstar.com> Could anyone else who has experimented with implementing SAX 1.0gamma (either in a parser or in an application) please let me know about any bugs or other serious problems in the interfaces, implementation, or documentation? Thank you again to all of you who have already sent in comments, suggestions, and bug reports. All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tbray at textuality.com Fri May 8 17:19:08 1998 From: tbray at textuality.com (Tim Bray) Date: Mon Jun 7 17:01:12 2004 Subject: SDD bogus Message-ID: <3.0.32.19980508081709.00b57c60@pop.intergate.bc.ca> At 09:51 AM 5/8/98 -0400, David Megginson wrote: >As many XML parser writers have shown, resolving external entities is >one of the easiest parts of XML Yes, but as is well-documented, difficulty is *not* the reason we made their processing optional by non-validating processors. The prime mover behind this decision was a passionate presentation from Jean Paoli explaining that the auto-include semantic of parsed entities is just *wrong* for web browsers. I've attached an explanation of why at the end of this message, but if you want to see it context, go to section 4.4.3 of the annotated spec and click on the "H". Having said that, Paul did raise a valid concern about the SDD (too bad this issue wasn't pointed out before the spec was frozen). Having said *that*, I think, for reasons that are on the record in the same place, that the problem the SDD exists to solve will essentially never arise in real operational scenarios anyhow. -Tim ================= >From the annotated spec at http://xml.com/axml/axml.html Why Are External Entities Included Optionally? In discussion of external entities, we realized that the semantics of external text entities (compulsory inclusion at the point where they are encountered) are deeply incompatible with the desired behavior of Web browsers. Consider the following example of the beginning of an XML document: ]> Netscape today announced that &NSA;. In response, Microsoft issued the following statement: &MSA;. ... A Web browser is typically making an aggressive effort to display text to the user as soon as possible, in parallel with fetching it from the network. In the example above, if a browser were required to fetch and process all external entities, it could only display the first four words before starting another network fetch operation. To make things worse, bear in mind that the replacement text for the entity NSA could well include other external entities which in turn would need to be fetched. This type of situation is unacceptable. Hence the rule that non-validating parsers need not fetch external entities if they don't want to. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From lauren at sqwest.bc.ca Fri May 8 18:43:12 1998 From: lauren at sqwest.bc.ca (Lauren Wood) Date: Mon Jun 7 17:01:12 2004 Subject: #cdata? In-Reply-To: <3552B2DD.ADB174D7@step.de> References: <3.0.5.32.19980507124933.00a6f100@village.promanage-inc.com> Message-ID: <199805081642.JAA28647@sqwest.bc.ca> At 08/05/98 12:23 AM , Lars Marius Garshol wrote: >Then I see something that seems highly important: a method in the >Document >interface for returning the element with a given ID. Something like >this: > > Element getElementWithID(in wstring id); > >You've very likely already thought of and discussed this, so it would be >nice to hear what is likely to happen in this area. (I could subclass >the Document object and the DOM builder in the Python DOM implementation to >get this information, but it seems pointless if the DOM is going to include >it.) The DOM will include something like this (it's even listed expressly in our requirements!) but it won't be in Level 1, since we're concentrating on really basic functionality in Level 1. Lauren xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From crism at ora.com Fri May 8 20:11:47 1998 From: crism at ora.com (Chris Maden) Date: Mon Jun 7 17:01:12 2004 Subject: Character Encoding Detection In-Reply-To: (message from Chris Hubick on Thu, 7 May 1998 22:38:08 +0000 (GMT)) Message-ID: <199805081810.OAA18622@ruby.ora.com> [Chris Hubick] > In the section on autodetection of character encodings the XML spec > states "00 3C 00 3F: UTF-16, big-endian, no Byte Order Mark (and > thus, strictly speaking, in error)" > > My question is, why is this an error rather than a perfectly > acceptable untransformed UCS-2 document? The XML spec states, by fiat, in 4.3.3, that "Entities encoded in UTF-16 must begin with the Byte Order Mark". So the reason the example is an error is because the spec says so. UCS-2 is identical to UTF-16, and so it is subject (presumably) to the same rule. As a side note, I was unsure until just now whether they were equivalent, but I finally found ISO 10646-1 clause 8: Plane 00 of Group 00 shall be the Basic Multilingual Plane (BMP). The BMP can be used as a two-octet coded character set in which case it shall be called UCS-2. From: Linkname: ISO/IEC 10646-1 including AMD 1 thru 4 URL: http://wwwold.dkuug.dk/JTC1/SC2/WG2/docs/N1396.doc -Chris -- http://www.oreilly.com/people/staff/crism/ +1.617.499.7487 90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cmatei at agora.ro Fri May 8 20:27:57 1998 From: cmatei at agora.ro (Crystian) Date: Mon Jun 7 17:01:12 2004 Subject: Comment and PI in a Mixed or children elements... Message-ID: <01BD7AC8.56166CC0@jackson.agora.ro> Hi, Can be comments or Processing Instructions in a element that his declaration matches Mixed or children? For example ] > is valid? Thank you in advance! Crystian xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Fri May 8 21:19:18 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:12 2004 Subject: SDD bogus References: <3.0.32.19980508081709.00b57c60@pop.intergate.bc.ca> Message-ID: <35535AE5.FFBAE939@technologist.com> Tim Bray wrote: > > Having said that, Paul did raise a valid concern about the SDD (too bad > this issue wasn't pointed out before the spec was frozen). Yes, I want to point out to those who do not know the dynamics here that I use the word "bogus" because I was in the SIG and it is as much my fault as anyone's that it got through. Were I talking about someone else's spec. I would be more tactful. Paul Prescod - http://itrc.uwaterloo.ca/~papresco Can we afford to feed that army, while so many children are naked and hungry? Can we afford to remain passive, while that soldier-army is growing so massive? - "Gabby" Barbadian Calpysonian in "Boots" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Fri May 8 21:34:59 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:12 2004 Subject: SDD bogus References: <3553061D.BF3A3461@technologist.com> <199805081351.JAA00618@unready.microstar.com> Message-ID: <35535E95.3B3F5B23@technologist.com> David Megginson wrote: > > In the end, as one might have predicted, there is an impressive range > of free XML processors available in several different programming > languages: someone writing an RDF tool does not need to worry about > the character and entity level of XML at all, and can work with XML > easily through a more abstract interface such as the DOM or SAX. That's a dangerous heresy, though one that I've promoted myself. If we presume that programmers are going to work through parsers, then why couldn't we leave GI's out of end tags and make XML substantially less verbose (qualitatively at least)? Anyhow, many people argue with some justification that regexp-based processing of the source files will still be very important and popular. I'm not convinced that the cost/benefit ratio is right, if we win over the awk hackers and annoy the document authors, but we will see. > So, we should let the authors decide -- if an author creates a > document referencing external entities (including an external DTD > subset), then the XML parser should handle them; if the author does > not want to use external entities, then she can simply avoid > referencing any. Although I agree with Tim that this is a separate issue, I agree with you that external entities should always be processed. It seems strange to me to put responsibility for managing performance in the hands of the parser. The parser writer has no information about the performance characteristics of the entity vs. the importance of the data. Ignoring content should not be the parser's perogative. Consider Tim's example: > Netscape today > announced that &NSA;. In > response, Microsoft > issued the following > statement: &MSA;. As an author, I would be pretty damn pissed off if a browser presented the document that way, even if it uses nice icons for the entities. What does the document mean with half of its text missing? If I've used external entities in that way, then I have probably decided to do it for a good reason. If I wanted a text inclusion that could be downloaded after a mouseclick, that sounds like a special behaviour that could be accomplished through XLink. Paul Prescod - http://itrc.uwaterloo.ca/~papresco Can we afford to feed that army, while so many children are naked and hungry? Can we afford to remain passive, while that soldier-army is growing so massive? - "Gabby" Barbadian Calpysonian in "Boots" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Fri May 8 21:43:53 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:12 2004 Subject: SDD again References: <3553086A.3494D7F9@technologist.com> <199805081500.LAA00857@unready.microstar.com> Message-ID: <355360A0.1C6900A6@technologist.com> David Megginson wrote: > > Your first premise is correct, but your second one is not. The spec > states that a validating parser must use the whole DTD; it does not > state that a non-validating parser may not use the DTD. AElfred, for > example, reads the DTD well enough that it can even flag ignorable > whitespace base on an element type's content model, but it is > non-validating. You are right. I guess when I think of parsers, I prefer to think of them as interchangable within a class, so I would be leery about trusting features that only overachievers like you implement. :) > That said, I still agree that the standalone declaration is wrong. > Perhaps some day, if there's an XML 1.1, we can think about fixing it. Probably better to deprecate it. And as soon as possible! Paul Prescod - http://itrc.uwaterloo.ca/~papresco Can we afford to feed that army, while so many children are naked and hungry? Can we afford to remain passive, while that soldier-army is growing so massive? - "Gabby" Barbadian Calpysonian in "Boots" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From maillist at chris.hubick.com Fri May 8 22:06:31 1998 From: maillist at chris.hubick.com (Chris Hubick) Date: Mon Jun 7 17:01:12 2004 Subject: Character Encoding Detection In-Reply-To: <199805081810.OAA18622@ruby.ora.com> Message-ID: Thanks Chris! That led me down the path of enlightenment. On Fri, 8 May 1998, Chris Maden wrote: > UCS-2 is identical to UTF-16, and so it is subject (presumably) to the > same rule. Ahh, I hadn't read the details of UTF-16 encoding. So, as I understand it now, UTF-16 is only applicable to UCS-4 (not UCS-2 ??) documents, because it transforms a UCS-4 document into a UCS-2 document that uses a reserved code range (rows D8-DB and DC-DF of the BMP) to represent the extra characters (where only UCS-4 chars < 0011 0000 can be mapped). A normal UCS-2 document would not use these reserved codes (??), so in that respect a UTF-16 encoded UCS-4 document is not identical to a "native" UCS-2 document. What this means is that you can only tell a UTF-16 document from a UCS-2 document by the encoding declaration, or by encountering a character in the reserved UCS-2 range for UTF-16 encoding. And both UCS-2 and UTF-16 docs must start with a byte order mark. Is this byte order mark "#xFEFF" specific to XML documents, or is it part of UCS? > As a side note, I was unsure until just now whether they were > equivalent, but I finally found ISO 10646-1 clause 8: > > Plane 00 of Group 00 shall be the Basic Multilingual Plane (BMP). > The BMP can be used as a two-octet coded character set in which > case it shall be called UCS-2. Without reading the whole document (uhg, I see it coming) I didn't think this statement had anything to do with UTF, I thought it was just explaining how UCS-2 is a subset of UCS-4. Though I think the following does state what you say: "UCS Transformation Format 16 (UTF-16)" at http://www.stonehand.com/unicode/standard/wg2n1035.html > The following method transforms the coded representation of over a > million graphic characters of UCS-4 into a form that is compatible with > the two-octet BMP form of UCS-2 (section 14.1). This permits the > coexistence of those UCS-4 characters within coded character data that > is in accordance with UCS-2. > > In UTF-16 each graphic character from the UCS-2 repertoire retains its > UCS-2 coded representation. In addition, the coded representation of any > character from a single contiguous block of 16 Planes in Group 00 > (1,048,576 code positions) is transformed to pairs of two-octet > sequences, where each sequence corresponds to a cell in a single > contiguous block of 8 Rows in the BMP (2,048 code positions). These > codes are reserved for the use of this transformation method, and shall > not be allocated for any other purpose. As for the goal of it being relatively easy for the desperate Perl hacker, or myself, the desperate Java hacker, to code an XML parser...it's not that bad. It helps if you come into it with prior knowledge of stuff like character encodings. I think all I would have changed in the spec is to not allow PE's _within_ markup declarations in the external DTD, I still have to sort that out if I ever want to make my parser validating. I might also have been tempted to not allow recursive PE's. They could have left it out for 1.0 and added it for 1.1, but now you have to be backward compatible. I think if you find yourself needing that kind of thing, SGML might be the language for you. --- Chris Hubick mailto:chris@hubick.com http://www.hubick.com/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jarle.stabell at dokpro.uio.no Fri May 8 22:25:25 1998 From: jarle.stabell at dokpro.uio.no (Jarle Stabell) Date: Mon Jun 7 17:01:12 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) Message-ID: <01BD7AD2.0F899E80.jarle.stabell@dokpro.uio.no> Paul Prescod wrote: > If we > presume that programmers are going to work through parsers, then why > couldn't we leave GI's out of end tags and make XML substantially less > verbose (qualitatively at least)? Anyhow, many people argue with some > justification that regexp-based processing of the source files will still > be very important and popular. I'm not convinced that the cost/benefit > ratio is right, if we win over the awk hackers and annoy the document > authors, but we will see. (those who have found the discussion about short end tags tiresome years/months ago, please forgive me and don't read any further.) I would love to see empty end tags making it into the standard in the future. In many cases, one only marks up single words, and then empty end tags would justify having longer and more descriptive GI's, when forced to write both start and end tags fully, one may be too tempted to use "cryptic"/abbreviated GI's. Example: Let's say your're making a markup language for documentating source code, which would be embedded inside comments in the source (like JavaDoc). Then you would prefer the tags to be as "silent" as possible during development/maintaince of the source code itself. Compare: 1. The method M1 of the fantastic Class1 can be used in situation X. to the "thinner": 2. The method M1 of the fantastic Class1 can be used in situation X. I think variant 2 is faster to read than variant 1, and you don't have to check the end-tags for misspellings. The argument that compressing reduces/eliminates the size advantage of documents with empty end tags often doesn't apply, the document will often be stored uncompressed on users hard-disks, in databases and in memory. Cheers, Jarle Stabell xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From crism at ora.com Fri May 8 22:27:46 1998 From: crism at ora.com (Chris Maden) Date: Mon Jun 7 17:01:13 2004 Subject: Attribute type NMTOKEN In-Reply-To: <3.0.3.32.19980508095011.008ce420@nexus.polaris.net> (simpson@polaris.net) Message-ID: <199805082025.QAA23192@ruby.ora.com> [John E. Simpson] > As I understand it (would appreciate being corrected if wrong), > NMTOKEN essentially applies a constraint on the form of the > attribute's value (i.e. value must be a valid name token -- start > with either a letter or an underscore) without constraining the > *specific* value. So yes, your "open-ended or... too big to list" > assumption is correct, with the qualification that a type of NMTOKEN > would not allow a value such as "1ABC," "//ENGLISH," or > " WHITESPACE." A possible misunderstanding here: a NMTOKEN itself can't contain whitespace, but the attribute value specification, the quoted thing after the equals sign, can: so *is* a valid NMTOKEN attribute specification. The NMTOKEN in this example is only "WHITESPACE", though; the leading (and trailing) whitespace is trimmed. -Chris -- http://www.oreilly.com/people/staff/crism/ +1.617.499.7487 90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From eliot at isogen.com Fri May 8 22:44:53 1998 From: eliot at isogen.com (W. Eliot Kimber) Date: Mon Jun 7 17:01:13 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) Message-ID: <3.0.32.19980508154119.0077c1e8@postoffice.swbell.net> At 10:38 PM 5/8/98 +0200, Jarle Stabell wrote: >I would love to see empty end tags making it into the standard in the >future. In many cases, one only marks up single words, and then empty end >tags would justify having longer and more descriptive GI's, when forced to >write both start and end tags fully, one may be too tempted to use >"cryptic"/abbreviated GI's. If you're going to ask for empty end tags for this, then why not go all the way and reinstate NET-enabled start tags? asd asdfasd Markup minimization is a slippery slope--in for a penny, in for a pound. Cheers, E. --
W. Eliot Kimber, Senior Consulting SGML Engineer ISOGEN International Corp. 2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004 www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Fri May 8 22:46:58 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:13 2004 Subject: SDD bogus In-Reply-To: <35535E95.3B3F5B23@technologist.com> References: <3553061D.BF3A3461@technologist.com> <199805081351.JAA00618@unready.microstar.com> <35535E95.3B3F5B23@technologist.com> Message-ID: <199805082043.QAA02512@unready.microstar.com> Paul Prescod writes: > David Megginson wrote: > > > > In the end, as one might have predicted, there is an impressive range > > of free XML processors available in several different programming > > languages: someone writing an RDF tool does not need to worry about > > the character and entity level of XML at all, and can work with XML > > easily through a more abstract interface such as the DOM or SAX. > > That's a dangerous heresy, though one that I've promoted myself. If we > presume that programmers are going to work through parsers, then why > couldn't we leave GI's out of end tags and make XML substantially less > verbose (qualitatively at least)? It's a matter of finding the right balance: XML must be simple enough to parse that there is a healthy competition among parser writers (and thus, a better quality and larger choice of tools for application writers), but simple enough to write that authors (including developers of tools that generate XML) are willing to learn and use it. SGML, with its bizarre tag-omission rules, delimiter-in-context recognition, shortrefs, etc., gave too much away to the authors, and as a result, there were very few good SGML parsers implemented (and never a one in Java, although many of us tried). > > So, we should let the authors decide -- if an author creates a > > document referencing external entities (including an external DTD > > subset), then the XML parser should handle them; if the author does > > not want to use external entities, then she can simply avoid > > referencing any. > > Although I agree with Tim that this is a separate issue, I agree with you > that external entities should always be processed. It seems strange to me > to put responsibility for managing performance in the hands of the parser. > The parser writer has no information about the performance characteristics > of the entity vs. the importance of the data. Ignoring content should not > be the parser's perogative. I'd take one step towards Tim's side, and allow the _application_ to specify whether (and which) external entities may be included, since they may have special knowledge that the parser lacks; you are quite right, though, that it should not be up to the parser, and that all parsers should be able to include external entities. In XML 1.1, I'd like to see one of those "at user discretion" clauses here. > Consider Tim's example: > > > Netscape today > > announced that &NSA;. In > > response, Microsoft > > issued the following > > statement: &MSA;. > > As an author, I would be pretty damn pissed off if a browser > presented the document that way, even if it uses nice icons for the > entities. What does the document mean with half of its text > missing? If I've used external entities in that way, then I have > probably decided to do it for a good reason. If I wanted a text > inclusion that could be downloaded after a mouseclick, that sounds > like a special behaviour that could be accomplished through XLink. Absolutely correct: an entity is a fundamental chunk of a document, not a link (web heads can think of it somelink sort-of like a client-side include). I'm sorry to see that this kind of misunderstanding arose as recently as last fall, especially since work was already well underway on XLink (aka XLL aka XML-Link), so (as you mention) people could see what kind of mechanisms would be available for linking to embedded information. In any case, couldn't a browser render around the entity references, then adjust the rendition as the entities were resolved, as HTML browsers generally do with images? All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tbray at textuality.com Fri May 8 22:52:05 1998 From: tbray at textuality.com (Tim Bray) Date: Mon Jun 7 17:01:13 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) Message-ID: <3.0.32.19980508134903.00b49e30@pop.intergate.bc.ca> At 10:38 PM 5/8/98 +0200, Jarle Stabell wrote: >I would love to see empty end tags making it into the standard in the >future. Speaking as one of those who opposed this, now I agree with you. Among other things, perl is going to have a built-in parser so no matter how D the PH is, this problem is just invisible. But it was a good decision at the time. -T. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Bryan_Gilbert at pml.com Fri May 8 22:58:54 1998 From: Bryan_Gilbert at pml.com (Bryan Gilbert) Date: Mon Jun 7 17:01:13 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) Message-ID: > -----Original Message----- > From: W. Eliot Kimber [SMTP:eliot@isogen.com] > Sent: Friday, May 08, 1998 1:41 PM > To: xml-dev@ic.ac.uk > Subject: Re: A little wish for short end tags (Was: RE: SDD > bogus) > > At 10:38 PM 5/8/98 +0200, Jarle Stabell wrote: > >I would love to see empty end tags making it into the standard in the > > >future. In many cases, one only marks up single words, and then empty > end > >tags would justify having longer and more descriptive GI's, when > forced to > >write both start and end tags fully, one may be too tempted to use > >"cryptic"/abbreviated GI's. > > If you're going to ask for empty end tags for this, then why not go > all the > way and reinstate NET-enabled start tags? > > asd asdfasd > marked > > Markup minimization is a slippery slope--in for a penny, in for a > pound. > > Cheers, > > E. > -- [BG] Why is it a slippery slope? Why go all the way and do these NET-enabled things. (They look gross.) My interest is supporting XML on embedded platforms where ROM, RAM, CPU, are all very limited. XML combined with XSL will be much better than HTML documents for this application but lecc verbosity would be good too. For those who need cryptic tags see the example DTDs that John Tigue (DataChannel) developed for the WebBRoker project. He presents a verbose DTD and then a terse version. Both use preliminary XML namespaces to scope the elements. The namespace helps document the terse version as does the comments in the DTD. http://www.datachannel.com/xml/dtd/index.html Bryan > xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From simpson at polaris.net Fri May 8 23:03:05 1998 From: simpson at polaris.net (John E. Simpson) Date: Mon Jun 7 17:01:13 2004 Subject: Attribute type NMTOKEN In-Reply-To: <199805082025.QAA23192@ruby.ora.com> References: <3.0.3.32.19980508095011.008ce420@nexus.polaris.net> Message-ID: <3.0.3.32.19980508170242.008c5870@nexus.polaris.net> At 04:25 PM 5/8/98 -0400, Chris Maden wrote: >[John E. Simpson] >> As I understand it (would appreciate being corrected if wrong), >> NMTOKEN essentially applies a constraint on the form of the >> attribute's value (i.e. value must be a valid name token -- start >> with either a letter or an underscore) without constraining the >> *specific* value. So yes, your "open-ended or... too big to list" >> assumption is correct, with the qualification that a type of NMTOKEN >> would not allow a value such as "1ABC," "//ENGLISH," or >> " WHITESPACE." > >A possible misunderstanding here: a NMTOKEN itself can't contain >whitespace, but the attribute value specification, the quoted thing >after the equals sign, can: so > >*is* a valid NMTOKEN attribute specification. The NMTOKEN in this >example is only "WHITESPACE", though; the leading (and trailing) >whitespace is trimmed. Thanks for that clarification, Chris! Best regards, | It's no disgrace t'be poor, John E. Simpson | but it might as well be. simpson@polaris.net | -- "Kin" Hubbard xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From russ at thetrip.com Fri May 8 23:33:34 1998 From: russ at thetrip.com (Russ Heithoff) Date: Mon Jun 7 17:01:13 2004 Subject: Attribute type NMTOKEN Message-ID: <01BD7A96.6843D180.russ@thetrip.com> Also, if I remember right, a NMTOKEN can start with Name characters. This is in contrast to a Name which must start with Name Start characters followed by characters. In otherwords 1ABC is a legal NMTOKEN but not a legal NAME. Russ -----Original Message----- From: John E. Simpson [SMTP:simpson@polaris.net] Sent: Friday, May 08, 1998 3:25 PM To: xml-dev@ic.ac.uk Subject: Re: Attribute type NMTOKEN At 04:25 PM 5/8/98 -0400, Chris Maden wrote: >[John E. Simpson] >> As I understand it (would appreciate being corrected if wrong), >> NMTOKEN essentially applies a constraint on the form of the >> attribute's value (i.e. value must be a valid name token -- start >> with either a letter or an underscore) without constraining the >> *specific* value. So yes, your "open-ended or... too big to list" >> assumption is correct, with the qualification that a type of NMTOKEN >> would not allow a value such as "1ABC," "//ENGLISH," or >> " WHITESPACE." > >A possible misunderstanding here: a NMTOKEN itself can't contain >whitespace, but the attribute value specification, the quoted thing >after the equals sign, can: so > >*is* a valid NMTOKEN attribute specification. The NMTOKEN in this >example is only "WHITESPACE", though; the leading (and trailing) >whitespace is trimmed. Thanks for that clarification, Chris! Best regards, | It's no disgrace t'be poor, John E. Simpson | but it might as well be. simpson@polaris.net | -- "Kin" Hubbard xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Fri May 8 23:45:01 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:13 2004 Subject: Comment and PI in a Mixed or children elements... In-Reply-To: <01BD7AC8.56166CC0@jackson.agora.ro> Message-ID: <3.0.1.16.19980508210135.753f3410@pop3.demon.co.uk> Hi Crystian, At 21:29 08/05/98 +-300, Crystian wrote: >Hi, > >Can be comments or Processing Instructions in a element that his declaration matches Mixed or children? [... question snipped ...] When I have questions like this I generally tend to see what happens with a parser. The parser writers have spent a *lot* of time over the spec and the current generation of parsers are very high quality. It's worth installing 2-3 and seeing if they all agree. There are a (very) few places where some parsers don't match the current spec (especially those which are pre-XML1.0). But they (the parsers and their authors) are much smarter than me. So I often use a computer to 'do my thinking for me' P. When SAX1.0 is released JUMBO2.0 will be able to support a number of SAX-compliant parsers under it. It depends on the timescale that the authors are able to manage. You can then compare how each parser behaves. The areas of concern that use of multiple parsers will highlight are those whose semantics are fuzzy. PaulP highlighted one - the question of what a parser should do with SDD+internalSubset+externalSubset. This is an area where it is likely that different parser writers might make different implementations.. BTW I didn't answer the question, did I? :-) I think you will find it is yes, but trust a parser and not me. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Fri May 8 23:55:16 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:13 2004 Subject: Comment and PI in a Mixed or children elements... In-Reply-To: <3.0.1.16.19980508210135.753f3410@pop3.demon.co.uk> References: <01BD7AC8.56166CC0@jackson.agora.ro> <3.0.1.16.19980508210135.753f3410@pop3.demon.co.uk> Message-ID: <199805082153.RAA02861@unready.microstar.com> Peter Murray-Rust writes: > BTW I didn't answer the question, did I? :-) I think you will find it is > yes, but trust a parser and not me. By all means, run problems like this through a good parser (don't use AElfred for this purpose, since it is too tolerant of errors). In the end, however, the only trustworthy source is the grammatical productions in the spec (together with constraints and other normative text): [39] element ::= EmptyElemTag | STag content ETag [43] content ::= (element | CharData | Reference | CDSect | PI | Comment)* All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From eliot at isogen.com Fri May 8 23:58:45 1998 From: eliot at isogen.com (W. Eliot Kimber) Date: Mon Jun 7 17:01:13 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) Message-ID: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> At 01:53 PM 5/8/98 -0700, Bryan Gilbert wrote: > [BG] Why is it [minimization] a slippery slope? Because if you can make an argument for empty end tags, you can probably make an argument for NET-enabled end tags, and minimized attribute specifications, and omitted tags, and so on and so on. Just because *your* requirements make it obvious that empty end tags are enough, that doesn't mean that everybody's do. Any decision short of no minimization or full minimization will be an arbitrary one that will cause those whose requirements are not met to wonder why. There is always full SGML, after all. The only argument I could see for empty tags involves the fact that you don't need explicit declarations in order to use it. But the same argument applies to NET-enabled start tags, so that razor would't help here. [NOTE: I'm using NET-enabled start tags as an example to make a point. I'm really aggitating for their inclusion in XML--they're handy, but not that much more handy than empty end tags, saving only one keystroke.] Cheers, E. --
W. Eliot Kimber, Senior Consulting SGML Engineer ISOGEN International Corp. 2200 N. Lamar St., Suite 230, Dallas, TX 95202. 214.953.0004 www.isogen.com
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From sjd at eps.inso.com Fri May 8 23:58:46 1998 From: sjd at eps.inso.com (Steven DeRose) Date: Mon Jun 7 17:01:13 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) In-Reply-To: <01BD7AD2.0F899E80.jarle.stabell@dokpro.uio.no> Message-ID: <3.0.5.32.19980508175003.00840e00@mailhost.eps.inso.com> At 10:38 PM 05/08/98 +0200, Jarle Stabell wrote: >2. The method M1 of the fantastic Class1 can >be used in situation X. > >I think variant 2 is faster to read than variant 1, and you don't have to >check the end-tags for misspellings. True, you don't have to check them; but the often forgotten corrolary is that you also *can't* check the end-tag for misspellings if you go that route. So if the data is erroneous you are far less likely to detect it *at all*, making for truly nasty debugging. This is an ancient information-theoretic tradeoff: you can always save space, but the more you save, the less chance you have of detecting errors. This is because when you reduce redundancy, you increase the % of all possible bit sequences that are syntactically correct. For example, imagine trying to communicate in a noisy room if every possible sequence of sounds was a legitimate English word. Or imagine programming in a language where every possible byte sequence is a syntactically correct program (APL and raw machine code are the only approximations I can think of to that -- guess why). > >The argument that compressing reduces/eliminates the size advantage of >documents with empty end tags often doesn't apply, the document will often >be stored uncompressed on users hard-disks, in databases and in memory. Sorry, but with Win98 rumored to demand 64MB of RAM just to run and with Moore's Law applying to memory prices, I can't muster much enthusiasm for an argument that it is too costly to shave bytes on markup. If you had to put *ten* full tags on every element you'd hardly ever notice any impact except on a 747 manual, and anything that big can't be handled practically in raw unparsed form anyway. I did a lot of statistics on this a few years ago: a fully-marked-up file with no minimization is still wayyyyy smaller than the equivalent word processor file in typical systems, so what's the big deal? I agree it would be handy when typing XML by hand or reading it raw. But it is not without adverse consequences too. I'd rather see better editing tools so I don't even have to know about such details. Steven J. DeRose, Ph.D. Visiting Chief Scientist, STG | Chief Scientist Adjunct Associate Professor | Inso Corp. EPS Steven_DeRose@brown.edu | sjd@eps.inso.com 401-863-3690 | 401-752-4438 xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Bryan_Gilbert at pml.com Sat May 9 00:14:30 1998 From: Bryan_Gilbert at pml.com (Bryan Gilbert) Date: Mon Jun 7 17:01:13 2004 Subject: Less verbose XML (Was: A little wish for short end tags (Was: RE: SDD bogus)) Message-ID: From: Steven DeRose [SMTP:sjd@eps.inso.com] Sent: Friday, May 08, 1998 2:50 PM To: xml-dev@ic.ac.uk Subject: Re: A little wish for short end tags (Was: RE: SDD bogus) Sorry, but with Win98 rumored to demand 64MB of RAM just to run and with Moore's Law applying to memory prices, I can't muster much enthusiasm for an argument that it is too costly to shave bytes on markup. If you had to Work for a while in a system that has 1/2 MB ROM, maybe 1MB RAM, and a CPU that is managing intense calculations and I/O. Am I alone? Or are there others out there with similar constraints who want to use XML? I'd like to hear from you. Thanks Bryan Gilbert, B.Sc. M.Sc. Systems Engineer, ION Modules Team, Power Measurement Ltd 2195 Keating Cros Road, Saanichton, BC, Can, V8M 2A5 Phone: (250) 652-7100 ext 7570 Fax: (250) 652-0411 email: (mailto:bryan_gilbert@pml.com) WWW: (http://www.pml.com) xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From adam at cyber-guru.com Sat May 9 00:26:35 1998 From: adam at cyber-guru.com (Adam M. Donahue) Date: Mon Jun 7 17:01:13 2004 Subject: EBNF question In-Reply-To: <199805082025.QAA23192@ruby.ora.com> References: <3.0.3.32.19980508095011.008ce420@nexus.polaris.net> (simpson@polaris.net) Message-ID: <9805082226.AA14674@acf2.nyu.edu> I just want to clear something up about reading the XML productions. Am I correct in the following assumptions: A character or sequence of characters between single quotation marks, e.g., 'ABC' is to be represented literally. There must be no space characters between the quotation marks and the entity in the production. So def ::= ' ABC ' would not be legal for representing " ABC " .. you'd need to use SP 'ABC' SP. However, when spaces do exist between the quotation marks, then the name(s) inside are to be treated as productions and the *quotation marks* as literals. For example: abc ::= 'ABC' ; literally ABC lit ::= ' abc ' ; literally ' ABC ' lit ::=' ABC ' ; undefined? Is this correct?  Adam mailto:adam@cyber-guru.com xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Sat May 9 00:34:48 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:13 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) In-Reply-To: <01BD7AD2.0F899E80.jarle.stabell@dokpro.uio.no> Message-ID: <3.0.1.16.19980508222800.747792f8@pop3.demon.co.uk> At 22:38 08/05/98 +0200, Jarle Stabell wrote: >Paul Prescod wrote: >> If we >> presume that programmers are going to work through parsers, then why >> couldn't we leave GI's out of end tags and make XML substantially less >> verbose (qualitatively at least)? Anyhow, many people argue with some >> justification that regexp-based processing of the source files will still >> be very important and popular. I'm not convinced that the cost/benefit >> ratio is right, if we win over the awk hackers and annoy the document >> authors, but we will see. > >(those who have found the discussion about short end tags tiresome >years/months ago, please forgive me and don't read any further.) > >I would love to see empty end tags making it into the standard in the >future. In many cases, one only marks up single words, and then empty end [... and others agreed/disagreed ...] This is not, of course, a new discussion - it has been vigorously debated on XML-SIG/WG and there were differences of opinion just as there are here. At this stage of XML it's important to agree that we should all adhere to XML1.0 *completely* whatever we feel its merits and demerits to be. The full spec has only been out ca 2 months and we are seeing and will continue to see a large amount of high quality software - it's critical that this conforms. When there has been more experience of what level of tools are actually used and what quality of documents are actually produced it may be time for revisions. The minimum-minimization principle is extremely valuable if there are a significant number of documents that are prone to well-formedness errors. I still create - and will continue to create - XML files by hand. I also write programs to create XML files. Both of these are error-prone processes :-) and it's extremely valuable to be able to locate the precise point of error. Using XML-DEV to push for immediate revision of the spec is probably unlikely to be a very productive process. Using it to highlight unnecessary - and therefore omissible - constructs (e.g. SDD) or areas of semantic fuzziness is probably more likely to be useful. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ricko at allette.com.au Sat May 9 02:23:50 1998 From: ricko at allette.com.au (Rick Jelliffe) Date: Mon Jun 7 17:01:14 2004 Subject: Character Encoding Detection Message-ID: <001a01bd7ae0$f8d2a7e0$a20b4ccb@NT.JELLIFFE.COM.AU> From: Chris Hubick >Now in UCS-2: > '<' is 00 3C > '?' is 00 3f > >So the start of a UCS-2 or UTF-16 encoded XML document would be 00 3C 00 >3F Not always. Because when storing a double-octet (i.e. two 8-bit bytes) different CPUs store them in different byte order: big-endian and little-endian. So if you read a big-endian file as a sequence of bytes you will get 003c003f, but if you read a little-ending file as a sequence of bytes you will get 3c003f00. This is why a binary file from an Intel machine will typically need to be byte-swapped for use on a Motorola 68K machine. I believe the Motorola PowerPC CPU has an instruction to switch between little-endian and big-endian reading modes. When you are storing 4 octet data there are 4 possibile arrangements, but in fact only 3 seem to have been used: PDP-11 use one, Motorola used another, Intel used another. To cope with this, network protocols and binary file formats will usually specify a particular byte order for storing or transfering 16 or 32 bit data. As an alternative, Unicode (UTF-16) reserves two characters near the high end of the codespace. From these characters, it is possible to determine whether the file is saved as big-endian or little endian. Unicode software will then resequence incoming bytes correctly, to construct the 16-but values.This is called the Byte Order Mark (BOM). The XML character encoding strategy is to encourage you, wherever possible, to make sure that all the information needed to parse your document can be marked-up in the document. Operating systems typically just provide a "text" type (.txt, Mac "TEXT") under the assumption, which was OK in the West when everyone had standalone computers, that all "text" on a computer would use the same character set or encoding. So, because operating systems and protocols fail in this key regard, XML provides its heuristic for determining text: this heuristic is reliable, in the sense that if you use it, your data is clearly labelled--it takes the guess work out of character detection. The heuristic is basically this: 1) If the operating system or transport protocol tells you the character set, and if it is reliable, use that. 2) If there is a BOM, then your data is Unicode. 3) Otherwise, use the XML declarations (i.e., the ) to determine the encoding (this is a little bit complex, but straightforward) 4) Otherwise, it is UTF-8 (7-bit ASCII conforms to UTF-8 already). The draft RFC for the MIME type text/xml will be made public soon. It gives some more policy on these issues. >In the section on autodetection of character encodings the XML spec >states "00 3C 00 3F: UTF-16, big-endian, no Byte Order Mark (and thus, >strictly speaking, in error)" > > My question is, why is this an error rather than a perfectly >acceptable untransformed UCS-2 document? > > The reason that the encoding declaration and BOM are not mandatory was to allow legacy documents and software systems to be used. But there seems to be a general concensus with the XML SIG/WG people that all new XML documents should have either a BOM or an explicit XML declaration (with the encoding attribute). As part of the XML-development process, some of us asked the question: "What are the intrinsic properties of text?", with the idea that if something was intrinsic ("prime metadata") then XML should provide a standard way of representing it. The things we came up with were notation (i.e. XML, including version and treatment of white-space xml:space), encoding, language (i.e., xml:lang). All these things are fundamental to any parsing of text files in a world wide web. (I also think that written-script is also an intrinsic, but the ISO standard for script codes was not finished and the RFC 1766 language format was flexible enough for most needs, it was thought.) Software developers should rise to this challenge. When you write out an XML file, make sure you write out the appropriate XML header: Don't treat it as an option but as a necessity. And if you are writing UTF-16/Unicode tools, always write out the BOM as the first character. With XML we have the chance to not get out of the character set and encoding maze: not by being forced to use a single encoding, but by disconnecting encoding from document character set (i.e. ISO 10646) and by clearly labelling which encoding is being used. Rick Jelliffe xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ricko at allette.com.au Sat May 9 02:37:29 1998 From: ricko at allette.com.au (Rick Jelliffe) Date: Mon Jun 7 17:01:14 2004 Subject: Less verbose XML (Was: A little wish for short end tags (Was: RE: SDD bogus)) Message-ID: <003501bd7ae2$dd719d10$a20b4ccb@NT.JELLIFFE.COM.AU> If you are worried about file size, then compress your files. Tags, being short strings commonly used, compress really nicely with the common compression algorithms out there. I once did an experiment (to confirm one that Gavin Nicol had done) using a document with several thousand lines, each with one start-tag, end-tag pair and no content. I tried compressing this file with * no minimization * short end-tags * end-tag ommission The uncompressed file was something like 50K. The compressed files differed by only a few 100bytes from each other. The gains from short end-tags did not carry over into the compressed versions, and the compressed versions were so much smaller there seemed little contest. Having short-tag ommision can only compress a document by less than 50% at an improbable maximum (in the case of a document with not data, not attributes, no white-space, and incredibly long GIs). Compression is a far better approach. Rick Jelliffe xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ricko at allette.com.au Sat May 9 02:56:16 1998 From: ricko at allette.com.au (Rick Jelliffe) Date: Mon Jun 7 17:01:14 2004 Subject: parser for xml-data? Message-ID: <006e01bd7ae5$7a49ac70$a20b4ccb@NT.JELLIFFE.COM.AU> From: Ron Bourret > The only major difference I have found so far for XML is that the elements in > XML documents are ordered, while data members in OO programming languages are > not. There is an error in the (January 5?) XML-data report, in the very first example. It gives the clear impression that XML-data does not constrain sequence for the element types it describes. In the example, the declarations for the element types which can appear as the content of an element types are given in one order, but the instance has them in another order. (A Microsoft representative pointed this out to me: I dont know why they haven't just reissued the note, since it is a fairly cricitical point for implementors.) Also note that the usage of ISO 8601 date formats seems to be wrong. ISO 8601 date format is yyyy-mm-dd, e.g. 1998-05-09, and not 19980509, last time I looked. If anyone is thinking of implementing XML-data, I suggest they befriend the authors, because the report misses out on several key issues. (I have previously mentioned that is does not seem to make clear whether you can have an XML-data schema as part of a document, or whether it must be external. If it is internal, can it describe the document's root element? I suppose a close reading of the XML-data text might help, but it is not clear to me after dozens of readings, but I do not claim to be particularly brilliant in this area.) Rick Jelliffe xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From alex at veosystems.com Sat May 9 03:18:07 1998 From: alex at veosystems.com (alex@veosystems.com) Date: Mon Jun 7 17:01:14 2004 Subject: SDD again Message-ID: <19980509011757.19901.qmail@veosystems.com> A non-text attachment was scrubbed... Name: not available Type: text Size: 1866 bytes Desc: not available Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980509/a6a71159/attachment.bat From tbray at textuality.com Sat May 9 03:26:53 1998 From: tbray at textuality.com (Tim Bray) Date: Mon Jun 7 17:01:14 2004 Subject: SDD again Message-ID: <3.0.32.19980508182528.00b5c2b0@pop.intergate.bc.ca> At 06:17 PM 5/8/98 -0700, alex@veosystems.com wrote: >Currently, validating parsers are the only real *safe* bet. Whatever the limitations of the SDD, this statement is simply false. There are many useful applications for non-validating processors; in fact, I suspect, a very large majority of XML apps. -Tim xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From alex at veosystems.com Sat May 9 03:52:23 1998 From: alex at veosystems.com (alex@veosystems.com) Date: Mon Jun 7 17:01:14 2004 Subject: SDD again In-Reply-To: <3.0.32.19980508182528.00b5c2b0@pop.intergate.bc.ca> from "Tim Bray" at May 8, 98 06:25:30 pm Message-ID: <19980509015211.20167.qmail@veosystems.com> A non-text attachment was scrubbed... Name: not available Type: text Size: 1934 bytes Desc: not available Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980509/5ff16937/attachment.bat From jnava at Adobe.COM Sat May 9 05:29:43 1998 From: jnava at Adobe.COM (Joel A. Nava) Date: Mon Jun 7 17:01:14 2004 Subject: FW: IDL in XML or SGML Message-ID: <001801bd7afa$87e20070$b83f2099@moebius.corp.adobe.com> I hope this is the first time this has come through, as the email was bounced from the Mail Delivery System [Mailer-Daemon@ic.ac.uk] the last 2 times I sent it. I am guessing that it was because I was subscribed to the digest and not the regular list. So, I am back on the regular list now. Joel -----Original Message----- From: Joel A. Nava [mailto:jnava@adobe.com] Sent: Thursday, May 07, 1998 10:32 PM To: xml-dev@ic.ac.uk Cc: Joel A. Nava Subject: IDL in XML or SGML I am forwarding the question of a colleague who is not on this list. I would appreciate being cc'ed, as I am getting xml-dev in digest form. Thanks, Joel ================================================================= Is anyone aware of an XML application which encodes an Interface Description Language, such as for DCE RPC, COM or CORBA? I'm particularly looking for an XML (or even SGML) application of encoding the Microsoft IDL (MIDL). I'd appreciate pointers. Thanks, Perry xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sat May 9 06:33:03 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:14 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) References: <3.0.32.19980508154119.0077c1e8@postoffice.swbell.net> Message-ID: <3553DCB0.50431ADB@technologist.com> W. Eliot Kimber wrote: > > Markup minimization is a slippery slope--in for a penny, in for a pound. I don't think that slippery slope arguments are interesting in technical discussions. You and I are smart enough to hammer out reasonable compromises. XML is full of such reasonable compromises. XML *is* such a compromise. One could have made the argument that as soon as you allowed element type declarations (for example) you are inevitably on the slope down to including every kind of declaration and pseudo-declaration SGML allows. Or once you make one concession to bandwidth (e.g. the external entity optionality) you are going to end up making all such concessions (of which markup minimization would be one). Or once you made one concession to easy parsability (fixed delimiters), you are going to end up making them all (but we allowed alternate literal string delimiters). Etc. etc. etc. But of course reasonable people do not (and did not) go that route. We weighed the costs and benefits of *each* feature more or less individually, and kept some and threw out some. I believe that this process works just as well for minimization. Just because one language went overboard does not mean that all future ones must shun minimzation as if it were heroine. Paul Prescod - http://itrc.uwaterloo.ca/~papresco Can we afford to feed that army, while so many children are naked and hungry? Can we afford to remain passive, while that soldier-army is growing so massive? - "Gabby" Barbadian Calpysonian in "Boots" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tbray at textuality.com Sat May 9 06:36:37 1998 From: tbray at textuality.com (Tim Bray) Date: Mon Jun 7 17:01:14 2004 Subject: SDD again Message-ID: <3.0.32.19980508213503.00b5b650@pop.intergate.bc.ca> At 06:52 PM 5/8/98 -0700, alex@veosystems.com wrote: >No, it is not false. I hightlighted the word 'safe'. If you absolutely >*must* know that everything was read and interpreted correctly, you *must* >use a validating parser. There are many applications where this is not >an absolute requirement and, thus, you may use a well-formed parser. No. This claim is without technical merit and I cannot let it pass unchallenged. It is trivially possible to achieve correctness and 100% unquestioned reliability without the use of a validating processor. If you either (a) construct your DTD so that standalone='true' or (b) don't have external markup declarations, then you can have NO DOUBT WHATSOEVER that the document is being parsed correctly, for any sane denotation or connotation of "correctly". So please stop making this demonstrably false assertion. Not only is your premise false in theory, it is vacuous in practice. If you think, for any real-world application, that its validation against some DTD guarantees "correctness" in any nontrivial sense, then I don't want to go anywhere near your software. Validity is a highly specific claim, one which is of great utility in many applications, but it does not equate to having "safety" or "correctness". Equally, lack of validity does not equate to lacking "safety" or "correctness". >In addition, there are some applications that need some level of guarantee >about whether external declaration subsets will be read and honored. It is >this class of applications that we cannot address today with the current >definition of well-formed. This statement is correct, except for the unnecessary temporizing about "some level of guarantee". "Guarantee" is a binary condition; if you need a guarantee that the organization to which you are sending information will have external declarations read, then you need to specify the use of a validating processor at that end. If not, then not. But please don't equate this particular guarantee with general concepts of "safety" or "correctness" - doing so gives the impression that the use of documents which are merely well-formed is in some way sloppy or irresponsible; such a claim is fatuous and very, very, very unhelpful. -Tim Bray xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Sat May 9 06:48:41 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:14 2004 Subject: SDD bogus References: <3553061D.BF3A3461@technologist.com> <199805081351.JAA00618@unready.microstar.com> Message-ID: <3553D056.5D6D5087@jclark.com> David Megginson wrote: > As many XML parser writers have shown, resolving external entities is > one of the easiest parts of XML It depends what sort of API your parser has. If it has an API like SAX (or Aelfred or XP) where the parser makes calls to get input, it is indeed easy. But that sort of API is not what is needed for convenient integration in Web browsers (at least not for Netscape). They need an API like expat's where the parser never blocks waiting for input, but instead the browser makes calls on the parser giving it input as it becomes available. Expat doesn't support parsing of external entities at all at the moment. I suspect it would be possible to make it support external general entities, and I'll probably do that at some point. It wouldn't be practical to support external parameter entities or external DTD subsets without switching to a different kind of API which be less convenient for Web browser integration. James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Sat May 9 06:49:19 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:14 2004 Subject: SDD bogus References: <3553061D.BF3A3461@technologist.com> Message-ID: <3553DD4F.D3CAE0C6@jclark.com> Paul Prescod wrote: > I'm concerned because I believe > this to be a valid XML document: > > > > > ]> > > > In my opinion, section 5.1 will require the non-validating parser to skip > the attribute list declaration, even if memo.dtd is an empty file. This is very good point. Your example isn't quite right: the entity must be referenced. Also a non-validating parser only has to skip the ATTLIST declaration if it skips the entity reference. Apart from this, your interpretation of 5.1 is the obvious one. Expat behaves consistently this. I think this is a serious problem, because it breaks the principle that if you declare your document as standalone=yes and validate it, then you will get the same result when you parse it with any non-validating parser (which to me is the point of the SDD). I think a bit of creative interpretation would be in order. Section 5.1 says: [Non-validating processors] must not process entity declarations or attribute-list declarations encountered after a reference to a parameter entity that is not read, since the entity may have contained overriding declarations. The "since" clause is false when standalone=yes, so I think this can fairly be said to be an inconsistency in the spec (rather than simply a poor design choice), which should be resolved by not applying this requirement when standalone=yes. The other way to fix this would be to tweak the definition of standalone to say that declarations after the first reference to an external parameter entity count as external for the purposes of determining whether the document is standalone. This is clearly needs to be fixed one way or the other. > Despite its reputation to the contrary, > XML is intricate and deep and I may have missed something important. Yes. Entities and the SDD are both tricky: the interaction of the two is particularily so. James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sat May 9 07:00:09 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:14 2004 Subject: SDD bogus References: <3553061D.BF3A3461@technologist.com> <199805081351.JAA00618@unready.microstar.com> <35535E95.3B3F5B23@technologist.com> <199805082043.QAA02512@unready.microstar.com> Message-ID: <3553E309.ED168257@technologist.com> David Megginson wrote: > > It's a matter of finding the right balance: XML must be simple enough > to parse that there is a healthy competition among parser writers (and > thus, a better quality and larger choice of tools for application > writers), but simple enough to write that authors (including > developers of tools that generate XML) are willing to learn and use > it. I agree 100%. I just believe that XML does NOT have balance. > SGML, with its bizarre tag-omission rules, delimiter-in-context > recognition, shortrefs, etc., gave too much away to the authors, and > as a result, there were very few good SGML parsers implemented (and > never a one in Java, although many of us tried). This strikes me as similar to Eliot's argument. We've gone too far in the past so let's not go anywhere near there again. Rather, the opposite should be true: we know from experience the limits of what is reasonable and should standardize it for all of the same reasons we standardized many other good practices in XML. Paul Prescod - http://itrc.uwaterloo.ca/~papresco Can we afford to feed that army, while so many children are naked and hungry? Can we afford to remain passive, while that soldier-army is growing so massive? - "Gabby" Barbadian Calpysonian in "Boots" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From andrewl at microsoft.com Sat May 9 07:19:46 1998 From: andrewl at microsoft.com (Andrew Layman) Date: Mon Jun 7 17:01:15 2004 Subject: Less verbose XML (Was: A little wish for short end tags (Was: RE: SDD bogus)) Message-ID: <5BF896CAFE8DD111812400805F1991F7038CA177@red-msg-08.dns.microsoft.com> This is not to debate the value of short end tags. That's been debated. But just as a technical matter, I also ran some tests a while ago, and found that files with short end tags compress 5 to 10 percent better than those with full end tags. I believe the reason is that with full end tags there are more unique pairs of end/start, while with short end tags there are only as many end/start pairs as types of start tag. So a compression scheme that first shortened end tags, then applied a standard compression would be more efficient size-wise. -----Original Message----- From: Rick Jelliffe [mailto:ricko@allette.com.au] Sent: Friday, May 08, 1998 5:39 PM To: 'XML-DEV' Subject: Re: Less verbose XML (Was: A little wish for short end tags (Was: RE: SDD bogus)) If you are worried about file size, then compress your files. Tags, being short strings commonly used, compress really nicely with the common compression algorithms out there. I once did an experiment (to confirm one that Gavin Nicol had done) using a document with several thousand lines, each with one start-tag, end-tag pair and no content. I tried compressing this file with * no minimization * short end-tags * end-tag ommission The uncompressed file was something like 50K. The compressed files differed by only a few 100bytes from each other. The gains from short end-tags did not carry over into the compressed versions, and the compressed versions were so much smaller there seemed little contest. Having short-tag ommision can only compress a document by less than 50% at an improbable maximum (in the case of a document with not data, not attributes, no white-space, and incredibly long GIs). Compression is a far better approach. Rick Jelliffe xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From alex at veosystems.com Sat May 9 07:55:53 1998 From: alex at veosystems.com (alex@veosystems.com) Date: Mon Jun 7 17:01:15 2004 Subject: SDD again In-Reply-To: <3.0.32.19980508213503.00b5b650@pop.intergate.bc.ca> from "Tim Bray" at May 8, 98 09:35:05 pm Message-ID: <19980509055543.21260.qmail@veosystems.com> A non-text attachment was scrubbed... Name: not available Type: text Size: 4729 bytes Desc: not available Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980509/2e364cef/attachment.bat From Jon.Bosak at eng.Sun.COM Sat May 9 09:30:31 1998 From: Jon.Bosak at eng.Sun.COM (Jon Bosak) Date: Mon Jun 7 17:01:15 2004 Subject: NOTE: Stylesheets and XML Message-ID: <199805090728.AAA25990@boethius.eng.sun.com> At the request of the W3C XML WG, James Clark has put into the form of a Note a proposal made some time ago for a simple method for associating stylesheets with XML documents. This method closely parallels the mechanism used for the same purpose in HTML 4.0. It is not an official recommendation, but the proposal has already been successfully implemented and should serve well for the first experiments with XML/CSS browsers. See http://www.w3.org/TR/NOTE-xml-stylesheet Jon Bosak Chairman, XML WG xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Sat May 9 10:39:21 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:15 2004 Subject: SDD again In-Reply-To: <19980509055543.21260.qmail@veosystems.com> References: <3.0.32.19980508213503.00b5b650@pop.intergate.bc.ca> Message-ID: <3.0.1.16.19980509082154.7d6fb61e@pop3.demon.co.uk> At 22:55 08/05/98 -0700, [several people] wrote: [... important and occasionally emotive discussion of SDD snipped...] This discussion exemplifies one of the roles we hoped XML-DEV could address - that different people could make different semantic assumptions from the spec. I am not surprised that this particular problem has surfaced because the distinction between validity and well-formedness is new and because it has been clear that it is a difficult one to formalise. In the current situation it would appear that different parser-creators can create code which can sometimes produce different parse trees from the same document. [Please correct me if I'm wrong.] These writers are not incompetent - they take different interpretations of what the spec requires to be done and under what circumstances. If - as appears - we have a spec which is fuzzy in places then we it would be extremely useful to know - dispassionately - where the problems lie. If they can be identified then we might be able to: - agree on a common course of action and/or - agree that the document creator could encode her wishes in some way and/or - agree that the parserwriter could make various options available to the reader/client (e.g. through commandline switches). Some validating parsers (e.g. DXP) already have commandline switches (e.g. -v for validate) and I suspect that there will be increasing pressure to add more ('ignore standalone declaration' for example). If we could systematise these it could be one way to reduce unexpected effects of parsing. [But I'm not the right person to offer :-)] P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From alex at veosystems.com Sat May 9 16:05:43 1998 From: alex at veosystems.com (alex@veosystems.com) Date: Mon Jun 7 17:01:15 2004 Subject: SDD again In-Reply-To: <3.0.1.16.19980509082154.7d6fb61e@pop3.demon.co.uk> from "Peter Murray-Rust" at May 9, 98 08:21:54 am Message-ID: <19980509140520.22138.qmail@veosystems.com> A non-text attachment was scrubbed... Name: not available Type: text Size: 3056 bytes Desc: not available Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980509/bbdc29ad/attachment.bat From simpson at polaris.net Sat May 9 16:26:03 1998 From: simpson at polaris.net (John E. Simpson) Date: Mon Jun 7 17:01:15 2004 Subject: Attribute type NMTOKEN In-Reply-To: <3.0.3.32.19980508095011.008ce420@nexus.polaris.net> References: <199805081213.OAA01196@berlin.dvs1.tu-darmstadt.de> Message-ID: <3.0.3.32.19980509102520.008be510@nexus.polaris.net> At 09:50 AM 5/8/98 -0400, in a stuporous moment I wrote: >a type of NMTOKEN would not allow a value such as >"1ABC," "//ENGLISH," or " WHITESPACE." At 04:25 PM 5/8/98 -0400, Chris Maden clarified: >A possible misunderstanding here: a NMTOKEN itself can't contain >whitespace, but the attribute value specification, the quoted thing >after the equals sign, can: so > >*is* a valid NMTOKEN attribute specification. The NMTOKEN in this >example is only "WHITESPACE", though; the leading (and trailing) >whitespace is trimmed. Then at 03:31 PM 5/8/98 -0600, Russ Heithoff further clarified: >Also, if I remember right, a NMTOKEN can start with Name characters. This >is in contrast to a Name which must start with Name Start characters >followed by characters. In otherwords 1ABC is a legal NMTOKEN but not a >legal NAME. Sheesh -- 2 flubs out of 3 swings at it! So to summarize: (1) The purpose of the NMTOKEN attribute type is to constrain the content of attribute values by their *form* (rather than by their actual value, as would be done with an enumeration). (I think this general statement was true even though I muffed the examples!) (2) The *form* of an attribute value declared as type NMTOKEN is such that it may contain only valid NAME characters. The valid name characters are "letters" (alphabetics as expressed in ASCII Latin, ISO Latin, and so on); digits (also including ASCII and various international sets); limited punctuation (full stop (.), hyphen (-), colon (:)); and various combining and extending characters required for internationalization. Internal whitespace is not allowed in NMTOKEN-type attribute values. [Constraints in item (2) established by productions #4, 7, and 84-89 in REC-xml-19900210.] (3) Leading and trailing whitespace is trimmed/ignored before determining whether the value is valid for a NMTOKEN-type attribute. Thanks to Chris (again) and Russ for correcting my examples. Best regards, | It's no disgrace t'be poor, John E. Simpson | but it might as well be. simpson@polaris.net | -- "Kin" Hubbard xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From robin at ACADCOMP.SIL.ORG Sat May 9 20:03:42 1998 From: robin at ACADCOMP.SIL.ORG (Robin Cover) Date: Mon Jun 7 17:01:15 2004 Subject: XML conformance and test suites Message-ID: <199805091810.NAA03382@ACADCOMP.SIL.ORG> Alex Milowski said: > Such a test suite and conformance to it needs to be governed by some > standards and/or consortium. OASIS springs to mind... OASIS (formerly SGML Open) has set up an XML Conformance Subcommittee, currently under the direction of G. Ken Holman. Ken overviewed the work plans of the subcommittee at the Seattle XML Conference, and slides from his presentation are to be posted on the OASIS web site. See other referrences/hints in the "XML Conformance" section of the SGML/XML Web Page: http://www.sil.org/sgml/xml.html#conformance Robin Cover ------------------------------------------------------------------------- Robin Cover Email: robin@acadcomp.sil.org 6634 Sarah Drive Dallas, TX 75236 USA >>> The SGML/XML Web Page <<< Tel: +1 (972) 296-1783 (h) http://www.sil.org/sgml/sgml.html Tel: +1 (972) 708-7346 (w) FAX: +1 (972) 708-7380 ========================================================================= xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From mrc at allette.com.au Sun May 10 01:13:30 1998 From: mrc at allette.com.au (Marcus Carr) Date: Mon Jun 7 17:01:15 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> Message-ID: <3554E2E6.6E13B796@allette.com.au> W. Eliot Kimber wrote: > Just because *your* requirements make it obvious that empty end tags are > enough, that doesn't mean that everybody's do. Any decision short of no > minimization or full minimization will be an arbitrary one that will cause > those whose requirements are not met to wonder why. Agreed. It would only be a matter of time before someone asked for: paragraph 1 <>paragraph 2 <>paragraph 3 to be made legal, (as it is in SGML) for exactly the same reasons as were originally proposed. -- Regards Marcus Carr email: mrc@allette.com.au _______________________________________________________________ Allette Systems (Australia) email: info@allette.com.au Level 10, 91 York Street www: http://www.allette.com.au Sydney 2000 NSW Australia phone: +61 2 9262 4777 fax: +61 2 9262 4774 _______________________________________________________________ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From adam at cyber-guru.com Sun May 10 01:55:58 1998 From: adam at cyber-guru.com (Adam M. Donahue) Date: Mon Jun 7 17:01:15 2004 Subject: EBNF again... In-Reply-To: <3554E2E6.6E13B796@allette.com.au> Message-ID: <9805092355.AA09694@acf2.nyu.edu> So no one out there knows the answer to my question? Adam mailto:adam@cyber-guru.com xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sun May 10 03:02:58 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:15 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> Message-ID: <3554FCF0.D08AB8FB@technologist.com> Marcus Carr wrote: > > Agreed. It would only be a matter of time before someone asked for: Thankfully, whatever language the conversation is conducted in is likely to have the word "no" in it. We've got quite proficient at using the English version in the XML effort, which is why slippery slope arguments strike me as bizarre. XML is precisely about achieving compromises -- about walking halfway down the slope without losing your balance. Paul Prescod - http://itrc.uwaterloo.ca/~papresco Can we afford to feed that army, while so many children are naked and hungry? Can we afford to remain passive, while that soldier-army is growing so massive? - "Gabby" Barbadian Calpysonian in "Boots" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Sun May 10 03:17:20 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:15 2004 Subject: NOTE: Stylesheets and XML Message-ID: Jon Bosak wrote: >At the request of the W3C XML WG, James Clark has put into the form of >a Note a proposal made some time ago for a simple method for >associating stylesheets with XML documents. This method closely >parallels the mechanism used for the same purpose in HTML 4.0. It is >not an official recommendation, but the proposal has already been >successfully implemented and should serve well for the first >experiments with XML/CSS browsers. See > > http://www.w3.org/TR/NOTE-xml-stylesheet After taking a look at the note, I have a concern. The note proposes as the equivalent for the HTML (There are, of course, many more examples...) The same information, for the most part, is contained in both statements, and a processing instruction ducks the issues raised by using an element to implement the link. Still, I'm wondering if XML isn't already becoming loaded down with different ways to reference external material. External entities provide one mechanism for including material. The DOCTYPE declaration provides another mechanism. This processing instruction is yet another mechanism. XLink provides another set of mechanisms which at least 'transclude' information if not 'include' it. I realize that these things are are connecting material of different types in different contexts, but it seems too many mechanisms are providing similar functionality in different contexts. I'm definitely _not_ saying that this way of implementing stylesheets is a bad idea - it's probably the most convenient way to do it in many circumstances. I'm hoping strongly that we don't find any other needs that require another reference of a different type. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From bckman at ix.netcom.com Sun May 10 03:31:46 1998 From: bckman at ix.netcom.com (Frank Boumphrey) Date: Mon Jun 7 17:01:15 2004 Subject: EBNF again... Message-ID: <01bd7bcd$3ee43fe0$2facdccf@uspppBckman> Adam, I'm not quite sure I followed the gist of your original question, but the following are legitimate characters in XML Sect 2.2 Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x10FFFF] and as you can see the ASCII equivelents of 9,10,13,and 32(i.e. White space) are all called legitimate characters. On the other hand if you look at name char, and check out the letter, the digit, the CombiningChar and the extender productions you will see that they do not include whitespace. [4] NameChar ::= Letter | Digit | '.' | '-' | '_' | ':' | CombiningChar | Extender Whether you can include white space or not depends on the notation. On the other hand the literal string " abc " is ASC(32) +ASC(97) +ASC(98) +ASC(99) + ASC(32) Does this answer your question? Frank -----Original Message----- From: Adam M. Donahue To: xml-dev@ic.ac.uk Date: Saturday, May 09, 1998 4:59 PM Subject: EBNF again... >So no one out there knows the answer to my question? > >Adam > >mailto:adam@cyber-guru.com > >xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk >Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ >To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; >(un)subscribe xml-dev >To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; >subscribe xml-dev-digest >List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) > > xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From mrc at allette.com.au Mon May 11 00:01:11 1998 From: mrc at allette.com.au (Marcus Carr) Date: Mon Jun 7 17:01:15 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com> Message-ID: <35562374.DF2B9B4A@allette.com.au> Paul Prescod wrote: > Marcus Carr wrote: > > > Agreed. It would only be a matter of time before someone asked for: > > Thankfully, whatever language the conversation is conducted in is likely > to have the word "no" in it. We've got quite proficient at using the > English version in the XML effort, which is why slippery slope arguments > strike me as bizarre. XML is precisely about achieving compromises -- > about walking halfway down the slope without losing your balance. I'd be interested to hear how you would justify allowing short end tags and not short start tags; after all, they're trying to accomplish exactly the same thing, so any argument for one surely supports the other. Without adequate justification for short end tags and against short start tags, any decision will appear to be arbitrary rather than a balanced compromise. This is what the slippery slope scenario means to me; your "halfway" won't be everyone elses. -- Regards Marcus Carr email: mrc@allette.com.au _______________________________________________________________ Allette Systems (Australia) email: info@allette.com.au Level 10, 91 York Street www: http://www.allette.com.au Sydney 2000 NSW Australia phone: +61 2 9262 4777 fax: +61 2 9262 4774 _______________________________________________________________ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cys at arbortext.com Mon May 11 01:21:14 1998 From: cys at arbortext.com (Cynthia Lenora Shern) Date: Mon Jun 7 17:01:15 2004 Subject: Is anybody working on ... Message-ID: <98May10.191726edt.26887@thicket.arbortext.com> This makes sense to me too. And, do you envision this particular family of markup residing internal or external to XML instances (as a perferred model)? At 03:13 PM 4/29/98 -0400, Paul Prescod wrote: >I would think that in the case where the formatting is an intrinsic part >of the meaning of a document, it should be represented in the markup just >like everything else. XML encourages you to separate formatting from the >abstraction, but it does not require you to. As someone else pointed out, >Precision Graphics Markup Language is an XML-based language designed >explicitly for formatting -- but it is still XML. > > Paul Prescod - http://itrc.uwaterloo.ca/~papresco > >"Perpetually obsolescing and thus losing all data and programs every 10 >years (the current pattern) is no way to run an information economy or >a civilization." - Stewart Brand, founder of the Whole Earth Catalog >http://www.wired.com/news/news/culture/story/10124.html > >xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk >Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ >To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; >(un)subscribe xml-dev >To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; >subscribe xml-dev-digest >List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) > > > Cynthia Shern (cshern@arbortext.com) xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Mon May 11 01:23:43 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:15 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com> <35562374.DF2B9B4A@allette.com.au> Message-ID: <3556370D.FC18BF6E@technologist.com> Marcus Carr wrote: > > I'd be interested to hear how you would justify allowing short end tags and not > short start tags; Simple: short end tags are trivial to understand, trivial to parse, non-obfuscatory and context-free. They also have plenty of precedent in many other languages. Short start tags behave differently depending on context, and thus are not trivial to understand, are not conducive to good document maintenance and are not implemented in any non-SGML language I know of. > after all, they're trying to accomplish exactly the same thing, > so any argument for one surely supports the other. Without adequate justification > for short end tags and against short start tags, any decision will appear to be > arbitrary rather than a balanced compromise. XML is a relatively arbritrary mix of SGML's most interesting features. I certainly think that the decision to include some things and exclude others seems arbitrary. C'est la vie. That's how you get a workable subset. > This is what the slippery slope scenario means to me; your "halfway" won't > be everyone elses. I didn't claim it had to be, nor should be. That's why the XML working group holds votes to decide the outcome of decisions. Each person chooses their halfway and the majority rules. This situation is no different than any other. Paul Prescod - http://itrc.uwaterloo.ca/~papresco Can we afford to feed that army, while so many children are naked and hungry? Can we afford to remain passive, while that soldier-army is growing so massive? - "Gabby" Barbadian Calpysonian in "Boots" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cys at arbortext.com Mon May 11 01:41:56 1998 From: cys at arbortext.com (Cynthia Lenora Shern) Date: Mon Jun 7 17:01:15 2004 Subject: Is anybody working on ... Message-ID: <98May10.193809edt.26885@thicket.arbortext.com> I (questioner) am wondering whether, the markup in question wouldn't be used to drive multiple processes or presentations. So I guess I'm assuming that if (for example) formatting needs to "provide_for" references to objects (graphics, sound, video, code, and "other_stuff_to_be_described"); and if these referenced objects would be processed differently, or not all, or multiply, depending_on_the_processor and media the processor was presenting the data to; then would an external file would allow for more portability of instances? 08:44 PM 4/29/98 -0400, Paul Prescod wrote: >James K. Tauber wrote: >> >> > Is anybody working on creating an abstration for preserving formatting >> > information inside or along with SGML documents, in cases where format >> > really is part_of the information. >> >> I'm wondering if something like this could be done with the Precision >> Graphics Markup Language (see http://www.jtauber.com/xml/appl/pgml.html) >> with out-of-line links asserting relationships between the precise >> formatting expressed in PGML and the document in a more application-specific >> (eg survey) DTD. > >It isn't clear to me why information that the questioner feels is >intrinsic to the document should be linked instead of embedded. > > Paul Prescod - http://itrc.uwaterloo.ca/~papresco > >"Perpetually obsolescing and thus losing all data and programs every 10 >years (the current pattern) is no way to run an information economy or >a civilization." - Stewart Brand, founder of the Whole Earth Catalog >http://www.wired.com/news/news/culture/story/10124.html > > > >xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk >Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ >To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; >(un)subscribe xml-dev >To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; >subscribe xml-dev-digest >List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) > > > Cynthia Shern (cshern@arbortext.com) xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cys at arbortext.com Mon May 11 02:06:21 1998 From: cys at arbortext.com (Cynthia Lenora Shern) Date: Mon Jun 7 17:01:15 2004 Subject: Is anybody working on ... Message-ID: <98May10.200235edt.26885@thicket.arbortext.com> Exactly - semantically significant is what I'm searching for here. In my first message I gave the example of kindergarten through grade 12 text books and marketing surveys as two examples that might require such types of abstractions. For K-12 the markup would probably have some sort of pedagogical significance. The marketing survey type might have some sort of statistical or perhaps psychological semantic. I think they are probably XML application specific. And probably some extension or addendum to an existing document type definition. So, based on what is hopefully more clarification, is anyone aware of some markup models that are addressing this type information. At 09:19 PM 4/29/98 -0400, David Megginson wrote: >Paul Prescod writes: > > > Still, it isn't wrong to use Java in a way that is tied to a > > particular platform nor to use SGML in a way that is tied to a > > particular formatter. I think that we agree on that central point. > >I don't think that that's the suggestion (though I agree that it would >be a possible application of SGML/XML). Rather, the suggestion is >that people may need to encode physical information about a text when >that information is required for useful processing, possibly with a >wide range of XML processing tools. More controversially, you could >say that the formatting information is semantically-significant. > > >All the best, > > >David > >-- >David Megginson ak117@freenet.carleton.ca >Microstar Software Ltd. dmeggins@microstar.com > http://home.sprynet.com/sprynet/dmeggins/ > >xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk >Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ >To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; >(un)subscribe xml-dev >To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; >subscribe xml-dev-digest >List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) > > > Cynthia Shern (cshern@arbortext.com) xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Mon May 11 08:10:47 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:16 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) In-Reply-To: <35562374.DF2B9B4A@allette.com.au> References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com> <35562374.DF2B9B4A@allette.com.au> Message-ID: <199805110611.CAA00265@unready.microstar.com> Paul Prescod has quite rightly objected to a simplistic slippery-slope argument against short end tags. I have thought of what might be a stronger argument: as an (unstated?) design principle, XML provides exactly one alternative for nearly every markup item. For example, the following are some of the possible start tags in SGML (the second-last one is an example of a shortref): "" "" "|" "" The following are some of the possible end tags in SGML (the second-last one, again, is a shortref): "" "/" "" "|" "" In XML, start tags with no attributes look like "" and end tags with no attributes look like "". That saves a chapter or so from every XML text, a month or so from each parser-writer's calendar, a few days from each intro to XML course, etc. etc. Of course, any good pedant will point out that "" (with trailing whitespace)is a sort-of variant in XML, and that XML does provide what might be called variant or redundant features (such as the choice of "'" or '"' as literal delimiters, and pre-declared entities like "&" where "&" would serve as well). All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From mrc at allette.com.au Mon May 11 10:41:28 1998 From: mrc at allette.com.au (Marcus Carr) Date: Mon Jun 7 17:01:16 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com> <35562374.DF2B9B4A@allette.com.au> <3556370D.FC18BF6E@technologist.com> Message-ID: <3556B982.2A0C2B68@allette.com.au> Paul Prescod wrote: > > I'd be interested to hear how you would justify allowing short end tags and not > > short start tags; > > Simple: short end tags are trivial to understand... > Short start tags behave differently depending on context, and thus are not trivial to > understand, If you mean that they behave differently based on whether OMITTAG is set to 'yes' or 'no', that wouldn't be an issue for XML. > are not conducive to good document maintenance I couldn't agree more, but short end tags can be abused in the same way. > and are not implemented in any non-SGML language I know of. Me either, though I'm not sure if that matters. I don't want short start tags, I don't like short start tags and I have never (intentionally) used short start tags, but unless I have misunderstood you, the reasons you have given above don't seem sufficient basis for voting them down while supporting short end tags. If (as I'm sure most of us believe) these issues are decided primarily on the basis of intelligent discourse rather than gut feeling, it would appear that there is not yet enough to separate short start tags from short end tags. If gut feeling does come into it, then so be it - I can accept that, I just wasn't aware that was the way the process works. This is getting a mile off topic... -- Regards Marcus Carr email: mrc@allette.com.au _______________________________________________________________ Allette Systems (Australia) email: info@allette.com.au Level 10, 91 York Street www: http://www.allette.com.au Sydney 2000 NSW Australia phone: +61 2 9262 4777 fax: +61 2 9262 4774 _______________________________________________________________ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From M.H.Kay at eng.icl.co.uk Mon May 11 11:27:42 1998 From: M.H.Kay at eng.icl.co.uk (Michael Kay) Date: Mon Jun 7 17:01:16 2004 Subject: Attribute type NMTOKEN Message-ID: <002e01bd7cbf$2eba4900$1e09e391@mhklaptop.bra01.icl.co.uk> >A possible misunderstanding here: a NMTOKEN itself can't contain >whitespace, but the attribute value specification, the quoted thing >after the equals sign, can: so > >*is* a valid NMTOKEN attribute specification. The NMTOKEN in this >example is only "WHITESPACE", though; the leading (and trailing) >whitespace is trimmed. You're right of course, but I had a devil of a job discovering why! This is one example of the spec being terribly unhelpful: it says quite clearly (in section 3.3.1): "Values of type NMTOKEN must match the Nmtoken production...", and then qualifies this in section 3.3.3, by saying in effect: "the value I was talking about in section 3.3.1 isn't the "attribute value" defined in section 3.1 as you might have thought, rather it's the "normalized attribute value" defined as follows..." Next time round could we please have a bit more rigour in the spec!? Mike Kay xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From M.H.Kay at eng.icl.co.uk Mon May 11 11:39:35 1998 From: M.H.Kay at eng.icl.co.uk (Michael Kay) Date: Mon Jun 7 17:01:16 2004 Subject: ISO Date (was: parser for xml-data?) Message-ID: <006d01bd7cc0$cf7bbd00$1e09e391@mhklaptop.bra01.icl.co.uk> Rick Jelliffe: >Also note that the usage of ISO 8601 date formats seems to be wrong. >ISO 8601 date format is yyyy-mm-dd, e.g. 1998-05-09, and not 19980509, >last time I looked. It allows both. The first form is preferred where the text is intended to be human-readable, but the second is permitted where it is primarily for machine use. [That's from memory, I don't have the spec to hand]. Mike Kay xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From edz at bsn.com Mon May 11 13:51:26 1998 From: edz at bsn.com (Edward C. Zimmermann) Date: Mon Jun 7 17:01:16 2004 Subject: ISO Date (was: parser for xml-data?) In-Reply-To: <006d01bd7cc0$cf7bbd00$1e09e391@mhklaptop.bra01.icl.co.uk> from "Michael Kay" at May 11, 98 10:40:18 am Message-ID: <199805111151.NAA07681@gils.bsn.com> > > Rick Jelliffe: > >Also note that the usage of ISO 8601 date formats seems to > be wrong. > >ISO 8601 date format is yyyy-mm-dd, e.g. 1998-05-09, and > not 19980509, > >last time I looked. > > It allows both. The first form is preferred where the text > is intended to be human-readable, but the second is > permitted where it is primarily for machine use. [That's > from memory, I don't have the spec to hand]. ISO date formats: YYYYMMDDTHHMMSS[Z] YYYYMMDDTHH:MM:SS[Z] YYYY-MM-DDTHHMMSS[Z] YYYY-MM-DDTHH:MM:SS[Z] (eg. 1996-07-16T13:19:51Z) Z := UTC (aka GMT) else unspecified (resp. local) > > Mike Kay > > > xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk > Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ > To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; > (un)subscribe xml-dev > To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; > subscribe xml-dev-digest > List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) > > -- ______________________ Edward C. Zimmermann Basis Systeme netzwerk/Munich xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Mon May 11 13:52:40 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:16 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com> <35562374.DF2B9B4A@allette.com.au> <3556370D.FC18BF6E@technologist.com> <3556B982.2A0C2B68@allette.com.au> Message-ID: <3556E683.EC0BE1F1@technologist.com> Marcus Carr wrote: > > Me either, though I'm not sure if that matters. > I don't want short start tags, I don't like short start tags and I have never > (intentionally) used short start tags, but unless I have misunderstood you, the reasons > you have given above don't seem sufficient basis for voting them down while supporting > short end tags. If (as I'm sure most of us believe) these issues are decided primarily > on the basis of intelligent discourse rather than gut feeling, it would appear that > there is not yet enough to separate short start tags from short end tags. If gut > feeling does come into it, then so be it - I can accept that, I just wasn't aware that > was the way the process works. Call it gut feeling. Call it years of experience. Call it common sense. Call it a simple matter of drawing a line somewhere just for the sake of drawing the line. It doesn't matter. In short: the perceived and observed usability and usefulness of short-end tags is high and that of short start-tags is low for a variety of reasons. If nobody on the list disagrees with that fundamental point then I don't really see why we should prepare an argument against *just in case*. Nobody wants short start tags, and even if they did, the working group wouldn't put them in. I'm sure a simple poll would reveal unanimous agreement on that issue. It's really that simple. Markup language design is not at all precise, and is influenced much more strongly by subjective factors than by definitive arguments. I expect that designing languages to manage the interface between human beings and computers can only be done subjectively. Paul Prescod - http://itrc.uwaterloo.ca/~papresco Can we afford to feed that army, while so many children are naked and hungry? Can we afford to remain passive, while that soldier-army is growing so massive? - "Gabby" Barbadian Calpysonian in "Boots" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From rajesh_n1 at verifone.com Mon May 11 16:16:07 1998 From: rajesh_n1 at verifone.com (Rajesh N) Date: Mon Jun 7 17:01:16 2004 Subject: freewares.. Message-ID: <3.0.5.32.19980511194649.00808cf0@blr-nt-mail1.verifone.com> Hi all, could anyone give me some pointers regarding the availability of any freewares with/without source code for the following.. i) An XML parser written in C++ ii) An XML parser written in C but should be DTD aware iii) Any XML generator API that uses a DTD while building an XML doc ? (ie, performs validations during the manipulation operations of the tree nodes ) thanks in advance, Rajesh N. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From alaly at inlink.com Mon May 11 16:56:19 1998 From: alaly at inlink.com (Michael Alaly) Date: Mon Jun 7 17:01:16 2004 Subject: XML parser in C++ Message-ID: <03d701bd7ced$502901c0$06001cac@mike> I believe that Microsoft released a C++ parser (no code) as well as a Java Parser which included code. They can both be found at www.microsoft.com/xml hope this helps, Michael Alaly alaly@inlink.com | 314.878.6474 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980511/331869c4/attachment.htm From Bryan_Gilbert at pml.com Mon May 11 17:48:04 1998 From: Bryan_Gilbert at pml.com (Bryan Gilbert) Date: Mon Jun 7 17:01:16 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) Message-ID: Yes don't forget that XML is meant for Humans too. Short end tags are a pain when there is lots of intermediate text. C programmers are familar with this when they have to deal with nested #ifdef #endif statements: the #endif part is a short end tag. For non-C programmers consider trying to find out which start tag belongs to after you have read pages and pages of text. So the best solution is the current solution: each start tag has a complete matching end tag. IMHO Bryan Gilbert PS Formerly I wanted short end tags for embedded processors but human readability is more important to me. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From maillist at chris.hubick.com Mon May 11 21:13:09 1998 From: maillist at chris.hubick.com (Chris Hubick) Date: Mon Jun 7 17:01:16 2004 Subject: SAX Parsers - standalone Message-ID: Up until now I have been using MSXML for my project at work. I am now converting to use SAX. I have documents with the following: The thing is, right now, "/dtd/JDF.dtd" doesn't exist. This is a browsing application, and I don't want to do validation. And I need to feed the parser using an input stream. Using MSXML I can do "document.setLoadExternal(false);" before I call load. With SAX, I thought I would just be able to return null in my entityResolver, and I would achieve the same functionality. Aelfred throws "no protocol: /dtd/JDF.dtd". What can I do to keep the parser from looking for external documents? Since all the documents are standalone, I could remove the DOCTYPE's from each, but I may want to do validation under some circumstances in the future, and would like to avoid this if possible. I also use the SystemLiteral to identify the document type (I guess I could use the root element name = JDF). Much Thanks! --- Chris Hubick mailto:chris@hubick.com http://www.hubick.com/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From distobj at acm.org Mon May 11 22:08:52 1998 From: distobj at acm.org (Mark Baker) Date: Mon Jun 7 17:01:16 2004 Subject: Less verbose XML (Was: A little wish for short end tags (Was: RE: SDD bogus)) In-Reply-To: Message-ID: <3.0.2.32.19980511160727.00724810@pop1.sympatico.ca> At 03:09 PM 8/5/98 -0700, Bryan Gilbert wrote: >Work for a while in a system that has 1/2 MB ROM, >maybe 1MB RAM, and a CPU that is managing >intense calculations and I/O. > >Am I alone? Or are there others out there with similar >constraints who want to use XML? I'd like to hear from >you. I'm developing for small devices. And I must say, as a developer, it certainly keeps you honest. But I don't have a burning desire for short end tags. At Beduin, we've learned the hard way about how difficult it is to render HTML because some tags don't require end tags (in combo with the general crappy state of HTML generators). I'd guess that our parser is twice as large as it needed to be because of these problems. With short end tags, in order for us to render XML, we'd have to have a similar amount of code to render broken XML. So IMHO, short end tags would make my code larger (though runtime memory requirements *might* drop - not sure). BTW, while on the topic of verbosity, I thought the following position paper by Rohit Khare for the "Future of HTML" conference might be of interest; http://xent.ics.uci.edu/FoRK-archive/apr98/0465.html, "Requirements for Interactive Access to HTML and XML Documents", aka "YML Requirements"; "XML has been accused of a great many sins, but brevity is not among them. >From the 'waste' of to non-normative whitespace, many a character can be considered extraneous. However, rather than fragment off into new archival formats using compact end-tags, rediscovering markup minimization, or paren-based S-expressions, we're best off approaching the Shannon limit directly rather than hacking around it." Well said. MB -- Mark Baker. CTO, Beduin Communications Corp Ottawa, Ontario, CANADA http://www.beduin.com xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From mrc at allette.com.au Tue May 12 00:04:11 1998 From: mrc at allette.com.au (Marcus Carr) Date: Mon Jun 7 17:01:16 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com> <35562374.DF2B9B4A@allette.com.au> <3556370D.FC18BF6E@technologist.com> <3556B982.2A0C2B68@allette.com.au> <3556E683.EC0BE1F1@technologist.com> Message-ID: <355775A9.9358FC20@allette.com.au> Paul Prescod wrote: > Markup language design is not at all precise, and is influenced much more > strongly by subjective factors than by definitive arguments. I expect that > designing languages to manage the interface between human beings and > computers can only be done subjectively. Thanks for the explanation. As one with a passing interest in mathematics, I probably just assumed that more formal approaches would be employed in the first instance, but never having participated in such an endeavour there isn't any basis for that. I'm sure an element of subjectivity is both expeditious and valuable when faced with some of these issues. -- Regards Marcus Carr email: mrc@allette.com.au _______________________________________________________________ Allette Systems (Australia) email: info@allette.com.au Level 10, 91 York Street www: http://www.allette.com.au Sydney 2000 NSW Australia phone: +61 2 9262 4777 fax: +61 2 9262 4774 _______________________________________________________________ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Tue May 12 02:46:51 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:16 2004 Subject: Less verbose XML In-Reply-To: <3.0.2.32.19980511160727.00724810@pop1.sympatico.ca> References: <3.0.2.32.19980511160727.00724810@pop1.sympatico.ca> Message-ID: <199805120047.UAA00374@unready.microstar.com> Mark Baker writes: > At Beduin, we've learned the hard way about how difficult it is to > render HTML because some tags don't require end tags (in combo with > the general crappy state of HTML generators). I'd guess that our > parser is twice as large as it needed to be because of these > problems. With short end tags, in order for us to render XML, we'd > have to have a similar amount of code to render broken XML. So > IMHO, short end tags would make my code larger (though runtime > memory requirements *might* drop - not sure). I think that Mark is quite right. Just for interest, here are some statistics, using Jon Bosak's ot.xml (with CRLF line-ends, and with a different XML declaration at the top): With full end tags: 3,880,187 bytes With short end tags: 3,773,844 bytes (2.7% saving) Here are the same files compressed with gzip -9 (savings are relative to the uncompressed version with full end tags): With full end tags: 994,835 bytes (74.4% saving) With short end tags: 988,976 bytes (74.5% saving) All the best, David -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Tue May 12 07:30:56 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:16 2004 Subject: Announcement: SAX 1.0 is finished Message-ID: <199805120530.BAA03034@unready.microstar.com> I am happy to announce public release 1.0 of SAX (the Simple API for XML), a collaborative project of the members of the XML-DEV discussion group. The first release of SAX is in Java, but versions in other programming languages may follow. SAX is free for both commercial and non-commercial use. SAX is a common, event-based API for parsing XML documents. The first draft of SAX was supported by IBM's XML for Java, DataChannel's DXP, James Clark's XP, and Microstar's AElfred parsers. Support for Tim Bray's Lark parser and Microsoft's MSXML parser was provided through third-party drivers. SAX fills the same role for XML that the JDBC fills for SQL: with SAX, a Java application can work with any XML parser, as long as the parser has a SAX 1.0 driver available. SAX 1.0 drivers for the major parsers will be appearing shortly. For more information, or to download SAX 1.0 with two sample drivers (Lark and MSXML), please visit the following URL: http://www.megginson.com/SAX/ Thanks and congratulations to everyone, David Megginson -- David Megginson ak117@freenet.carleton.ca Microstar Software Ltd. dmeggins@microstar.com http://home.sprynet.com/sprynet/dmeggins/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From akitoshi.yoshida at sap-ag.de Tue May 12 10:45:45 1998 From: akitoshi.yoshida at sap-ag.de (Akitoshi Yoshida) Date: Mon Jun 7 17:01:16 2004 Subject: Less verbose XML (Was: A little wish for sh Message-ID: <199805120831.KAA27617@hs2114.wdf.sap-ag.de> standard text compression algorithms can generally compress files with short end-tags than those with full end-tags because these standard algorithms can't build the right source model for XML (in practice). to increase the compression factor, one should model the input source more accurately using the XML specs and the DTD of the input file. but in this case, encoding/decoding needs more work. regards, aki yoshida -----Original Message----- From: Andrew Layman This is not to debate the value of short end tags. That's been debated. But just as a technical matter, I also ran some tests a while ago, and found that files with short end tags compress 5 to 10 percent better than those with full end tags. I believe the reason is that with full end tags there are more unique pairs of end/start, while with short end tags there are only as many end/start pairs as types of start tag. So a compression scheme that first shortened end tags, then applied a standard compression would be more efficient size-wise. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From rajesh_n1 at verifone.com Tue May 12 12:11:14 1998 From: rajesh_n1 at verifone.com (Rajesh N) Date: Mon Jun 7 17:01:16 2004 Subject: Qn. reg. Expat parser Message-ID: <3.0.5.32.19980512154051.007c5550@blr-nt-mail1.verifone.com> Hi all, Does the James Clark's expat parser use external DocType declarations , what i mean is something like this Does it do parameter entity substitution if it does and also General Entity substitution?? So far i have been unable to observe this using the expat parser!!! Any help ? Thanks, Rajesh N. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From robin at ACADCOMP.SIL.ORG Tue May 12 17:02:15 1998 From: robin at ACADCOMP.SIL.ORG (Robin Cover) Date: Mon Jun 7 17:01:17 2004 Subject: SGML/XML and literate programming Message-ID: <199805121509.KAA05497@ACADCOMP.SIL.ORG> Today I created a document entitled "SGML/XML and Literate Programming" as part of the SGML/XML Web Page. It is intended to be a collection of references to literature programming (publications and ongoing work) within the context of descriptive markup language applications. See: http://www.sil.org/sgml/xmlLitProg.html and the rationale for this new section, in the "What's New" page, http://www.sil.org/sgml/sgmlnew.html Readers subscribed to this list are invited to contact me via email to report on other candidate URLs or bibliographic references. I have not launched a major investigation to ascertain what activity might now be underway, or what references to previous work might be available. So: your comments/corrections/additions are welcome. Best wishes, Robin Cover ------------------------------------------------------------------------- Robin Cover Email: robin@acadcomp.sil.org 6634 Sarah Drive Dallas, TX 75236 USA >>> The SGML/XML Web Page <<< Tel: +1 (972) 296-1783 (h) http://www.sil.org/sgml/sgml.html Tel: +1 (972) 708-7346 (w) FAX: +1 (972) 708-7380 ========================================================================= xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From James.Anderson at mecom.mixx.de Tue May 12 17:24:34 1998 From: James.Anderson at mecom.mixx.de (james anderson) Date: Mon Jun 7 17:01:17 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) References: Message-ID: <35586A0E.4A785960@mecom.mixx.de> maybe i've been writing lisp too long, and i've even come to trust my text editor's ability to balance parens, but even before the days of emacs and co., reading (and indenting, if need be) for balancing became second nature. i won't repeat paul graham's argument from "on lisp" here, but it came down to "you end up reading shapes, not parentheses. if you get to need to count the parentheses, the cause is lost anyway..." besides, the editors i've seen for c and java these days not only blink braces, they even display them in pretty colors. shouldn't one expect the same of xml editors - to the extent that they're not wysiwyg? Bryan Gilbert wrote: > > Yes don't forget that XML is meant > for Humans too. Short end tags are a pain > when there is lots of intermediate text. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From James.Anderson at mecom.mixx.de Tue May 12 17:26:17 1998 From: James.Anderson at mecom.mixx.de (james anderson) Date: Mon Jun 7 17:01:17 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com> <35562374.DF2B9B4A@allette.com.au> <3556370D.FC18BF6E@technologist.com> Message-ID: <35586A2C.C77EF45E@mecom.mixx.de> although i agree with mr prescod's intent here, the details are not quite complete Paul Prescod wrote: > > Marcus Carr wrote: > > > > I'd be interested to hear how you would justify allowing short end tags and not > > short start tags; > > Simple: short end tags are trivial to understand, trivial to parse, > non-obfuscatory and context-free. They also have plenty of precedent in ----------------------^? > many other languages. Short start tags behave differently depending on > context, and thus are not trivial to understand, are not conducive to good > document maintenance and are not implemented in any non-SGML language I > know of. > they both depend on state. and although i would suggest, off the top of my head, that the amount of state required for short end tags is less than that required for short start tags, that all depends on the storage semantics implicit in ones xml parser/processor. in my case, this relation holds and there would be two clear steps involved- one notably larger than the other and a slippery slope by no means. on the hand, given the absence of such a semantics for xml in general, any consideration of the implications of minimization find themselves not on a slipery slope but rather in an amorphous swamp... bye for now, xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From bsteele at tdiinc.com Tue May 12 19:04:22 1998 From: bsteele at tdiinc.com (Bob Steele) Date: Mon Jun 7 17:01:17 2004 Subject: Less verbose XML References: <3.0.2.32.19980511160727.00724810@pop1.sympatico.ca> <199805120047.UAA00374@unready.microstar.com> Message-ID: <355880D2.94F60EA@tdiinc.com> David Megginson wrote: > Here are the same files compressed with gzip -9 (savings are relative > to the uncompressed version with full end tags): > > With full end tags: 994,835 bytes (74.4% saving) > With short end tags: 988,976 bytes (74.5% saving) I suggest you also experiment with DTD's designed for the data transfer of objects. With tall inheritance hierarchies in my samples, my generated XML documents grow very large and very repetitive. This is exactly what compression algorithms such as LZW are designed to eliminate. I have found compressions as high as 700%. This does not mean that the resulting compressed documents are small but it does help make practical the transfer of data in XML. Note: The size of the document is influenced by my choice of element over XML attribute encoding for simple types. This was done for symmetry. bob -- xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From michael at textscience.com Tue May 12 20:58:51 1998 From: michael at textscience.com (Michael Leventhal) Date: Mon Jun 7 17:01:17 2004 Subject: Announcement of a book for XML developers References: <3.0.5.32.19970326203913.008ff790@192.168.1.1> Message-ID: <35589860.95ED3C17@shell1.aimnet.com> We have written a book directed to XML developers. A description of it and the table of contents can be found below. The book is available at Amazon and should be appearing in bookstores. It will also be available in the GCA bookstore at the SGML/XML conference in Paris next week. Michael Leventhal ----------------------------------------------------------------------- Designing XML Internet Applications Michael Leventhal, David Lewis, Matthew Fuchs with contributions from Stuart Culshaw and Gene Kan 582 pages, CD-ROM included. Published by Prentice Hall PTR in the Charles F. Goldfarb Series on Open Information Management, ISBN 0-13-616822-1. "Designing XML Internet Applications" is divided into five parts. In the first part we will introduce you to the XML universe. Here you will find a discussion of the role of XML in the internet and a quick-start on the XML recommendation and XML tools. We don't assume prior knowledge of either XML or SGML but our task here is not to provide an extended tutorial or reference on the language syntax. What we do do is develop the perspective of the XML internet application designer and provide any background that is needed to comprehend the subsequent chapters. The next three parts consist of a series of projects using XML in actual internet applications. Working through the projects the reader will gain concrete experience in the design of XML applications, DTDs, and programming. We also delve into standards related to XML and the internet wherever relevant. The first project spans five chapters as the construction of several types of components is involved including a bulletin board, forms processing tools, a search engine, and transformation filters. Most of the work is done in Perl and the approach is less rigorous than that used in subsequent projects. Our intention here is to introduce XML programming in the most simple and "exposed" form possible. We have chosen to use Perl in this first part for various reasons. It is the closest thing we know of to a lingua franca for internet programmers, it is extremely compact allowing us to construct complete examples in relatively few lines of code, and, most significantly, Perl is the most versatile XML scripting language. The second project implements SGML/XML email and digs into the topics of entity management, catalogs, MIME, and full- scale SGML/XML parsing. Code is presented in Perl and C++. Lest the reader think we are Perl bigots the third project plunges us into Java and XML, building an application based on the Document Object Model and making use of a Java XML parser API. Java is the language in which most of the new XML internet infrastructure is being built. The fifth and final section of the book takes a rigorous, formal look at the role of XML in software architectures and agents based on the paradigm of negotiation. Full source code for all the projects has been included on the CD-OM as have all the public domain tools used in the book. TABLE OF CONTENTS PART I Internets, XML, and Tools Chapter 1 Internets 1.1 Introduction 1.2 Why XML 1.3 Structure of the Book 1.4 Let's Talk: Internets are for Communicating 1.5 The Velocity of Information 1.6 Into the Smart Network 1.7 Current Approaches-Can the Web Help? HTML - Java 1.8 Where the Web Needs Help 1.9 Beyond the Traditional Document 1.10 Toward the Active Document 1.11 Down to the Nitty-Gritty 1.12 What Do We Do with Documents? 1.13 DTDs and Content Specifications-A Short Excursion Search - Retrieve - Store - Send/Receive - Import/Export - Type Transformation - Hyperlinking - Compare - Interpret - Define - Create 1.14 Conclusion Chapter 2 XML, Data, and Documents 2.1 XML-What It Is, What It Does, SGML Ancestry SGML Essential Description - The XML Subset and HTML - How XML Simplifies SGML - Valid versus Well-Formed XML 2.2 XML and Data-Driven Architecture 2.3 XML and Documents Using XML to Deliver Large and Complex Documents Efficiently - Taming the Chaos - Production of Multiple Information Products from a Single Source - Reuse and Preservation - Information Interchange Standards - Cost of Production - Safety, Regulatory, and Other Legal Documentation Requirements, Advanced Hypertext, Collaborative Authoring?, Advanced Information Management-Connecting Databases Chapter 3 XML and SGML Tools 3.1 Tool Information Sources The SGML/XML Web Page - The Whirlwind Guide to SGML & XML Tools and Vendors - SGML Buyer's Guide 3.2 Evolution of SGML and XML Tools 3.3 Software Parsers - Programming Languages - Browsers - Search Engines - Document and Component Management - Authoring - Conversion, Capture, and Export of XML - DTD Design Tools - Down Conversion from XML - HyTime 3.4 DTDs as Development Resources Part II Perl and XML Chapter 4 The Desperate Perl Hacker and Internet Applications: Overview 4.1 Apropos of Perl and the Desperate Perl Hacker Java Versus Perl - Perl and XML Compliance 4.2 System Components Applications - Functionality - Software 4.3 System Operation Chapter 5 An XML Bulletin Board 5.1 Overview 5.2 XML Document Types Messages - Password Docuemnt - User State Documents 5.3 Reading and Writing XML in the Bulletin Board Writing Messages - Reading Messages - Password and User State Documents - Transformation from XML to HTML Chapter 6 An XML Contact Database 6.1 Overview 6.2 XML Data Formats 6.3 Reading and Writing XML in the Customer Database XMLForms - Using an XML Editor and CSS to Create Contact Database Records - Chapter 7 Structure-based Search 7.1 Overview: Structure- and Property-Driven Search Search Tools in Internet Applications 7.2 sgrep's Query Language A Web Interface to sgrep Chapter 8 Type Transformation, Import, and Export 8.1 Overview Import, Type Transformation, Export 8.2 Approaches to Transformation Event Stream, Groves, DSSSL/XSL Transformation 8.3 Event Stream Transformation with Perl Core Routine - Element-in-Context Subroutines? - Generation of Subroutine Stubs - Bulletin Board Type Transformation - Contact Database Type Transformation - Bulletin Board Export to RTF - Delayed Output and Forward References Part III XML/SGML E-mail Chapter 9 XML E-mail 9.1 Overview 9.2 Why XML/SGML is Hard to E-mail 9.3 Entity Catalogs Entity Catalog Structure - Catalog Entry Syntax - Building an E-mail Message from a Catalog 9.4 MIME Parts of a MIME message - Handling MIME Messages 9.5 Building the SGMaiL Agent 9.6 The Sending Agent Modifying SP for "createCatalog" 9.7 Parsing the Catalog and Creating the E-mail Message 9.8 The Receiving Agent 9.9 Conclusion Part IV XML and Java - Parsers and APIs Chapter 10 XML Parsers and Application Programmer Interfaces 10.1 Introduction 10.2 Parser Capabilities and Applications Well-formedness and Validity Verification - Document Instance Decomposition - Parser Applications 10.3 XML Parser Interfaces Command-Line and ESIS Event Stream Interfaces - Event Callback Interfaces - Object Model Interfaces 10.4 Sample XML Parsers NXP XML Parser - AElfred XML Parser 10.5 The SAX Event Callback API 10.6 The W3C Document Object Model andAPI Support and Implementation - Acquiring Specifications - Overview of the W3C's DOM Level-1 Interface 10.7 Sample DOM Implementation DOM Interface Definition - DOM Interface Implementations - Intergrating the DOM Implementation - The XSpecViser Application 10.8 Chapter Summary Part V Future - Agents and all that Chapter 11 Input Gathering and Negotiation using XML 11.1 Introduction The Ubiquitous Problem of Input Gathering - Input Gathering and Negotiation - Negotiation Processes from 20,000 Feet 11.2 Negotiation and Language-Agent Architectures Negotiation Problem Specification - Specification Problems and Language-Agent Architecture - Optimal Specification Environments 11.3 Description of Negotiation Problems Negotiation Problem Output and Structure - Negotiation Problem Output-Agreements - Recurring Negotiation Problems - Output Specification Languages - Constraints on Valid Agreement - Practical Enforcement of Output Constraints - Examples from Current Systems Practice 11.4 Manufacturing Negotiator Behavior Overview of Information Used by a Negotiator - Introductory and Ancillary Information - Information about Negotiable Variables - Negotiation Structure - Output Specification Language and Instance Generation 11.5 Chapter Summary xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From M.H.Kay at eng.icl.co.uk Tue May 12 21:05:19 1998 From: M.H.Kay at eng.icl.co.uk (Michael Kay) Date: Mon Jun 7 17:01:17 2004 Subject: Announcing SAXON Message-ID: <001b01bd7dd9$0337b680$0c11e391@mhklaptop.bra01.icl.co.uk> With SAX now stable, I feel ready to expose SAXON to the world. SAXON is a Java package that provides a layer of services on top of SAX. These services include: * support for specific element handlers for each element type, allowing your application to be more modular * providing context information about each element, saving your application from having to maintain this itself * management of multiple output streams linked to particular elements in the input document, useful when you are splitting an XML document or loading the data into a database * an ActiveX interface to make it easy to do an HTML rendition of an XML document from server-side VBScript code on an ASP page I have used SAXON to do a variety of XML->XML and XML->HTML transformations. It includes a number of standard element handlers that are convenient for doing common HTML renditions as well as more advanced functions such as auto-numbering and sorting. More details, and downloadable software, on http://home.iclweb.com/icl2/mhkay/saxon.html Why "SAXON"? Well, because it's a layer on top of SAX. (And originally it was closely linked to AElfred)... Have fun. Mike Kay, ICL M.H.Kay@eng.icl.co.uk xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From alain at cabinfo.com Tue May 12 23:24:22 1998 From: alain at cabinfo.com (Alain DESEINE) Date: Mon Jun 7 17:01:17 2004 Subject: innovation Partners and CEI announce IRIS XML EDITOR beta 1 Message-ID: <3558BD6E.3D86@cabinfo.com> innovation Partners and CEI announce IRIS XML EDITOR beta 1 The first release (beta 1) of IRIS XML EDITOR is now available for download. This first beta allow the user to create XML projects based on a DTD generated (saved) with our IRIS XML DTD GENERATOR software. Futures version will permit to open DTDs that was not saved with IRIS XML DTD GENERATOR. So, if you have provide additionnal informations (like Hint, Description, Toolbar, and icon) in IRIS XML DTD GENERATOR, you'll get in IRIS XML EDITOR dynamics toolbar that contains buttons which can insert easily elements you describe in your DTD. You'll get dynamic element wizzards too, if you provide some attributes for elements. Obviously, IRIS XML EDITOR is a full XML oriented editor with many improved editing functionnalities like : - Toggle between Upper and lower case with shift + F3 - Glossaries - Full text search and replace - Intelligent inserting of element - multi pages editing windows to edit large collections of files - etc. You can download IRIS XML EDITOR from the following URL. http://www.cabinfo.com/download.htm We currently working on improving support of other DTDs (those not saved with IRIS XML DTD GENERATOR, please see the readme.txt file in the distribution for more informations on how to use others DTDs). We are also working on an internal XSL engine for visualisation purpose. All this work will appear in the next beta version (beta 2 or beta 3). Well known in FRANCE for his HTML EDITOR ODYSSEE ( see http://www.mondial-telecom.com/odyssee ), Innovation Partners has start to think about XML many months ago. This thought tell us to develop 3 XML software. The first one, IRIS XML DTD GENERATOR, is a tool for building XML DTDs. The second is IRIS XML EDITOR for editing XML files. And the third one is a XSL stylesheet EDITOR for presenting and processing XML files. These 3 softwares can naturally work together, building a complete sofware suite for XML editing. For limitations, supported functionnalities, and bugs of beta versions, please see the readme file of the downloaded software. Now lets have a look to the DTD functionnalities. IRIS XML DTD GENERATOR - Editing existing DTDs - Creating DTDs - Logical and Text view mode for editing DTDs - Wizzard for creating ELEMENTS both in text an logical view - Wizzard for creating ATTRIBUTES both in text an logical view - Wizzard for creating ENTITIES both in text an logical view - Wizzard for creating COMMENTS both in text an logical view - Support of additionnals informations for IRIS XML EDITOR (saved as comment in the DTD) - Generation of XML DATA schemas - Find an replace both in logical and text view - Wizzard for creating xml standard attributes (like xml:lang for example) - support of XLL linking attributes - Glossaries - Printing a DTD - multi language software (currently French and English only - You can change the language in the file menu) - Automatic save capabilities - backup on save capabilities - MDI environment NOTE : All of these functionnalities are not implemented in the beta 1 release. This list describe what the software can be for the version 1.00 release. Some of these functionnalities will perharps disappear for the final release. IRIS XML EDITOR - Creating XML files according to a DTD generated by IRIS XML DTD GENERATOR, or generated by something else. - Project notion for managing XML file collections (like in the ODYSSEE HTML EDITOR). - glossaries - Dynamic toolbars with generated icons that can insert ELEMENTS, ATTRIBUTES, or ENTITIES you define in your DTD - Dynamic Wizzard generated automaticly for ELEMENTS with attributes, that ask user for informations. - Text view for editing XML files - Hierachical view for for editing the XML file (the tree object flow view) - Visualisation view for seeing the rendering of the XML file : - If an XSL file is specified in your XML document these one is use to generate the visualisation view. - if none is provide, a minimal built in style sheet is use to render your document. - Automatic save capabilities - backup on save capabilities - XML document printing - XML Visualisation view printing - FTP publication of XML files - Specific Wizzard adding capabilities through an Standardize DLL API (like in ODYSSEE HTML EDITOR) - Friend application capabilities - multi language software (currently French and English only - You can change the language in the file menu) - Find an replace both in tree and text view - Insertion of XML Standard Attributes (like xml:lang) NOTE : All of these functionnalities are not implemented in the beta 1 release. This list describe what the software can be for the version 1.00 release. Some of these functionnalities will perharps disappear for the final release. IRIS XSL STYLER - Creating XSL stylesheet - full support of XSL proposal and future ones. - glossaries - Text view for editing XSL files - Hierachical view for for editing the XSL rules - Import a DTD to re-use its elements, attributes, etc. in the style sheet - Visualisation view for seeing the rendering produce with the XSL file : - You can provide your own XML file to see the rendering. - if none is provide, a minimal built in XML file based on the target-elements is used. - Automatic save capabilities - backup on save capabilities - XSL document printing - XSL Visualisation view printing - Specific Wizzard adding capabilities through an Standardize DLL API (like in ODYSSEE HTML EDITOR) - Friend application capabilities - multi language software (currently French and English only - You can change the language in the file menu) - Find an replace both in tree and text view NOTE : All of these functionnalities are not implemented in the beta 1 release. This list describe what the software can be for the version 1.00 release. Some of these functionnalities will perharps disappear for the final release. Currently, only the IRIS XML DTD GENERATOR beta 1 and IRIS XML EDITOR beta 1 are available. IRIS XSL STYLER beta 1 will be available in June. All theses software are commercial, so they are copyright. The beta version of these software are freely usable for evaluation purpose, bugs report, and functionnality discussions. Once the final release (Version 1.00) launch up, you ***MUST*** register the software if you want to use it. For Bugs reports, Functionnalities discussion, technical questions, XML and XSL discussion, you can use the DTD GENERATOR mailling list, for details about the mailling list, please see the readme file include in the beta 1 distribution. You can download the Beta 1 distribution of IRIS XML EDITOR at : http://www.cabinfo.com/download.htm Paris the May 4th 1998 Alain DESEINE, Technical contact. ******************************************************************** Innovation Partners Commercial contact : M. Emmanuel CHAMBON innovationpartners@hol.fr Technical contact : M. Alain DESEINE alain@cabinfo.com http://www.mondial-telecom.com/odyssee http://www.cabinfo.com Commercial department 148, rue Boucicaut 92260 Fontenay-aux-Roses Tel : 33 01 43 50 95 25 or 33 06 09 80 10 16 (GSM) Postal Address : 9, bis rue des Besnards 92260 FONTENAY AUX ROSES FRANCE Tel : 33 01 41 41 00 89 or 33 01 41 41 91 91 or 33 06 09 80 10 16 (GSM) Fax : 33 01 41 41 02 34 xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Tue May 12 23:46:31 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:17 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) In-Reply-To: <355775A9.9358FC20@allette.com.au> References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com> <35562374.DF2B9B4A@allette.com.au> <3556370D.FC18BF6E@technologist.com> <3556B982.2A0C2B68@allette.com.au> <3556E683.EC0BE1F1@technologist.com> Message-ID: <3.0.1.16.19980512195759.09575568@pop3.demon.co.uk> At 08:03 12/05/98 +1000, [lots of people] wrote: [...lots of stuff about how nice it would/wouldn't be to have short end, short start-tags, other revisions to the spec, etc... all snipped...] Just in case any newcomer to XML-DEV is confused, short end-tags, short start-tags, no tags at all, etc. are illegal in XML V1.0 and all conforming parsers will throw well-formedness errors. There is - as far as I know - no formal or informal note being taken of these discussions for any future revision of the spec, whether or not such revision exists. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Tue May 12 23:51:58 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:17 2004 Subject: LISTRIVIA (Re: XML parser in C++) In-Reply-To: <03d701bd7ced$502901c0$06001cac@mike> Message-ID: <3.0.1.16.19980512195111.09076a24@pop3.demon.co.uk> At 09:58 11/05/98 -0500, [an XML-DEV member] wrote: [... helpful message snipped ...] > >Attachment Converted: "c:\eudora\attach\XMLparse.htm" It's not a good idea to attach things to XML-DEV postings - they take up bandwidth (costs me money) and eat up my disk space. Also they do terrible things to the hypermail. I don't know if this was the problem, but my browser went a horrid shade of turquoise on reading this message. (It didn't actually throw a Draconian error, but I think it came close). P. > Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Tue May 12 23:52:38 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:17 2004 Subject: freewares.. In-Reply-To: <3.0.5.32.19980511194649.00808cf0@blr-nt-mail1.verifone.com > Message-ID: <3.0.1.16.19980512194742.09078402@pop3.demon.co.uk> At 19:46 11/05/98 +0500, Rajesh N wrote: >Hi all, > could anyone give me some pointers regarding the availability >of any freewares with/without source code for the following.. Rajesh, There are now a *lot* or very good resources describing XML software - try Robin Cover's XML page to start (www.sil.org/sgml/xml.html) It will give you a better idea than getting random (or null) replies here and has *all* the pointers to other resources. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Tue May 12 23:57:08 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:17 2004 Subject: Announcing SAXON In-Reply-To: <001b01bd7dd9$0337b680$0c11e391@mhklaptop.bra01.icl.co.uk> Message-ID: <3.0.1.16.19980512200245.0957c554@pop3.demon.co.uk> At 20:06 12/05/98 +0100, Michael Kay wrote: >With SAX now stable, I feel ready to expose SAXON to the >world. [... details snipped...] > >Mike Kay, ICL >M.H.Kay@eng.icl.co.uk Thanks very much Mike - looks great fun - haven't downloaded it yet. This interoperability of tools developed in a virtual environment is exactly our original vision of one way that XML-DEV might work. It is extremely gratifying to see it happen. > Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jeremy at omsys.com Wed May 13 03:15:52 1998 From: jeremy at omsys.com (Jeremy H. Griffith) Date: Mon Jun 7 17:01:17 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) In-Reply-To: <35586A0E.4A785960@mecom.mixx.de> References: <35586A0E.4A785960@mecom.mixx.de> Message-ID: <355ff2c4.244161315@mail.together.net> On Tue, 12 May 1998 17:26:09 +0200, james anderson wrote: >maybe i've been writing lisp too long, and i've even come to trust my text >editor's ability to balance parens, but even before the days of emacs and co., >reading (and indenting, if need be) for balancing became second nature. >i won't repeat paul graham's argument from "on lisp" here, but it came down to >"you end up reading shapes, not parentheses. if you get to need to count the >parentheses, the cause is lost anyway..." >besides, the editors i've seen for c and java these days not only blink >braces, they even display them in pretty colors. >shouldn't one expect the same of xml editors - to the extent that they're not wysiwyg? Wouldn't that be nice! Of course, asking an editor to balance based on string comparison of the tag names isn't quite as easy as matching parens... Too bad XML was defined like this: ... instead of this: < ... > or even this: < ... > which would work with every programming editor around, right now... --Jeremy H. Griffith http://www.omsys.com/jeremy/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From h.rzepa at ic.ac.uk Wed May 13 09:30:01 1998 From: h.rzepa at ic.ac.uk (Rzepa, Henry) Date: Mon Jun 7 17:01:17 2004 Subject: LISTRIVIA (Re: XML parser in C++) In-Reply-To: <3.0.1.16.19980512195111.09076a24@pop3.demon.co.uk> References: <03d701bd7ced$502901c0$06001cac@mike> Message-ID: >At 09:58 11/05/98 -0500, [an XML-DEV member] wrote: > >[... helpful message snipped ...] >> >>Attachment Converted: "c:\eudora\attach\XMLparse.htm" > >It's not a good idea to attach things to XML-DEV postings - they take up >bandwidth (costs me money) and eat up my disk space. Also they do terrible >things to the hypermail. I don't know if this was the problem, but my >browser went a horrid shade of turquoise on reading this message. (It >didn't actually throw a Draconian error, but I think it came close). Attachments (and v-cards) are normally bad form for lists. Majordomo does handle the attachment, but Hypermail (ancient as it is) does not. More modern combinations might make a better job, but I would not recommend using this list to send attachments (not modem friendly for one). If there is a real need to exchange such things, probably the easiest solution is to mount your document on a server (ftp, http, etc) somewhere, and just send the list the URL. Dr Henry Rzepa, Dept. Chemistry, Imperial College, LONDON SW7 2AY; mailto:rzepa@ic.ac.uk; Tel (44) 171 594 5774; Fax: (44) 171 594 5804. URL: http://www.ch.ic.ac.uk/rzepa/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Wed May 13 13:53:59 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:17 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) References: <3.0.32.19980508164517.00766e2c@postoffice.swbell.net> <3554E2E6.6E13B796@allette.com.au> <3554FCF0.D08AB8FB@technologist.com> <35562374.DF2B9B4A@allette.com.au> <199805110611.CAA00265@unready.microstar.com> Message-ID: <3559889A.C7C59C7B@technologist.com> David Megginson wrote: > > Paul Prescod has quite rightly objected to a simplistic slippery-slope > argument against short end tags. I have thought of what might be a > stronger argument: as an (unstated?) design principle, XML provides > exactly one alternative for nearly every markup item. I think that you are correct that this is a stronger point. But XML does allow alternative encodings. Consider defaulted attributes. Or vs. . Consider < vs. < . vs. . Each of these was consciously added to XML as a usability feature. Nevertheless all of them cause implementors more work. But in my mind, not one of them has the cost to implementors vs. benefit to users ratio of short end tags. Tim is right that the critical issue is the ease of processing of XML software by "stupid" (regexp-based) software. He feels that this situation has changed since we debated it. I don't agree. I thought that it was evident then that there would be parsers for XML for Perl, Python and any other regular-expression friendly. At the very least perlsgml, sgmlspl and sgmllib would always exist. Nevertheless the situation has changed in some ways. XML is now widely hyped and destined for success. It is far more popular with implementors than with users. This is the right time to try to rebalance the scales so that XML becomes a megahit and not a hacker favourite. Paul Prescod - http://itrc.uwaterloo.ca/~papresco Can we afford to feed that army, while so many children are naked and hungry? Can we afford to remain passive, while that soldier-army is growing so massive? - "Gabby" Barbadian Calpysonian in "Boots" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Wed May 13 13:55:09 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:18 2004 Subject: A little wish for short end tags (Was: RE: SDD bogus) References: <35586A0E.4A785960@mecom.mixx.de> Message-ID: <35598924.AC686153@technologist.com> > Bryan Gilbert wrote: > > > > Yes don't forget that XML is meant > > for Humans too. Short end tags are a pain > > when there is lots of intermediate text. james anderson wrote: > > maybe i've been writing lisp too long, and i've even come to trust my text > editor's ability to balance parens, but even before the days of emacs and co., > reading (and indenting, if need be) for balancing became second nature. I think that even more important than this argument is that nobody is proposing mandatory short end tags. XML with optional short end tags offers the advantages of languages with uniform, short end markers but also allows you to "be redundant" where that will help. I've proposed in the past that full SGML should take optional redundancy farther to allow something like this:
....
> they both depend on state. and although i would suggest, off the top > of my head, that the amount of state required for short end tags is > less than that required for short start tags, that all depends on > the storage semantics implicit in ones xml parser/processor. I do not believe that there is any way ot implementat a legal XML parser without keeping around all of the information required to implement short end tags. Checking that an end-tag matches its start-tag (the current situation) is no easier than not checking. Paul Prescod - http://itrc.uwaterloo.ca/~papresco Can we afford to feed that army, while so many children are naked and hungry? Can we afford to remain passive, while that soldier-army is growing so massive? - "Gabby" Barbadian Calpysonian in "Boots" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From alaly at inlink.com Wed May 13 14:27:42 1998 From: alaly at inlink.com (Michael Alaly) Date: Mon Jun 7 17:01:18 2004 Subject: No subject Message-ID: <023701bd7e6a$6f5de6e0$06001cac@mike> I am new to the list, but I have followed the short tag discussion for about a week now. I am in agreement with those who believe it is not necessary. To respond to Paul's argument about 'optional' XML standards, I will quote the W3C's XML recommendation (uh oh!) "4. It shall be easy to write programs which process XML documents. 5. The number of optional features in XML is to be kept to the absolute minimum, ideally zero." Page 4. I believe that human readability and ease of use is more important than saving the few K that short tags could offer (in some cases more). XML should be as standardized as possible and as readable as possible. Optional short tags would undermine both of those characteristics which make XML the future. My two cents, Michael Alaly alaly@inlink.com Paul Prescod ---------------------------------------------------------------------------------------------------------------------- I think that even more important than this argument is that nobody is proposing mandatory short end tags. XML with optional short end tags offers the advantages of languages with uniform, short end markers but also allows you to "be redundant" where that will help. I've proposed in the past that full SGML should take optional redundancy farther to allow something like this:
....
------------------------------------------------------------------------------------------------------------------------ -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980513/e218ef97/attachment.htm From alaly at inlink.com Wed May 13 14:46:36 1998 From: alaly at inlink.com (Michael Alaly) Date: Mon Jun 7 17:01:18 2004 Subject: Short Tags Message-ID: <024b01bd7e6d$728b2aa0$06001cac@mike> Just a small note I forgot to add in my last message: An XML parser does have all of the information to be able to process a document with short tags if you don't care about debugging. However, if I have a document with 10 000 start tags and with 9 999 end tags, the only information a parser can give is that the document isn't valid. This is because it won't know which elements are complete and which are still open. If it made some futile attempt to detect by checking with the DTD and debugging that way, it still isn't exact. Whereas, using long tags the parser can search for exactly which tag isn't closed and output its starting line position. This makes debugging cause a lot less headaches. Another part of my two cents, Cheers, Michael Alaly alaly@inlink.com | 314.878.6474 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980513/81b90df1/attachment.htm From amitr at abinfosys.com Wed May 13 15:18:08 1998 From: amitr at abinfosys.com (Amit Rekhi) Date: Mon Jun 7 17:01:18 2004 Subject: Calling external DTD from MSXML Message-ID: <3.0.5.32.19980513185226.00963dd0@192.168.1.1> Hello All, I am trying to validate my XML files with their corresponding DTD's using MS validating XML parser in Java but the MSXML is throwing an exception :- Could Not find External DTD. Let me explain the scenario:- The XML file (Books.xml) I am trying to validate starts with and it's corresponding DTD (Books.dtd) starts with and ) Yet when I start to load the Books.xml file using the MSXML validating parser the following exception is thrown :- "Error Loading XML Document 'Books.xml' com.xml.ms.xml.parser ParseException: Could Not Find External DTD 'Books.dtd'" (The Books.xml and Books.dtd files are in the same folder) How to correct this problem??Any Suggestions??? Regards, AMIT xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Wed May 13 16:15:08 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:18 2004 Subject: LISTRIVIA In-Reply-To: <023701bd7e6a$6f5de6e0$06001cac@mike> Message-ID: <3.0.1.16.19980513150632.6f27bae4@pop3.demon.co.uk> At 07:27 13/05/98 -0500, Michael Alaly wrote: >I am new to the list [... subject matter snipped ...] I have recently posted about etiquette on this list - maybe some people have missed it - under the portmanteau heading 'LISTRIVIA'. Henry has also added his comments. Among the fairly few aspects of discipline we like people to keep are: - please include a subject line "Re:" is not helpful and may cause indexing problems - please keep meaningful thread subjects > >Attachment Converted: "c:\eudora\attach\Untitled.htm" PLEASE do not add attachments of any kind. It costs me money that I can't afford. It annoys other list members. It fouls up the hypermail. P. > Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Wed May 13 16:20:31 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:18 2004 Subject: LISTRIVIA (was Re: Short Tags) In-Reply-To: <024b01bd7e6d$728b2aa0$06001cac@mike> Message-ID: <3.0.1.16.19980513151007.6bb74440@pop3.demon.co.uk> At 07:48 13/05/98 -0500, Michael Alaly wrote: >Just a small note I forgot to add in my last message: > >Attachment Converted: "c:\eudora\attach\ShortTag.htm" And just an attachment you forgot to omit from this one :-) I am afraid I'm quite persistent. P. > Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From yosr.hmaied at inria.fr Wed May 13 16:39:15 1998 From: yosr.hmaied at inria.fr (Yosr HMAIED) Date: Mon Jun 7 17:01:18 2004 Subject: Calling external DTD from MSXML References: <3.0.5.32.19980513185226.00963dd0@192.168.1.1> Message-ID: <3559B067.4EE6@inria.fr> Amit Rekhi wrote: > > Hello All, > I am trying to validate my XML files with their corresponding DTD's > using MS validating XML parser in Java but the MSXML is throwing an > exception :- Could Not find External DTD. > may be it's problem of incoherence between the file name book.dtd declared in book.xml and it's corresponding Book.dtd ( because in UNIX book.dtd and Book.dtd are different) -- Yosr HMAIED mailto:yosr.hmaied@inria.fr tel :01 39 63 51 71 Projet Rodin INRIA Rocquencourt xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From schampeo at hesketh.com Wed May 13 16:51:24 1998 From: schampeo at hesketh.com (Steven Champeon) Date: Mon Jun 7 17:01:18 2004 Subject: LISTRIVIA (was Re: Short Tags) In-Reply-To: <3.0.1.16.19980513151007.6bb74440@pop3.demon.co.uk> Message-ID: On Wed, 13 May 1998, Peter Murray-Rust wrote: > At 07:48 13/05/98 -0500, Michael Alaly wrote: > >Just a small note I forgot to add in my last message: > > > >Attachment Converted: "c:\eudora\attach\ShortTag.htm" > > And just an attachment you forgot to omit from this one :-) I am afraid I'm > quite persistent. Peter, he's not sending attachments in the usual sense - he's using an HTML emailer, which is sending both text and HTML version of the message to the list - Eudora is reading the text version and saving the HTML version. In pine, all I see are notices of MIME attachments, so I usually just delete them without reading the message at all. Email is about bare bones communication. Michael, get a better mailer (Eudora used to be my favorite, but I've gone back to pine when they started shipping 16MB worth of MSIE/OS upgrades with what used to be a 4MB mail package) or learn how to configure the one you're using so that it doesn't annoy the rest of the world with needless formatting cruft. Thanks, Steve -- Steven Champeon | "Yeah, but come on. Push http://hesketh.com/schampeo/ | is dead. Nobody cares." http://a.jaundicedeye.com | - Tim Bray on CDF xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From larsga at ifi.uio.no Wed May 13 19:56:14 1998 From: larsga at ifi.uio.no (Lars Marius Garshol) Date: Mon Jun 7 17:01:18 2004 Subject: Free XML software list Message-ID: I'm pleased to announce that my list of XML software is now available in categorized form with descriptions at For those of you who have made the software on this list: I'm trying out an experimental auto-update feature with this list. If you want me to be automatically notified when you release a new version of your software (or when it moves to a new URL) you can now achieve that by writing 6 lines of XML about your product and putting it online. The necessary auto-update software is already written and tested, so you can start using this immediately. I may eventually make a Java client (or a CGI script) which end-users can run to be informed of new releases of chosen products, but for the moment that's just an idea. Further instructions (with DTD and an example file) are available from -- "These are, as I began, cumbersome ways / to kill a man. Simpler, direct, and much more neat / is to see that he is living somewhere in the middle / of the twentieth century, and leave him there." -- Edwin Brock http://www.stud.ifi.uio.no/~larsga/ http://birk105.studby.uio.no/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jackpark at thinkalong.com Wed May 13 20:47:05 1998 From: jackpark at thinkalong.com (Jack Park) Date: Mon Jun 7 17:01:18 2004 Subject: JavaCC Message-ID: Just a curiosity; perhaps this has been discussed before. Has anyone looked at using JavaCC to build a parser that handles XML plus other namespaces (e.g. CML) in one uniform parser? There is an example source for HTML3.2. I'm wondering where that could lead. The work appears to be in mapping dtds into JavaCC (perhaps JJTree) source form. Jack Park xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Jon.Bosak at eng.Sun.COM Wed May 13 21:23:27 1998 From: Jon.Bosak at eng.Sun.COM (Jon Bosak) Date: Mon Jun 7 17:01:18 2004 Subject: WD of XSL requirements released Message-ID: <199805131921.MAA26860@boethius.eng.sun.com> The first working draft of the requirements for XSL has just been made public: http://www.w3.org/TR/1998/WD-XSLReq Jon xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Wed May 13 21:48:09 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:18 2004 Subject: Free XML software list In-Reply-To: Message-ID: <3.0.1.16.19980513194554.5bb7db3e@pop3.demon.co.uk> At 19:54 13/05/98 +0200, Lars Marius Garshol wrote: > >I'm pleased to announce that my list of XML software is now available >in categorized form with descriptions at > > Brilliant - many thanks. It's tremendous to have these resources which are a great help to the 'XML-DEV effort'. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From trafford at interlog.com Wed May 13 21:49:48 1998 From: trafford at interlog.com (Ben Trafford) Date: Mon Jun 7 17:01:18 2004 Subject: WD of XSL requirements released References: <199805131921.MAA26860@boethius.eng.sun.com> Message-ID: <3559FAE0.C491DA17@interlog.com> Jon Bosak wrote: > The first working draft of the requirements for XSL has just been made > public: > > http://www.w3.org/TR/1998/WD-XSLReq Oops! This URL is incorrect. The actual URL is: http://www.w3.org/TR/WD-XSLReq --->Ben Trafford xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Wed May 13 21:54:13 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:18 2004 Subject: Xlink semantics Message-ID: <3.0.1.16.19980513204547.5bdf6c46@pop3.demon.co.uk> I have been hacking an application (the VHG DTD) using Xlink and I'd like to check on some semantics. Since the spec has contentSpecs of ANY for all three link-types the formal situation is that anything is permitted. Should my software complain at the following examples (notation should be obvious), or should it try to do something clever? ... ... ... ... ... The problem is that all of these throw no error in the parser as they are probably impossible to constrain except in very spartan DTDs. I suspect most are not productive, but some might be valuable on occasions I have or haven't thought of. This is an important occasion that there is a clear requirement for applications to apply semantics to parts of one of the specs. We already have to write an attribute processor and I'm interested in knowing how much additional processing any conforming Xlink software is going to have to do. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Arun_Varshney at intuit.com Wed May 13 23:29:34 1998 From: Arun_Varshney at intuit.com (Arun Varshney) Date: Mon Jun 7 17:01:19 2004 Subject: EDI to XML Message-ID: <002B15AE.1826@intuit.com> I am new to the XML world, but do lot of EDI-VAN based transactions with our trading community & in a process of defining long-term strategy to do these transactions over the Web/internet. I have started hearing that XML is a replacement of EDI over the Web. could anyone suggest me, how XML would be used to replace EDI in terms of what all componenets/tools, steps to be taken in developers & planners point of view. Is that some product available in the market which could be used now etc. any comment would be appreciated. -Arun V. ______________________________ Reply Separator _________________________________ Subject: Free XML software list Author: Lars Marius Garshol at Internet Date: 5/13/98 7:54 PM I'm pleased to announce that my list of XML software is now available in categorized form with descriptions at For those of you who have made the software on this list: I'm trying out an experimental auto-update feature with this list. If you want me to be automatically notified when you release a new version of your software (or when it moves to a new URL) you can now achieve that by writing 6 lines of XML about your product and putting it online. The necessary auto-update software is already written and tested, so you can start using this immediately. I may eventually make a Java client (or a CGI script) which end-users can run to be informed of new releases of chosen products, but for the moment that's just an idea. Further instructions (with DTD and an example file) are available from -- "These are, as I began, cumbersome ways / to kill a man. Simpler, direct, and much more neat / is to see that he is living somewhere in the middle / of the twentieth century, and leave him there." -- Edwin Brock http://www.stud.ifi.uio.no/~larsga/ http://birk105.studby.uio.no/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) -------------- next part -------------- Received: from intuit.com (199.2.34.8) by intugate.intuit.com with SMTP (IMA Internet Exchange 2.12 Enterprise) id 002B07BF; Wed, 13 May 98 11:02:05 -0700 Received: by intuit.com; id KAA25305; Wed, 13 May 1998 10:47:15 -0700 (PDT) Received: from bowmore.cc.ic.ac.uk(155.198.5.22) via SMTP by fw2.intuit.com, id smtpd025270; Wed May 13 10:47:06 1998 Received: from majordom by bowmore.cc.ic.ac.uk with smtp (Exim 1.58 #2) id 0yZfll-0004Nm-00; Wed, 13 May 1998 18:56:45 +0100 Received: by ic.ac.uk (bulk_mailer for ic.ac.uk v1.7); Wed, 13 May 1998 18:56:19 +0100 Received: from majordom by bowmore.cc.ic.ac.uk with local (Exim 1.58 #2) id 0yZflG-0004Lo-00; Wed, 13 May 1998 18:56:14 +0100 Received: from punch.ic.ac.uk [155.198.5.17] by bowmore.cc.ic.ac.uk with smtp (Exim 1.58 #2) id 0yZfl8-0004Lf-00; Wed, 13 May 1998 18:56:06 +0100 Received: from ifi.uio.no [129.240.64.2] by punch.ic.ac.uk with esmtp (Exim 1.62 #1) id 0yZfl8-0000fV-00; Wed, 13 May 1998 18:56:06 +0100 Received: from BIRK105.birk105.studby.uio.no (birk105.studby.uio.no [129.240.214.135]) by ifi.uio.no (8.8.8/8.8.7/ifi0.2) with SMTP id TAA21333 for ; Wed, 13 May 1998 19:56:05 +0200 (MET DST) To: xml-dev@ic.ac.uk Subject: Free XML software list From: Lars Marius Garshol Date: Wed, 13 May 1998 18:56:14 +0100 Message-ID: Lines: 29 X-Mailer: Gnus v5.4.9/Emacs 19.34 Sender: owner-xml-dev@ic.ac.uk Precedence: bulk Reply-To: Lars Marius Garshol From alaly at inlink.com Wed May 13 23:59:58 1998 From: alaly at inlink.com (Michael Alaly) Date: Mon Jun 7 17:01:19 2004 Subject: Short Tags and my Email Client Message-ID: <02ff01bd7eba$dd551740$06001cac@mike> I apologize for my previous posts, I am using a Microsoft Outlook Express (uh oh!) and had it configured to send as HTML which obviously was a bad idea. In any case, I fail to see how short tags does anything but reduce the readability of a file. Paul: ---------------------------------------------------------------------------- -------------------- After marking up several megabytes of data by hand over the last few weeks, I have come to the conclusion that this is more readable than the fully tagged equivalent. Of course it is all about context, which is why I think that people would (and do!) want to mix short versions and long versions just as they do in SGML, LaTeX and some programming languages. ---------------------------------------------------------------------------- -------------------- I too mark up many megabytes of documents by hand and I find that for "Readability and ease of use" I much prefer long tags. This is because when faced with:
.... the list goes on for ever I would MUCH prefer to see when a specific tag ends rather than just seeing an end. I often look through files which were marked up in short tag (proprietary format) and I will see 20 's in a row. Which are still open? When debugging a file that was incorrectly tagged and doesn't parse there is NO way to tell which tags are open and which are closed! If I spend 30 mins going through and attempting to find which elements were closed and which ones were missed it is hardly an easy process. Long tags make debugging/proofreading much less laborious and I would contend that most people can read and process a long end tag as quickly, if not more quickly than a short tag. This text is more readable than the next example, and is easier for a human to process and follow. This text is LESS readable than the previousexample, and is easier for a human to process and follow. In the latter case the human has to look back and see which tags are open in order to understand which are being closed. In the former, it is clear which tags are being opened and which are being closed. Once again, sorry for the HTML format I was using, I didn't even realize until Peter pointed it out... a number of times. :) Michael Alaly alaly@inlink.com | 314.878.6474 xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Thu May 14 00:04:04 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:19 2004 Subject: WD of XSL requirements released In-Reply-To: <199805131921.MAA26860@boethius.eng.sun.com> Message-ID: <3.0.1.16.19980513230042.5fe7a896@pop3.demon.co.uk> At 12:21 13/05/98 -0700, Jon Bosak wrote: >The first working draft of the requirements for XSL has just been made >public: > > http://www.w3.org/TR/1998/WD-XSLReq-19980511 [amended] This is impressive! There is clearly a great deal of work to be done and we wish the editors and their support group a lot of energy and good humour. Please do *NOT* discuss this document on XML-DEV; I assume that xsl-list will give pointers as to how feedback can be obtained. I can see that in a few cases (e.g. interactivity, objects) discussion on XML-DEV (and, better, prototypes :-) might be useful - but we should wait to be asked in the first instance. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From kent at trl.ibm.co.jp Thu May 14 02:00:53 1998 From: kent at trl.ibm.co.jp (TAMURA Kent) Date: Mon Jun 7 17:01:19 2004 Subject: IBM XML for Java has been updated. Message-ID: <199805132359.IAA15499@ns.trl.ibm.com> XML for Java, an XML processor in Java, has been updated. http://www.alphaworks.ibm.com/formula/xml Changes: 16-Apr-1998 to 13-May-1998 o Updated DOM spec. [16-Apr-1998] o Supports SAX 1.0 o Supports UTF-16 encoding o New factories: com.ibm.xml.parser.util.TreeFactory and com.ibm.xml.parser.util.XHFactory Added new sample: TreeView for TreeFactory o Restructed some methods in TXElement o Subclasses of Child request some jobs to ElementFactory o And fixed some bugs -- TAMURA, Kent @ Tokyo Research Laboratory, IBM Japan xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Thu May 14 07:15:52 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:19 2004 Subject: XMLTest updated Message-ID: <355A75BE.D306F160@jclark.com> I've updated by XMLTest program (http://www.jclark.com/xml/XMLTest.java) to support SAX 1.0. XMLTest uses SAX to generate canonical XML, allowing any Java XML parser with a SAX driver to be tested with any XML test suite that includes canonical XML output for its test cases. James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Thu May 14 07:15:58 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:19 2004 Subject: Expat updated Message-ID: <355A7AE8.DAA82112@jclark.com> A new version of expat, my XML parser in C, is now available. See http://www.jclark.com/expat.html for more information. This version adds support for parsing external general entities. There's a new -x option on xmlwf that enables this. James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Thu May 14 07:16:08 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:19 2004 Subject: XP 0.3 available Message-ID: <355A76B7.4A73038@jclark.com> XP 0.3 is now available. See http://www.jclark.com/xml/xp/index.html for more information. Apart from bug fixes, the changes in this release are: - Support for SAX 1.0. - Efficient support for large CDATA sections (previously these were not handled very efficiently). - New approach to exceptions in the interface. If you have code using XP 0.2, you can get it to work with XP 0.3 just by changing it to import Application, ApplicationImpl, Parser and ParserImpl from com.jclark.xml.parse.io instead of com.jclark.xml.parse. James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From gotou at flab.fujitsu.co.jp Thu May 14 07:24:03 1998 From: gotou at flab.fujitsu.co.jp (Masatomo Goto) Date: Mon Jun 7 17:01:19 2004 Subject: Xlink semantics In-Reply-To: <3.0.1.16.19980513204547.5bdf6c46@pop3.demon.co.uk> Message-ID: <199805140517.OAA28951@kmailserv.akashi.flab.fujitsu.co.jp> Hello, I'm now working on design and implementation of a XLink facility. My approach is to provide the XLink facility as an engine. At 20:45 98/05/13, Peter Murray-Rust wrote: > I have been hacking an application (the VHG DTD) using Xlink and I'd like > to check on some semantics. Since the spec has contentSpecs of ANY for all > three link-types the formal situation is that anything is permitted. Should > my software complain at the following examples (notation should be > obvious), or should it try to do something clever? In my Xlink engine, these examples are recognized as follows: > > > > > - One extended link which has no or only inline resource (depends on inline attribute value) - Two simple links which are in the extended link content > ... > > > > > > - One extended link which has two or three(inline) resources. - One simple link which is in the extended link content > ... > > > > - One extended link which has one or two (inline) resources. > ... > > > > > > > > > > > - One extended link which has no or only inline resources - two extended links which have two or three (inline) resources in the parent extended link's content > ... > > > > > > ... - no meaning. no processing. > The problem is that all of these throw no error in the parser as they are > probably impossible to constrain except in very spartan DTDs. I suspect > most are not productive, but some might be valuable on occasions I have or > haven't thought of. If the XLink processing facilities are separated from the application, It is possible to throw some errors from the "XLink processor". > This is an important occasion that there is a clear requirement for > applications to apply semantics to parts of one of the specs. We already > have to write an attribute processor and I'm interested in knowing how much > additional processing any conforming Xlink software is going to have to do. FYI, I will give a speach and demonstration about my XLink engine in the HyTime at work session of SGML/XML Europe '98. Best regards, - Masatomo Goto --- Masatomo Goto Information Service Architecture lab. Fujitsu Laboratories Ltd. Tel: +81-78-934-8249 Fax: +81-78-934-3312 xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Thu May 14 08:43:56 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:19 2004 Subject: Expat updated References: <355A7AE8.DAA82112@jclark.com> Message-ID: <355A9130.17D3D6F7@jclark.com> James Clark wrote: > > A new version of expat, my XML parser in C, is now available. See > > http://www.jclark.com/expat.html > > for more information. Sorry, that should have been: http://www.jclark.com/xml/expat.html James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Thu May 14 09:16:52 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:19 2004 Subject: XP 0.3 available In-Reply-To: <355A76B7.4A73038@jclark.com> Message-ID: <3.0.1.16.19980514075908.5bb7db9e@pop3.demon.co.uk> At 11:44 14/05/98 +0700, James Clark wrote: >XP 0.3 is now available. See > > http://www.jclark.com/xml/xp/index.html > >for more information. > >Apart from bug fixes, the changes in this release are: > >- Support for SAX 1.0. > Many thanks. Astute readers will have seen that this is just one of three announcements today of updated software from James, and the XML-DEV community owes him an enormous debt for his commitment. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Thu May 14 09:25:36 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:19 2004 Subject: Xlink semantics In-Reply-To: <199805140517.OAA28951@kmailserv.akashi.flab.fujitsu.co.jp> References: <3.0.1.16.19980513204547.5bdf6c46@pop3.demon.co.uk> Message-ID: <3.0.1.16.19980514080952.514748b8@pop3.demon.co.uk> At 14:20 14/05/98 +0900, Masatomo Goto wrote: > >Hello, > >I'm now working on design and implementation of a XLink facility. >My approach is to provide the XLink facility as an engine. This is wonderful! From time to time I have suggested such an engine on XML-DEV and it is great to see someone as well advanced as this. Do you plan to make this available? If not, are you able to publish the semantics? [... examples and replies snipped ...] This was very helpful. From your replies, you appear to attach some meaning to all except the last (which I think we would all agree was a semantic error). I'd be very interested to know if other Xlink specialists (including the authors?) take the same view as you. >If the XLink processing facilities are separated from the application, >It is possible to throw some errors from the "XLink processor". I agree with this strategy. I have been campaigning for an XLink processor :-) > >> This is an important occasion that there is a clear requirement for >> applications to apply semantics to parts of one of the specs. We already >> have to write an attribute processor and I'm interested in knowing how much >> additional processing any conforming Xlink software is going to have to do. > >FYI, I will give a speach and demonstration about my XLink engine in >the HyTime at work session of SGML/XML Europe '98. I have recently met a good fairy which means that I shall be at XML98 for part of the time (probably Tuesday - Thursday). It would be nice to meet. P. > Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Thu May 14 09:27:53 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:19 2004 Subject: IBM XML for Java has been updated. In-Reply-To: <199805132359.IAA15499@ns.trl.ibm.com> Message-ID: <3.0.1.16.19980514075624.0ac78214@pop3.demon.co.uk> At 08:59 14/05/98 +0900, TAMURA Kent wrote: > >XML for Java, an XML processor in Java, has been updated. > >http://www.alphaworks.ibm.com/formula/xml > > >Changes: 16-Apr-1998 to 13-May-1998 > o Updated DOM spec. [16-Apr-1998] > o Supports SAX 1.0 > o Supports UTF-16 encoding > o New factories: com.ibm.xml.parser.util.TreeFactory > and com.ibm.xml.parser.util.XHFactory > Added new sample: TreeView for TreeFactory > o Restructed some methods in TXElement > o Subclasses of Child request some jobs to ElementFactory > o And fixed some bugs > >-- >TAMURA, Kent @ Tokyo Research Laboratory, IBM Japan > > Many thanks - both for making your software SAX-compliant and for a very well presented posting with exactly the right amount of information. I have not snipped it so that others can see the volume of information that is sufficient for most readers to decide whether they wish to download a README or the whole software. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From rbourret at dvs1.informatik.tu-darmstadt.de Thu May 14 09:44:21 1998 From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret) Date: Mon Jun 7 17:01:19 2004 Subject: EDI to XML Message-ID: <199805140737.JAA21449@berlin.dvs1.tu-darmstadt.de> See http://www.geocities.com/WallStreet/Floor/5815/ > I am new to the XML world, but do lot of EDI-VAN based transactions > with our trading community & in a process of defining long-term > strategy to do these transactions over the Web/internet. > > I have started hearing that XML is a replacement of EDI over the Web. > could anyone suggest me, how XML would be used to replace EDI in terms > of what all componenets/tools, steps to be taken in developers & > planners point of view. Is that some product available in the market > which could be used now etc. > > any comment would be appreciated. > > -Arun V. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From M.H.Kay at eng.icl.co.uk Thu May 14 12:37:29 1998 From: M.H.Kay at eng.icl.co.uk (Michael Kay) Date: Mon Jun 7 17:01:20 2004 Subject: White space after last element Message-ID: <000a01bd7f24$02de1160$1e09e391@mhklaptop.bra01.icl.co.uk> I just tested SAXON with the new version of IBM's xml4j parser: it crashes because SAXON doesn't expect white space to be reported after the end of the outermost element. I'll fix SAXON so it doesn't crash, but it raises a wider point, because we now have several SAX-conformant parsers reporting white space differently, and I'm not sure whether the spec says clearly which of them is right (perhaps they all are). Any comments from the experts? Mike xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tms at ansa.co.uk Thu May 14 13:16:02 1998 From: tms at ansa.co.uk (Toby Speight) Date: Mon Jun 7 17:01:20 2004 Subject: A little wish for short end tags In-Reply-To: Paul Prescod's message of "Wed, 13 May 1998 07:51:00 -0400" References: <35586A0E.4A785960@mecom.mixx.de> <35598924.AC686153@technologist.com> Message-ID: Paul> Paul Prescod Paul> I do not believe that there is any way ot implementat a legal XML Paul> parser without keeping around all of the information required to Paul> implement short end tags. Checking that an end-tag matches its Paul> start-tag (the current situation) is no easier than not checking. But there are plenty of (non-parsing) applications that benefit from XML standard end-tags. An obvious one is selection of an element from a document; a regexp search for the start-tag, and then just match start and end tags *for that element type*, keeping track of depth *for that element type* (we don't even need to do that if the element type is known not to be nestable in itself). That application need not even notice tags for other element types. -- xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From M.H.Kay at eng.icl.co.uk Thu May 14 13:56:26 1998 From: M.H.Kay at eng.icl.co.uk (Michael Kay) Date: Mon Jun 7 17:01:20 2004 Subject: Parser benchmark result Message-ID: <004e01bd7f2f$758542a0$1e09e391@mhklaptop.bra01.icl.co.uk> At the risk of making myself unpopular by looking gift horses in their mouths, I report the results of running a simple SAX 1.0 application with five different parsers. They were all run using the default SAXON Renderer application (which simply reconstitutes the XML input file supplied) against an XML file containing 291089 bytes, or 11500 elements, each on the same machine (a reasonably powerful Windows NT server, with the SUN Java VM), which was otherwise idle (as far as one can tell). I measured elapsed time by calling java.util.Date#getTime() at the start and end of the run. Each was run twice, I report both results. Elapsed time in milliseconds: AElfred: 8203, 8215 Lark: 10422, 10453 MSXML: 13250, 12250 xml4j: 18125, 18313 xp: 8156, 7907 These were obtained after some performance tuning on SAXON: the issued version takes about twice as long. Of course, there are many attributes to a piece of software other than its raw speed, and the results cannot necessarily be extrapolated to a different application or a different data file. The really good news is that (with the one caveat noted in an earlier message) the application worked unchanged on all five parsers. Mike Kay, ICL xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Thu May 14 14:42:49 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:20 2004 Subject: SAX: White space after last element In-Reply-To: <000a01bd7f24$02de1160$1e09e391@mhklaptop.bra01.icl.co.uk> References: <000a01bd7f24$02de1160$1e09e391@mhklaptop.bra01.icl.co.uk> Message-ID: <199805141242.IAA00235@unready.microstar.com> Michael Kay writes: > I just tested SAXON with the new version of IBM's xml4j > parser: it crashes because SAXON doesn't expect white space > to be reported after the end of the outermost element. I'll > fix SAXON so it doesn't crash, but it raises a wider point, > because we now have several SAX-conformant parsers reporting > white space differently, and I'm not sure whether the spec > says clearly which of them is right (perhaps they all are). The JavaDoc comment for DocumentHandler.ignorableWhitespace begins with the line Receive notification of ignorable whitespace in element content. ^^^^^^^^^^^^^^^^^^ In other words, the method should be invoked only between the first startElement event and the last endElement event (and even then, only when there is a DTD and the parent element type is declared with element-only content). SAX is very new, and I am very tired, after a difficult week (of which SAX 1.0 was only a tiny part). When I am better rested, I will take some time to test the different SAX implementations and to work with the authors to resolve any problems, or to take suggestions for clarifying the interface. While Java interfaces and JavaDoc comments are useful, they are no substitute for a proper written specification, which I owe to all of you as soon as I can manage it. All the best, David -- David Megginson david@megginson.com http://www.megginson.com/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From amitr at abinfosys.com Thu May 14 14:53:05 1998 From: amitr at abinfosys.com (Amit Rekhi) Date: Mon Jun 7 17:01:20 2004 Subject: Accessing default attributes from DTD Message-ID: <3.0.5.32.19980514145544.00972d20@192.168.1.1> Hello All, I want to access default attributes of elements of an XML file(say A.xml) defined in A.xml's DTD (A.dtd) in an HTML file (say A.html) which invokes the MSXML java parser and loads the A.xml file. Let me explain my scenario The A.dtd contains :- ....... ......... The A.xml file contains :- ..... .... ...... In my A.html file I want to access 's default attribute ATTR-1 (defined in A.dtd) after I parse the A.xml file into a tree using MSXML java parser. How would I do that??? Any suggestions would help, Thanx in Advance, AMIT xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From larsga at ifi.uio.no Thu May 14 15:30:55 1998 From: larsga at ifi.uio.no (Lars Marius Garshol) Date: Mon Jun 7 17:01:20 2004 Subject: Accessing default attributes from DTD In-Reply-To: <3.0.5.32.19980514145544.00972d20@192.168.1.1> References: <3.0.5.32.19980514145544.00972d20@192.168.1.1> Message-ID: * Amit Rekhi | | The A.dtd contains :- | ....... | | ......... | | In my A.html file I want to access 's default attribute ATTR-1 | (defined in A.dtd) after I parse the A.xml file into a tree using | MSXML java parser. | | How would I do that??? A conforming XML parser that reads the DTD will behave as if the ATTR-1 attribute had been present on every TEMP element, so you don't need access to the DTD or any other kind of magic. If MSXML doesn't do this I suggest you use a parser that does. You can find a list of parsers at (Also note that XML doesn't have a "NUMBER" attribute type. You'll have to use CDATA.) -- "These are, as I began, cumbersome ways / to kill a man. Simpler, direct, and much more neat / is to see that he is living somewhere in the middle / of the twentieth century, and leave him there." -- Edwin Brock http://www.stud.ifi.uio.no/~larsga/ http://birk105.studby.uio.no/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Thu May 14 15:49:45 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:20 2004 Subject: SAX 1.0: Parsers and Applications Message-ID: <199805141349.JAA00505@unready.microstar.com> I've updated the Parsers and Applications page for SAX 1.0, after the flurry of announcements over the past couple of days (most of which only just arrived because of problems with my POP server). You will find the page at the following URL: http://www.megginson.com/SAX/applications.html Note that AElfred does not appear there yet because it has not been updated from SAX 1.0gamma -- only minor changes will be necessary, though. Thanks to everyone for the support. All the best, David -- David Megginson david@megginson.com http://www.megginson.com/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jriedese at jnana.com Thu May 14 16:05:46 1998 From: jriedese at jnana.com (Joel Riedesel) Date: Mon Jun 7 17:01:20 2004 Subject: SAX, DTDs, parsing... In-Reply-To: <199805141349.JAA00505@unready.microstar.com> Message-ID: <005201bd7f41$244a1ac0$0132fac1@ipusushi> Please excuse my ignorance; I'm just starting to dive into code for XML stuff... Would someone direct me to a good overview of SAX? Also, I'm looking for a decent java validating XML parser that lets me get at the DTD for parsed XML files. Any suggestions? As far as I can tell, it looks as the only thing close is Lark or DXP, is that correct? Thanks. --- Joel Riedesel Jnana Technologies Corporation http://www.jnana.com mailto:jriedese@jnana.com 303 805 8275 xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From digitome at iol.ie Thu May 14 16:20:57 1998 From: digitome at iol.ie (Sean Mc Grath) Date: Mon Jun 7 17:01:20 2004 Subject: NFF - Notes Flat File Format Initiative + Freely available software Message-ID: <199805141419.PAA09759@GPO.iol.ie> Announcement ============ A New XML Application Initiative : Notes Flat File Format (NFF) NFF is an XML based interchange format for the Lotus Notes/Domino platform. The DTD, basic documentation, sample + software are available for download at : http://www.digitome.com/download.htm NFF is an XML based application intended to act as an interchange format for the Lotus Notes/Domino Publishing platform. The NFF DTD supports the majority of the constructs that can occur in Lotus Notes data such as structured fields, rich text, doclinks, import objects and so on. Once data is in XML conforming to the NFF DTD it can be imported into Lotus Notes using a simple "File-Import". The download package includes the necessary software along with a sample application - Timon Of Athens by William Shakespeare in NFF format. The software is freely available and comes with absolutely no warranty whatsoever. All constructive feedback welcome! Sean Mc Grath http://www.digitome.com/sean.htm County Sligo, Ireland, Tel: +353 96 47391 xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From rbourret at dvs1.informatik.tu-darmstadt.de Thu May 14 17:20:42 1998 From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret) Date: Mon Jun 7 17:01:20 2004 Subject: SAX, DTDs, parsing... Message-ID: <199805141418.QAA24119@berlin.dvs1.tu-darmstadt.de> MSXML is a validating parser written in Java. You can get at the DTD, but only as XML-Data elements. Of course, they ship the code, so you could probably go straight to their internal structures. -- Ron Bourret > Also, I'm looking for a decent java validating XML parser that > lets me get at the DTD for parsed XML files. Any suggestions? > As far as I can tell, it looks as the only thing close is > Lark or DXP, is that correct? xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jriedese at jnana.com Thu May 14 17:37:35 1998 From: jriedese at jnana.com (Joel Riedesel) Date: Mon Jun 7 17:01:20 2004 Subject: SAX, DTDs, parsing... In-Reply-To: <199805141418.QAA24119@berlin.dvs1.tu-darmstadt.de> Message-ID: <005801bd7f4d$e1b4e520$0132fac1@ipusushi> I didn't find it that simple to get at the DTD data. And, it appeared to me that the regular expression parts of DTD elements weren't really accessible in any clean API fashion. Yes, I can muck with the source, but I'd rather not; especially since I thought I've seen reports of bugginess on the part of MSXML - I don't want to have to start maintaining it in order to do what I want to do. Ideally, I'd like to throw a DTD file at a validating XML parser, in addition to a normal XML file which references an external DTD; and in both cases, get details of the DTD out of the parser. (Lots and lots of details!) I also noticed, that with MSXML, if the DTD is external to the XML; you can't get to the DTD elements from the parsed document. That 'node' isn't available to them main tree once parsing is done. I found that to be a bit annoying too. > > MSXML is a validating parser written in Java. You can get at > the DTD, but only > as XML-Data elements. Of course, they ship the code, so you > could probably go > straight to their internal structures. > > -- Ron Bourret > xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Thu May 14 18:03:33 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:20 2004 Subject: SAX, DTDs, parsing... In-Reply-To: <005201bd7f41$244a1ac0$0132fac1@ipusushi> References: <199805141349.JAA00505@unready.microstar.com> <005201bd7f41$244a1ac0$0132fac1@ipusushi> Message-ID: <199805141602.MAA00202@unready.microstar.com> Joel Riedesel writes: > > Please excuse my ignorance; I'm just starting to dive into code for XML > stuff... > > Would someone direct me to a good overview of SAX? Here's an intro: http://www.megginson.com/SAX/quickstart.html > Also, I'm looking for a decent java validating XML parser that > lets me get at the DTD for parsed XML files. Any suggestions? > As far as I can tell, it looks as the only thing close is > Lark or DXP, is that correct? AElfred provides (non-SAX) methods for accessing the DTD, and I think that IBM's XML for Java might as well. All the best, David -- David Megginson david@megginson.com http://www.megginson.com/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From James.Anderson at mecom.mixx.de Thu May 14 18:24:17 1998 From: James.Anderson at mecom.mixx.de (james anderson) Date: Mon Jun 7 17:01:20 2004 Subject: Short Tags and (ui standards) References: <02ff01bd7eba$dd551740$06001cac@mike> Message-ID: <355B1B06.18DF4604@mecom.mixx.de> Michael Alaly wrote: > > This text is more readable than the > next example, and is easier > for a human to process and follow. > > This text is LESS readable than the > previousexample, and is easier for a human to > process and follow. > IF i have to read marked up source, then i try to make sure that it looks more like This text is more readable than either example, and is easier for a human to process and follow. which is not markedly easier to read than this This text is NO less readable than the previous example and easier to read than either of the running-textexamples, should it be necssary for a human to process and follow it. and, in either case, significantly better than running text with strictly width-constrained line-breaks. if you really have to read marked up source, you're working against the odds unless it's the shape of the text body, not the content of the end tag, which matters. the more general point is that, the argument for long tags for the purpose of human readability has little credibility, since one should place minimal standards on the tools which present the marked up source to do so in a way (ie with meaningful justification) which, as a side-effect, renders the end tag content redundant. i don't have to do with many editors, but the one which i do use (visual page) tends to enforce such ("pretty printing") conventions. which i think was the correct decision and well worth the extra implementation effort. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Jon.Bosak at eng.Sun.COM Thu May 14 21:11:53 1998 From: Jon.Bosak at eng.Sun.COM (Jon Bosak) Date: Mon Jun 7 17:01:20 2004 Subject: WD of XSL requirements released In-Reply-To: <3559FAE0.C491DA17@interlog.com> (message from Ben Trafford on Wed, 13 May 1998 15:56:16 -0400) Message-ID: <199805141908.MAA27070@boethius.eng.sun.com> [Ben Trafford:] | Jon Bosak wrote: | | > The first working draft of the requirements for XSL has just been made | > public: | > | > http://www.w3.org/TR/1998/WD-XSLReq | | Oops! This URL is incorrect. The actual URL is: | | http://www.w3.org/TR/WD-XSLReq Sorry, I was forwarding an announcement without checking. Thanks for catching this. Jon xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjaakkol at cs.Helsinki.FI Thu May 14 22:05:56 1998 From: jjaakkol at cs.Helsinki.FI (Jani Jaakkola) Date: Mon Jun 7 17:01:20 2004 Subject: A little wish for short end tags In-Reply-To: Message-ID: On 14 May 1998, Toby Speight wrote: > Paul> Paul Prescod > > Paul> I do not believe that there is any way ot implementat a legal XML > Paul> parser without keeping around all of the information required to > Paul> implement short end tags. Checking that an end-tag matches its > Paul> start-tag (the current situation) is no easier than not checking. > > But there are plenty of (non-parsing) applications that benefit from > XML standard end-tags. An obvious one is selection of an element from > a document; a regexp search for the start-tag, and then just match > start and end tags *for that element type*, keeping track of depth > *for that element type* (we don't even need to do that if the element > type is known not to be nestable in itself). That application need > not even notice tags for other element types. Yes. I've got now an indexing and SGML/XML aware version of sgrep under development (it works, but isn't ready for release yet). It does not do real XML-parsing; it only scans files for indexes of start and end tags other markup. Therefore it is also very fast. It wouldn't work (at least as well) with short end tags. However, writing a normalizer for short end tags in XML would be trivial, unlike writing a normalizer for all possible short tags in SGML (thank you James). So if someone wants to use short end tags while authoring or internally in some organization i'd say that go for it. But please, keep the standard small and simple. - Jani xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From kent at trl.ibm.co.jp Fri May 15 03:03:45 1998 From: kent at trl.ibm.co.jp (TAMURA Kent) Date: Mon Jun 7 17:01:20 2004 Subject: SAX: White space after last element In-Reply-To: David Megginson's message of "Thu, 14 May 1998 08:42:33 -0400" <199805141242.IAA00235@unready.microstar.com> References: <000a01bd7f24$02de1160$1e09e391@mhklaptop.bra01.icl.co.uk> <199805141242.IAA00235@unready.microstar.com> Message-ID: <199805150101.KAA37087@ns.trl.ibm.com> > > I just tested SAXON with the new version of IBM's xml4j > > parser: it crashes because SAXON doesn't expect white space > > to be reported after the end of the outermost element. I'll > > fix SAXON so it doesn't crash, but it raises a wider point, > > because we now have several SAX-conformant parsers reporting > > white space differently, and I'm not sure whether the spec > > says clearly which of them is right (perhaps they all are). > The JavaDoc comment for DocumentHandler.ignorableWhitespace begins > with the line > Receive notification of ignorable whitespace in element content. > ^^^^^^^^^^^^^^^^^^ I'm sorry for the bug. It is fixed by the following patch. --- SAXDriver.19980512 Fri May 15 09:49:49 1998 +++ SAXDriver.java Fri May 15 09:51:46 1998 @@ -240,8 +240,11 @@ return 0 > ind ? null : getValue(ind); } + + int depth = 0; // parser.TagHandler public void handleStartTag(TXElement el, boolean empty) { + this.depth ++; m_attributes = el.getAttributeArray(); try { m_documenthandler.startElement(el.getName(), this); @@ -252,6 +255,7 @@ } // parser.TagHandler public void handleEndTag(TXElement el, boolean empty) { + this.depth --; try { m_documenthandler.endElement(el.getName()); } catch (SAXException e) { @@ -261,11 +265,13 @@ // parser.DefaultElementFactory public TXText createText(String data, boolean ignorable) { try { + if (0 < this.depth) { char[] ac = data.toCharArray(); if (ignorable) m_documenthandler.ignorableWhitespace(ac, 0, ac.length); else m_documenthandler.characters(ac, 0, ac.length); + } } catch (SAXException e) { throw new ExceptionWrapper(e); } -- TAMURA, Kent @ Tokyo Research Laboratory, IBM Japan xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From kendal at interlog.com Fri May 15 03:41:50 1998 From: kendal at interlog.com (Rolande Kendal) Date: Mon Jun 7 17:01:20 2004 Subject: do(duration){display video} Message-ID: <3.0.32.19980514213837.015e6bb0@interlog.com> I wish to include SMIL/PGML like encoded characteristics in a file; however, in addition I require some means to include rudamentary programatic control. For example, I need some method of saying things like repeat showing this resource for ten minutes, or until this event. In addition, if the user makes a selection then the cycle may have to abruptly stop and proceed to something else. Your know - browser stuff. I threw in a loop below: Is there some established means to do this? Thanks Rolande Still waiting for Jumbo2... xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cbullard at hiwaay.net Fri May 15 05:31:41 1998 From: cbullard at hiwaay.net (len bullard) Date: Mon Jun 7 17:01:21 2004 Subject: WD of XSL requirements released References: <199805141908.MAA27070@boethius.eng.sun.com> Message-ID: <355BB6A2.353A@hiwaay.net> My favorite part: "Easy things should be easy." Amen. "Absolute/relative positioning, Layering, and Transparency ... Z-order for overlapping areas" Does this include creating 3D objects such as indexed face sets, graphic primitives, or only layering? "Animation: support for identifying and including encapsulated, animated objects." Loose. Traditionally, that is a client plugin. I understand the idea, but avi, VRML, gif89 are all animation and very different environments. Animation can be a hub (eg, VRML anchors) so, an example here would be appreciated. "Callouts: Linking hotspots to items in the flow of text. Another aspect would be manipulating presentation of text within a CGM graphic." Callouts? From the IETM community, I understand why you want this, but it is an application content type and should be defined in the DTD, not the style language although the style language specifies the rendering. The CGM example seems odd since that is a presentation language with text primitives. Why are you you are applying XSL style rules to CGM text types? Just curious. "Tables: Support for the table models of CSS and DSSSL. Ability to easily format popular source table models such as HTML and CALS." Vague. Is this interpreted that 1. XSL processors shall always process CSS and DSSSL table models to a feature set TBD 2. XSL processors may optionally support HTML and CALS table models "Data Types: Scalar types, units of measure, Flow Objects, XML Objects" What are XML Objects? "Scripting" Ok. This appears to limit XSL to transforms. If so, architecturally, how do scripting languages in the other categories relate to XSL? The "out of scope" requirements are hard to interpret in light of current scripting practice, eg, inline JavaScript. "Acessibility" What are audio stylesheets? What are "accessibility mechanisms"? "Persistent headers/footers w/scrolling body: ability to specify headers and/or footers that should be fixed at the borders of a scrolling text area. This would provide the functionality most commonly achieved with frames in HTML browsers today." Loose. Frames also encapsulate plugins which are active hubs. This feels like WinHelp, not an Internet browser, so I am wondering what "functionality most commonly achieved" is indicated, ie, just scrolling? **************************************** Overall, very good start. It appears to be a good sampling of all of the necessary techniques. Len Bullard Intergraph xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From gotou at flab.fujitsu.co.jp Fri May 15 08:55:12 1998 From: gotou at flab.fujitsu.co.jp (Masatomo Goto) Date: Mon Jun 7 17:01:21 2004 Subject: Xlink semantics In-Reply-To: <3.0.1.16.19980514080952.514748b8@pop3.demon.co.uk> References: <199805140517.OAA28951@kmailserv.akashi.flab.fujitsu.co.jp> <3.0.1.16.19980513204547.5bdf6c46@pop3.demon.co.uk> Message-ID: <199805150649.PAA22430@kmailserv.akashi.flab.fujitsu.co.jp> At 08:09 98/05/14, Peter Murray-Rust wrote: > >I'm now working on design and implementation of a XLink facility. > >My approach is to provide the XLink facility as an engine. > > This is wonderful! From time to time I have suggested such an engine on > XML-DEV and it is great to see someone as well advanced as this. Do you > plan to make this available? If not, are you able to publish the semantics? I have no idea around these things NOW. But, I hope something can be available or published. > I have recently met a good fairy which means that I shall be at XML98 for > part of the time (probably Tuesday - Thursday). It would be nice to meet. I'm looking forward to meeting and having a discussion. Best regards. -Masatomo Goto --- Masatomo Goto Information Service Architecture lab. Fujitsu Laboratories Ltd. Tel: +81-78-934-8249 Fax: +81-78-934-3312 xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From yosr.hmaied at inria.fr Fri May 15 08:59:44 1998 From: yosr.hmaied at inria.fr (Yosr HMAIED) Date: Mon Jun 7 17:01:21 2004 Subject: root entities Message-ID: <355BE7D2.1CCA@inria.fr> in XML 1.0 specification it's said that -- Yosr HMAIED mailto:yosr.hmaied@inria.fr tel :01 39 63 51 71 Projet Rodin INRIA Rocquencourt xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From rajesh_n1 at verifone.com Fri May 15 09:20:29 1998 From: rajesh_n1 at verifone.com (Rajesh N) Date: Mon Jun 7 17:01:21 2004 Subject: test cases for xml parser & dom Message-ID: <3.0.5.32.19980515125132.007e1cf0@blr-nt-mail1.verifone.com> Hi all, can anyone give me a pointer to test cases for xml parser (including data validation) and the DOM API ? thanks in advance for any help, Rajesh N. -------------------------------------------------------------- There is no right way to do wrong. -------------------------------------------------------------- xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From larsga at ifi.uio.no Fri May 15 09:44:58 1998 From: larsga at ifi.uio.no (Lars Marius Garshol) Date: Mon Jun 7 17:01:21 2004 Subject: test cases for xml parser & dom In-Reply-To: <3.0.5.32.19980515125132.007e1cf0@blr-nt-mail1.verifone.com> References: <3.0.5.32.19980515125132.007e1cf0@blr-nt-mail1.verifone.com> Message-ID: * Rajesh N. | | can anyone give me a pointer to | test cases for xml parser (including data validation) James Tauber has a good list at (BTW: List crossposting is not a good idea. I'm following your example so people know your question has been answered.) -- "These are, as I began, cumbersome ways / to kill a man. Simpler, direct, and much more neat / is to see that he is living somewhere in the middle / of the twentieth century, and leave him there." -- Edwin Brock http://www.stud.ifi.uio.no/~larsga/ http://birk105.studby.uio.no/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From akitoshi.yoshida at sap-ag.de Fri May 15 13:42:44 1998 From: akitoshi.yoshida at sap-ag.de (Akitoshi Yoshida) Date: Mon Jun 7 17:01:21 2004 Subject: question: NodeIterator in DOM Message-ID: <199805151131.NAA14413@hs2114.wdf.sap-ag.de> Hi, I have a question on NodeIterator in DOM-19980416. Excerpt from the spec: If a node is inserted before the node just after the iterator position, it will be returned by getNextNode(); likewise if a node is inserted after the node just previous to the iterator position, it will be return by the next getPrevNode() call. The first part is clear. When a Node is inserted by insertBefore() at the node just before the iterator position, the iterator position is set in front of this new node. The second part starting "likewise.." is not clear to me. Are we assuming the existance of the insertAfter() method? It sounds like the insertion position (relative to the nodes) is the same as in the first case. If we only have the insertBefore() method in the Node inteface, how can we distinguish the above two cases? thanks in advance for any clarification aki xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From David.Halls at cl.cam.ac.uk Fri May 15 13:51:47 1998 From: David.Halls at cl.cam.ac.uk (David Halls) Date: Mon Jun 7 17:01:21 2004 Subject: demo of client-side XML stylesheets in Standard ML available Message-ID: The first release of Persimmon's Standard ML to Java bytecode compiler, MLJ, was made on May 13th, 1998. Please see the posting in comp.lang.functional. We have since added a new on-line demo which demonstrates some of our work using Standard ML to write stylesheets that transform SGML and XML into HTML. MLJ allows us to compile our stylesheets into applets that run in browsers. We are able to send XML to browsers which is processed by our stylesheets. The generated HTML is displayed to users with no further network access back to the server. Please feel free to try the demo out - go to the MLJ home page (http://research.persimmon.co.uk/mlj/) and follow the link to on-line demos. Our stylesheet demo installs a sample stylesheet applet in your browser and downloads a sample XML document. The HTML generated by the stylesheet is then displayed. You will also be able to control the style of the generated HTML through the applet's user interface. Links are provided enabling you to view the Standard ML source of the stylesheet (for comparison with other stylesheet languages). You can also view the original SGML document, its DTD and the XML that is automatically generated from it. You'll also find some general links to documents describing the benefits of SGML, XML and stylesheets. Email: mlj@persimmon.co.uk mlj@persimmon.com Web: http://research.persimmon.co.uk/mlj/ http://www.persimmon.com/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From amitr at abinfosys.com Fri May 15 16:16:22 1998 From: amitr at abinfosys.com (Amit Rekhi) Date: Mon Jun 7 17:01:21 2004 Subject: XML and EDI Message-ID: <3.0.5.32.19980515190728.0097b600@192.168.1.1> Hello! > There is a special XML/EDI list itself for this purpose where you can get > a lot more information. > I dont think that XML would REPLACE EDI , in fact it very well COMPLIMENTS > EDI. > XML is enabling EDI on the Web in a very smooth way where in EDI data > could be exchanged in XML wrapped EDI File , the Transaction Sets / > Messages would be EDI compliant DTDs. The developement is thus helping EDI > to be done on the Web taking into account the work already put in to > desgign the EDI structure and the Messages. > I not very sure of the products available on XML\EDI as its quite into the > discussion stages . But its not very long when there will be products out > as XML seems to be extremely helpful for the cause. > > All the Best!! > Aditya > A.B.Infosys Pvt. Ltd. > New Delhi , India > xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From mcc at arbortext.com Fri May 15 16:58:03 1998 From: mcc at arbortext.com (Mike Champion) Date: Mon Jun 7 17:01:21 2004 Subject: question: NodeIterator in DOM In-Reply-To: <199805151131.NAA14413@hs2114.wdf.sap-ag.de> Message-ID: <98May15.105314edt.26883@thicket.arbortext.com> At 07:31 AM 5/15/98 -0400, Akitoshi Yoshida wrote: >Hi, >I have a question on NodeIterator in DOM-19980416. > >Excerpt from the spec: > > If a node is inserted before the node just after > the iterator position, it will be returned by getNextNode(); > likewise if a node is inserted after the node just previous > to the iterator position, it will be return by the next > getPrevNode() call. > >The first part is clear. When a Node is inserted by insertBefore() >at the node just before the iterator position, the iterator position >is set in front of this new node. The second part >starting "likewise.." is not clear to me. Are we assuming the >existance of the insertAfter() method? It sounds like the >insertion position (relative to the nodes) is the same as >in the first case. If we only have the >insertBefore() method in the Node inteface, >how can we distinguish the above two cases? It's better to post DOM questions on www-dom@w3.org. I'll answer here and cc: to ww-dom so please look there for any followup. But anyway, Good Point! We can't distinguish the two cases and I have no idea what I was thinking when I typed it. I'll remove the clause starting with "likewise" from the spec. It should work as follows. Consider an iterator through nodes A, B, C. The last node returned was B, so the "current position" signified by the "^" is after B. A --------B--^-----C If a new node X is inserted between B and C, the current position remains after B, so X will be returned by getNextNode(), and B returned by getPrevNode(). A --------B--^-----X--------C Sorry for the confusion, Mike Champion xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From akitoshi.yoshida at sap-ag.de Fri May 15 17:35:19 1998 From: akitoshi.yoshida at sap-ag.de (Akitoshi Yoshida) Date: Mon Jun 7 17:01:21 2004 Subject: question: NodeIterator in DOM Message-ID: <199805151519.RAA02352@hs2114.wdf.sap-ag.de> Hi, Another question regarding NodeIterator: You have an NodeIterator instance with the iterator position set somewhere in the middle. When you remove three nodes: first, the node just after the iterator position, second the node after this removed node, and finally the node just before the original iterator position, then what should toNextNode() and toPrevNode() return? in example: we have nodes A B C D E. the iterator is positioned just before C. remove C, D, and B. then we have A E and the iterator position should be positioned just before E. so toNextNode() and toPrevNode() should return E and A, respectively. NodeIterator may mark its current iterator position by an integer offset or by an object reference. but it must be notified for certain insert and remove operations to preserve the above semantics. Is it correct? Thanks aki ---- original message Hi, I have a question on NodeIterator in DOM-19980416. Excerpt from the spec: If a node is inserted before the node just after the iterator position, it will be returned by getNextNode(); likewise if a node is inserted after the node just previous to the iterator position, it will be return by the next getPrevNode() call. The first part is clear. When a Node is inserted by insertBefore() at the node just before the iterator position, the iterator position is set in front of this new node. The second part starting "likewise.." is not clear to me. Are we assuming the existance of the insertAfter() method? It sounds like the insertion position (relative to the nodes) is the same as in the first case. If we only have the insertBefore() method in the Node inteface, how can we distinguish the above two cases? thanks in advance for any clarification aki xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From mcc at arbortext.com Fri May 15 18:20:47 1998 From: mcc at arbortext.com (Mike Champion) Date: Mon Jun 7 17:01:22 2004 Subject: question: NodeIterator in DOM In-Reply-To: <199805151519.RAA02352@hs2114.wdf.sap-ag.de> Message-ID: <98May15.121627edt.26888@thicket.arbortext.com> At 11:19 AM 5/15/98 -0400, Akitoshi Yoshida wrote: >in example: >we have nodes A B C D E. >the iterator is positioned just before C. >remove C, D, and B. >then we have A E and the iterator position should be >positioned just before E. >so toNextNode() and toPrevNode() should return E and A, respectively. Yes. > >NodeIterator may mark its current iterator position by >an integer offset or by an object reference. but it must >be notified for certain insert and remove operations to >preserve the above semantics. Is it correct? I'm not sure. There's been a lot of discussion of how to implement NodeIterators on www-dom; see the archives at http://lists.w3.org/Archives/Public/www-dom/ I believe different designs exist, some of which require iterators to be notified on insert and delete operations, and others which maintain "markers" in the tree that move around as the tree is edited. I have not followed this closely enough to give you a definitive answer, but look at the www-dom archives, and especially Don Park's SAXDOM implementation, because Don participated in the discussions of this a couple of weeks ago and I believe his latest release reflects what he learned. Good luck, Mike xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From elm at arbortext.com Fri May 15 18:22:37 1998 From: elm at arbortext.com (Eve L. Maler) Date: Mon Jun 7 17:01:22 2004 Subject: Xlink semantics In-Reply-To: <98May14.012336edt.26885@thicket.arbortext.com> References: <3.0.1.16.19980513204547.5bdf6c46@pop3.demon.co.uk> Message-ID: <3.0.5.32.19980515120249.00a257b0@village.promanage-inc.com> This is a really good issue. Obviously, the interactions among the XLink-aware elements are underspecified! I agree with Masatomo Goto's interpretation of the various element configurations. A few other comments below... At 01:20 AM 5/14/98 -0400, Masatomo Goto wrote: > >Hello, > >I'm now working on design and implementation of a XLink facility. >My approach is to provide the XLink facility as an engine. (This is wonderful!) >At 20:45 98/05/13, Peter Murray-Rust wrote: >> I have been hacking an application (the VHG DTD) using Xlink and I'd like >> to check on some semantics. Since the spec has contentSpecs of ANY for all >> three link-types the formal situation is that anything is permitted. Should >> my software complain at the following examples (notation should be >> obvious), or should it try to do something clever? > >In my Xlink engine, these examples are recognized as follows: > >> >> >> >> >> > > - One extended link which has no or only inline resource > (depends on inline attribute value) > - Two simple links which are in the extended link content > >> >> >> >> >> >> > > - One extended link which has two or three(inline) resources. > - One simple link which is in the extended link content > >> ... >> >> >> >> > > - One extended link which has one or two (inline) resources. > >> ... >> >> >> >> >> >> >> >> >> >> >> > > - One extended link which has no or only inline resources > - two extended links which have two or three (inline) resources > in the parent extended link's content > >> ... >> >> >> >> >> >> ... > > - no meaning. no processing. > > >> The problem is that all of these throw no error in the parser as they are >> probably impossible to constrain except in very spartan DTDs. I suspect >> most are not productive, but some might be valuable on occasions I have or >> haven't thought of. > >If the XLink processing facilities are separated from the application, >It is possible to throw some errors from the "XLink processor". This is how I envision linking support. Each XML-related specification suggests the creation of a "processor" for that level, with any other "applications" overlaying it. Of course, if you do encapsulate the relevant XLink awareness into your DTD, you will get some validation "for free." >> This is an important occasion that there is a clear requirement for >> applications to apply semantics to parts of one of the specs. We already >> have to write an attribute processor and I'm interested in knowing how much >> additional processing any conforming Xlink software is going to have to do. > >FYI, I will give a speach and demonstration about my XLink engine in >the HyTime at work session of SGML/XML Europe '98. I look forward to seeing it! Eve xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Jon.Bosak at eng.Sun.COM Fri May 15 23:37:44 1998 From: Jon.Bosak at eng.Sun.COM (Jon Bosak) Date: Mon Jun 7 17:01:22 2004 Subject: A little wish for short end tags In-Reply-To: (message from Toby Speight on 14 May 1998 12:15:26 +0100) Message-ID: <199805152135.OAA27215@boethius.eng.sun.com> [Toby Speight:] | But there are plenty of (non-parsing) applications that benefit from | XML standard end-tags. An obvious one is selection of an element from | a document; a regexp search for the start-tag, and then just match | start and end tags *for that element type*, keeping track of depth | *for that element type* (we don't even need to do that if the element | type is known not to be nestable in itself). That application need | not even notice tags for other element types. This is precisely the scenario that I had in mind when I invented the figure of the Desperate Perl Hacker -- someone who has no idea how to build a parser but can do very powerful operations on large quantities of XML using simple pattern matches if the presence of full end-tags is guaranteed. Jon xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From greyno at mcs.com Sat May 16 06:23:09 1998 From: greyno at mcs.com (Gregg Reynolds) Date: Mon Jun 7 17:01:22 2004 Subject: A little wish for short end tags References: <199805152135.OAA27215@boethius.eng.sun.com> Message-ID: <355D0208.3B00@mcs.com> Jon Bosak wrote: > > [Toby Speight:] > > | But there are plenty of (non-parsing) applications that benefit from > | XML standard end-tags. > This is precisely the scenario that I had in mind when I invented the > figure of the Desperate Perl Hacker -- someone who has no idea how to > build a parser but can do very powerful operations on large quantities > of XML using simple pattern matches if the presence of full end-tags > is guaranteed. > Given: 1. Short tags 2. Some non-trivial number of docs marked up with short-tags 3. Some non-trivial number of DPH's desperate to hack at these docs; Isn't it likely that some non-trivial number of XML normalizers will become at least as widespread as perl? Thereby relieving our lonely hackers of some non-trivial measure of their desperation? xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jtauber at jtauber.com Sat May 16 07:21:03 1998 From: jtauber at jtauber.com (James K. Tauber) Date: Mon Jun 7 17:01:22 2004 Subject: A little wish for short end tags In-Reply-To: <199805152135.OAA27215@boethius.eng.sun.com> Message-ID: <000a01bd808a$142122c0$be6118cb@caleb> > This is precisely the scenario that I had in mind when I invented the > figure of the Desperate Perl Hacker -- someone who has no idea how to > build a parser but can do very powerful operations on large quantities > of XML using simple pattern matches if the presence of full end-tags > is guaranteed. I still think life would have been easier for the DPH if > had had the same prohibitions as <. James -- James Tauber / jtauber@jtauber.com Perth, Western Australia XML Pages: http://www.jtauber.com/xml/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From bckman at ix.netcom.com Sat May 16 07:30:46 1998 From: bckman at ix.netcom.com (Frank Boumphrey) Date: Mon Jun 7 17:01:22 2004 Subject: A little wish for short end tags Message-ID: <01bd808c$7c07baa0$71431ecc@uspppBckman> It is so easy to use script and code to isolate element text when the full end tag is included, and it is really difficult (though possible) to do this when 'short' endtags are employed. To my way of thinking this fact alone is enough to justify the survival and existance of the full end tag. Frank -----Original Message----- From: Frank Boumphrey To: Gregg Reynolds Date: Saturday, May 16, 1998 1:32 AM Subject: Re: A little wish for short end tags >It is so easy to use script and code to isolate element text when the full >end tag is included, and it is really difficult (though possible) to do this >when 'short' endtags are employed. > >To my way of thinking this fact alone is enough to justify the survival and >existance of the full end tag. > >Frank > >-----Original Message----- >From: Gregg Reynolds >To: xml-dev@ic.ac.uk >Date: Saturday, May 16, 1998 12:35 AM >Subject: Re: A little wish for short end tags > > >>Jon Bosak wrote: >>> >>> [Toby Speight:] >>> >>> | But there are plenty of (non-parsing) applications that benefit from >>> | XML standard end-tags. >>> >> This is precisely the scenario that I had in mind when I invented the >>> figure of the Desperate Perl Hacker -- someone who has no idea how to >>> build a parser but can do very powerful operations on large quantities >>> of XML using simple pattern matches if the presence of full end-tags >>> is guaranteed. >>> >> >>Given: >> 1. Short tags >> 2. Some non-trivial number of docs marked up with short-tags >> 3. Some non-trivial number of DPH's desperate to hack at these docs; >> >>Isn't it likely that some non-trivial number of XML normalizers will >>become at least as widespread as perl? Thereby relieving our lonely >>hackers of some non-trivial measure of their desperation? >> >> >>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk >>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ >>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; >>(un)subscribe xml-dev >>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following >message; >>subscribe xml-dev-digest >>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) >> >> > xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From lisarein at finetuning.com Sat May 16 07:36:32 1998 From: lisarein at finetuning.com (Lisa Rein) Date: Mon Jun 7 17:01:22 2004 Subject: A little wish for short end tags References: <01bd808c$7c07baa0$71431ecc@uspppBckman> Message-ID: <355D2BFB.C337803C@finetuning.com> LONG LIVE THE FULL END TAG! lisa Frank Boumphrey wrote: > > It is so easy to use script and code to isolate element text when the full > end tag is included, and it is really difficult (though possible) to do this > when 'short' endtags are employed. > > To my way of thinking this fact alone is enough to justify the survival and > existance of the full end tag. > > Frank > > -----Original Message----- > From: Frank Boumphrey > To: Gregg Reynolds > Date: Saturday, May 16, 1998 1:32 AM > Subject: Re: A little wish for short end tags > > >It is so easy to use script and code to isolate element text when the full > >end tag is included, and it is really difficult (though possible) to do > this > >when 'short' endtags are employed. > > > >To my way of thinking this fact alone is enough to justify the survival and > >existance of the full end tag. > > > >Frank > > > >-----Original Message----- > >From: Gregg Reynolds > >To: xml-dev@ic.ac.uk > >Date: Saturday, May 16, 1998 12:35 AM > >Subject: Re: A little wish for short end tags > > > > > >>Jon Bosak wrote: > >>> > >>> [Toby Speight:] > >>> > >>> | But there are plenty of (non-parsing) applications that benefit from > >>> | XML standard end-tags. > >>> > >> This is precisely the scenario that I had in mind when I invented the > >>> figure of the Desperate Perl Hacker -- someone who has no idea how to > >>> build a parser but can do very powerful operations on large quantities > >>> of XML using simple pattern matches if the presence of full end-tags > >>> is guaranteed. > >>> > >> > >>Given: > >> 1. Short tags > >> 2. Some non-trivial number of docs marked up with short-tags > >> 3. Some non-trivial number of DPH's desperate to hack at these docs; > >> > >>Isn't it likely that some non-trivial number of XML normalizers will > >>become at least as widespread as perl? Thereby relieving our lonely > >>hackers of some non-trivial measure of their desperation? > >> > >> > >>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk > >>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ > >>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; > >>(un)subscribe xml-dev > >>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following > >message; > >>subscribe xml-dev-digest > >>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) > >> > >> > > > > xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk > Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ > To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; > (un)subscribe xml-dev > To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; > subscribe xml-dev-digest > List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From liamquin at interlog.com Sat May 16 10:04:19 1998 From: liamquin at interlog.com (Liam Quin) Date: Mon Jun 7 17:01:22 2004 Subject: A little wish for short end tags In-Reply-To: <355D0208.3B00@mcs.com> Message-ID: On Fri, 15 May 1998, Gregg Reynolds wrote: > Given: > 1. Short tags > 2. Some non-trivial number of docs marked up with short-tags > 3. Some non-trivial number of DPH's desperate to hack at these docs; > > Isn't it likely that some non-trivial number of XML normalizers will > become at least as widespread as perl? I wish people would stop this. There are no minimisation features in XML. This is a major reason why it is succeeding. Yes, perl will include an XML parser. None the less, there are a lot of good reasons for keeping the full end tags, and these have been discussed carefully and at great length. Handling requires keeping a stack and processing the entire document from the beginning. There are no constraints on the possible depth of the stack. Yes, you can keep a stack in perl. Yes, you can read the document into memory too, if you have enough memory. <> * Why stop there? * this is a perfectly valid SGML bullet list, with the asterisks getting replaced by ITEM tags automatically, and OMITTAG filling in the end tags. * note that you can map just about any character to a tag, except an upper case B, which cannot be used or escaped from its special meaning. * with the RANK feature, you can save a few more bytes, and it's often fairly straight forward to add once you have all the other features. * and with LINK, you can have attributes added automatically, so you don't need to put them in the DTD or the document directly. In other words, every SGML feature has uses, and if XML had them all, it would no longer be a subset suitable for widespread use. There is already software to mormalise null end tags. It didn't make SGML catch on. "Full SGML" is too complex. It is widely used, but XML looks like it will be used massively more widely than its parent. Stop trying to add complexities. XML 1.0 is published. Use it. Lee -- Liam Quin -- the barefoot typographer -- Toronto lq-text: freely available Unix text retrieval IRC: discuss XML/SGML/XSL/XLL/DSSSL Mondays irc.technonet.net in #XML email address: l i a m q u i n, at host: i n t e r l o g dot c o m xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjc at jclark.com Sat May 16 10:19:26 1998 From: jjc at jclark.com (James Clark) Date: Mon Jun 7 17:01:23 2004 Subject: XML test suite updated Message-ID: <355D4A53.1D8A1252@jclark.com> I've updated my collection of XML test cases (ftp://ftp.jclark.com/pub/xml/xmltest.zip). I've reorganized it a little and added some more test cases. James xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Patrice.Bonhomme at loria.fr Sat May 16 14:32:57 1998 From: Patrice.Bonhomme at loria.fr (Patrice Bonhomme) Date: Mon Jun 7 17:01:23 2004 Subject: A Content Model Question ? Message-ID: <199805161231.OAA10248@chimay.loria.fr> Is the Content Model for the Element D valid in XML 1.0 ? ]> Thanks. Pat. -- ============================================================== bonhomme@loria.fr | Office : B.228 http://www.loria.fr/~bonhomme | Phone : 03 83 59 30 52 -------------------------------------------------------------- * Serveur Silfide : http://www.loria.fr/projets/Silfide * Projet Aquarelle : http://aqua.inria.fr ============================================================== xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From larsga at ifi.uio.no Sat May 16 15:38:38 1998 From: larsga at ifi.uio.no (Lars Marius Garshol) Date: Mon Jun 7 17:01:23 2004 Subject: A Content Model Question ? In-Reply-To: <199805161231.OAA10248@chimay.loria.fr> References: <199805161231.OAA10248@chimay.loria.fr> Message-ID: * Patrice Bonhomme | | Is the Content Model for the Element D valid in XML 1.0 ? | | No. You must write it like this: See section 3.2.2 of the spec, production 51. -- "These are, as I began, cumbersome ways / to kill a man. Simpler, direct, and much more neat / is to see that he is living somewhere in the middle / of the twentieth century, and leave him there." -- Edwin Brock http://www.stud.ifi.uio.no/~larsga/ http://birk105.studby.uio.no/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sat May 16 16:34:26 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:23 2004 Subject: A little wish for short end tags References: Message-ID: <355DA41C.E99FF4AE@technologist.com> I had hoped to allow this thread to die, but I can't allow this incorrect statement to pass. These are the sorts of things that become dogma: Liam Quin wrote: > > There are no minimisation features in XML. 1. Empty-element tags (XML actually ADDED this to SGML) 2. Doctype declarations of the form Plus there are many alternative representations geared totally toward usability: 3. Predefined entities 4. Alternate literal quoting characters 5. Alternate unicode number syntax (XML actually ADDED this to SGML) As long as I've already put myself into disrepute by perpetuating the thread: Another myth that I see floating around often is that XML is simple. Anybody who believes that has not studied the various types of entities, their allowed occurrences, order of replacement and interaction with the standalone declaration. Essentially these features cannot be expressed in prose text that could be read and understood by a typical reader. That means that XML's syntax is more complicated than most programming languages which *can* be (and sometimes are!) described completely in prose text. I have never looked at the grammars for Python or Java, for example, but I have a pretty clear idea of what is legal and what is not. Of course, XML's central concepts are simple, just as SGML's were. But neither language is syntactically simple. Compared to the complexity of these features short end tags would make the specification essentially no more complex. You would add a single question mark to the EBNF as opposed to hundreds of percent signs for parameter entities. The "keep XML simple" argument is a non-starter. "Minimization is a slippery slope" is also a non-starter. We've already got minimization and any move towards more is strongly resisted (which is good...we always need some people to argue against features). The XML working group is full of people who have the ability to make decisions on a case by case basis. I think it is an insult to them to propose otherwise. The logical end-point of the slippery slope argument is "If we try to make a subset of SGML, we'll start adding SGML features until we end up with SGML" which obviously did not happen. "Full end tags help hackers" is a completely valid point. It is the central point. It is the point that forced the decision in the first place. Personally, I think that it is quite easy to type: expandTags myFile.sgm | awk .... But I recognize that others disagree. It is arguably the case that downloading and compiling "expandTags" is an unacceptable burden on Desperate Perl Hackers. Paul Prescod - http://itrc.uwaterloo.ca/~papresco Can we afford to feed that army, while so many children are naked and hungry? Can we afford to remain passive, while that soldier-army is growing so massive? - "Gabby" Barbadian Calpysonian in "Boots" xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sat May 16 16:51:47 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:23 2004 Subject: Correction: Re: A little wish for short end tags References: <355DA41C.E99FF4AE@technologist.com> Message-ID: <355DA818.513C141E@technologist.com> Paul Prescod wrote: > > You would add a single question mark to the EBNF as opposed > to hundreds of percent signs for parameter entities. Correction: I forgot that the final draft uses a much more elegant rule for parameter entities. They are still much more complicated to understand and use than short end-tags, but there is not as much spec-space dedicated to them as there once was. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Jon.Bosak at eng.Sun.COM Sat May 16 18:52:05 1998 From: Jon.Bosak at eng.Sun.COM (Jon Bosak) Date: Mon Jun 7 17:01:23 2004 Subject: A little wish for short end tags In-Reply-To: <355D0208.3B00@mcs.com> (message from Gregg Reynolds on Fri, 15 May 1998 23:03:36 -0400) Message-ID: <199805161649.JAA27414@boethius.eng.sun.com> [Gregg Reynolds:] | Given: | 1. Short tags | 2. Some non-trivial number of docs marked up with short-tags | 3. Some non-trivial number of DPH's desperate to hack at these docs; | | Isn't it likely that some non-trivial number of XML normalizers will | become at least as widespread as perl? Thereby relieving our lonely | hackers of some non-trivial measure of their desperation? Nothing can match the brute simplicity of a one-line perl regexp operating over unlimited amounts of data within a ksh or bash command-line loop. People with a programming background tend to find this hard to understand, but there are a lot of folks out there in publishing and everyday business management who know exactly what I'm talking about. A perl regexp is the *upper bound* of sophistication for this constituency. Please try, if you can, to imagine being faced with the job of doing an element-specific mass search-and-replace over two years' worth of company reports when all you know about XML is what you can see by looking at the source, you've never heard of the concept of a normalizer, and the only scripting tool you know how to use is the Word or WordPerfect macro language. You may never find yourself in this position, but there are hundreds of thousands of ordinary users who aren't going to be so lucky. This is one of the reasons that many corporate SGML users made it a policy years ago to normalize all SGML documents to expand the end tags and why most SGML editors do this automatically every time a file is saved. SGML gives you the option of using empty end tags, and the historical fact is that most large users, given this option and a sufficient amount of experience with it, choose not to use it. XML simply enforces what many people faced with the management of large amounts of tagged text adopted as good practice a long time ago and provides the same guarantee of safe tagging across organizations that has generally existed within them. If you really like using shortcuts, then go all the way: get a genuine SGML tool and define a DTD that allows not just end-tag minimization but full omission of both start-tags *and* end-tags. Knock yourself out. Just make sure to normalize the result before you call it XML and ship it out to the rest of us to work with. Jon xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sat May 16 21:58:32 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:23 2004 Subject: A little wish for short end tags References: <199805161649.JAA27414@boethius.eng.sun.com> Message-ID: <355DF024.FF2557F2@technologist.com> Jon Bosak wrote: > > A perl regexp is the *upper bound* of sophistication for this > constituency. Please try, if you can, to imagine being faced with the > job of doing an element-specific mass search-and-replace over two > years' worth of company reports when all you know about XML is what > you can see by looking at the source, you've never heard of the > concept of a normalizer, and the only scripting tool you know how to > use is the Word or WordPerfect macro language. I do not believe that a person with the knowledge level you have described is going to succeed at the task you have set for him or her. Entities are going to kill them. Whitespace in end-tags is going to toast them. CDATA sections are going to confuse them. Elements (and tags!) broken across lines are going to destroy them. This person can only succeed if a) the data is already normalized, probably due to a corporate standard such as the one you mention. b) they download a normalizer. If I am wrong, it would be easy to prove me so. All someone has to do is provide a regular expression that can (for instance) change all occurrences of the GI "FOO" into "BAR" in any XML document corresponding to a DTD of their choice (but which I can extend in the internal subset). On the other hand, I can do this *trivially* in a regular expression on data that has been normalized. > SGML gives you the option of using empty end tags, and the > historical fact is that most large users, given this option and a > sufficient amount of experience with it, choose not to use it. These "large users" have expensive SGML editors that they have paid someone thousands of dollars to customize to perfection. Under those conditions, I would legislate redundancy also -- not just fully expanded end-tags, but probably redundant IDs in comments of end-tags, public identifiers on all entity declarations, perhaps even unique identifiers on all elements. But XML is about a different world than that. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From schampeo at hesketh.com Sat May 16 22:16:16 1998 From: schampeo at hesketh.com (Steven Champeon) Date: Mon Jun 7 17:01:23 2004 Subject: A little wish for short end tags In-Reply-To: <355DF024.FF2557F2@technologist.com> References: <199805161649.JAA27414@boethius.eng.sun.com> Message-ID: <3.0.5.32.19980516161020.00995ea0@wasabi.hesketh.com> At 03:59 PM 5/16/98 -0400, Paul Prescod graced us with: > I do not believe that a person with the knowledge level you have described > is going to succeed at the task you have set for him or her. The tasks and knowledge level Jon describes are exactly what I had when I first started doing SGML back in 1993. I knew a bit about what the specific tagsets I was using could contain, I'd picked up the regex search/replace then supplied with my editor, SoftQuad's Author/Editor, and I'd started to maintain a few Perl scripts which were designed to convert text files (the output from our proprietary workflow conversion system) into SGML. It wasn't uncommon for me to use the following sort of ugliness, which I inherited from a programmer before me: $/=""; $*=1; # slurp mode while(<>) { s/$absurdly_long_regular_expression/what_ought_to_go_there/g; # ... repeat one per line for fifty or more regexes } I had regexes which were longer than pico could handle, so I used Sun textedit, which wrapped them onscreen. Eventually, I learned emacs and vi and discovered the pure joy of the UNIX command line, and picked up a few more Perl tricks, like formatted code, comments, and not using absurdly long regular expressions in a multiple-pass global search and replace in order to mark up incomplete text files. ;) I saved myself countless hours of manual tagging in A/E this way. It'd make your hair stand on end to see some of the scripts I used in my daily work, but it made my life easier and provided a break from the tedium of hand-tagging, and was also a challenge. Try to remember that not everyone has time or the background to absorb intricacies, and that perfection falls a far second after getting the job done in almost every context outside the realm of pure thought. S -- "All the good geek things, schampeo@hesketh.com only without all the http://a.jaundicedeye.com bad geek things." http://hesketh.com/schampeo/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sun May 17 04:20:56 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:23 2004 Subject: A little wish for short end tags References: <199805161649.JAA27414@boethius.eng.sun.com> <3.0.5.32.19980516161020.00995ea0@wasabi.hesketh.com> Message-ID: <355E49BE.CB166F4A@technologist.com> Steven Champeon wrote: > > The tasks and knowledge level Jon describes are exactly what I had when I > first started doing SGML back in 1993. So what you're saying is that this sort of thing can work, even with all of the minimization features of SGML, because you knew the general layout of your data and/or you had a tool that could normalize the weird stuff for you. You succeeded at your task because you approached it (perhaps unknowingly) with either: a) the right data set: more or less already normalized SGML or b) the right tool -- a normalizer: AE. That's exactly what I've been saying also. If you are going to do regular expression hacking on XML it had better have been already marked-up in some corporate standard (which would probably exclude short end-tags, confusing entities, confusing whitespace, confusing newlines, etc.) or you should have a tool that can normalize it for you. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sun May 17 10:11:15 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:24 2004 Subject: Clarification: (was: A little wish...) References: <355DA41C.E99FF4AE@technologist.com> Message-ID: <355E9BEB.2206DDFA@technologist.com> Paul Prescod wrote: > > good...we always need some people to argue against features). The XML > working group is full of people who have the ability to make decisions on > a case by case basis. I think it is an insult to them to propose > otherwise. I did not mean that Liam intended to insult the working group. After all, Eliot Kimber made the same argument, and he is on the group. It just seems to me that it is an inadvertant insult, as if I said to a friend: "You better not start eating those shrip, you'll never be able to stop." If they are not a chronic over-eater or shrimp-addict, then it would be strange to suggest that in *this one case* they are likely to not know when they are full. My friend would probably come to the conclusion that I thought that he did not know what was best for himself. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cbullard at hiwaay.net Sun May 17 10:39:46 1998 From: cbullard at hiwaay.net (len bullard) Date: Mon Jun 7 17:01:24 2004 Subject: A little wish for short end tags References: <199805161649.JAA27414@boethius.eng.sun.com> Message-ID: <355EA1E9.55A9@hiwaay.net> Jon Bosak wrote: > > SGML gives you the option of using empty end tags, and the > historical fact is that most large users, given this option and a > sufficient amount of experience with it, choose not to use it. Jon is right. I found it easier to read when the SGML instances were large and the designer used a lot of content types in deeper hierachies than one gets used to with HTML. For DTDs like the MIL Content Data Model, or the IADS DTDs where the indexes are named types mapped to stylesheet tables, one often wants to reach in and grab pieces of the tree easily. a good editor helps, but a lot of people won't have what they really need unless some serious price reductions happen in that market. Moving elements with highlighting is easier with the endtags, ihmo, too, because when one hits the end of a deep branch, a lot of grouped short end tags are hard to count by eye. sgml: people always say no one will edit it by hand, and almost everyone does. len bullard xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Philippe.Le_Hegaret at sophia.inria.fr Sun May 17 18:32:26 1998 From: Philippe.Le_Hegaret at sophia.inria.fr (Philippe Le Hégaret) Date: Mon Jun 7 17:01:24 2004 Subject: SAX 1.0 : handleData and processingInstruction Message-ID: <355F110F.E39AD593@sophia.inria.fr> charecters takes an array of characters for the data : public abstract void characters(char ch[], int start, int length) throws SAXException processingInstruction takes a String for the data : public abstract void processingInstruction(String target, String data) throws SAXException Why is there this difference in SAX ? Philippe. --------- Philippe Le Hegaret Philippe.Le_Hegaret@sophia.inria.fr -- http://www.inria.fr/koala/plh/ KOALA/DYADE/BULL @ INRIA (Stagiaire) - Sophia Antipolis xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From schampeo at hesketh.com Sun May 17 18:38:45 1998 From: schampeo at hesketh.com (Steven Champeon) Date: Mon Jun 7 17:01:24 2004 Subject: A little wish for short end tags In-Reply-To: <355E49BE.CB166F4A@technologist.com> References: <199805161649.JAA27414@boethius.eng.sun.com> <3.0.5.32.19980516161020.00995ea0@wasabi.hesketh.com> Message-ID: <3.0.5.32.19980517123239.00afcde0@wasabi.hesketh.com> At 10:21 PM 5/16/98 -0400, Paul Prescod graced us with: > So what you're saying is that this sort of thing can work, even with all > of the minimization features of SGML, because you knew the general layout > of your data and/or you had a tool that could normalize the weird stuff > for you. You succeeded at your task because you approached it (perhaps > unknowingly) with either: > > a) the right data set: more or less already normalized SGML or > b) the right tool -- a normalizer: AE. Um, no. I'm sorry - you didn't read what I said. I didn't say that I had been running my SGML through A/E, I had been running Perl scripts on text files that were output from a conversion system whose input was irregular at best - "this page intentionally left blank", typos, and so forth were the order of the day. This previous sentence may be translated as "the input was not normalized". Or did I not make this abundantly clear? If so, my apologies. I should have used terms more appropriate to your dialog, like "non-normalized", "poorly ordered data sets" or some such nonsense - which, if I understand Jon Bosak correctly, the average numbnut with Perl and a job to do wouldn't know. I never thought I'd ever show anyone this crud, but in the interests of keeping XML accessible, here's an extremely horrid script I wrote, based on other horrid scripts I inherited. I knew nothing about Perl at the time, having only a passing understanding of the following while() construct, and my only real useful knowledge was of regular expressions, which I'd taught myself using A/E's search/replace function. The script below fixes a number of problems with the output from another filter, whose input was the text output from a proprietary conversion system. Illustrated, Garbage on Paper -> Garbage in Electronic Format -> ASCII Trash -> SGML This script was the last in the pipeline. The references to 'paragrapgh' and 'loping' below are jokes, based on a misspelling in the original script I rec'd from the programmer :) #!/aie/newgateway/tools/perl # enable paragrapgh mode, multiline mode $/ = ""; $* =1; # get rid of hard returns, spaces before and after tags # add an "a" to board no. Take out endpara-beginpara in # emphasis tagset. gets deg& right. gets rid of empty tagsets. # puts a space after end sup/subscrpt and end emphasis tags. # joins seqlists that ought to be joined. fixes numstyle attrib. # rejoins paras broken by page or hyphen. # substitutes para0 title tags for emphasis tags found in paras. # puts figures on the outside of notes, warnings, and cautions. # Attempts to fix the seqlist nesting problem. # gets nested emphasis tags to titles of para0s. # start loping while (<>) { # remove para tags from within emphasis tags. s:(.*)\n:\1\n:g; # join hyphenated lines, take out hardreturns, replace multispaces with one space. # s:-\n::g; s/\n+/ /g; s/( )+/\1/g; # removes spaces after and before tags. s/> />/g; s/ ::g; s:::g; s:::g; s:::g; s:::g; s:::g; s:::g; # adds space after emphasis and s-script tags. s:():\1 :g; s:(): \1:g; s:():\1 :g; # gets the entities right. s:°ree;:°:g; s:&ree;:&:g; s:@reg;:®:g; # joins seqlists broken by paras. # s:::g; # dehyphenates mistakenly broken paras. s:-?([a-z]):\1:g; # puts seqlists inside the previous para. s#(:|.)[\032]*()#\1\2#g; # turns emphasis tags just inside paras into titles of the previous para0. # works on nested emphasis tags as well, to two levels. s#([0-z \-\(\)/]*)\. #\2.#g; s#([0-z \-\(\)/]*)\. #\2.#g; # puts in numstyle attrib. in seqlists. if(/<\/ITEM>[^<>\/]*<\/PARA><\/ITEM>[^<>/]*)<\/NOTE>/) { s:([^<>/]*)(
[^<>/]*
)+(
):\1\3\2:g; } if(/<\/FIGURE><\/CAUTION>/) { s:([^<>/]*)(
[^<>/]*
)+(
):\1\3\2:g; } if(/<\/FIGURE><\/WARNING>/) { s:([^<>/]*)(
[^<>/]*
)+(
):\1\3\2:g; } # puts untagged FIGURE inside tags. if(/FIGURE [0-9 \-\.]* [^<>]*<\/PARA>/) { s:FIGURE ([0-z\-\.]*\.) ?([^<>]*):
\2
:g; } # check for FIGURES with part of the label inside the title. if(/
[0-9\.]+ /) { s:(<FIGURE LABEL=")([^"]*)("><TITLE>)([0-9\.]+) :\1\2\4\3:g; } # puts cautions, warnings, notes inside the previous item tag. if(/<\/PARA><\/ITEM><\/SEQLIST><\/PARA><NOTE>/) { s:(</ITEM></SEQLIST></PARA>)(<NOTE><PARA>[^<>]*</PARA></NOTE>):\2\1:g; } if(/<\/PARA><\/ITEM><\/SEQLIST><\/PARA><WARNING>/) { s:(</ITEM></SEQLIST></PARA>)(<WARNING><PARA>[^<>]*</PARA></WARNING>):\2\1:g; } if(/<\/PARA><\/ITEM><\/SEQLIST><\/PARA><CAUTION>/) { s:(</ITEM></SEQLIST></PARA>)(<CAUTION><PARA>[^<>]*</PARA></CAUTION>):\2\1:g; } if(/"><PARA>[A-Z \-\/\(\)\.]*\. /) { s:(">)(<PARA>)([A-Z \-\/\(\)\.]*\.) :\1<TITLE>\3\2:g; } # joins seqlists that were created by bad tagging of cautions, warnings, and notes. if(/<\/CAUTION><\/ITEM><\/SEQLIST><\/PARA>):\1:g; } if(/<\/NOTE><\/ITEM><\/SEQLIST><\/PARA>):\1:g; } if(/<\/WARNING><\/ITEM><\/SEQLIST><\/PARA>):\1:g; } # fix erring para0s (figures, lb-in.) s:(lb in.):\1 \2:g; s:(lb in.):\1 \2:g; s:(lb in.):\1 \2:g; s:(lb in.):\1 \2:g; s:figure
:figureS \1:g; s:figure
:figureS \1:g; s:figure:figureS \1:g; s:figure:figureS \1:g; s:figure:figureS \1:g; # change the dtd header from docgasturb to dcgastep -- for meter books s:\[:[]>:; s:([^YH])*::; print; } The bitch of it is, this script worked fairly well and saved us enormous amounts of time. I hope that this ugliness demonstrates that Perl and SGML-like text files can allow a complete neophyte to do wonders, and that any hifalutin changes to the relative simplicity of the current XML spec would be detrimental to the average 'frustrated perl programmer'. Steve (I'm so ashamed) -- "All the good geek things, schampeo@hesketh.com only without all the http://a.jaundicedeye.com bad geek things." http://hesketh.com/schampeo/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sun May 17 21:39:29 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:24 2004 Subject: A little wish for short end tags References: <199805161649.JAA27414@boethius.eng.sun.com> <3.0.5.32.19980516161020.00995ea0@wasabi.hesketh.com> <3.0.5.32.19980517123239.00afcde0@wasabi.hesketh.com> Message-ID: <355F3D36.AADDE804@technologist.com> Sorry, Steven. I still don't understand. The output of the scripts was irregular, but it didn't use any of SGML's hundreds of "hard" features like entities, whitespace in tags etc. The data may not have been normalized in any formal sense, but it used a predictable set of SGML's easier features. This is "good enough" to allow processing with regexps and basically partial normalization. If the data had used entities, whitespace in tags, comments etc., then I would say it was completely unnormalized. But anyhow, I don't understand how you can point to your *success* at taming a system built around a language with hundreds of minimizations as proof that one of those minimizations should not be allowed in another language. Had you failed, solely because of short end-tags, that would have been a persuasive argument. My belief is that that would not happen, because there are so many other factors. Most data falls either into the category of predictable and simple (which yours seems to have) or unpredictable and requiring normalization (which most data authored by people with text editors will look like -- short end-tags or not). Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From digitome at iol.ie Mon May 18 12:58:20 1998 From: digitome at iol.ie (Sean Mc Grath) Date: Mon Jun 7 17:01:24 2004 Subject: A little wish for short end tags Message-ID: <199805181058.LAA12969@GPO.iol.ie> [Jon Bosak] >If you really like using shortcuts, then go all the way: get a genuine >SGML tool and define a DTD that allows not just end-tag minimization >but full omission of both start-tags *and* end-tags. Knock yourself >out. Just make sure to normalize the result before you call it XML >and ship it out to the rest of us to work with. I just had to reply and express my wholehearted aggreement with Jon's posting. SGML and SGML power tools can make excellent XML production systems. James Clarks SGML to XML tool - SX - for example is built on top of the core SP library and thus you get basically fully blown SGML power. All the minimization power you can shake a stick at. Sean Sean Mc Grath http://www.digitome.com/sean.htm County Sligo, Ireland, Tel: +353 96 47391 xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tyler at infinet.com Mon May 18 14:32:45 1998 From: tyler at infinet.com (Tyler Baker) Date: Mon Jun 7 17:01:24 2004 Subject: question: NodeIterator in DOM References: <199805151519.RAA02352@hs2114.wdf.sap-ag.de> Message-ID: <35602AC2.60993E3A@infinet.com> Akitoshi Yoshida wrote: > Hi, > Another question regarding NodeIterator: > You have an NodeIterator instance with the iterator > position set somewhere in the middle. When you > remove three nodes: first, the node just after the > iterator position, second the node after this removed > node, and finally the node just before > the original iterator position, > then what should toNextNode() and toPrevNode() return? Essentially a similiar problem occurs in the JDK 1.2 Collection classes. What JavaSoft suggests is that for all Iterators (a new interface in the JDK 1.2 Collection classes) you synchronize on the object being iterated so that no internal changes to the object's state are made while iterating. Since DOM is supposed to be architectural and language neutral, languages which lack synchronization support may find some difficulty in making sure that an object which returns an Iterator object is not mutated while iteration is occurring in a multithreaded environment. If your application is single threaded, you should not have to worry about your object (such as a List) that returns the Iterator object being mutated except for programming error where you add and remove objects to the object returning the Iterator object while iterating. Sorry to be so convoluted here but I got 3 hours of sleep last night so I hope the above made sense. Tyler xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Amitr at abinfosys.com Mon May 18 16:42:46 1998 From: Amitr at abinfosys.com (Amit) Date: Mon Jun 7 17:01:24 2004 Subject: Loading an External DTD for an XML using MSXML Message-ID: <9805181515.AB25790@del2.vsnl.net.in> Hello All, I am invoking the MSXML java parser through the XMLDSO class which I am embeding in the tag in my HTML page. '' + '' + ''; In my tag itself I am loading the XML file (magic.xml). But I also want to call the DTD associated with the magic.xml file so in the beginning of my XML(magic.xml) I gave But, now when I run my HTML page which embeds the MSXML in the tag the MSXML throws the following exception :- "Cannot find External DTD Magic.dtd" (The Magic.dtd file is present and is the same direc. as magic.xml) Why is the MSXML not being able to access the external DTD (Magic.dtd)? Thanx in advance, Regards, AMIT xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From larsga at ifi.uio.no Mon May 18 17:42:00 1998 From: larsga at ifi.uio.no (Lars Marius Garshol) Date: Mon Jun 7 17:01:24 2004 Subject: SAX 1.0 : handleData and processingInstruction In-Reply-To: <355F110F.E39AD593@sophia.inria.fr> References: <355F110F.E39AD593@sophia.inria.fr> Message-ID: * Philippe Le Hégaret | | charecters takes an array of characters for the data : | | public abstract void characters(char ch[], | int start, | int length) throws SAXException | | processingInstruction takes a String for the data : | | public abstract void processingInstruction(String target, | String data) throws SAXException | | Why is there this difference in SAX ? I wasn't "present" when this decision was made, but I'd guess characters is the way it is because this way is faster in most implementations, since most parsers keep a character buffer during parsing and this avoids unecessary buffer copying. However, doing it this way is obviously less convenient for the user, who would prefer to just get a String and be done with it. I guess this is why the current form of processingInstruction was chosen, since processing instructions rarely occur often in documents and thus performance is less of an issue. See also: -- "These are, as I began, cumbersome ways / to kill a man. Simpler, direct, and much more neat / is to see that he is living somewhere in the middle / of the twentieth century, and leave him there." -- Edwin Brock http://www.stud.ifi.uio.no/~larsga/ http://birk105.studby.uio.no/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From syost at rational.com Mon May 18 21:50:18 1998 From: syost at rational.com (Steve Yost) Date: Mon Jun 7 17:01:24 2004 Subject: Diff/Merge tools? Message-ID: <00a901bd8296$38fb71f0$ad395ec7@ratskellar.atria.com> Is anyone aware of existing or in-development diff/merge tools, GUI-based or command-line based, for XML? I'd like to make use of one if it exists, and possibly build one otherwise. -Steve Yost xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Mon May 18 22:32:30 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:25 2004 Subject: do(duration){display video} In-Reply-To: <3.0.32.19980514213837.015e6bb0@interlog.com> Message-ID: <3.0.1.16.19980518211743.28973dec@pop3.demon.co.uk> At 21:38 14/05/98 -0400, Rolande Kendal wrote: >Still waiting for Jumbo2... RSN - my modem crashed. (Lightning hit it). I hacked a simple distribution today. I had hoped to get it out on the Net before Paris but shall proably have to wait till next weekend. Have tested it on a 10,000 element file (600 Kbytes) and it runs OK on my 166/32Mbytes. I have some minor problems on largish files - DXP seems to run out of memory (does it build a tree?) and I have a problem with another parser. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Mon May 18 22:34:47 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:25 2004 Subject: Xlink semantics In-Reply-To: <3.0.5.32.19980515120249.00a257b0@village.promanage-inc.com > References: <98May14.012336edt.26885@thicket.arbortext.com> <3.0.1.16.19980513204547.5bdf6c46@pop3.demon.co.uk> Message-ID: <3.0.1.16.19980518210830.2a5f7e80@pop3.demon.co.uk> At 12:02 15/05/98 -0400, Eve L. Maler wrote: >This is a really good issue. Obviously, the interactions among the >XLink-aware elements are underspecified! I agree with Masatomo Goto's >interpretation of the various element configurations. A few other comments >below... This is most helpful. After thinking about Masatomo Goto's reply I realised that some of the things I had thought were semantically incorrect were useful. I have been thinking that we need something rather like an API - or at least a guide - for people writing link processor tools. I will try to work out what is - and what isn't - allowed in the spec and maybe if we meet in Paris have a discussion. BTW I think XLink is incredibly powerful and if we can develop abstract processing machinery will revolutionise a lot of what I and others want to do. My personal prejudice will be to look at XLink first before other ways of solving problems of inheritance, relations, networks, etc. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Mon May 18 22:38:20 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:25 2004 Subject: SAX: White space after last element In-Reply-To: <199805141242.IAA00235@unready.microstar.com> References: <000a01bd7f24$02de1160$1e09e391@mhklaptop.bra01.icl.co.uk> <000a01bd7f24$02de1160$1e09e391@mhklaptop.bra01.icl.co.uk> Message-ID: <3.0.1.16.19980518210036.304749ea@pop3.demon.co.uk> At 08:42 14/05/98 -0400, David Megginson wrote: >SAX is very new, and I am very tired, after a difficult week (of which Sympathies. I know what it's like. Came back expecting a few hours before moving on only to find that lightning had blown the modem. Bought a new one. Spent half the night trying to make it work. No luck. Had to swop it the next day. Exhausted. >SAX 1.0 was only a tiny part). When I am better rested, I will take >some time to test the different SAX implementations and to work with >the authors to resolve any problems, or to take suggestions for >clarifying the interface. While Java interfaces and JavaDoc comments >are useful, they are no substitute for a proper written specification, >which I owe to all of you as soon as I can manage it. This is a novel use of the word 'owe'. We owe you a great deal. I was talking to some people in real life and they were saying how much they appreciated SAX because it provided just what they wanted. Documentation is never fun while you are writing it, but there is a little bit of pleasure when you have finished it and can - metaphorically - put it in a binder. P. > > > >All the best, > > >David > >-- >David Megginson david@megginson.com > http://www.megginson.com/ > >xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk >Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ >To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; >(un)subscribe xml-dev >To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; >subscribe xml-dev-digest >List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) > > Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cfranks at microsoft.com Tue May 19 01:54:43 1998 From: cfranks at microsoft.com (Charles Frankston) Date: Mon Jun 7 17:01:25 2004 Subject: parser for xml-data? Message-ID: rbourret@dvs1.informatik.tu-darmstadt.de wrote: > One possibility is that something in your XML document, such > as an attribute at > the root, would refer to the XML document containing the > XML-Data definition of > your grammar: > > > ... > > Another (uglier) possibility is that you use namespaces: the > XML-Data namespace > and the namespace your XML-Data data defines. I haven't > looked enough at either > the namespaces or XML-Data specs to be sure how this would > work, but it seems > the object structure might be something like: > > ... > Namespaces are intended to what you're asking for. I.e.: I don't see why using a standardized solution is uglier than inventing your own namespace tag. This does require you to namespace qualify your instance information. I.e. the tags that come from the urn:mycompany:MyRootSchema would be something like . xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cfranks at microsoft.com Tue May 19 03:50:41 1998 From: cfranks at microsoft.com (Charles Frankston) Date: Mon Jun 7 17:01:25 2004 Subject: parser for xml-data? Message-ID: > -----Original Message----- > From: Rick Jelliffe [mailto:ricko@allette.com.au] > Sent: Friday, May 08, 1998 5:58 PM > To: xml-dev@ic.ac.uk > Subject: Re: parser for xml-data? > > > From: Ron Bourret > > > The only major difference I have found so far for XML is > that the elements > in > > XML documents are ordered, while data members in OO > programming languages > are > > not. > > There is an error in the (January 5?) XML-data report, in the > very first > example. > It gives the clear impression that XML-data does not > constrain sequence for > the element types it describes. In the example, the > declarations for the > element > types which can appear as the content of an element types are > given in one > order, but the instance has them in another order. (A Microsoft > representative > pointed this out to me: I dont know why they haven't just > reissued the note, > since it > is a fairly cricitical point for implementors.) Well, it's not actually a note, Rick, it's a submission to the W3C. But yes, the errors should be corrected, presumably by a re-submission. There are more, mostly less serious, errors and typos that should be corrected as well. > > Also note that the usage of ISO 8601 date formats seems to be wrong. > ISO 8601 date format is yyyy-mm-dd, e.g. 1998-05-09, and not 19980509, > last time I looked. Both 1998-05-09 and 19980509 are legal in ISO 8601 (there's a "full" and a "basic" format, or something like that). However, my current inclination is always to use the full form, i.e. 1998-05-09, as per Misha Wolf's and Charles Wickstead's note: http://www.w3c.org/TR/NOTE-datetime-970915.html. > > If anyone is thinking of implementing XML-data, I suggest > they befriend the > authors, because the report misses out on several key issues. (I have > previously > mentioned that is does not seem to make clear whether you can have an > XML-data schema as part of a document, or whether it must be > external. If it > is internal, can it describe the document's root element? I > suppose a close > reading of the XML-data text might help, but it is not clear > to me after > dozens of readings, but I do not claim to be particularly > brilliant in this > area.) Befriending the authors is always a good idea :-), as is allowing schema information in a document instance. I think the next revision should try to define this. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From murata at apsdc.ksp.fujixerox.co.jp Tue May 19 08:50:53 1998 From: murata at apsdc.ksp.fujixerox.co.jp (MURATA Makoto) Date: Mon Jun 7 17:01:25 2004 Subject: Example XML documents in Japanese Message-ID: <199805190652.AA01126@murata.apsdc.ksp.fujixerox.co.jp> Example XML documents in Japanese are available. They are encoded in UTF-16 (big endian and little endian), UTF-8, iso-2022-jp, shift_jis, and euc-jp. One document is the translation of the XML PR, and it will soon be replaced with that of the XML recommendation. The other document is my weekly report and its DTD is also available. This document uses a number of element type names in Japanese. The URL is: http://www.fxis.co.jp/DMS/sgml/xml/charset/xml-japan.zip Makoto Fuji Xerox Information Systems Tel: +81-44-812-7230 Fax: +81-44-812-7231 E-mail: murata@apsdc.ksp.fujixerox.co.jp xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From rbourret at dvs1.informatik.tu-darmstadt.de Tue May 19 10:44:05 1998 From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret) Date: Mon Jun 7 17:01:25 2004 Subject: parser for xml-data? Message-ID: <199805190833.KAA09534@berlin.dvs1.tu-darmstadt.de> Charles Frankston wrote: > rbourret@dvs1.informatik.tu-darmstadt.de wrote: > > > One possibility is that something in your XML document, such > > as an attribute at > > the root, would refer to the XML document containing the > > XML-Data definition of > > your grammar: > > > > > > ... > > > > Another (uglier) possibility is that you use namespaces: the > > XML-Data namespace > > and the namespace your XML-Data data defines. I haven't > > looked enough at either > > the namespaces or XML-Data specs to be sure how this would > > work, but it seems > > the object structure might be something like: > > > > ... > > > > Namespaces are intended to what you're asking for. I.e.: > > prefix="myschema" src="http://something/MyRootSchema.xml"?> > > I don't see why using a standardized solution is uglier than inventing your > own namespace tag. This does require you to namespace qualify your instance > information. I.e. the tags that come from the urn:mycompany:MyRootSchema > would be something like . It looked ugly to me not because of using namespaces, but because I couldn't figure out which namespace owned the root tag. My first guess was something like this, which I think is a bit ugly, not to mention invalid: Root object XML-Data root XML-data data... Your data root Your data... I hadn't thought of your solution because I assumed namespace declarations pointed to DTDs, not to any general schema mechanism. Several other questions / comments about namespaces: 1) How do you use multiple namespaces in a valid document? That is, if you have two separate DTDs (schemas), neither of which references elements in the other, how do you build a single valid document with both of them? Elements from the first DTD can't nest inside elements from the second DTD (because they aren't in the second DTD's grammar) and vice versa. The example in section 3.1 of the namespaces spec is well-formed, but the spec doesn't explain how it can be valid. Presumably, it doesn't match any of the DTDs presented as namespaces. 2) The src attribute in your namespace declaration does not point to a DTD; it points to an XML-Data file. While the namespace spec does not prohibit this, I had simply assumed that the schema would be a DTD. It would be nice if the namespace spec clarified that it does not impose any rules on the format of a namespace schema. This is important for validating parsers, as it means that namespace declarations are dependent on the parser's ability to read the particular schema format that is used. (And if a parser can read multiple schema formats, how does it know which one to use?) 3) Why is production [1] in the namespace spec: [1] NamespacePI ::= '' instead of: [1] NamespacePI ::= '' Is the ambiguity of the production, which needs to be qualified with the Required Parts constraint, worth the flexibility in the order of PrefixDef, NSDef, and SrcDef? My opinion is no. -- Ron Bourret xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jjaakkol at cs.Helsinki.FI Tue May 19 16:46:14 1998 From: jjaakkol at cs.Helsinki.FI (Jani Jaakkola) Date: Mon Jun 7 17:01:25 2004 Subject: Report on the elimination of SGML AND groups (fwd) Message-ID: I'm sure that there are people in this list, who might find this report interesting. However, be prepared for some theoretically oriented computer science. You have been warned. ---------- Forwarded message ---------- Date: Tue, 19 May 1998 12:44:54 +0300 (EET DST) From: Pekka Kilpelainen To: Anne.Brueggemann-Klein@informatik.tu-muenchen.de, dwood@cs.ust.hk, cmsmcq@tigger.cc.uic.edu, marcy@world.std.com, ak117@freenet.carleton.ca Cc: Helena.Ahonen@cs.helsinki.fi, Barbara.Heikkinen@cs.helsinki.fi, Oskari.Heinonen@cs.helsinki.fi, Jani.Jaakkola@cs.helsinki.fi, Pekka.Kilpelainen@cs.helsinki.fi, Greger.Linden@cs.helsinki.fi, Jyrki.Niemi@cs.helsinki.fi, Kimmo.Paasiala@cs.helsinki.fi Subject: Report on the elimination of SGML AND groups Dear colleagues, FYI: I have written a report, where I analyze the possibilities and the lengtehening effect of replacing AND groups of SGML content models by equivalent XML model groups. The report is available as gnu-zipped Postscript through its abstract page at the address http://www.cs.helsinki.fi/~kilpelai/C-1998-12.html . I include at the bottom the abstract of the report. I would be thankful for any comments. Yours, Pekka Kilpelainen Pekka Kilpelainen, University of Helsinki, Dept. of Computer Science Email: Pekka.Kilpelainen@cs.helsinki.fi phone: +358 9 7084 4227, fax: +358 9 7084 4441 http://www.cs.helsinki.fi/~kilpelai --------------------------- SGML & XML Content Models Pekka Kilpel?inen University of Helsinki, Department of Computer Science Report C-1998-12, May 1998 16 pages http://www.cs.helsinki.fi/TR/C-1998-12/ The SGML and XML standards use a variation of regular expressions called content models for modeling the markup structures of document elements. SGML content models may include so called AND groups, which are excluded from XML. An AND group, which is a sequence of subexpressions separated by an &-operator, denotes the sequential catenation of its subexpressions in any possible order. If one wants to shift from SGML to XML in document production, one has to translate SGML content models to corresponding XML content models. The allowed content models in both SGML and XML are restricted by a requirement of determinism, which means that a parser recognizing document element contents has to be able to decide without lookahead, which content model token to match with the current input token, while processing the document from left to right. It is known that not all SGML content models can be expressed as an equivalent XML content model. It is also known that transforming an SGML content model into an equivalent XML content model may cause an exponential growth in the length of the content model. We discuss methods of eliminating AND groups and analyze the circumstances where they can be applied. We derive a tight bound of $e n!$ on the number of symbols in the result of eliminating an AND group of $n$ symbols, where $e = 2.71828...$is the base of natural logarithms. We present the analysis in a pedagogical manner, emphasizing mathematical methods which are typical to the analysis of algorithms. We also show that minimal deterministic automata for recognizing an AND group of $n$ distinct element names contain $2^{n}$ states and $n 2^{n-1}$ transitions, excluding the failure state and transitions leading to it. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jcupp at essc.psu.edu Tue May 19 17:03:21 1998 From: jcupp at essc.psu.edu (Jason R. Cupp) Date: Mon Jun 7 17:01:25 2004 Subject: XML & Postmodernism, was: Separation of formatting... Message-ID: <35618CB3.DA3035C8@essc.psu.edu> Gregg Reynolds wrote: > > Personally I've come around to a pragmatist position on this, after some > time as a radical Free The Text purist. It's just not possible to > encode information without also encoding something about what we're > supposed to do with it, any more than it's possible to draw a "real" > line segment. But we can certainly do some very useful things by > trying. > > An interesting paper on a similar topic is at > http://www.sil.org/sgml/ohco1.html, "Refining Our Notion of What Text > Really Is: The Problem of Overlapping Hierarchies." > -- That's a great paper! Was/is there any attempt to address these issues in XML? -- the idea that there is no unique logical privileged perspective as it relates to the encoding of a text (or at least there shouldn't be); that multiple hierarchical sub perspectives can exist orthogonally within a non-hierarchical perspective. The work of any parser would be to deconstruct whatever perspective he/she wished (through a stylesheet perhaps). What immediately came to mind were namespaces: Mr. Thurston J. Howell , III Where P2,P1 are sub perspectives of P. With a perspective aware parser, this no longer becomes mixed content. If you follow the logic through then there should really be no unique logical API for XML such as SAX or DOM... -- Jason R. Cupp (jcupp@essc.psu.edu) The Pennsylvania State University xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From crism at ora.com Tue May 19 17:20:29 1998 From: crism at ora.com (Chris Maden) Date: Mon Jun 7 17:01:25 2004 Subject: Empty End Tags Considered Confusing Message-ID: <199805191516.LAA07380@ruby.ora.com> I think it was xml-dev that was discussing this; the archive is down (Henry, are you listening?). I was pretty agnostic on the matter; I won't use them, but can cope with them. But I got this message from O'Reilly's German office today. They're a pretty clued group of people, but a book rife with empty end-tags had them stumped: ------- Start of forwarded message ------- Date: Tue, 19 May 1998 12:27:56 +0200 (MET DST) To: Chris Maden From: ... <...@ora.de> Subject: (SGML) end tags in Linux Device Drivers Hi Chris, in "Linux Device Drivers" the sgml files contain as end tags for everything instead of specific end tags (like , , etc.) Is this only because this book was written in SGML by the author or will we have this kind of end tag in future sgml files from FrameMaker->SGML conversion. I ask because we have a simple Perl script which produces readable HTML files (for reviewers, proof readers etc.). It has worked fine for us during the last projects. With as an unspecific end tag we cannot use this script. It's not a problem for our translation of Device Drivers (the typesetter can handle it) - I just would like to know which end tags will be used for the next projects (and if it would be worth the effort to update our Perl script). Regards, ... ------- End of forwarded message ------- Now, the Perl script could have been updated, but to handle the general case (even in an XML+empty-end-tag world) would have added considerable complexity to what's currently a pretty simple script. They can use spam to get around this, or hack up a custom solution for this simpler case (which is what one of their people did). But it's confusing and even potentially dangerous, without a pretty large chunk of SGML knowledge (ref. _The SGML FAQ Book_; you need to know a lot to avoid *accidentally* using a feature). I have to side with Jon Bosak et al. that empty end-tags are potentially a sizable amount of trouble, and that the perceived value is negligible in a compressed-transfer world. -Chris -- http://www.oreilly.com/people/staff/crism/ +1.617.499.7487 90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From cfranks at microsoft.com Tue May 19 20:20:16 1998 From: cfranks at microsoft.com (Charles Frankston) Date: Mon Jun 7 17:01:25 2004 Subject: parser for xml-data? Message-ID: > -----Original Message----- > From: rbourret@dvs1.informatik.tu-darmstadt.de > [mailto:rbourret@dvs1.informatik.tu-darmstadt.de] > Sent: Tuesday, May 19, 1998 1:34 AM > To: xml-dev@ic.ac.uk; Charles Frankston > Subject: RE: parser for xml-data? > 1) How do you use multiple namespaces in a valid document? > That is, if you have > two separate DTDs (schemas), neither of which references > elements in the other, > how do you build a single valid document with both of them? > Elements from the > first DTD can't nest inside elements from the second DTD > (because they aren't in > the second DTD's grammar) and vice versa. The example in > section 3.1 of the > namespaces spec is well-formed, but the spec doesn't explain > how it can be > valid. Presumably, it doesn't match any of the DTDs > presented as namespaces. DTDs are not well-equipped to handle namespaces. It can technically be done: for example, you could allow your outer DTD to have 'ANY' content. XML-Data schemas are designed to integrate with namespaces: > > 2) The src attribute in your namespace declaration does not > point to a DTD; it > points to an XML-Data file. While the namespace spec does > not prohibit this, I > had simply assumed that the schema would be a DTD. It would > be nice if the > namespace spec clarified that it does not impose any rules on > the format of a > namespace schema. This is important for validating parsers, > as it means that > namespace declarations are dependent on the parser's ability > to read the > particular schema format that is used. (And if a parser can > read multiple > schema formats, how does it know which one to use?) > Most of the XML-Data spec describes the rules of the XML-Data schema file. The schema happens to use XML instance syntax, rather than the separate grammar approach of DTDs. We think this is a big advantage -- you can use all the tools you have for editing XML instance data to edit XML-Data schemas. > 3) Why is production [1] in the namespace spec: > > [1] NamespacePI ::= ' | SrcDef))+ '?>' > > instead of: > > [1] NamespacePI ::= ' (S SrcDef)? '?>' > > Is the ambiguity of the production, which needs to be > qualified with the > Required Parts constraint, worth the flexibility in the order > of PrefixDef, > NSDef, and SrcDef? My opinion is no. The ns, prefix, and src parameters to a namespace PI look a lot like attributes (although they are not in a formal sense). Since attributes in XML do not have to be in a particular order, it would certainly be surprising for people to discover that attributes in a namespace have to be a particular order. You're suggesting that the syntax be made harder to use in order to make the productions easier to author. I think this is a bad tradeoff. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From simpson at polaris.net Wed May 20 18:43:50 1998 From: simpson at polaris.net (John E. Simpson) Date: Mon Jun 7 17:01:25 2004 Subject: Comments, parsers, XPointers In-Reply-To: Message-ID: <3.0.3.32.19980520124217.00bd928c@nexus.polaris.net> >From previous (sometimes contentious) discussions on this list, I gather that there's no provision in SAX for passing document to a downstream application. The XPointer WD, production 12, allows for relative addressing by node type (element, PI, and so on), *including* comments. Does the downstream invisibility of comments imply that a URL like this will always fail? somedoc.xml#child(1,#comment) Thanks for any insights, John John E. Simpson | It's no disgrace t'be poor, simpson@polaris.net | but it might as well be. | -- "Kin" Hubbard xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From crism at ora.com Wed May 20 19:27:41 1998 From: crism at ora.com (Chris Maden) Date: Mon Jun 7 17:01:25 2004 Subject: Comments, parsers, XPointers In-Reply-To: <3.0.3.32.19980520124217.00bd928c@nexus.polaris.net> (simpson@polaris.net) Message-ID: <199805201726.NAA16393@ruby.ora.com> [John E. Simpson] > From previous (sometimes contentious) discussions on this list, I > gather that there's no provision in SAX for passing document to a downstream application. > > The XPointer WD, production 12, allows for relative addressing by > node type (element, PI, and so on), *including* comments. > > Does the downstream invisibility of comments imply that a URL like > this will always fail? > somedoc.xml#child(1,#comment) > > Thanks for any insights, Insight: XML != SAX. A purely SAX implementation will not see comments, so an XPointer implementation based on it will fail with that pointer. However, a different XML parser may still have the comments, and an XPointer based on it may well find them. A complete XPointer implementation would need to have access to the comments, though I'm not entirely sure what the purpose would be (except possibly for editing applications). -Chris -- http://www.oreilly.com/people/staff/crism/ +1.617.499.7487 90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Wed May 20 19:32:37 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:26 2004 Subject: Comments, parsers, XPointers In-Reply-To: <3.0.3.32.19980520124217.00bd928c@nexus.polaris.net> References: <3.0.3.32.19980520124217.00bd928c@nexus.polaris.net> Message-ID: <198001011834.NAA00512@unready.megginson.com> John E. Simpson writes: > >From previous (sometimes contentious) discussions on this list, I gather > that there's no provision in SAX for passing document to > a downstream application. > > The XPointer WD, production 12, allows for relative addressing by node type > (element, PI, and so on), *including* comments. > > Does the downstream invisibility of comments imply that a URL like this > will always fail? > somedoc.xml#child(1,#comment) With SAX level 1, it does. I had a very interesting e-mail discussion with Eve and Steve about the whole issue of pointing at comments, and it will interesting to see how that develops in the next draft. The DOM includes the comments to support authoring tools, but I imagine that many (most?) DOM builders will simply discard the comments before constructing the tree (comments have no purpose on the production/browsing side). In the longer term, what we need is an official definition of an XML information set, specifying (for example) that reporting comments is optional, while reporting the start and end of elements is required. Once such a beastie exists, many vexing questions about (and inconsistencies among) the DOM, XPointers, SAX, XSL, etc. will disappear. All the best, David -- David Megginson david@megginson.com http://www.megginson.com/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From M.H.Kay at eng.icl.co.uk Wed May 20 19:52:43 1998 From: M.H.Kay at eng.icl.co.uk (Michael Kay) Date: Mon Jun 7 17:01:26 2004 Subject: Comments, parsers, XPointers Message-ID: <017601bd8418$40dabb00$1e09e391@mhklaptop.bra01.icl.co.uk> >The XPointer WD, production 12, allows for relative addressing by node type >(element, PI, and so on), *including* comments. > Even more contentious is that it allows access to chunks of CDATA. XPointer (and to some extent the DOM as well) in my view fails to recognise that XML defines two object models, a logical model and a physical model. SAX quite clearly and explicitly gives you access to the logical model only, whereas XPointer and DOM are rather ambiguous about the distinction. In my view, if XPointer is intended as a mechanism for defining relationships and underpinning hyperlinks, then it should only allow reference to objects in the logical view only. Mike Kay xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Wed May 20 20:38:08 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:26 2004 Subject: Proposal Announcement - XML DTDs to XML docs Message-ID: I'd like to announce the posting of an unofficial and incomplete proposal for the representation of XML DTDs as XML documents at: http://members.aol.com/simonstl/xml While it lacks _any_ official standing, I hope folks will take a look at it and consider its possibilities. It has some overlap (and I think eventual compatibility) with XML-Data, but it aims at a much smaller target. This proposal is more interested in making XML a friendlier and more self-consistent environment than in creating complex schemas for XML content. All comments, positive and negative, are welcome. Unlike my other essays, this one is explicitly _not_ copyrighted; I'd really like to see these ideas flower, even as part of someone else's proposal (as is probably necessary). If anyone is interested in contributing, I'd like very much to expand this. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From simpson at polaris.net Wed May 20 21:23:31 1998 From: simpson at polaris.net (John E. Simpson) Date: Mon Jun 7 17:01:26 2004 Subject: Comments, parsers, XPointers In-Reply-To: <198001011834.NAA00512@unready.megginson.com> References: <3.0.3.32.19980520124217.00bd928c@nexus.polaris.net> <3.0.3.32.19980520124217.00bd928c@nexus.polaris.net> Message-ID: <3.0.3.32.19980520152214.00bd52d4@nexus.polaris.net> At 01:34 PM 1/1/80 -0500, you wrote: > [various helpful stuff] >(comments have no purpose on the production/browsing side). I was wondering about that. Specifically, I was wondering about a "view source"-type feature in XML browsers, perhaps with a show/hide comments toggle. I've no experience with SGML, but I (and a lot of -- most? -- other HTMLites) have frequently made use of other developers' comments for learning purposes, outside the context of formal training. Assume there were some convention for associating a comment with a particular element or other component of the logical (even physical) model (probably a big assumption!); some facility for locating element X's comment, if any, would be extremely helpful for such purposes. A docname.xml#child(1,elementname).(1,#comment) sort of XPointer seems like a natural construct in this case. Of course excessive comments add to network load, invite abuse and so on (I still get the willies when I see scripting embedded in HTML comments); but they aren't always noise, and aren't always useful only to their author. >In the longer term, what we need is an official definition of an XML >information set, specifying (for example) that reporting comments is >optional, while reporting the start and end of elements is required. >Once such a beastie exists, many vexing questions about (and >inconsistencies among) the DOM, XPointers, SAX, XSL, etc. will >disappear. Yes to all! Thanks, David. John John E. Simpson | It's no disgrace t'be poor, simpson@polaris.net | but it might as well be. | -- "Kin" Hubbard xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From simpson at polaris.net Wed May 20 21:28:57 1998 From: simpson at polaris.net (John E. Simpson) Date: Mon Jun 7 17:01:26 2004 Subject: Comments, parsers, XPointers In-Reply-To: <199805201726.NAA16393@ruby.ora.com> References: <3.0.3.32.19980520124217.00bd928c@nexus.polaris.net> Message-ID: <3.0.3.32.19980520152809.00bd1580@nexus.polaris.net> At 01:26 PM 5/20/98 -0400, Chris Maden wrote: >[John E. Simpson] >> Thanks for any insights, > >Insight: XML != SAX. Heh. Thanks. >A complete XPointer >implementation would need to have access to the comments, though I'm >not entirely sure what the purpose would be (except possibly for >editing applications). Well, as I just mentioned in a reply to DavidM, it's certainly not *critical* that comments be made available to downstream apps other than editors. But they seem to serve some valid purposes as well -- not necessarily just of the "remind the author what this next thing does." John John E. Simpson | It's no disgrace t'be poor, simpson@polaris.net | but it might as well be. | -- "Kin" Hubbard xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From ak117 at freenet.carleton.ca Wed May 20 21:52:20 1998 From: ak117 at freenet.carleton.ca (David Megginson) Date: Mon Jun 7 17:01:26 2004 Subject: Comments, parsers, XPointers In-Reply-To: <3.0.3.32.19980520152214.00bd52d4@nexus.polaris.net> References: <3.0.3.32.19980520124217.00bd928c@nexus.polaris.net> <198001011834.NAA00512@unready.megginson.com> <3.0.3.32.19980520152214.00bd52d4@nexus.polaris.net> Message-ID: <198001012046.PAA01198@unready.megginson.com> John E. Simpson writes: > At 01:34 PM 1/1/80 -0500, you wrote: > >(comments have no purpose on the production/browsing side). > I was wondering about that. Specifically, I was wondering about a > "view source"-type feature in XML browsers, perhaps with a > show/hide comments toggle. I've no experience with SGML, but I > (and a lot of -- most? -- other HTMLites) have frequently made use > of other developers' comments for learning purposes, outside the > context of formal training. I can imagine as many as three "view" features for XML browsing beyond the default formatted presentation: 1) View Source: see the raw, unparsed source of document entity, possibly with options for viewing the source of other entities. 2) View Logical Structure: see a tree control showing the element structure of the document, probably based on a DOM. 3) View Physical Structure: see a tree control showing the physical structure of the document, starting at the document entity; 'embed' XLinks might also be included here. It should be possible to invoke "View Source" for any tree node. For the first, there is no need to use an XML parser at all: just show the original entity, comments, whitespace, and all. For the second, comments (and CDATA section boundaries s, etc.) would simply confuse the view by adding too many tree nodes -- I'd want them filtered out even if the underlying DOM builder supported them. For the third, I'd probably be interested only in external entities or XLinks. All the best, David -- David Megginson david@megginson.com http://www.megginson.com/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Wed May 20 22:08:50 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:26 2004 Subject: Proposal Announcement - XML DTDs to XML docs References: Message-ID: <35633897.13A71868@technologist.com> Simon St.Laurent wrote: > > I'd like to announce the posting of an unofficial and incomplete proposal for > the representation of XML DTDs as XML documents at: We discussed this in the XML working group. The idea does have the benefits you list in your proposal. But there are more disadvantages than you have listed, and some hard problems to be solved. Nobody has ever done the work to flesh it out to the point where an XML-document notation is as expressive as the current DTD notation. I believe that people who have not been using the DTD notation for a long time underestimate the extent to which common use depends on weird stuff like parameter entities specifying partial content models, element types in element type declarations and so forth. It might be possible to build an XML-document notation that was as expressive as XML DTDs (or at least expressive enough) and didn't use any text substitution wizardry, but nobody has done that yet. This is not a problem if you are content having your XML-syntax used only by those who build simple DTDs. There are other, more subtle problems. Having a single notation appeals mathematically to those who like consistent, recursive structures, but it is not clear that end-users fall into that category. A user interface (markup languages *are* user interfaces) can be too consistent, if it obscures the differences between things. In the case of documents and DTDs, I expect many users would get confused about the distinction between documents and DTDs if DTDs *were* documents. There is also the issue of compatibility. If the DTD for DTDs is extensible and open, as most proponents argue it should be, then Microsoft, Netscape and Sun can all take shots at "extending" it in the way that they "extended" HTML. If the DTD DTD was specifically designed to be extensible, then we could not complain about that. Depending on the level of extensibility, XML documents could actually parse differently depending on which browser you were using. If it was designed NOT to be extensible, then we have to cross one of the benefits of this alternate notation off of the list. Another problem is "specification encapsulation." XML 1.0 is specifically designed NOT to depend on XLink or XPointer. Your proposal depends on them. It seems to me that there is some sort of circularity problem there. Several of my complaints about your proposal stem from the fact that DTDs both change the parse and validate the document. In other words, they are both schemata and "parse information providers". If your XML-instance DTDs only validated, then many of the complaints would go away. But if they only validated, I don't think it would be accurate to call them DTDs anymore. Then they would be just "schemas" since they would accomplish only one of the DTD's two functions. I suspect that these functions will, in fact, become more and more distinct as time goes by. So proposals that keep them together should probably not succeed. Our current DTD syntax is quite reasonable and efficient for the task of declaring entities, setting default attributes and so forth. If we want extensible, XML-instance notation schemata, then we should probably forget about replacing DTDs and just define extensible, XML-instance schemata (i.e. XML-Data) and leave DTDs to do the other tasks. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Wed May 20 22:12:17 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:26 2004 Subject: Proposal Announcement - XML DTDs to XML docs References: Message-ID: <3563395C.8C348359@technologist.com> Simon St.Laurent wrote: > > I'd like to announce the posting of an unofficial and incomplete proposal for > the representation of XML DTDs as XML documents at: We discussed this in the XML working group. The idea does have the benefits you list in your proposal. But there are more disadvantages than you have listed, and some hard problems to be solved. Nobody has ever done the work to flesh it out to the point where an XML-document notation is as expressive as the current DTD notation. I believe that people who have not been using the DTD notation for a long time underestimate the extent to which common use depends on weird stuff like parameter entities specifying partial content models, element types in element type declarations and so forth. It might be possible to build an XML-document notation that was as expressive as XML DTDs (or at least expressive enough) and didn't use any text substitution wizardry, but nobody has done that yet. This is not a problem if you are content having your XML-syntax used only by those who build simple DTDs. There are other, more subtle problems. Having a single notation appeals mathematically to those who like consistent, recursive structures, but it is not clear that end-users fall into that category. A user interface (markup languages *are* user interfaces) can be too consistent, if it obscures the differences between things. In the case of documents and DTDs, I expect many users would get confused about the distinction between documents and DTDs if DTDs *were* documents. There is also the issue of compatibility. If the DTD for DTDs is extensible and open, as most proponents argue it should be, then Microsoft, Netscape and Sun can all take shots at "extending" it in the way that they "extended" HTML. If the DTD DTD was specifically designed to be extensible, then we could not complain about that. Depending on the level of extensibility, XML documents could actually parse differently depending on which browser you were using. If it was designed NOT to be extensible, then we have to cross one of the benefits of this alternate notation off of the list. Another problem is "specification encapsulation." XML 1.0 is specifically designed NOT to depend on XLink or XPointer. Your proposal depends on them. It seems to me that there is some sort of circularity problem there. Several of my complaints about your proposal stem from the fact that DTDs both change the parse and validate the document. In other words, they are both schemata and "parse information providers". If your XML-instance DTDs only validated, then many of the complaints would go away. But if they only validated, I don't think it would be accurate to call them DTDs anymore. Then they would be just "schemas" since they would accomplish only one of the DTD's two functions. I suspect that these functions will, in fact, become more and more distinct as time goes by. So proposals that keep them together should probably not succeed. Our current DTD syntax is quite reasonable and efficient for the task of declaring entities, setting default attributes and so forth. If we want extensible, XML-instance notation schemata, then we should probably forget about replacing DTDs and just define extensible, XML-instance schemata (i.e. XML-Data) and leave DTDs to do the other tasks. I would propose the following levels: XML parser (works with document and standard XML DTD) XML Link/XML Pointer engine (annotates parse tree with linking info) XML Schema Engine 1 (validates and modifies annotated parse tree) XML Schema Engine 2 (validates and modifies annotated parse tree) ... XML Schema Engine N (validates and modifies annotated parse tree) XML Application There is no level circularity here. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Wed May 20 22:25:17 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:26 2004 Subject: Comments, parsers, XPointers References: <017601bd8418$40dabb00$1e09e391@mhklaptop.bra01.icl.co.uk> Message-ID: <35633C33.B60CF1CD@technologist.com> Michael Kay wrote: > > XPointer (and to some extent the DOM as well) in my view > fails to recognise that XML defines two object models, a > logical model and a physical model. SAX quite clearly and > explicitly gives you access to the logical model only, > whereas XPointer and DOM are rather ambiguous about the > distinction. XML has no semantics and thus no object models. :) I am only half kidding. People are confused because the XML REC is confused. Where does it say that CDATA sections and comments are part of the physical and not logical structure? I agree that they *should be*, but where does it say that? Until XML's semantics are defined precisely, we must guess at them. John E. Simpson wrote: > > Well, as I just mentioned in a reply to DavidM, it's certainly not > *critical* that comments be made available to downstream apps other than > editors. But they seem to serve some valid purposes as well -- not > necessarily just of the "remind the author what this next thing does." I think that the semantics of comments should be precisely "remind the author what this next thing means." No more and no less. Any use for machine processing is abuse because it is bound to cause the problems you are complaining about. Some parsers will strip them out (as is their right). If you depend on that comment for anything more than source file maintenance, then you are in trouble. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Kenneth.J.Meltsner at jci.com Wed May 20 22:47:08 1998 From: Kenneth.J.Meltsner at jci.com (Meltsner, Kenneth J) Date: Mon Jun 7 17:01:27 2004 Subject: Alternative syntaxes (was RE: Proposal Announcement) Message-ID: <8625660A.0070EF73.00@Corpnotes.JCI.Com> There's a great prototype for "Adaptive Forms" at: http://www.isi.edu/~mm-proj/ The developers put together a Java program that takes a grammar for a "natural language-ish" statement, form, rule, query, etc. and provides a form-based interface that dynamically changes depending on the terms entered by the user. It seems like this would be a good way to enter information for some applications with complex DTDs. I suspect the grammar could be automatically generated (in many cases) directly from the DTD (or vice-versa?). In one of their papers, the authors briefly mention the Texas Instruments natural language shell from more than a decade ago that allowed users to frame complicated queries by selecting appropriate text fragments from a set of lists. This was an exceptionally powerful shell that never reached widespread popularity; perhaps this will be more successful. Ken Meltsner >From the first page: "Adaptive Forms is a tool for producing context-sensitive form-based interfaces. The system initially displays an overview of the main sections of a form, and an initial set of fields for the user to fill in. Depending on the values that the user enters, Adaptive Forms progressively adds new fields to the form. For example, a form for entering household information would show the user fields for entering the spouse's name only if the user had entered "married" in the "marital status" field. "The main design goal for Adaptive Forms is in entering structured information rapidly and without errors. One of our target applications was the specification of air campaign objectives, which are structured objects consisting of a verb (e.g., deny, gain), an aspect (e.g., what to deny or gain), an actor (e.g., country, a branch of armed forces), a location (e.g., a country or a region) and a time period. Each of the parts is itself a structured object whose substructure and possible values depend on the values specified for the other parts. For example, the aspects that can be gained are different from the aspects that can be denied, so the interface needs to compute the menus for the "aspect" field dynamically based on the fillers of other fields. Similar requirements arise in virtually any other application domain. " xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Wed May 20 22:59:49 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:27 2004 Subject: Proposal Announcement - XML DTDs to XML docs Message-ID: It's good to get feedback. Paul Prescod stated: >Another problem is "specification encapsulation." XML 1.0 is specifically >designed NOT to depend on XLink or XPointer. Your proposal depends on >them. It seems to me that there is some sort of circularity problem there. I think this circularity problem is not too difficult to cope with in practice. It could be reduced, if not disposed of, by decomposing XML 1.0 into three different specs: 1) A document syntax specification (a simplified version of well-formed documents) 2) A syntax for linking to DTDs (and perhaps schemas) internal or external (which would depend on XLink) 3) A syntax for DTDs providing rules for validation. (Schema definitions could rest on top of #3 or beside it.) Even if these seems inadequate, I think it would be useful to reduce the number of ways users and developers reference external resources from XLinks, general entities, parameter entities, notations, and assorted other goodies to a single URI-based spec with consistent notation, preferably XLink. >Nobody has ever done the work to flesh it out to the point where an >XML-document notation is as expressive as the current DTD notation. I >believe that people who have not been using the DTD notation for a long >time underestimate the extent to which common use depends on weird stuff >like parameter entities specifying partial content models, element types >in element type declarations and so forth. I realize that these are significant challenges, and that conversion to an XML document format for DTDs doesn't whisk them away. What I'd like to see is a determined effort to map the 'weird stuff' to XML DTDs. Note, for instance, that I kept the SGML-like content model in my examples. It's sort of a half-way step, leaving in some of the old that the new may prosper. As for parameter entities, I think they can be expressed as easily through the model I presented. Fleshing this out is admittedly a large task that I don't think I can manage alone. >A user interface >(markup languages *are* user interfaces) can be too consistent, if it >obscures the differences between things. In the case of documents and >DTDs, I expect many users would get confused about the distinction between >documents and DTDs if DTDs *were* documents. This is a significant consideration I hadn't noticed, and I'll address it more directly in the next revision. I don't think it's insuperable. >There is also the issue of compatibility. If the DTD for DTDs is >extensible and open, as most proponents argue it should be, then >Microsoft, Netscape and Sun can all take shots at "extending" it in the >way that they "extended" HTML. If the DTD DTD was specifically designed to >be extensible, then we could not complain about that. Depending on the >level of extensibility, XML documents could actually parse differently >depending on which browser you were using. If it was designed NOT to be >extensible, then we have to cross one of the benefits of this alternate >notation off of the list. I would certainly want this to be extensible; parsers that didn't understand an extended portion of this DTD could simply ignore that portion, provided the document met the basic rules. I don't see this as a significant problem, especially after looking at several of the schemas other people are proposing for XML documents. XML is going to fragment to a certain extent; I'd like to see the core made more extensible to provide an orderly framework for such extensions rather than letting them run off on their own. >Several of my complaints about your proposal stem from the fact that DTDs >both change the parse and validate the document. In other words, they are >both schemata and "parse information providers". If your XML-instance DTDs >only validated, then many of the complaints would go away. But if they >only validated, I don't think it would be accurate to call them DTDs >anymore. Then they would be just "schemas" since they would accomplish >only one of the DTD's two functions. I think you miss part of the point - that this is a representation intended to completely represent the same data that is provided currently by XML DTDs. You seem to assume that there will be data lost in the transition, without pointing out where it would be lost. I think you could build the same schemas and parse information with this system as you could with the current XML DTD structure, while providing the extensibility that several other proposals seem to need. I would rather _not_ provide full schema information in this proposal - moving XML DTDs to a new format seems like enough of a task to start with. In the long run, of course, DTD would be an inadequate term to describe these. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From mrc at allette.com.au Thu May 21 00:24:20 1998 From: mrc at allette.com.au (Marcus Carr) Date: Mon Jun 7 17:01:27 2004 Subject: Proposal Announcement - XML DTDs to XML docs References: Message-ID: <356357DE.47BA453E@allette.com.au> Simon St.Laurent wrote: > What I'd like to see is a > determined effort to map the 'weird stuff' to XML DTDs. Note, for instance, > that I kept the SGML-like content model in my examples. It's sort of a > half-way step, leaving in some of the old that the new may prosper. As for > parameter entities, I think they can be expressed as easily through the model > I presented. What about issues such as redefining a parameter entity in the external subset? Or multiple identically named parameter entities within the DTD? These seem like issues where you would either have to lose information from the DTD or define behaviour for the XML DTD that doesn't apply to other XML documents. If that were the case, I would question the benefit of the proposed syntax. Sticking with parameter entities, what if the same entities behaved as a content model and an attribute name? How and why would you make the distinction with the proposed syntax? I think that the syntax that describes the structure of documents can validly be different from the syntax that frames data because they're trying to accomplish very different things. -- Regards Marcus Carr email: mrc@allette.com.au _______________________________________________________________ Allette Systems (Australia) email: info@allette.com.au Level 10, 91 York Street www: http://www.allette.com.au Sydney 2000 NSW Australia phone: +61 2 9262 4777 fax: +61 2 9262 4774 _______________________________________________________________ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Thu May 21 01:45:21 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:27 2004 Subject: Proposal Announcement - XML DTDs to XML docs Message-ID: >What about issues such as redefining a parameter entity in the external subset? Or >multiple identically named parameter entities within the DTD? These could both be defined by specifying the behavior for the DTD-as-document, just as they are now for DTDs. I see no difficulty here. >Sticking with >parameter entities, what if the same entities behaved as a content model and an >attribute name? How and why would you make the distinction with the proposed >syntax? How and why would you make the distinction with the current syntax? Why would this be so difficult to do in a document rather than the current syntax? Are you saying that it would be crossing element boundaries and therefore break the well-formedness requirements? The syntax is currently (obviously) incomplete; I'll see what I can do to address this issue. I've obviously done an inadequate job here. Parameter entities are admittedly my least favorite part of XML, a necessary evil and a powerful tool. There may well be limits on how well they can map to this model - but is that a significantly worse limitation than the abolition of the & content model? I think the manageability you'd gain with this representation of XML DTDs would more than compensate for any loss incurred by the enforced simplification of parameter entities. >I think that the syntax that describes the structure of documents can validly be >different from the syntax that frames data because they're trying to accomplish >very different things. To a certain extent, this is certainly true. However, I think there's a strong case to be made for using a single syntax - see the advantages listed in the Rationale. I'm very happy with the document syntax XML inherited from SGML, particularly as XML made that syntax much more strictly enforced. I'm not as happy with the DTD syntax - and this seems like a good way to take advantage of the power of XML's document syntax. The current DTD syntax is workable - but not very extensible. I see a lot of effort being put into schemas and other projects that seem to add additional layers of complexity, and require applications to implement all kinds of extra linkages. By standardizing the linkage mechanism and the format for these extensions (as XLink or a derivative and XML documents, respectively), I hope to see a lot less EBNF and a lot more XML. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Thu May 21 02:26:32 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:27 2004 Subject: Proposal Announcement - XML DTDs to XML docs References: Message-ID: <35637063.FE3EF5A7@technologist.com> Simon St.Laurent wrote: > > It's good to get feedback. > > 1) A document syntax specification (a simplified version of well-formed > documents) > 2) A syntax for linking to DTDs (and perhaps schemas) internal or external > (which would depend on XLink) > 3) A syntax for DTDs providing rules for validation. > (Schema definitions could rest on top of #3 or beside it.) So at what level do I get the equivalent of internal entities and defaulted attributes? And what levels are required of all XML processors vs. optional? > I would certainly want this to be extensible; parsers that didn't understand > an extended portion of this DTD could simply ignore that portion, provided the > document met the basic rules. The important point is that you aren't talking about an alternate notation for XML DTDs, but a complete change in the relationship between DTDs and documents. In XML, DTDs can affect the interpretation (not just validation) of a document, through entities, defaulted attributes, element-content vs. mixed-content and so forth. If DTDs can both change the way a document is parsed *and* be extensible, then two parsers could get completely different information out of the same document. For example: One company's DTD extension could add in SGML tag ommission. The start- and end-tag of an element could be implied, without violating well-formedness. So then you could use that company's parser through SAX and get a completely different set of events than if you used someone else's parser. After all, changing the parse is one of the responsibilities of the DTD. > I would rather _not_ provide full schema information in this > proposal - moving XML DTDs to a new format seems like enough of a task to > start with. I don't know what you mean by full schema information. DTDs serve as schemas (in addition to changing the parse). If you propose to replace DTDs, then you are in part designing a new schema language. My suggestion is to develop a new schema language *without* changing DTDs. In other words, I am suggesting you make your project smaller, not larger. I would suggest you forget about entities, defaulted attributes, etc. Leave those to DTDs. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Thu May 21 03:06:29 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:27 2004 Subject: Proposal Announcement - XML DTDs to XML docs Message-ID: Paul Prescod wrote: >> 1) A document syntax specification (a simplified version of well-formed >> documents) >> 2) A syntax for linking to DTDs (and perhaps schemas) internal or external >> (which would depend on XLink) >> 3) A syntax for DTDs providing rules for validation. > >So at what level do I get the equivalent of internal entities and >defaulted attributes? And what levels are required of all XML processors >vs. optional? At what level do you get defaulted attributes now? Do you get defaulted attributes in a well-formed document without a DTD? Right now, it doesn't look like it. This could be in level 1, if default attributes were deemed necessary to document syntax, but I'd expect to see it in level 3. Internal entities could be defined much as they are now, at the start of a document, within a structure set aside for that purpose using (or whatever develops) instead of . This would indeed need to be covered in level 1, unless you could live without internal entities. In the past you seemed quite happy about forcing scripts to be external to a document, so I can't see why it would be so terrible to exile entities - and DTDs as well - to separate documents either. I don't think it would be necessary, though, any more than it's necessary now. As for requiring levels, level 1 would serve a similar purpose to well-formed documents today. 2 would be a prerequisite for 3, of course. >For example: One company's DTD extension could add in SGML tag ommission. >The start- and end-tag of an element could be implied, without violating >well-formedness. So then you could use that company's parser through SAX >and get a completely different set of events than if you used someone >else's parser. After all, changing the parse is one of the >responsibilities of the DTD. I think this is overstating your case rather dramatically. I could do something similarly brutal by creating a PI at the start of a regular XML document and using the implied tags. No one else could read my documents, but I sure could. Not only that, but I already proposed separating the document syntax - which includes full start- and end-tags - from the DTD. There's no reason this proposal would allow the DTD to modify the basic document syntax and markup, period. >I don't know what you mean by full schema information. DTDs serve as >schemas (in addition to changing the parse). If you propose to replace >DTDs, then you are in part designing a new schema language. We can argue about the meaning of the word schema all you like; it's not that exciting for me. XML-Data performs similar mapping, but attempts to add a lot more, parts which I see more as data schemas. If this is a schema, then so be it. >My suggestion >is to develop a new schema language *without* changing DTDs. In other >words, I am suggesting you make your project smaller, not larger. I would >suggest you forget about entities, defaulted attributes, etc. Leave those >to DTDs. My suggestion is that DTD's present a significant problem in their current format, and that they could be improved significantly. I would enjoy being able to focus on elements and attributes, the core of XML (and SGML) document syntax, and worry less about the rest. This project already is an attempt to be smaller, but to provide a place for new things to grow. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From mrc at allette.com.au Thu May 21 04:20:25 1998 From: mrc at allette.com.au (Marcus Carr) Date: Mon Jun 7 17:01:27 2004 Subject: Proposal Announcement - XML DTDs to XML docs References: Message-ID: <35638F2E.502A50D@allette.com.au> Simon St.Laurent wrote: > > What about issues such as redefining a parameter entity in the > > external subset? Or multiple identically named parameter > > entities within the DTD? > > These could both be defined by specifying the behavior for the > DTD-as-document, just as they are now for DTDs. I see no difficulty here. It seems that you are proposing ascribing characteristics to an entire class of XML documents, that is, when you encounter a particular element you ignore it based on the existence of a previous element's characteristics. That's essentially the behaviour of parameter entities, but is a semantic not automatically imposed on other XML documents. In my opinion, this difference is significant grounds for not making DTDs XML. > > Sticking with parameter entities, what if the same entities > > behaved as a content model and an attribute name? > > How and why would you make the distinction with the proposed > > syntax? > > How and why would you make the distinction with the current syntax? Why would > this be so difficult to do in a document rather than the current syntax? You wouldn't currently make the distinction, but you don't need to. In your example you indicated that a the parameter entity resolved to a content model (sorry, I'm working from memory). Assuming that a parameter entity is just a straight text replacement, you probably shouldn't make any determination about it's use at the time of declaration, though this may just be a syntactic issue. > Parameter entities are admittedly my least favorite part of XML, a necessary > evil and a powerful tool. There may well be limits on how well they can map > to this model - but is that a significantly worse limitation than the > abolition of the & content model? I think the manageability you'd gain with > this representation of XML DTDs would more than compensate for any loss > incurred by the enforced simplification of parameter entities. Having done some work with a Docbook derivative lately, I would argue that yes, messing with parameter entities could be more damaging for less gain than the abolition of the & model. I suppose I come from the camp that prefers no change, but I don't really find manageability a major issue - if I want a different view of the DTD, I could always convert it to HTML or use any number of applications to present it nicely. If you mean to extend the information stored in a DTD, I would also prefer to see this done within the existing framework as much as is feasible. > To a certain extent, this is certainly true. However, I think there's a > strong case to be made for using a single syntax - see the advantages listed > in the Rationale. I did read it, but I accidently deleted the mail before recording the URL - could you send it to me again? Thanks. -- Regards Marcus Carr email: mrc@allette.com.au _______________________________________________________________ Allette Systems (Australia) email: info@allette.com.au Level 10, 91 York Street www: http://www.allette.com.au Sydney 2000 NSW Australia phone: +61 2 9262 4777 fax: +61 2 9262 4774 _______________________________________________________________ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Thu May 21 12:48:54 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:27 2004 Subject: Proposal Announcement - XML DTDs to XML docs References: Message-ID: <356406D3.F69A123F@technologist.com> Simon St.Laurent wrote: > > At what level do you get defaulted attributes now? Do you get defaulted > attributes in a well-formed document without a DTD? No. But if you are talking about replacing the DTD, then I don't see how a comparison to documents without a DTD are relevant. > Internal > entities could be defined much as they are now, at the start of a document, > within a structure set aside for that purpose using (or whatever > develops) instead of . This would indeed need to be covered in level > 1, unless you could live without internal entities. Okay. So is in level 1 -- the document level, just like with DTDs. But presumably can use XLink. So now you've dragged XLink into level 1. Now we are back to specification circularity. Am I missing something here? > In the past you seemed > quite happy about forcing scripts to be external to a document, so I can't see > why it would be so terrible to exile entities - and DTDs as well - to separate > documents either. I don't think it would be necessary, though, any more than > it's necessary now. I have never argued in favour of forcing scripts to be external to a document. I argued that everything I know about text processing says that putting scripts in textual documents is a bad idea -- and is in fact a regression to the technique that SGML was invented to replace. But not everybody uses XML or SGML for text processing, so I do not believe that the *language* should restrict them from embedding scripts. XSL is a perfect example of an appropriate mix of scripts and markup....but you'll notice that there is essentially no text in an XSL stylesheet. Anyhow, even if we exile entities, you still have the cirularity problem. How can a level 1 parser process entities (as they do now) if the syntax for declaring entities depends on XLink, XPointer, and other specifications that are suppoed to be separate from XML itself. Let me make this concrete (using a random DTDs in XML notation, with old-syntax comments for clarity): foo.xdtd: TEST2 foo.xml: &foo; Does the processor have to go and fetch foo.xdtd, read it and understand it before it can know the contents of this document? > As for requiring levels, level 1 would serve a similar purpose to well-formed > documents today. 2 would be a prerequisite for 3, of course. Well-formed documents can have entities. In fact, all XML documents that have entities are well-formed. > >For example: One company's DTD extension could add in SGML tag ommission. > >The start- and end-tag of an element could be implied, without violating > >well-formedness. So then you could use that company's parser through SAX > >and get a completely different set of events than if you used someone > >else's parser. After all, changing the parse is one of the > >responsibilities of the DTD. > > I think this is overstating your case rather dramatically. I could do > something similarly brutal by creating a PI at the start of a > regular XML document and using the implied tags. No you could not. The semantics of XML DTDs are *fixed*, not extensible. Any parser that interpreted processing instructions as commands to change the parse would be *wrong*. But you propose that DTDs should become extensible. Since DTDs can change the parse (radically, in some cases), your proposal would allow DTD extensions to make documents specific to particular processors, unless an amended proposal explicitly disallows that. > No one else could read my > documents, but I sure could. Not only that, but I already proposed separating > the document syntax - which includes full start- and end-tags - from the DTD. > There's no reason this proposal would allow the DTD to modify the basic > document syntax and markup, period. DTD's don't modify the document syntax and markup, but they do modify the parse tree created by the document. In other words, they modify its semantics. If you replace DTDs with something "extensible", you must expect them to be able to modify the parse tree in extensible ways, unless you explicitly disallow this in your proposal. Just as today's DTDs can have "implied attributes", Microsoft could invent one with "implied elements". Netscape could go in the opposite direction and give us "transparent elements" that do not show up in the parse tree at all. You could use the two parsers through SAX and get completely different parse trees. > My suggestion is that DTD's present a significant problem in their current > format, and that they could be improved significantly. I would enjoy being > able to focus on elements and attributes, the core of XML (and SGML) document > syntax, and worry less about the rest. This project already is an attempt to > be smaller, but to provide a place for new things to grow. If all you are interested in is elements and attributes then you are proposing a new schema language for XML, not a replacement for DTDs (which do more). It is clear that you associate the word schema with complexity, and I can't force you to use it. Schemata constrain the structure of data models (databases, documents, etc.) DTDs are schemata. They also do more. They can change the parse. That's what makes them complex and part of what makes *XML* complex. If you try to do everything that DTDs do, then your new language will also be needlessly complex. If you do not, then you are not replacing DTDs but rather inventing something new. It sounds like a new schema language to me. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Thu May 21 12:54:41 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:27 2004 Subject: Proposal Announcement - XML DTDs to XML docs References: Message-ID: <35640818.9251DC46@technologist.com> Simon St.Laurent wrote: > Parameter entities are admittedly my least favorite part of XML, a necessary > evil and a powerful tool. There may well be limits on how well they can map > to this model - but is that a significantly worse limitation than the > abolition of the & content model? I don't believe that I have ever written a DTD that *did* use the & occurrence indicator in a content model. I also don't believe that I have ever written a DTD (to be used more than once) that did NOT use parameter entities. If you invent enough subclassing/inheritance/content model reuse features, you may be able to get away without them. But you can't just take them out if you expect people to do serious document modelling in XML. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Thu May 21 13:33:58 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:27 2004 Subject: parser for xml-data? References: Message-ID: <35640BD5.F781F0BF@technologist.com> Charles Frankston wrote: > > DTDs are not well-equipped to handle namespaces. It can technically be > done: for example, you could allow your outer DTD to have 'ANY' content. This example can be done in DTD syntax in the same way that you do it with namespaces. In fact, namespaces were specifically designed to not break validation. (also, note that the ns pseudo-attribute is supposed to be a URL) > XML-Data schemas are designed to integrate with namespaces: > > > > > > > > > > > > > > > > > > xyz.dtd: zyx.dtd: instance.xml: You don't need "ANY" to use namespaces. > The ns, prefix, and src parameters to a namespace PI look a lot like > attributes (although they are not in a formal sense). Since attributes in > XML do not have to be in a particular order, it would certainly be > surprising for people to discover that attributes in a namespace have to be > a particular order. You're suggesting that the syntax be made harder to use > in order to make the productions easier to author. I think this is a bad > tradeoff. Note that the XML declaration has a required order of pseudo-attribute occurrence. It would be best if the XML-family of language were consistent. [23] XMLDecl ::= '' Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Thu May 21 15:27:07 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:28 2004 Subject: Proposal Announcement - XML DTDs to XML docs Message-ID: >Okay. So is in level 1 -- the document level, just like with >DTDs. But presumably can use XLink. So now you've dragged XLink >into level 1. Now we are back to specification circularity. > >Am I missing something here? There are several possible answers. You could allow XML Level 1 parsers to ignore external entities if they choose - something similar is already in the spec right now (in a more limited case, section 4.1) for well-formed documents. You could hard-wire the href attribute's interpretation in a DTD - parsers are already dealing with references in the context of DTDs, and it doesn't seem that hard to make sense of href. Another option is to allow the circularity. This message was brought to you (at least partway) by the Internet Protocol, IP, defined in RFC 791. IP includes, and indeed requires, the services of ICMP (defined in RFC 792). ICMP uses IP to get from one place to another. Circular? Yep. Workable? Certainly. IP isn't allowed to generate extra ICMP messages about the delivery of an ICMP message. There is no circle in practice. Nor would there be a circle in _practice_ by allowing the level 1 spec to refer to the hrefs described in XLink, or to simply use href without further consideration. >foo.xdtd: > >TEST2 > > > > > > > > > >foo.xml: > > >&foo; > > >Does the processor have to go and fetch foo.xdtd, read it and understand >it before it can know the contents of this document? No more than it needs to in the current system, as stated in section 4.1: X>Note that if entities are declared in the external subset X>or in external parameter entities, a non-validating processor X>is _not_ _obligated_ _to_ read and process their declarations; X>for such documents, the rule that an entity must be declared X>is a well-formedness constraint only if _standalone='yes'_. >Well-formed documents can have entities. In fact, all XML documents that >have entities are well-formed. In fact, technically, X>A data object is an XML document if it is _well-formed_, X>as defined in this specification >The semantics of XML DTDs are *fixed*, not extensible. >Any parser that interpreted processing instructions as commands to change >the parse would be *wrong*. But you propose that DTDs should become >extensible. Since DTDs can change the parse (radically, in some cases), >your proposal would allow DTD extensions to make documents specific to >particular processors, unless an amended proposal explicitly disallows >that. I think you're dramatically misreading my argument, deliberately making this a bogeyman when it isn't. I see no reason why malicious DTDs would be allowed to 'change the parse' any more than current DTDs would be. Extensible DTDs do _not_ mean that anything goes. Behavior can be proscribed, rules can be set. A DTD in this proposal would be allowed to add things to the the parse, not change the fundamental rules set in level 1. Perhaps I should make this more explicit in the proposal - since the proposal is to 'map' XML DTD syntax to XML document syntax, it seemed reasonable to me that the same strictures demanded for processing an XML DTD would apply here. >DTD's don't modify the document syntax and markup, but they do modify the >parse tree created by the document. In other words, they modify its >semantics. If you replace DTDs with something "extensible", you must >expect them to be able to modify the parse tree in extensible ways, unless >you explicitly disallow this in your proposal. I don't think this is difficult; the types of extensions allowed can be limited to a reasonable set (data types, for instance) and expanded through the standards process when it appears necessary. Not everyone may want to wait, of course, but they'd find a way to get around DTDs anyway. I think you're going to see plenty of ersatz XML in practice anyway - one of the great things about SAX is that people can put _any_ kind of parser underneath it and watch it spit out nice-looking XML on top. As Chris Maden pointed out on another topic, CM>Insight: XML != SAX. >DTDs are schemata. They also do more. They can change the parse. That's >what makes them complex and part of what makes *XML* complex. If you try >to do everything that DTDs do, then your new language will also be >needlessly complex. If you do not, then you are not replacing DTDs but >rather inventing something new. It sounds like a new schema language to >me. Well, we'll see what happens. This proposal is only starting out, and complexities always look simpler at the beginning. Anyone who would like to help me figure out ways of expressing DTDs in XML document syntax is welcome to join this project - and suggestions for making sure that these DTDs don't change the parse in violent ways are also welcome. Even if this solution isn't perfect, it opens up a lot of questions that are worth asking about the current way of doing things. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Thu May 21 15:30:40 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:28 2004 Subject: Proposal Announcement - XML DTDs to XML docs Message-ID: >I also don't believe that I have ever written a DTD (to be used more than >once) that did NOT use parameter entities. > >If you invent enough subclassing/inheritance/content model reuse features, >you may be able to get away without them. But you can't just take them out >if you expect people to do serious document modelling in XML. And once again, you're making a much stronger claim than I am. Simply because _I_ don't like parameter entities _doesn't_ mean they're being abolished. Making them as capable in this model as they are in the current DTDs will take some doing, but a basic model is already presented in the proposal. Suggestions for expanding that model - from anyone - are welcome. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Fri May 22 00:02:15 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:28 2004 Subject: Proposal Critique - XML DTDs to XML docs References: Message-ID: <3564A44A.FB0A686A@technologist.com> Simon St.Laurent wrote: > > There are several possible answers. You could allow XML Level 1 parsers to > ignore external entities if they choose - something similar is already in the > spec right now (in a more limited case, section 4.1) for well-formed > documents. Well, it's a mistake now, and not something I personally think should be perpetuated. Parsers should not choose what they look at or what they do not. This badly violates XML's goal of having no optional features. "External entity parsing" is an optional feature in XML. It doesn't make any sense to me that parsers should be able to decide what parts of an authors document to process, and I would not encourage you to perpetuate it into a new DTD replacement. > You could hard-wire the href attribute's interpretation in a DTD - > parsers are already dealing with references in the context of DTDs, and it > doesn't seem that hard to make sense of href. I thought that you wanted to use XLink and XPointer? > Another option is to allow the circularity. This message was brought to you > (at least partway) by the Internet Protocol, IP, defined in RFC 791. IP > includes, and indeed requires, the services of ICMP (defined in RFC 792). > ICMP uses IP to get from one place to another. Circular? Yep. Workable? > Certainly. IP isn't allowed to generate extra ICMP messages about the > delivery of an ICMP message. There is no circle in practice. Nor would there > be a circle in _practice_ by allowing the level 1 spec to refer to the hrefs > described in XLink, or to simply use href without further consideration. There is a reason that we usually choose not to have circular specifications. First, reading and writing them is often a pain. Second, the two become interdependent. As it is now, we could invent "XLink-Em" in five years, and deprecate XLink without affecting XML. This makes progress much easier. In fact, it is the primary reason that we split these things into different specifications in the first place -- so that they can grow separately. > I think you're dramatically misreading my argument, deliberately making this a > bogeyman when it isn't. I see no reason why malicious DTDs would be allowed > to 'change the parse' any more than current DTDs would be. Extensible DTDs do > _not_ mean that anything goes. Behavior can be proscribed, rules can be set. What would the rules be? What would extensions be allowed to do and not do? > A DTD in this proposal would be allowed to add things to the the parse, not > change the fundamental rules set in level 1. I guess I don't understand the difference between adding things and changing the fundamental rules of the "level 1" parse. DTDs DO change the fundamental rules of the fundamental parse. What could be more fundamental than this: ]> &foo; Now if DTD's were extensible, then I would expect to be able to do something like this: ]> &foo; And I would expect to be able to provide the behaviour for MY-ENTITY-DECLARATION (somehow). We could restrict DTD extension to data typing, but that strikes me as a step backwards. Verification is going to be (and should be) increasingly the job of non-DTD schemata. There is no good reason, in my mind, that verification of data types, or even element and attribute types, should be the responsibility of the parser. XML makes them the responsibility of the parser for historical reasons (but goes as far in separating the responsibility out as was possible). I would encourage you not to perpetuate that confusing conflation of responsibility in a DTD replacement. Verification should be handled at a different level and by a different piece of software than the parser. In other words, I think that we should be reducing the responsibilities of the DTD, rather than expanding them. A whole new syntax for a core part of the language would make XML much more complicated than it is now. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From jdg at midl.co.jp Fri May 22 00:56:44 1998 From: jdg at midl.co.jp (Joel de Guzman) Date: Mon Jun 7 17:01:28 2004 Subject: unsubscribe Message-ID: <199805212256.HAA26883@balthasar.hsnt.or.jp> unsubscribe xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From tommybranch at mindspring.com Fri May 22 01:02:03 1998 From: tommybranch at mindspring.com (Tommy) Date: Mon Jun 7 17:01:28 2004 Subject: unsubscribe Message-ID: <000c01bd850b$79d24600$102d56d1@ymmot> unsubscribe -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980522/e9d9ab98/attachment.htm From SimonStL at classic.msn.com Fri May 22 02:27:18 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:28 2004 Subject: Proposal Critique - XML DTDs to XML docs Message-ID: >There is a reason that we usually choose not to have circular >specifications. First, reading and writing them is often a pain. Second, >the two become interdependent. Yes, there is the 'ingrown toenail' metaphor for standards that rely on each other too closely and turn into a mess. > [re: using href declarations for references] >I thought that you wanted to use XLink and XPointer? Of course I want to use XLink and XPointer. The href declaration is the tiniest piece of the XLink standard, and seems fairly well established, if not indeed set in stone. I'd be happy to use the full XLink spec, but realize that not everyone needs it. Fine. Make href a part of the 'Level 1' spec and pray that XLink doesn't migrate to entirely different terminology. It's no worse than SYSTEM and PUBLIC are now, certainly. >What would the rules be? What would extensions be allowed to do and not >do? For now, because this is simply a 'representation', I expected the same rules to hold for these DTDs with regard to document syntax as apply now. Maybe I should have written a complete section on behavior; maybe I will. >I guess I don't understand the difference between adding things and >changing the fundamental rules of the "level 1" parse. DTDs DO change the >fundamental rules of the fundamental parse. What could be more fundamental >than this: Here we begin to see where the communications breakdown has set in, and maybe we can unravel it. You see entities as modifying the rules of the 'fundamental parse'. I see entities as riding along on the rules of the 'fundamental parse' to make their changes. To me, the basic rules for parsing establish a syntax for documents, including a set of rules for including entities. Using an entity is just taking advantage of those rules, _not_ modifying them in any way. I see the distinction between expanding an entity and including (or transcluding) information from a link as a minor technical skirmish that should have been settled long ago, not a major battle over the fundamental shape of documents. Maybe that's what I get for working in HyperCard and HTML all these years... >We could restrict DTD extension to data typing, but that strikes me as a >step backwards. Verification is going to be (and should be) increasingly >the job of non-DTD schemata. >... >Verification should be handled at a different level and by a different >piece of software than the parser. I think this philosophy reflects SGML's heritage in document management. Developers who'd like to apply XML to other tasks may find this heritage distracting or indeed disturbing, giving the DTD's current lack of extensibility. It's not hard to imagine database developers who need to use XML coming up with a really simple schema like: Then they could just use a PI to tell their application to check their well-formed document ("Who the hell needs a DTD anyway? Like who came up with _that_?") against this schema. Something like: This doesn't really do any harm; part of the joy of well-formed documents is that you can chuck all the rest of the goodies in XML and build it yourself. Still, to me, this loses a lot. I'd like to see developers use DTDs, and I think that describing the structure of these documents is important for many reasons: easier use with editors, easier-built storage systems, and, of course, error-checking. Making DTDs extensible in clearly defined ways (and not your critter) seems lke a good way to bring these folks in. By providing a structure that developers can use to ensure interoperability of their documents, as well as extend to include data-type verfication, I think we'd be able to keep more developers in the habit of using DTDs. Which brings us to the core of the issue: >In other words, I think that we should be reducing the responsibilities of >the DTD, rather than expanding them. A whole new syntax for a core part >of the language would make XML much more complicated than it is now. Right now, the options for including verification on top of the DTD structure look pretty ugly. Namespaces, schemas, and PIs pile on top of each other to drive documents into the ground. These sort of extensions are going to sprout. I'd like to give them a good place to grow, a single document that provides a complete picture of a document model's content. Do you really want stacks of schemas floating around as well as the style sheets, scripts, link group documents, and the DTD? I don't feel the need to put _everything_ in one place - style sheets, scripts, and link information seem better managed outside this framework and don't cause endless repetition of the document structure. Does it really make sense to define the DTD once for XML 1.0 validation and define an entirely separate but redundant structure for data type validation? If SGML compatibility is your highest aspiration, it certainly may. To me, it doesn't make sense. Maybe the XML-Data crew will get their ubercombination to work. I'd rather start by getting DTD's made extensible and more easily managed first, and then add the schemas later, without requiring redundant structures. This doesn't seem like that bizarre a goal. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Fri May 22 02:46:01 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:28 2004 Subject: W3C XML activities Message-ID: Would it be possible for someone involved to give a brief rundown on XML projects the W3C is actually working on? >From what I can gather, XML, XLink, XPointer, and XML Namespaces are all projects of the XML Working Group under the XML Activity. XSL appears to have its own Working Group under the Style activity. RDF appears to have its own Working Group under the Metadata activity. XML-Data, WIDL, and a number of other proposals are just that - proposals. Is this an accurate picture? Is there a roadmap on the W3C site that I simply haven't found? Thanks. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From rbourret at dvs1.informatik.tu-darmstadt.de Fri May 22 16:03:31 1998 From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret) Date: Mon Jun 7 17:01:28 2004 Subject: Proposal Announcement - XML DTDs to XML docs Message-ID: <199805220931.LAA21896@berlin.dvs1.tu-darmstadt.de> Simon St. Laurent wrote: > I'd like to announce the posting of an unofficial and incomplete proposal for > the representation of XML DTDs as XML documents at: > > http://members.aol.com/simonstl/xml > I think one thing that has gotten lost in the discussion as to whether Simon's proposal is even implementable is the usefulness of it if it is. It will always be the case that more people are interested in the data in XML documents than in the DTDs. Hence, the number of tools available for exploring the data and the standards relating to these tools (e.g. SAX and DOM) will always be greater than those relating to DTDs. Simon's proposal, assuming it can be realized, erases this difference, which is exceedingly useful to those of us who want to explore DTDs. While I think something like XML-Data is needed in the long run for data type description and so on, I agree with Simon that a less ambitious subset is very useful in the short run. -- Ron Bourret xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From rbourret at dvs1.informatik.tu-darmstadt.de Fri May 22 16:14:21 1998 From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret) Date: Mon Jun 7 17:01:28 2004 Subject: parser for xml-data? Message-ID: <199805220905.LAA21599@berlin.dvs1.tu-darmstadt.de> Paul Prescod wrote: > Charles Frankston wrote: > > The ns, prefix, and src parameters to a namespace PI look a lot like > > attributes (although they are not in a formal sense). Since attributes in > > XML do not have to be in a particular order, it would certainly be > > surprising for people to discover that attributes in a namespace have to be > > a particular order. You're suggesting that the syntax be made harder to use > > in order to make the productions easier to author. I think this is a bad > > tradeoff. > > Note that the XML declaration has a required order of pseudo-attribute > occurrence. It would be best if the XML-family of language were > consistent. > > [23] XMLDecl ::= '' Chris has a point, but my intention was to make the production easier to read (benefits the user), not write (benefits the writer). I've been wading through a lot of specs recently and I tend to appreciate anything that simplifies them. Maybe I'll change my tune when I'm further along the learning curve, but right now, the added flexibility on an instruction I won't use much (even if all my XML documents use namespaces) isn't worth the added complexity. -- Ron Bourret xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Fri May 22 16:40:05 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:28 2004 Subject: Proposal Critique - XML DTDs to XML docs References: Message-ID: <35658E59.F0B90F34@technologist.com> Simon St.Laurent wrote: > > For now, because this is simply a 'representation', I expected the same rules > to hold for these DTDs with regard to document syntax as apply now. Maybe I > should have written a complete section on behavior; maybe I will. If I understand this correctly, then you are saying that at first you allow no extensions, just as DTDs allow no extensions. > Here we begin to see where the communications breakdown has set in, and maybe > we can unravel it. You see entities as modifying the rules of the 'fundamental > parse'. I see entities as riding along on the rules of the 'fundamental > parse' to make their changes. To me, the basic rules for parsing establish a > syntax for documents, including a set of rules for including entities. I misspoke. I meant that DTD's change the fundamental parse. This is not a very interesting observation: given the same document instance, and two different DTDs, you can get two radically different document parse trees. Is there any good reason that the ability to change the parse tree should be conflated with the responsibility for verifying schema-compliance as they are in DTDs. Is there any good reason to perpetuate this conflation in your proposed replacement for DTDs? > Using > an entity is just taking advantage of those rules, _not_ modifying them in any > way. I see the distinction between expanding an entity and including (or > transcluding) information from a link as a minor technical skirmish that > should have been settled long ago, not a major battle over the fundamental > shape of documents. It may or may not be a major battle over the fundamental shape of documents, but it *is* a major battle over the fundamental shape of XML. The differences between textual inclusion and structural inclusion are quite deep and subtle. They affect hyperlinking, well-formedness, validity, character set issues and almost everything else in the XML specification. At some level of abstraction the distinction may be minor, but in the details of the specification, it is humungous. > Maybe that's what I get for working in HyperCard and HTML all these years... HTML doesn't really support either. I don't know HyperCard. > >Verification should be handled at a different level and by a different > >piece of software than the parser. > > I think this philosophy reflects SGML's heritage in document management. I'm not sure if you understand that I am suggesting a model that is fundamentally different from SGML's. > Then they could just use a PI to tell their application to check their > well-formed document ("Who the hell needs a DTD anyway? Like who came up with > _that_?") against this schema. Something like: > > > > This doesn't really do any harm; part of the joy of well-formed documents is > that you can chuck all the rest of the goodies in XML and build it yourself. > > Still, to me, this loses a lot. I'd like to see developers use DTDs, and I > think that describing the structure of these documents is important for many > reasons: easier use with editors, easier-built storage systems, and, of > course, error-checking. Like anything else, I think that people should use the "standard mechanisms" when that makes sense, and avoid them otherwise. I believe that DTDs are inappropriate for some types of data, and would rather not see them used in those cases. Some data types are not supposed to be edited in editors. Sometimes the storage system and error-checking are better driven by non-DTD schema languages. > Making DTDs extensible in clearly defined ways (and not your > critter) seems lke a good way to bring these folks in. By providing a > structure that developers can use to ensure interoperability of their > documents, as well as extend to include data-type verfication, I think we'd be > able to keep more developers in the habit of using DTDs. Data type verification is only one way that DTDs fall short of some types of applications. Also, I don't believe that data type verification requires "extensible DTDs". > Does it really make sense to define the DTD once for XML 1.0 validation and > define an entirely separate but redundant structure for data type validation? > If SGML compatibility is your highest aspiration, it certainly may. To me, > it doesn't make sense. The element and attribute type verification provided by DTDs *are* data type validation. I am not arguing that data type validation should be separate from them. I am arguing that both element type validation and all other types of verification should be completely separate from issues of parsing, entity management and so forth. > Maybe the XML-Data crew will get their ubercombination to work. I'd rather > start by getting DTD's made extensible and more easily managed first, and then > add the schemas later, without requiring redundant structures. This doesn't > seem like that bizarre a goal. I don't know what you mean by "add schemas later." DTDs are schemata. That their verification responsibilities are mixed up with their parsing responsibilities is unfortunate, but over the long term, correctable. But only if we recognize that it is done in the wrong place and choose not to perpetuate it in new schema languages like the one you propose. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Fri May 22 16:59:25 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:28 2004 Subject: Proposal Announcement - XML DTDs to XML docs Message-ID: Ron Bourret wrote: >Hence, the number of tools available for exploring the data and >the standards relating to these tools (e.g. SAX and DOM) will >always be greater than those relating to DTDs. Simon's proposal, >assuming it can be realized, erases this difference, which is >exceedingly useful to those of us who want to explore DTDs. >While I think something like XML-Data is needed in the long >run for data type description and so on, I agree with Simon that >a less ambitious subset is very useful in the short run. This cuts right to the chase, making the clearest argument for a less ambitious proposal that moves XML DTDs to document syntax. I'm considering rewriting the proposal less as a concrete proposal for syntax (which just seems to generate endless arguments, and which I'm not an expert at anyway) and more as an exploration of the implications of (and possibilities for) using XML document syntax for DTDs. As an outsider, albeit one who's digging through _The SGML Handbook_, I'm fairly concerned that a lot of the specs are attempting to do too much. I'm very glad, for instance, that XML-Linking was broken down into a spec for Linking and a spec for XPointers. XML-Data seems to me to do far too much in one place, providing at once an XML syntax for DTDs and a powerful set of data schemas. While I can't see the battles directly, XML-Data seems to have a lot of people rather annoyed for a variety of widely different reasons. Still, in the long run, many projects need its capabilities. Slowing down seems like one answer to this. Build the foundation and then build the skyscraper. I'm not totally delighted with the current foundation - DTD syntax - but I can imagine it being a whole lot worse. Maybe the speed of the standards process is too slow for developers who want it all _now_, but I think we'd do well to be less ambitious but improve on the foundations we have now. Reducing the number of ways to achieve the same result is a good way to do this, as is making the foundation extensible. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Fri May 22 17:15:35 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:28 2004 Subject: Proposal Critique - XML DTDs to XML docs Message-ID: >If I understand this correctly, then you are saying that at first you >allow no extensions, just as DTDs allow no extensions. DTDs aren't allowed to change document syntax - the use of tags for elements and attributes, the use of '&' for general entities, etc. The same rules apply in this representation, as I will state more explicitly. This representation would, however, allow _additional_ rules - with data schemas the first issue to be addressed. This really isn't that difficult. >Is there any good reason that the ability to change the parse tree should >be conflated with the responsibility for verifying schema-compliance as >they are in DTDs. Is there any good reason to perpetuate this conflation >in your proposed replacement for DTDs? I'd like to see a structure that's: a) easily interpreted, edited, and stored, without the need for multiple toolsets b) capable of containing a complete set of information about a document, including structure and data What's so difficult about that? I can't think of any good reason (besides SGML compatibility) to oppose either of those goals. Why on earth would I want to keep multiple sets of document descriptions (schemas, whatever) around that share the task of defining the same document set? It seems like a management mess, a processing mess, a waste of bandwidth and storage because of redundant information, and just generally a nuisance. Making DTDs extensible is a good way, in my view, to address this issue, and several others. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From Bryan_Gilbert at pml.com Fri May 22 17:25:11 1998 From: Bryan_Gilbert at pml.com (Bryan Gilbert) Date: Mon Jun 7 17:01:28 2004 Subject: Proposal Announcement - XML DTDs to XML docs Message-ID: > -----Original Message----- > From: Simon St.Laurent [SMTP:SimonStL@classic.msn.com] > Ron Bourret wrote: > ... cut ... > >While I think something like XML-Data is needed in the long > >run for data type description and so on, I agree with Simon that > >a less ambitious subset is very useful in the short run. > > This cuts right to the chase, making the clearest argument for a less > ambitious proposal that moves XML DTDs to document syntax. I'm > considering > rewriting the proposal less as a concrete proposal for syntax (which > just > seems to generate endless arguments, and which I'm not an expert at > anyway) > and more as an exploration of the implications of (and possibilities > for) > using XML document syntax for DTDs. >... cut .. ---------------- YES! The discussion was losing direction and momentum. Both Ron's and Simon's comments are right on. People will become more familiar with XML syntax than DTD syntax and any tools that work with XML should be capable of working with the document type definition (which may be the DTD or may be the subset of XML-Data that deals with the document structure.) I hope this idea grows. What about a style sheet like transformation to/from DTD syntax to XML? (Sorry that's a naive question.) But if such a transformation could be defined then any XML tool that wished to display or work with the document type definition could do so using XML instead of DTD syntax. Bryan Gilbert email: (mailto:bryan_gilbert@pml.com) xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From mintert at irb.informatik.uni-dortmund.de Fri May 22 17:51:36 1998 From: mintert at irb.informatik.uni-dortmund.de (Stefan Mintert) Date: Mon Jun 7 17:01:28 2004 Subject: Q: SW for parsing DTD's Message-ID: <199805221551.RAA01630@brown.informatik.uni-dortmund.de> Hi! For the project I'm currently working on, I need to parse DTD's and query information like this: - which elements are valid in a given context? - which elements can contain an element of a certain type? Now I'm looking for software that does the parsing for me (Java classe are preferred) and provides a high-level interface. The best I've found is IBM's "XML for Java". Everything I need is in the class com.ibm.xml.parser.DTD. But unfortunately the license agreement ends 90 days after downloading :-( Since I'm writing a program for my thesis it must run for more than 90 days ;-) Do you know of any software that has similar functions as XML4J and that is free for unlimited (non-profit) use? Thanks. Bye, Stefan. PS: I've already checked the docs of Ælfred and XP, but that isn't exactly what I'm looking for. XP looks fine if I use the low-level interface and build on top of it; but that's still some work to do... +-----------------------------------------------------------+ Stefan Mintert UniDo: mintert@irb.informatik.uni-dortmund.de private: stefan@mintert.com WWW: http://www.informatik.uni-dortmund.de/~sm/ +-----------------------------------------------------------+ "let the music keep our spirits high..." (Jackson Browne) xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Fri May 22 17:56:42 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:29 2004 Subject: Proposal Announcement - XML DTDs to XML docs Message-ID: >YES! The discussion was losing direction and momentum. As a significant participant, I have to agree with you there, and I'm glad to see a spark is returning. >What about a style sheet like transformation to/from DTD syntax to >XML? (Sorry that's a naive question.) But if such a transformation >could be defined then any XML tool that wished to display >or work with the document type definition could do so >using XML instead of DTD syntax. That's actually what I had in mind when I first started writing this proposal, which explains to some extent why I didn't go into detail explaining behavior. I've heard of a few SGML tools that do this - Marcus Carr noted that OmniMark includes something called DTD2DTD. I think an equivalent tool for making such transformations in XML would be a great start. Of course, we'd have to figure out what that transformation would look like, but that's all part of the fun. >I hope this idea grows. Glad to hear it! I certainly hope it grows, whether or not it has anything to do with this particular proposal. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From rbourret at dvs1.informatik.tu-darmstadt.de Fri May 22 18:58:13 1998 From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret) Date: Mon Jun 7 17:01:29 2004 Subject: Proposal Announcement - XML DTDs to XML docs Message-ID: <199805220945.LAA21915@berlin.dvs1.tu-darmstadt.de> A couple of comments about the example on your Web page: IMAGE,CAPTION? CDATA #IMPLIED Content model should contain sub-elements, such as , not text. You don't want to force applications to parse text. On the other hand, attribute descriptions are probably better stored in attributes: The reason is that the possible choices are limited and work very well as enumerated attributes. Note that this is what XML-Data does. If you are defining some sort of XML-Data-Lite, XML-Data is probably a pretty good starting place. -- Ron Bourret xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From mcc at arbortext.com Fri May 22 19:03:31 1998 From: mcc at arbortext.com (Mike Champion) Date: Mon Jun 7 17:01:29 2004 Subject: Q: SW for parsing DTD's In-Reply-To: <199805221551.RAA01630@brown.informatik.uni-dortmund.de> Message-ID: <98May22.125851edt.26881@thicket.arbortext.com> At 11:51 AM 5/22/98 -0400, Stefan Mintert wrote: >For the project I'm currently working on, I need to parse DTD's and query >information like this: > >- which elements are valid in a given context? >- which elements can contain an element of a certain type? > >Do you know of any software that has similar functions as XML4J and that is >free for unlimited (non-profit) use? See http://www.docuverse.com/personal/freedom/index.html I don't have personal experience with this package, but from xml-dev and www-dom postings, I know that FREE-DOM uses the SAX interface to support any of several Java XML parsers, and produces a set of data structures that can be accessed with the DOM APIs. So in principle, you should be able to do more or less what you've done with XML4JAVA with FREE-DOM. I'm also excited to hear about your project because the DOM WG and mailing lists have discussed whether/when the DOM should support XML validation APIs. I've thought that this would be a good APPLICATION for the DOM, e.g., a XMLValidate JavaBean that uses the DOM to access the DTD information and the document structure in order to answer the kinds of questions you pose, and more generally, to allow the kinds of dynamic validation that XML editors will need to do. Good luck, Mike Champion xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Fri May 22 20:14:56 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:29 2004 Subject: Proposal Announcement - XML DTDs to XML docs Message-ID: Ron Bourret wrote: >A couple of comments about the example on your Web page: Okay, I admit it. I wrote the examples at the beach. I tend to be the kind of person who prefers to see content stored as text - empty elements as a way of life doesn't fit the way I think about data. But for the most part, you're right, and thanks for the suggestion. I had a difficult time deciding whether to leave the examples in the proposal, because I'm quite aware that the folks working on XML-Data have a lot more experience than I do. I left them in there primarily to make it clear to readers familiar with XML-Data that this proposal was _not_ XML-Data, hoping that they would see that it comes from a different orientation (at least as far as I can gather from the XML-Data proposal.) The syntax is less important to me than the shift to a document format. In the next version, the examples will likely disappear, at least for now, as the document shifts from being a proposal to an examination of the implications of such a model. All suggestions for the next version are still welcome - anyone wanting to participate in the development of this proposal is welcome to contact me. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Fri May 22 20:43:50 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:29 2004 Subject: Proposal Critique - XML DTDs to XML docs References: Message-ID: <3565C7A5.9BBE99CA@technologist.com> Simon St.Laurent wrote: > > >Is there any good reason that the ability to change the parse tree should > >be conflated with the responsibility for verifying schema-compliance as > >they are in DTDs. Is there any good reason to perpetuate this conflation > >in your proposed replacement for DTDs? > > I'd like to see a structure that's: > a) easily interpreted, edited, and stored, without the need for multiple > toolsets > b) capable of containing a complete set of information about a document, > including structure and data The word "structure" is too vague for me to be able to argue for or against. Are you talking about a single *language* (or specification) that incorporates a) instance syntax b) textual replacement c) external text embedding d) extensible validation XML 1.0 incorporates all of them. I think that that made sense for XML 1.0, in order to be SGML compatible, but for future versions I would rather see the first three completely separate from the fourth. The reason I feel that the last should be separated is that the types of validation (or "verification") that people have to do can be quite varied. XML made the DTD optional for this reason. I don't see that making the XML specification substantially larger with an alternative encoding for DTDs can really make that specification simpler. > Why on earth would I > want to keep multiple sets of document descriptions (schemas, whatever) around > that share the task of defining the same document set? It seems like a > management mess, a processing mess, a waste of bandwidth and storage because > of redundant information, and just generally a nuisance. > > Making DTDs extensible is a good way, in my view, to address this issue, and > several others. That sounds attractive, and I encourage you to try and make it work. If you succeed, I will be happy to use it. But, to be honest, I don't think it will succeed. It's like in the early days of computer programming when people thought that it was possible to invent a single, "extensible" programming language (or "meta programming language") that would serve all needs. Every attempt didn't quite do everything that everybody needed, and the harder people worked to make languages "extensible", the more complex (C++) or merely unpopular (Lisp) the language became. I personally don't believe that one extensible schema/DTD language can serve all of our diverse validation needs. The set of "extensions" will be unlimited and approach the complexity of a full programming language. Look at RDF schemata. They are miles and miles away from DTDs. I've had document types where I was modeling OO systems and wanted to verify things like "base class is not inherited more than once." Some OO-modeling schema language would handle that, but DTDs (even extensible ones) could never do so. I tend to think that a strategy that is more likely to be successful is one that layers schema languages. At the bottom level you have something like XML DTDs without all of the stuff related to entities and notations (in XML element notation). That layer might include data type validation. In levels above that you have RDF and other schemata that are more interested in relationships than in positional occurrence. It seems like you are interested in that bottom layer schema. I think that it would be good to formalize an XML element notation for the bottom layer. But if you try to make it a replacement for DTDs, then it must do everything that DTDs do and inherit all of the problems that the conflation of features in DTDs causes. > What's so difficult about that? I can't think of any good reason (besides > SGML compatibility) to oppose either of those goals. It is quite likely that SGML will soon be changed to allow you to use whatever notation you want for XML DTDs. SGML compatibility is not a problem. The question is what is the right design. You can make a slightly better version of a bad design, or you can try to start again with a good design. Let me ask this plainly: Does it make sense a) that textual substitutions should be specified in a part of a document called a "document type definition". b) that the "document type definition" should also be responsible for declaring media types and attaching them to non-XML entities. c) that the language for verifying element and attribute occurrence must be in the same specification (XML 1.x) as that for creating elements and attributes themselves? I don't think that those three things (among others) make sense anymore. Hence, I don't think that inventing a new notation for this inappropriate concept is a good idea. If we are to replace DTDs, let us replace them with something simpler and more specific to the task of validation, instead of transliterating them into another syntax, warts and all. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From bckman at ix.netcom.com Fri May 22 20:58:28 1998 From: bckman at ix.netcom.com (Frank Boumphrey) Date: Mon Jun 7 17:01:29 2004 Subject: Proposal Critique - XML DTDs to XML docs Message-ID: <01bd85b4$43645be0$LocalHost@uspppBckman> >>Every attempt didn't quite do everything that everybody needed, and >>the harder people worked to make languages "extensible", the more complex >>(C++) or merely unpopular (Lisp) the language became. I think this is a good point and an excellent argument for Simon's proposal. This language will be a very useful tool for a subset of users who dont want the hassel of learning a DTD language. There are always going to be things that a DTD is necessary for, but that doesnt mean that this is the only tool one should use. On the other hand if a language tries to be something to every one it is in danger of becoming like a swiss army knife. It will perhaps do the job, but not as well as a specialized tool. Frank -----Original Message----- From: Paul Prescod To: Xml-Dev (E-mail) Date: Friday, May 22, 1998 2:46 PM Subject: Re: Proposal Critique - XML DTDs to XML docs >Simon St.Laurent wrote: >> >> >Is there any good reason that the ability to change the parse tree should >> >be conflated with the responsibility for verifying schema-compliance as >> >they are in DTDs. Is there any good reason to perpetuate this conflation >> >in your proposed replacement for DTDs? >> >> I'd like to see a structure that's: >> a) easily interpreted, edited, and stored, without the need for multiple >> toolsets >> b) capable of containing a complete set of information about a document, >> including structure and data > >The word "structure" is too vague for me to be able to argue for or >against. Are you talking about a single *language* (or specification) that >incorporates > > a) instance syntax > b) textual replacement > c) external text embedding > d) extensible validation > >XML 1.0 incorporates all of them. I think that that made sense for XML >1.0, in order to be SGML compatible, but for future versions I would >rather see the first three completely separate from the fourth. The reason >I feel that the last should be separated is that the types of validation >(or "verification") that people have to do can be quite varied. XML made >the DTD optional for this reason. I don't see that making the XML >specification substantially larger with an alternative encoding for DTDs >can really make that specification simpler. > >> Why on earth would I >> want to keep multiple sets of document descriptions (schemas, whatever) around >> that share the task of defining the same document set? It seems like a >> management mess, a processing mess, a waste of bandwidth and storage because >> of redundant information, and just generally a nuisance. >> >> Making DTDs extensible is a good way, in my view, to address this issue, and >> several others. > >That sounds attractive, and I encourage you to try and make it work. If >you succeed, I will be happy to use it. But, to be honest, I don't think >it will succeed. It's like in the early days of computer programming when >people thought that it was possible to invent a single, "extensible" >programming language (or "meta programming language") that would serve all >needs. Every attempt didn't quite do everything that everybody needed, and >the harder people worked to make languages "extensible", the more complex >(C++) or merely unpopular (Lisp) the language became. > >I personally don't believe that one extensible schema/DTD language can >serve all of our diverse validation needs. The set of "extensions" will be >unlimited and approach the complexity of a full programming language. Look >at RDF schemata. They are miles and miles away from DTDs. I've had >document types where I was modeling OO systems and wanted to verify things >like "base class is not inherited more than once." Some OO-modeling schema >language would handle that, but DTDs (even extensible ones) could never do >so. > >I tend to think that a strategy that is more likely to be successful is >one that layers schema languages. At the bottom level you have something >like XML DTDs without all of the stuff related to entities and notations >(in XML element notation). That layer might include data type validation. >In levels above that you have RDF and other schemata that are more >interested in relationships than in positional occurrence. > >It seems like you are interested in that bottom layer schema. I think that >it would be good to formalize an XML element notation for the bottom >layer. But if you try to make it a replacement for DTDs, then it must do >everything that DTDs do and inherit all of the problems that the >conflation of features in DTDs causes. > >> What's so difficult about that? I can't think of any good reason (besides >> SGML compatibility) to oppose either of those goals. > >It is quite likely that SGML will soon be changed to allow you to use >whatever notation you want for XML DTDs. SGML compatibility is not a >problem. The question is what is the right design. You can make a slightly >better version of a bad design, or you can try to start again with a good >design. > >Let me ask this plainly: > >Does it make sense > >a) that textual substitutions should be specified in a part of a document >called a "document type definition". > >b) that the "document type definition" should also be responsible for >declaring media types and attaching them to non-XML entities. > >c) that the language for verifying element and attribute occurrence must >be in the same specification (XML 1.x) as that for creating elements and >attributes themselves? > >I don't think that those three things (among others) make sense anymore. >Hence, I don't think that inventing a new notation for this inappropriate >concept is a good idea. If we are to replace DTDs, let us replace them >with something simpler and more specific to the task of validation, >instead of transliterating them into another syntax, warts and all. > >Paul Prescod - http://itrc.uwaterloo.ca/~papresco > >"A writer is also a citizen, a political animal, whether he likes it or >not. But I do not accept that a writer has a greater obligation >to society than a musician or a mason or a teacher. Everyone has >a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate > >xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk >Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ >To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; >(un)subscribe xml-dev >To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; >subscribe xml-dev-digest >List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) > > xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From donpark at quake.net Fri May 22 22:51:34 1998 From: donpark at quake.net (Don Park) Date: Mon Jun 7 17:01:29 2004 Subject: Q: SW for parsing DTD's Message-ID: <005701bd85c2$53c43880$2ee044c6@arcot-main> >>For the project I'm currently working on, I need to parse DTD's and query >>information like this: >> >>- which elements are valid in a given context? >>- which elements can contain an element of a certain type? >> >>Do you know of any software that has similar functions as XML4J and that is >>free for unlimited (non-profit) use? > >See http://www.docuverse.com/personal/freedom/index.html I am afraid that Free-DOM currently does not provide access to DTD information primarily because SAX currently does not. DTD support will be provided in the near future (< 2 months). Don Park http://www.docuverse.com/personal/index.html xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Fri May 22 23:01:47 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:29 2004 Subject: Comments, parsers, XPointers In-Reply-To: <198001011834.NAA00512@unready.megginson.com> References: <3.0.3.32.19980520124217.00bd928c@nexus.polaris.net> <3.0.3.32.19980520124217.00bd928c@nexus.polaris.net> Message-ID: <3.0.1.16.19980522215632.3f3f7e10@pop3.demon.co.uk> At 13:34 01/01/80 -0500, David Megginson wrote: > >In the longer term, what we need is an official definition of an XML >information set, specifying (for example) that reporting comments is >optional, while reporting the start and end of elements is required. >Once such a beastie exists, many vexing questions about (and >inconsistencies among) the DOM, XPointers, SAX, XSL, etc. will >disappear. > Agreed. But I think we need it in the shorter term :-) Over the last day Eliot Kimber has spent a lot of time (thanks :-) helping to clear up some of my ideas about addressing and linking. (I have been much concerned about how to build software that is unambiguous). I believe I'm quoting Eliot correctly in saying that it is impossible to implement Xpointer rigorously until we agree on the abstract model of an XML document. I suspect we don't, all, yet... P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Fri May 22 23:04:15 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:29 2004 Subject: LISTRIVIA (was Re: Empty End Tags Considered Confusing) In-Reply-To: <199805191516.LAA07380@ruby.ora.com> Message-ID: <3.0.1.16.19980522215452.34af4846@pop3.demon.co.uk> At 11:16 19/05/98 -0400, Chris Maden wrote: >I think it was xml-dev that was discussing this; the archive is down >(Henry, are you listening?). I have been away in Paris...so I haven't made contact recently with Henry. Do you mean the hypermail server on which the HTML resides is down? or that the hypermailer isn't adding to the archive? In any case poor Henry can't do anything himself as he doesn't actually run the machine - even though he looks after the large amount of work the list generates. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Fri May 22 23:06:48 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:29 2004 Subject: W3C XML activities In-Reply-To: Message-ID: <3.0.1.16.19980522215513.328f4328@pop3.demon.co.uk> At 00:45 22/05/98 UT, Simon St.Laurent wrote: >Would it be possible for someone involved to give a brief rundown on XML >projects the W3C is actually working on? > >From what I can gather, XML, XLink, XPointer, and XML Namespaces are all >projects of the XML Working Group under the XML Activity. XSL appears to have >its own Working Group under the Style activity. RDF appears to have its own >Working Group under the Metadata activity. XML-Data, WIDL, and a number of >other proposals are just that - proposals. > >Is this an accurate picture? Is there a roadmap on the W3C site that I simply >haven't found? > >Thanks. I think this would be very valuable. I have just come back from Paris where John Bosak gave exactly such a report. I intend to comment on some of Paris, but specifically excluded Jon's report in case I got something wrong. As always it was very carefully presented. There were no surprises to those who talk regularly about these things but it formed up a good deal I was unclear about. I know Jon is very busy - I don't know whether others are prepared to summarise? P. > >Simon St.Laurent >Dynamic HTML: A Primer / XML: A Primer / Cookies > > >xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk >Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ >To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; >(un)subscribe xml-dev >To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; >subscribe xml-dev-digest >List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) > > Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Fri May 22 23:26:43 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:30 2004 Subject: Proposal Critique - XML DTDs to XML docs Message-ID: I thought I was arguing with a conservative, but now I see that you're a radical! Very impressive, and I think we're reaching some points where we (at last) both agree. > a) instance syntax > b) textual replacement > c) external text embedding > d) extensible validation > >XML 1.0 incorporates all of them. I think that that made sense for XML >1.0, in order to be SGML compatible, but for future versions I would >rather see the first three completely separate from the fourth. The reason >I feel that the last should be separated is that the types of validation >(or "verification") that people have to do can be quite varied. Actually, I'd like to see all four of these separated, except for b and c, which I feel should at least use common syntax. a is the basic XML document syntax, b and c provide entity and linking-type services, and d is today's point of contention. Still, this breakdown of the standard is a very good start. >XML made >the DTD optional for this reason. I don't see that making the XML >specification substantially larger with an alternative encoding for DTDs >can really make that specification simpler. You're right here as well. I'd recommend disconnecting d, the validation, from the core standard (which I really think is just a). Making validation a separate standard would open the way a re-examination of validation that wasn't deeply intertwined with existing syntax, and could well lead the way to a syntax better than either the current standard or the XML document syntax. At least it would keep the core standard from growing warts. >I tend to think that a strategy that is more likely to be successful is >one that layers schema languages. At the bottom level you have something >like XML DTDs without all of the stuff related to entities and notations >(in XML element notation). That layer might include data type validation. >In levels above that you have RDF and other schemata that are more >interested in relationships than in positional occurrence. I'm very cautious right now about adding too many layers of schemas, mostly because they seem to require very high levels of redundancy. (XML-Data, a DTD, and an RDF representation of a single document? And they all have to stay consistent? Whoa...) Still, a layered model like this might make sense in the long run, provided that we can stay away from truckloads of redundant information. >But if you try to make it a replacement for DTDs, then it must do >everything that DTDs do and inherit all of the problems that the >conflation of features in DTDs causes. >... >The question is what is the right design. You can make a slightly >better version of a bad design, or you can try to start again with a good >design. I fear this is a symptom of the conservatism of the proposal, which began merely as an effort to map DTD syntax to an XML document syntax. It inherits the warts along with the beauty. A more radical solution would excise many of the warts, but didn't seem like a wise idea given the conservatism of many in the XML community. Smaller steps on the way to a radical change seemed more appropriate in this environment, but that may not be the best way to go. >Does it make sense > >a) that textual substitutions should be specified in a part of a document >called a "document type definition". No. Entities were what bothered me most about XML DTDs in the first place. I'd move them elsewhere quite happily. >b) that the "document type definition" should also be responsible for >declaring media types and attaching them to non-XML entities. No. This seemed like a significant departure from current practice on the Web (which I support strongly) - MIME types. >c) that the language for verifying element and attribute occurrence must >be in the same specification (XML 1.x) as that for creating elements and >attributes themselves? Must be? No. Could be, and would offer significant advantages? Yes. I still think XML document syntax has significant advantages over current syntax. Could there be a better syntax? Of course. >If we are to replace DTDs, let us replace them >with something simpler and more specific to the task of validation, >instead of transliterating them into another syntax, warts and all. Good idea. Who, where, when? I'll buy beer... I think I'll continue the proposal as an implications document, while looking out for something more radical. In the meantime, I've got a lot of XML to build! Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sat May 23 06:53:52 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:30 2004 Subject: Comments, parsers, XPointers References: <3.0.3.32.19980520124217.00bd928c@nexus.polaris.net> <3.0.3.32.19980520124217.00bd928c@nexus.polaris.net> <3.0.1.16.19980522215632.3f3f7e10@pop3.demon.co.uk> Message-ID: <356656A8.DABB51C3@technologist.com> Peter Murray-Rust wrote: > > Agreed. But I think we need it in the shorter term :-) Over the last day > Eliot Kimber has spent a lot of time (thanks :-) helping to clear up some > of my ideas about addressing and linking. (I have been much concerned about > how to build software that is unambiguous). I believe I'm quoting Eliot > correctly in saying that it is impossible to implement Xpointer rigorously > until we agree on the abstract model of an XML document. I suspect we > don't, all, yet... I think that Eliot understands this better than most people, and it's a lesson I'm glad I didn't have to learn for myself. HyTime was mostly rewritten from scratch between versions 1 and 2 because without the formalism underlying the markup (the "low-level semantics"), the first version was built on a base of sand. By the time DSSSL and HyTime 2 came around, the sand was starting to crumble, and the grove is the concrete basis that the new family of standards used. I really hope that we can get concensus on this issue so that the W3C will move quickly on it, before the Web standards follow the same path and must all be rewritten (and unified) for version 2.0. I personally believe that this should be a higher priority than XSL, the DOM or anything else. What's the use in rushing to build an apartment subdivision before laying the foundation? Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sat May 23 07:01:20 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:30 2004 Subject: Proposal Critique - XML DTDs to XML docs References: Message-ID: <35665863.ED94D88B@technologist.com> Simon St.Laurent wrote: > > Actually, I'd like to see all four of these separated, except for b and c, > which I feel should at least use common syntax. Well, I'm not that much of a radical yet. I am not yet convinced that structural inclusion (e.g. get that tree from that document and place it *here*) can completely and conveniently replace XML/SGML's text substitution model. I'm ready to be convinced, however. Once XLink is done and widely implemented, we will have an opportunity to pit the two methods head to head against each other, and the market will decide. It's because the jury is still out on that issue that I argue(d) for entities etc. in XML. I wasn't willing to bet XML's usability on the faith that XLink would come along and would do everything that entities do. Presumably the people who could actually vote felt that way also. > I'm very cautious right now about adding too many layers of schemas, mostly > because they seem to require very high levels of redundancy. (XML-Data, a > DTD, and an RDF representation of a single document? And they all have to > stay consistent? Whoa...) Well, XML allows you to skip the DTD. XML-Data is supposed to verify everything the DTD does, so you can skip the verification (schema) parts of the DTD if you use XML-Data. You may still need the DTD for entities and notations. (XML-Data does allow you to declare entities, which makes no sense to me, so I'll just ignore it and hope it goes away) I believe that by the time RDF and XML-Data are done, there should be little overlap between them. RDF schemata constrain relationships between elements of particular types. XML-Data constrains where they can occur. So it there is still a question about managing multiple files and layers, but should NOT be a question of duplication of services. I try to think of the situation as analogous to stylesheets. You probaby wouldn't have one RDF schema for every XML Data schema you have, and one XML Data schema for every document you have. (you can't help but have one DTD per document...that's the way XML is defined) Instead, you would use a single XML Data schema for dozens of documents, and perhaps a single RDF schema for dozens of document *types*. There is some conceptual overlap here with architectural forms, as they are also meant to be layered in the way I describe. But archforms allow multiple layers of purely positional verification. You need some other kind of schema language to do link/relationship verification. > I fear this is a symptom of the conservatism of the proposal, which began > merely as an effort to map DTD syntax to an XML document syntax. It inherits > the warts along with the beauty. A more radical solution would excise many of > the warts, but didn't seem like a wise idea given the conservatism of many in > the XML community. Smaller steps on the way to a radical change seemed more > appropriate in this environment, but that may not be the best way to go. Well, there are no more conservatives anymore. ISO seems willing to go far beyond what you or I am proposing. As I understand it, they are moving to a situation where DTDs can be in *any notation* whatsoever. I could understand it wrong, because I was not at the meeting, but as I understand it, you could invent a new binary notation for DTDs, and it could turn on tag ommission and shortref features that would make your SGML documents unreadable to anyone without a parser for your binary DTD notation. If you approach it the way I am suggesting, then that isn't a problem. If schemata are separated from syntax, then Microsoft may not be able to validate documents created with Netscape's editor, but at least they will be able to *read* them. Having a proprietary schema language/engine would be just like having a proprietary style language/engine -- not ideal, but sometimes necessary. I should mention that I have mostly cribbed this "ignore DTDs and work directly on schemata" approach from various people. The only one whose name I can think of right now is Eliot Kimber, who wants to move away from DTDs for different reasons than you do (syntax) or I do (conflation of features). He dislikes the fact that they are controlled by, and can be overridden by, the document. Although this also bothers me, it bothers him a lot more. See: http://www.sil.org/sgml/n1957Note.html Of course the non-DTD schemata that Eliot is most interested in are architectures, which look a lot like DTDs. Also, http://www.sil.org/sgml/thompsonSchemata.html is useful. Henry uses the phrase "document structure definition" to avoid getting stuck in the "DTD" rut. > >c) that the language for verifying element and attribute occurrence must > >be in the same specification (XML 1.x) as that for creating elements and > >attributes themselves? > > Must be? No. Could be, and would offer significant advantages? Yes. I > still think XML document syntax has significant advantages over current > syntax. Could there be a better syntax? Of course. I wasn't asking about syntax, but about actual specifications. Should the validation language be specified in the same standards document as the language syntax? I think that we agree that it should not. > >If we are to replace DTDs, let us replace them > >with something simpler and more specific to the task of validation, > >instead of transliterating them into another syntax, warts and all. > > Good idea. Who, where, when? I'll buy beer... Well, I think that this is what XML-Data is about, but it is only a rough sketch. I also think that the W3C is supposed to create a working group that will address these sorts of issues. We could work out a concrete proposal in this mailing list, or offline, but I'm not sure if it would move us beyond all of the other DTDs for DTDs. In other words, I think that your basic idea will eventually get implemented, hopefully as modified according to my comments. I don't know how to expedite that, however. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From papresco at technologist.com Sat May 23 07:13:24 1998 From: papresco at technologist.com (Paul Prescod) Date: Mon Jun 7 17:01:30 2004 Subject: Q: SW for parsing DTD's References: <005701bd85c2$53c43880$2ee044c6@arcot-main> Message-ID: <35665B36.AFC3BD9A@technologist.com> Don Park wrote: > > I am afraid that Free-DOM currently does not provide access to DTD > information primarily because SAX currently does not. DTD support will be > provided in the near future (< 2 months). I would like to take this opportunity to stress the point that there are two ways of unifying DTD and XML document instance processing (or *any* processing) at the syntactic level and at the API level. The SGML grove formalism does the latter: given a grove with information about both the DTD and the instance (such as that provided by a version of Jade that should be coming out early next month), you can use the same functions or methods to navigate that information. Free-DOM will also provide this service in a couple of months. I believe that Jumbo already converts DTDs into something XML instance-ish internally. If every tool did this, then it wouldn't matter whether DTDs were in instance syntax or DTD syntax. Yes, I recognize that this means that every tool should have an "extra" parser and grove/DOM builder. Yes, there is a room for argument as to whether that is an acceptable price to pay for DTD's compact and distinctive syntax. But the important thing to recognize is that this is an option. If your grove/DOM API server doesn't provide access to DTDs and instances through the same API, then its creator can correct that in that API server without the W3C changing a single byte of the XML specification. Paul Prescod - http://itrc.uwaterloo.ca/~papresco "A writer is also a citizen, a political animal, whether he likes it or not. But I do not accept that a writer has a greater obligation to society than a musician or a mason or a teacher. Everyone has a citizen's commitment." - Wole Soyinka, Africa's first Nobel Laureate xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From SimonStL at classic.msn.com Sat May 23 15:40:24 1998 From: SimonStL at classic.msn.com (Simon St.Laurent) Date: Mon Jun 7 17:01:30 2004 Subject: Proposal Critique - XML DTDs to XML docs Message-ID: >I am not yet convinced that >structural inclusion (e.g. get that tree from that document and place it >*here*) can completely and conveniently replace XML/SGML's text >substitution model. I'm ready to be convinced, however. Once XLink is done >and widely implemented, we will have an opportunity to pit the two methods >head to head against each other, and the market will decide. I think letting the market decide here is a wise idea, but I remain concerned that XLink hasn't yet stepped up to the challenge. Given the variety of interpretations surrounding the behavior of EMBED, and the number of people who told me last time I asked that EMBED _wasn't_ about text substitution, I'm not sure the market will get a chance to decide. I hope this is made explicit by the time the standard reaches stability. >I believe that by the time RDF and XML-Data are done, there should be >little overlap between them. RDF schemata constrain relationships between >elements of particular types. XML-Data constrains where they can occur. So >it there is still a question about managing multiple files and layers, but >should NOT be a question of duplication of services. > >I try to think of the situation as analogous to stylesheets. I must admit I remain skeptical; RDF and XML-Data seem to want to do too much of the same thing at this point. It doesn't help that they look radically different. I don't mind telling people to use one or the other, but using both seems like a lot. Again, I hope this is cleared up by the time the standards reach stability. The analogy to stylesheets unfortunately makes me wonder if we're going to see documents using both CSS and XSL, leaving applications to puzzle out which to use and how/if they should interact. A uniform standard defining how resources (schemas, DTDs, stylesheets, extended link documents, etc.) should be linked to documents and with what priority would go a long way toward easing my concerns. The current soup of PIs, DOCTYPE, and XLink elements is messy at best. >Well, there are no more conservatives anymore. ISO seems willing to go far >beyond what you or I am proposing. As I understand it, they are moving to >a situation where DTDs can be in *any notation* whatsoever. Looks like I'm turning into the conservative. *Any notation* sounds like an expansion into the world of 'how many options can you conceivably overload this system with?' One of the most important things I liked about XML was its insistence on single mechanisms with no options. It might make sense to break validation and schema issues into a 'family' of standards, connected by the uniform linking standard mentioned above, but at this point I think we have plenty of chaos. >I wasn't asking about syntax, but about actual specifications. Should the >validation language be specified in the same standards document as the >language syntax? I think that we agree that it should not. Completely right. This opens the way to a family of standards and hopefully will reduce the number of bullets whizzing by. >Well, I think that this is what XML-Data is about, but it is only a rough >sketch. I also think that the W3C is supposed to create a working group >that will address these sorts of issues. We could work out a concrete >proposal in this mailing list, or offline I'd like to see that rough sketch grow into a usable set of standards. I wrote the 'Representation' proposal to get some public discussion started, and that seems to have succeeded. I'm hoping this weekend to modify the proposal, making clear that the syntax presented is only for illustration of implications. Perhaps it can serve as one of many springboards for this discussion in the more formal standards process, which I'll trust to work out the concrete proposal. Simon St.Laurent Dynamic HTML: A Primer / XML: A Primer / Cookies xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Sun May 24 00:40:01 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:30 2004 Subject: Proposed process for DTDs in XML (was Re: Q: SW for parsing DTD's, etc.) In-Reply-To: <35665B36.AFC3BD9A@technologist.com> References: <005701bd85c2$53c43880$2ee044c6@arcot-main> Message-ID: <3.0.1.16.19980523224835.0d570982@pop3.demon.co.uk> At 01:14 23/05/98 -0400, Paul Prescod wrote: >service in a couple of months. I believe that Jumbo already converts DTDs >into something XML instance-ish internally. Yes. And it's because I'd like this to be *syntactically* compatible with any other similar tool that I'm keen on exploring this idea. > >If every tool did this, then it wouldn't matter whether DTDs were in >instance syntax or DTD syntax. Yes, I recognize that this means that every Agreed. BUT we would have to agree on a set of elementTypes (DTD-speak) or property-set (Grove-speak). As a very simple example, do we use 'ATTRIBUTE' or 'Attribute' or AttName or whatever. Agreement here would go a long way towards interoperability. I have raised this idea periodically and SimonStL has persevered over the last 2 weeks and my feeling that there is a critical mass of people who would like to see if something could be formalised out of this. I encouraged Simon to keep posting so take responsibility for the continued discussion. This posting includes a proposal as to how we go forward. NOTE: Objections have been raised on the basis that: - such a proposal is impossible and we are bound to fail. In my mind the operations we are prosing are simply syntactic transformations with an agreed vocabulary and therefore almost trivial and automatable. I think that these objectors think we propose something far more ambitious that we actually do. - the proposal is not worthwhile, because the DOM/WG/XML-data/RDF/etc. are working on this and we are simply duplicating work that they are/will_be doing. This is a potentially valid objection, but I suspect that our effort will be valuable in any case. Even if later subsumed by other efforts, experience gained will be valuable in and may help those efforts. - the proposal is irresponsible because it will encourage people to do things they didn't ought to be doing. By creating a DTD syntax that is potentially extensible, it actually will be extended. I think that this community has shown itself responsible, and I shall suggest that our proposal is outlined in such a way as to encourage responsibility. Motivation ---------- There seems to be a feeling that the current DTD syntax does not meet a number of needs. *** In all discussion that follows I am NOT suggesting that XML DTD syntax should be replaced *** . I hope that the suggestion will enhance it. Current limitations seem to be: 1. There is no mechanism for adding human-readable semantics to a DTD. The point has been made strongly on XML-DEV that DTDs must be documented, but the method of documentation is undefined. Even in a simple example like: it is impossible to know whether the comment is associated with the element, the attribute, both or neither. [I am in the last stages of releasing the next VHG DTD and feel very strongly the lack of a mechanism for documenting it.] This problem alone is enough to convince me that a mechanism would be desirable. We all agree that a DTD per se cannot carry semantic information but there is an urgent need to be able to associate semantic information with a DTD. 2. There is no mechanism for associating machine-readable semantics with a DTD. This is also a serious problem for me. I need to be able to link elements and attributes to behavior (at present through Java classes). I am not suggesting that we develop universal mechanisms for doing this, but that we choose a syntax which allows it. 3. There are no defined tools or other processes for analysing a DTD in XML format. The attraction of a DTD-in-XML is that we can use the very large number of XML tools for manipulating, filtering, rendering, etc. For example, JUMBO1 can create a tree from the DTD and can therefore express it as a JUMBO object just like a document. XSL and CSS could apply to DTDs as well as documents. Help could be created from DTDs, etc. It is possible that the XML property set (if it is anywhere defined) might meet part of this need. If so, and if it can be expressed in a tree structure, then it could be isomorphous with an XML representation of a DTD. 4. There is a need for an additional level of semantic validation using concepts not expressible in a DTD [cardinality, data typing, etc.] I think this is one of the more sensitive areas of this proposal because it encourages the creations of schemas as opposed to DTDs. Proposal -------- I think we can proceed by the methodology that David Megginson developed so successfully for SAX. The Megginsonic 'dialogue' consists of posting succinct questions and gathering feedback - as a result of answers gained a new round of questions are proposed. I personally do not intend to play this central role although I am happy to try to provide a subsidiary role as before. It seems that we: - need to agree on goals (and especially to limit their scope) - define the limits of use of the resulting document. - define timescales. For myself I would suggest the following criteria and hope that others would be added: - the DTDXML DTD should be algorithmically derivable from the XML 1.0 spec - a DTDXML for a given DTD should represent the DTD after normalisation (i.e. no support for PEs and other lexical operations). It should correspond to information potentially available after parsing and should correspond roughly to the goals of SAX (i.e. be simple and not try to represent everything in a DTD) - the DTDXML DTD must support Help and documentation. Individual attributes should be documentable. - a DTDXML should be internally addressable by Xlink (unlike DTDs). The granularity should allow addressing of attributes and elements. - the DTDXML should NOT devise methods for extending the range of semantic validation. - from a DTDXML it should be possible to recreate the corresponding (normalised) DTD without loss. I do not know whether Simon can take on the central role himself or whether volunteers are needed. If he were to devise some procedural questions we could get a feel of whether and how the process could proceed further. P. Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic net connection VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary http://www.venus.co.uk/vhg xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk) From peter at ursus.demon.co.uk Sun May 24 00:40:05 1998 From: peter at ursus.demon.co.uk (Peter Murray-Rust) Date: Mon Jun 7 17:01:30 2004 Subject: SGML/XML 98 Paris Message-ID: <3.0.1.16.19980523233912.1b9738a2@pop3.demon.co.uk> A fairy godmother made it possible for me to attend the latter half of the Paris meeting and this is a brief report for other XML-DEVers who weren't so fortunate. I only get about one chance a year to meet real-life XML people - a year ago in Barcelona, and occasionally some visiting London. This report is not comprehensive - I missed anything before Wed a.m. The conf was wound up by Tim Bray who asked 'What is a document?' This is important as Tim now categorises himself - and most of us - as indulging in 'document computing'. Tim threw up slides of objects that may or may not be documents - music, a book with no words, a signed baseball, etc. The essential message was that in working with XML we are working with the material of human culture - in many forms - and that we can both enjoy it and have a responsibility. The responsibility is the trust that is put in us to manage information for the benefit of humankind. There is no doubt that 'XML has arrived' and is here to stay. Unfortunately I missed Jon Bosak's plenary (and I missed Jon). His plenary was highly praised and - I believe- again stressed the responsibility that we have to make XML work by always bearing in mind that we are part of a greater community. In a later session Jon outlined the next stages of the XML process. I date not attempt my transcription here as Jon chooses his phrases very carefully and with great meaning. He explained how the namespace proposal had come from the requirement of many W3-members and the narrow path that the WG had to tread. It had been essential to be simple at the outset to avoid committing to something that later might not be found to be workable. He stressed the need for the different W3 groups to work together (e.g. he did not feel a separate - and therefore potentially isolated - group for XML-data would be a good idea.) For many people the joint plenaries on Wednesday (Jean Paoli, Microsoft, and Bernard Feinmann, Netscape) were the key piece of take-home evangelism. Both represented their companies as committed wholeheartedly to XML. JP summarised some of the themes: - 'Data should be free' (Charles Goldfarb) - 'Give the user [power] (Jon Bosak) [I forget the exact words]. and emphasized the critical power of text-lovers and free speech on the WWW. He summarised the many initiatives of vendors and others as showing that XML had really arrived (including CML :-). He talked about XML and HTML interoperating (see a URL at w3.org/TR/NOTE-xh-19980511.html if I got it right) and how