From mrc at allette.com.au  Tue Sep  1 02:32:23 1998
From: mrc at allette.com.au (Marcus Carr)
Date: Mon Jun  7 17:04:17 2004
Subject: Validating IDREFS...
References: <01BDD40A.50BDF630@wingate>
Message-ID: <35EB406E.1D9BA771@allette.com.au>

Attila Torcsvari wrote:

> Recently I wanted to make more checkable my SGML/XML source,
> and I have similar troubles mentioned on this thread.
> What should I do if my IDs and IDREFs contain non-name characters? Why is it so?!
> (My pages should be reachable for external databases, which use identifiers with the same syntax, so I can not just convert the values to something else.)

With SGML it's easy - you just need to extend the SGML declaration to allow the characters to be valid name characters. With XML, the only thing I can think
of off hand is to define the attributes as CDATA rather than ID/IDREF and write something to check that the values match up, possibly as a post process.
ID/IDREF processing can't be resolved until the end of the document has been reached anyway, so this might not be a huge disadvantage.


--
Regards,

Marcus Carr                 email:  mrc@allette.com.au
_______________________________________________________________
Allette Systems (Australia) email:  info@allette.com.au
Level 10, 91 York Street    www:    http://www.allette.com.au
Sydney 2000 NSW Australia   phone:  +61 2 9262 4777
                            fax:    +61 2 9262 4774
_______________________________________________________________


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From stevew at access.com.au  Tue Sep  1 09:52:02 1998
From: stevew at access.com.au (Steve Withall)
Date: Mon Jun  7 17:04:17 2004
Subject: ANNOUNCEMENT: XML Testbed software
Message-ID: <3.0.5.32.19980901174220.007f69e0@mail.access.com.au>

I am pleased to announce the release of an XML application environment
written in Java which I have been developing over the last eleven months or
so - as presented at the recent Montreal XML Developers' Conference. For
historical reasons it's called 'XML Testbed', but there's now more to it
that that.

The software uses an XML configuration file to define the (Swing-based)
user interface. It includes its own non-validating XML parser (though it
can use any SAX parser instead), a nascent XSL engine (to the old standard
- just in time to be out of date), and a few other odds and ends.

The key feature of the infrastructure is that it is intended to be easily
expandable, to allow application-specific functionality to be slotted in
dynamically. This is achieved by registering the classes to be instantiated
for given named elements, and invoking special behaviour in a generic way
by invoking a method called verify() on each element as soon as it has been
parsed.

The software is freely available for non-commercial use and can be
downloaded, with all source code, from the following sites:

   http://www.xml.com
   http://www.w3.org/XML/#9808withall
   http://www.gca.org/conf/xmldev98            (soon!)

Please don't hesitate to send me any comments you may have - criticisms,
suggestions, anything.

I'd like to thank xml.com, W3C and GCA for being so kind as to host this
software. Thanks also to my employer, Access Systems, for being so tolerant
in allowing me to pursue this activity, which is an independent, personal
endeavour.

Enjoy!

Steve Withall.


________________________________________________________________________
Steve Withall
Systems Architect                            Tel: 61 2 9957 1036
Access Systems Research Pty. Limited         Fax: 61 2 9959 5111
Level 10, 20 Berry Street
North Sydney NSW 2060                        Email: stevew@access.com.au
Australia                                    http://www.access.com.au

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Tue Sep  1 10:39:37 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:17 2004
Subject: Validating IDREFS...
Message-ID: <005401bdd584$93b73a20$1e09e391@mhklaptop.bra01.icl.co.uk>

>The whole point of ID and IDREF is to have a simple
intra-document
>link, so that documents that have non-hierarchical natural
structures
>can be fitted into SGML/XML hierarchies

Indeed so. And the real difficulty with them is not the
inconvenience to parser writers, but the fact that they are
incompatible with XPointer.

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Tue Sep  1 11:06:24 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:17 2004
Subject: ANN: New SAXON Release (3.03)
Message-ID: <00da01bdd588$4d500400$1e09e391@mhklaptop.bra01.icl.co.uk>

A new version of SAXON (3.03) is available for download at
http://home.iclweb.com/icl2/mhkay/saxon.html

SAXON is a java class library that sits on top of a
SAX-compliant XML parser, providing additional services to
aid document manipulation and transformation. In general it
is designed to help you write applications that need to
process a *specific* document type, rather than for
general-purpose XML tools.

The distribution includes as a sample application
DTDGenerator, a tool that takes an XML document as input and
produces as output a DTD to which it conforms.

Principal changes in this version:
* Improved mechanisms for performing multiple document
passes when using the DOM
* Improved ParserManager for controlling which SAX parser to
use (now uses a Java properties file and incorporates a
starter list of known parsers)
* Updated to work (optionally) with Docuverse DOM-SDK.

(The previous unannounced version 3.01 worked with Free-DOM
3 and was on the web for fully three hours before Don Park
announced the replacement of Free-DOM by DOM-SDK. Is this a
record for software obsolescence?)

There are two classes included in SAXON which are
free-standing and which can probably add value to any SAX
application:
- ParserManager, which allows you to maintain a list of
installed parsers and to control which one should be
instantiated
- ExtendedInputSource, which subclasses the SAX InputSource
class to allow a java File to be supplied as the XML source

Terms of use have not changed: essentially free to use but
not to include in a commercial product. Source is included.
If you find SAXON useful, please let me know, it tells me my
time was not wasted!

Thanks to the correspondents, whose names I have forgotten,
who suggested the improvements in this version.

Michael Kay
M.H.Kay@eng.icl.co.uk
ICL Electronic Business Services


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From David.Rosenborg at xsse.se  Tue Sep  1 11:08:14 1998
From: David.Rosenborg at xsse.se (David Rosenborg)
Date: Mon Jun  7 17:04:17 2004
Subject: XML ATTRIBUTES : Are they attribute Names or Type Names?
References: <199808311925.OAA21202@foyt.indyrad.iupui.edu>
Message-ID: <35EBB8FB.B159989@xsse.se>

Hi,

Yes, this is exactly what I'm aiming at.

Mark Tucker wrote:
> 
> Hi David,
> 
>         I'm not sure what you said about typenames and attribute
> names, but I wonder if it is this problem. XML ELEMENT names are
> confused about whether they are acting as programming language "type
> names", or as programming language "field names".

[snip]

> <RECTANGLE>     -- type
>   <lower-left>          -- field
>       <POINT>           -- type
>           <x>3</x>      -- field
>           <y>4</y>
>       <POINT>
>   </lower-left>
...
> </RECTANGLE>

In a strongly typed system you could also write it like this

<rectangle>
  <lower-left><x>3</x><y>4</y></lower-left>
  ...
</rectanlge>

And the fact that lower-left is of type point would be expressed
in the schema. I guess that architectural forms
provide the basic machinery to express these relations at the
XML level (that is structurally), but I would also like
to formally express the relations of types and names in the
application level with some schema.

Cheers,

</David>

___________________________________________________________________
David.Rosenborg@xsse.se                      OM Exchange Technology

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Tue Sep  1 12:15:37 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:17 2004
Subject: XML-QL
Message-ID: <00e501bdd591$faef4180$1e09e391@mhklaptop.bra01.icl.co.uk>

>I just discovered the XML Query Language proposal at
>http://www.w3.org/TR/NOTE-xml-ql/, and find it very
interesting.  It looks
>a lot like SQL, which could be handy, but also somewhat
limiting.  What do
>you'all think about it?


Thanks for drawing this to my attention.

My immediate reaction is to compare this not with SQL, but
with the new XSL "tree construction" facilities which
essentially provide an XML transformation language. I don't
have time to do a detailed point-by-point comparison but it
would certainly be a useful exercise. Conceptually they have
many similarities but there are many points of detail where
one is stronger than the other. I would think it is entirely
possible to devise a language that combines the power of
both without a significant loss of usability.

Generally XSL seems more oriented to the "document" paradigm
(an XML stream consists of sequential content interspersed
with markup) while XML-QL is more oriented to the "data"
paradigm (an XML stream is a serialisation of a database).
So XML-QL has much better facilities for operations such as
sort, join, aggregation, and IDREF dereferencing,  while XSL
is stronger on detecting patterns based on ordering of input
elements (e.g. the first-of-type() predicate).

Both proposals seem to concentrate primarily on transforming
the structure of the tree, with little emphasis on
transforming the character strings in its leaf nodes;
neither seems to be capable of doing something as elementary
as converting an attribute value to upper case. Also,
neither has matching operators oriented to free text
searching, e.g. linguistic word matching.

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Tue Sep  1 12:21:56 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:17 2004
Subject: New SAXON Release (3.03)
Message-ID: <001001bdd590$f52b3b60$2ee044c6@arcot-main>

>A new version of SAXON (3.03) is available for download at
>http://home.iclweb.com/icl2/mhkay/saxon.html


Great.

>(The previous unannounced version 3.01 worked with Free-DOM
>3 and was on the web for fully three hours before Don Park
>announced the replacement of Free-DOM by DOM-SDK. Is this a
>record for software obsolescence?)


Well, you did it to me this time because I was about to release the PR2
version of the DOM SDK when I saw your announcement.  I am going to delay
the release to make sure it works with SAXON 3.03.  For your information,
PR2 includes the HTML layer support (lots of typing folks).

Best,

Don Park
Docuverse


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ruchig at iitk.ac.in  Tue Sep  1 15:22:33 1998
From: ruchig at iitk.ac.in (ruchig)
Date: Mon Jun  7 17:04:18 2004
Subject: New SAXON Release (3.03)
In-Reply-To: <001001bdd590$f52b3b60$2ee044c6@arcot-main>
Message-ID: <Pine.HPP.3.96.980901185006.5886A-100000@apah.cc.iitk.ernet.in>


Really Great.
Thanks Don.

Regards
Ruchig

On Tue, 1 Sep 1998, Don Park wrote:

> >A new version of SAXON (3.03) is available for download at
> >http://home.iclweb.com/icl2/mhkay/saxon.html
> 
> 
> Great.
> 
> >(The previous unannounced version 3.01 worked with Free-DOM
> >3 and was on the web for fully three hours before Don Park
> >announced the replacement of Free-DOM by DOM-SDK. Is this a
> >record for software obsolescence?)
> 
> 
> Well, you did it to me this time because I was about to release the PR2
> version of the DOM SDK when I saw your announcement.  I am going to delay
> the release to make sure it works with SAXON 3.03.  For your information,
> PR2 includes the HTML layer support (lots of typing folks).
> 
> Best,
> 
> Don Park
> Docuverse
> 
> 
> 
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
> 
> 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From amitr at abinfosys.com  Tue Sep  1 15:26:04 1998
From: amitr at abinfosys.com (Amit Rekhi)
Date: Mon Jun  7 17:04:18 2004
Subject: Is there a size limitation on XML file given to MSXSL as input?
Message-ID: <011601bdd5ac$a4a79820$0101a8c0@server.abinfosys.com>

Hello,

ENVIRONMENT :-

Web Browser :- IE 4.01
XSL Processor :- MSXSL ActiveX Control

        Is there a restriction on the size of the XML file I give as input
to the MSXSL ActiveX control?
        I am using the MSXSL ActiveX control to display a XML file using the
following :-
    .
    .
    .
<OBJECT ID="XSLControl"
CLASSID="CLSID:2BD0D2F2-52EC-11D1-8C69-0E16BC000000"
                            codebase="msxsl.cab" width="100" height="100">
(***)  <PARAM NAME="documentURL" VALUE="Test.xml">
  <PARAM NAME="styleURL" VALUE="Test.xsl">
</OBJECT>
    .
    .
    .


PROBLEM

    Now when I give the documentURL property of the MSXSL (see *** above) an
XML file (say test.xml) which contains a large amount of data (say about
100-150 instances each element) , MSXSL does not render the XML file
(test.xml).

    Infact IE 4.01 hangs!

QUESTIONS

1) Is there a limitation on the amount of XML elements that can be present
in a XML file given as input to the MSXSL processor?

2) If so what is it?

3) If there exists a length limitation then ,how to send XML files
containing large amounts of data to MSXSL ActiveX control?

                    Any help would be appreciated,
                    Thanks in advance,

AMIT REKHI

Software Engineer,

A.B. Infosys. P. Ltd,

New Delhi,

INDIA.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Tue Sep  1 15:38:04 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:18 2004
Subject: XML-QL
References: <00e501bdd591$faef4180$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <35EBF5E5.1BB74120@technologist.com>

Michael Kay wrote:
> 
> My immediate reaction is to compare this not with SQL, but
> with the new XSL "tree construction" facilities which
> essentially provide an XML transformation language. I don't
> have time to do a detailed point-by-point comparison but it
> would certainly be a useful exercise. 

It would also be useful to compare XPointer, which is a sort of query that
returns a single node.

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

Everything I touch turns into Python.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From macherius at darmstadt.gmd.de  Tue Sep  1 16:45:28 1998
From: macherius at darmstadt.gmd.de (Ingo Macherius)
Date: Mon Jun  7 17:04:18 2004
Subject: Is there a size limitation on XML file given to MSXSL as input?
In-Reply-To: <011601bdd5ac$a4a79820$0101a8c0@server.abinfosys.com>
Message-ID: <199809011442.QAA29400@sonne.darmstadt.gmd.de>

Amit Rekhi <amitr@abinfosys.com> wrote at 1 Sep 98, 18:26:

>         Is there a restriction on the size of the XML file I give as input
> to the MSXSL ActiveX control?
>         I am using the MSXSL ActiveX control to display a XML file using the
> following :-

I used systematically tested free XML tools and found many of them 
choking on large input. Especially IE5b1 seems to have a crash-
guarantee when dealing with > 25.000 elements. The most common death 
is exponential time consumption while garbage collecting string 
fragments. Many Java-based tools suffer from that.

My afterall impression is that most available tools do well with toy 
examples, but any input being in the MB range easily blasts them. At 
least that's true for what came from MS so far.
 
> 1) Is there a limitation on the amount of XML elements that can be present
> in a XML file given as input to the MSXSL processor?

There is a way in SGML, namely SGML declarations. Maybe that wasn't 
an all-so-bad idea. Vendors should publish formal declarations for 
the capacity of their products. 
 
	++im
--
Ingo Macherius//Dolivostrasse 15//D-64293 Darmstadt//+49-6151-869-882
GMD-IPSI German National Research Center for Information Technology
mailto:macherius@gmd.de http://www.darmstadt.gmd.de/~inim/
Information!=Knowledge!=Wisdom!=Truth!=Beauty!=Love!=Music==BEST (Zappa)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From crism at oreilly.com  Tue Sep  1 17:20:26 1998
From: crism at oreilly.com (Chris Maden)
Date: Mon Jun  7 17:04:18 2004
Subject: Validating IDREFS...
In-Reply-To: <005401bdd584$93b73a20$1e09e391@mhklaptop.bra01.icl.co.uk>
	(M.H.Kay@eng.icl.co.uk)
Message-ID: <199809011518.LAA19289@ruby.ora.com>

[Michael Kay]
> >The whole point of ID and IDREF is to have a simple intra-document
> >link, so that documents that have non-hierarchical natural
> >structures can be fitted into SGML/XML hierarchies
> 
> Indeed so. And the real difficulty with them is not the
> inconvenience to parser writers, but the fact that they are
> incompatible with XPointer.

That's not true.  XPointer doesn't explicitly provide support for
IDREFs, but it doesn't need to, since they are built in to XML
itself.  And it's not clear what you meant by "they", but IDs are at
the heart of robust use of XPointer.

The first XSL draft provides a way, using the link and
link-end-locator formatting objects, to turn IDREFs into links using
XPointers.

-Chris
-- 
<!NOTATION SGML.Geek PUBLIC "-//Anonymous//NOTATION SGML Geek//EN">
<!ENTITY crism PUBLIC "-//O'Reilly//NONSGML Christopher R. Maden//EN"
"<URL>http://www.oreilly.com/people/staff/crism/ <TEL>+1.617.499.7487
<USMAIL>90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Tue Sep  1 17:27:47 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:18 2004
Subject: Validating IDREFS...
In-Reply-To: <199809011518.LAA19289@ruby.ora.com>
Message-ID: <Pine.SUN.3.91.980901112116.23624C-100000@cito.uwaterloo.ca>

On Tue, 1 Sep 1998, Chris Maden wrote:

> The first XSL draft provides a way, using the link and
> link-end-locator formatting objects, to turn IDREFs into links using
> XPointers.

Terminological nit: A particular element either is or is not a link.
Link-ness is as inherent as paragraph-ness or title-ness. XSL provides a
way to format links (and other elements) as clickable hotspots. 

 Paul Prescod


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jonathan at texcel.no  Tue Sep  1 17:36:37 1998
From: jonathan at texcel.no (Jonathan Robie)
Date: Mon Jun  7 17:04:18 2004
Subject: XML-QL
In-Reply-To: <00e501bdd591$faef4180$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <3.0.3.32.19981031113623.00c537c0@pop.mindspring.com>

At 11:19 AM 9/1/98 +0100, Michael Kay wrote:
 
>My immediate reaction is to compare this not with SQL, but
>with the new XSL "tree construction" facilities which
>essentially provide an XML transformation language. I don't
>have time to do a detailed point-by-point comparison but it
>would certainly be a useful exercise. Conceptually they have
>many similarities but there are many points of detail where
>one is stronger than the other. I would think it is entirely
>possible to devise a language that combines the power of
>both without a significant loss of usability.
 
In fact, at Metastructures 98 I presented a language called XQL that uses a
syntax very similar to XSL Patterns. This language was developed primarily
by Joe Lapp of webMethods and me. Like XML-QL, XQL is declarative.

One of the significant differences between XML-QL and XQL is that XQL can
do both hierarchy and sequence. The fundamental structural relationships in
XQL are:

o	hierarchy

	o	parent/child
	o	ancestor/descendant

o	sequence

	o	precedes
	o	immediately precedes

o	position

	o	subscripts
	o	ranges

I think sequence is pretty important in documents, though it is not
important in many data-oriented systems. XML-QL's heritage in relational
theory has caused it to ignore sequence.

Jonathan
 
jonathan@texcel.no
Texcel Research
http://www.texcel.no

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From lisarein at finetuning.com  Tue Sep  1 18:02:48 1998
From: lisarein at finetuning.com (Lisa Rein)
Date: Mon Jun  7 17:04:18 2004
Subject: XML-QL
References: <3.0.3.32.19981031113623.00c537c0@pop.mindspring.com>
Message-ID: <35EC23CB.9C589B4D@finetuning.com>

Jonathan:

We can sink our teeth into this elusive spec so we can compare them head
to head?  Is there a URL available?  I'm doing a query language round up
for xml.com

So far I've got:

SQL (of course)
XPointer
XML-QL
XQL?
Appel...

more?

thanks everybody,

lisa

Jonathan Robie wrote:
> 
> At 11:19 AM 9/1/98 +0100, Michael Kay wrote:
> 
> >My immediate reaction is to compare this not with SQL, but
> >with the new XSL "tree construction" facilities which
> >essentially provide an XML transformation language. I don't
> >have time to do a detailed point-by-point comparison but it
> >would certainly be a useful exercise. Conceptually they have
> >many similarities but there are many points of detail where
> >one is stronger than the other. I would think it is entirely
> >possible to devise a language that combines the power of
> >both without a significant loss of usability.
> 
> In fact, at Metastructures 98 I presented a language called XQL that uses a
> syntax very similar to XSL Patterns. This language was developed primarily
> by Joe Lapp of webMethods and me. Like XML-QL, XQL is declarative.
> 
> One of the significant differences between XML-QL and XQL is that XQL can
> do both hierarchy and sequence. The fundamental structural relationships in
> XQL are:
> 
> o       hierarchy
> 
>         o       parent/child
>         o       ancestor/descendant
> 
> o       sequence
> 
>         o       precedes
>         o       immediately precedes
> 
> o       position
> 
>         o       subscripts
>         o       ranges
> 
> I think sequence is pretty important in documents, though it is not
> important in many data-oriented systems. XML-QL's heritage in relational
> theory has caused it to ignore sequence.
> 
> Jonathan
> 
> jonathan@texcel.no
> Texcel Research
> http://www.texcel.no
> 
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From lisarein at finetuning.com  Tue Sep  1 18:13:16 1998
From: lisarein at finetuning.com (Lisa Rein)
Date: Mon Jun  7 17:04:18 2004
Subject: XML-QL
References: <3.0.3.32.19981031113623.00c537c0@pop.mindspring.com> <35EC23CB.9C589B4D@finetuning.com> <f5bemtvx1ch.fsf@cogsci.ed.ac.uk>
Message-ID: <35EC2633.5531F7C6@finetuning.com>

Henry I am a little confused about perceiving XSL as a query language.

It was my understanding that it aims to be a transformation language,
like DSSSL, that works with a query or scripting language to
transform/process and format data.

Am I confused again :-)?

thanks,

lisa


Henry S. Thompson wrote:
> 
> I'd hope you will include the XSL query language
> (http://www.w3.org/TR/1998/WD-xsl-19980818#AEN310) and the LT-query,
> the query language from LT XML, our toolkit:
> http://www.ltg.ed.ac.uk/corpora/xmldoc/release/c335.htm
> 
> Cheers
> 
> ht
> --
>   Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh
>      2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440
>             Fax: (44) 131 650-4587, e-mail: ht@cogsci.ed.ac.uk
>                      URL: http://www.ltg.ed.ac.uk/~ht/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From lisarein at finetuning.com  Tue Sep  1 18:15:36 1998
From: lisarein at finetuning.com (Lisa Rein)
Date: Mon Jun  7 17:04:18 2004
Subject: xsl query or transformation language
Message-ID: <35EC2682.EB9C9E38@finetuning.com>

hey sorry about that I should have started a new thread.

Henry Thompson suggested XSL for my query language roundup, and, well
you know the rest...

lisa

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jlapp at webMethods.com  Tue Sep  1 18:27:51 1998
From: jlapp at webMethods.com (Joe Lapp)
Date: Mon Jun  7 17:04:18 2004
Subject: xsl query or transformation language
Message-ID: <3.0.32.19980901122818.006cc2d4@gw1.webmethods.com>

At 09:53 AM 9/1/98 -0700, Lisa Rein wrote:
>hey sorry about that I should have started a new thread.
>
>Henry Thompson suggested XSL for my query language roundup, and, well
>you know the rest...

I suspect that he was referring to the pattern language portion of
XSL -- the match and selection strings.

XSL patterns is an excellent query language; be sure to cover it.
--
Joe Lapp, Senior Engineer | jlapp@webMethods.com
webMethods, Inc.          | Voice: 703-267-1726
http://www.webMethods.com |   Fax: 703-352-0370

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ht at cogsci.ed.ac.uk  Tue Sep  1 18:30:11 1998
From: ht at cogsci.ed.ac.uk (Henry S. Thompson)
Date: Mon Jun  7 17:04:18 2004
Subject: xsl query or transformation language
In-Reply-To: Lisa Rein's message of Tue, 01 Sep 1998 09:53:22 -0700
References: <35EC2682.EB9C9E38@finetuning.com>
Message-ID: <f5bbtozx0f2.fsf@cogsci.ed.ac.uk>

Lisa Rein <lisarein@finetuning.com> writes:

> Henry I am a little confused about perceiving XSL as a query language.

> It was my understanding that it aims to be a transformation language,
> like DSSSL, that works with a query or scripting language to
> transform/process and format data.

Well, XSL defines its own query syntax for walking the input document
tree, 

a) for the purposes of deciding which style rules to apply to
which input document components (in which case you should think of a
query as returning a 'yes this matches' or a 'no this doesn't match'
result when applied to a node in the tree;

b) for the purposes of finding one or more bits of the tree to process
next, given a starting point (the node we're processing now), in which
case you should think of a query as returning a set of nodes in the
tree given a starting point.

A variant of type (b) where you just want the first element of the set
occurs as well.

Sounds like what I mean by a query language, how about you?

ht
-- 
  Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh
     2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440
	    Fax: (44) 131 650-4587, e-mail: ht@cogsci.ed.ac.uk
		     URL: http://www.ltg.ed.ac.uk/~ht/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Tue Sep  1 19:07:11 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:18 2004
Subject: XML tools and big documents (was: Re: Is there a size limitation on XML file given to MSXSL as input?)
In-Reply-To: <199809011442.QAA29400@sonne.darmstadt.gmd.de>
References: <011601bdd5ac$a4a79820$0101a8c0@server.abinfosys.com>
	<199809011442.QAA29400@sonne.darmstadt.gmd.de>
Message-ID: <199809011655.MAA00993@unready.megginson.com>

Ingo Macherius writes:

 > My afterall impression is that most available tools do well with
 > toy examples, but any input being in the MB range easily blasts
 > them. At least that's true for what came from MS so far.

I don't think that that's true in general.  Most of the Java-based XML
parsers I've tried seem to be able to handle Jon Bosak's XML Old
Testament (nearly 4MB) just fine, if somewhat slowly -- I used ot.xml
for routine testing and profiling while developing AElfred, and
AElfred barely kicked up a sweat.

The problem comes if the parser tries to build a tree rather than
simply reporting an event stream.  Depending on the implementation,
document trees tend to be very large.  With a naive tree
implementation, a 10MB document might use 100MB or more of virtual
memory for the document tree -- that'll bring most current desktop
systems to a screeching halt.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Tue Sep  1 19:22:27 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:18 2004
Subject: Validating IDREFS...
Message-ID: <013601bdd5cd$9e1c2ae0$1e09e391@mhklaptop.bra01.icl.co.uk>

MK>> ... the real difficulty with [IDREFs] is ... the fact
that they are
MK>> incompatible with XPointer.
>
Chris Maden>That's not true.

I can use either an IDREF or an XPointer (within an XLink,
or otherwise) to define a reference by ID to another element
in the same XML document, but there are differences:

* The syntax is different ["idval" versus "ID(idval)" or
"#idval"]
* I can have several IDREF attributes in an element, but
only one XLink attribute
* A dangling IDREF is an error; a dangling XPointer is not

That is what I mean by saying the two facilities are
incompatible. Or to put it another way, once I have made a
design choice to use IDREF or to use XPointer for the links
in my documents, I am stuck with my choice.

This is one of several situations in the XML family of
standards where there is more than one way of doing the same
thing, and no obvious way to choose between them. As the
frequency of questions about the element-vs-attribute choice
shows, this confuses users no end; it also complicates
software tools.

MK


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From lisarein at finetuning.com  Tue Sep  1 19:46:48 1998
From: lisarein at finetuning.com (Lisa Rein)
Date: Mon Jun  7 17:04:18 2004
Subject: Validating IDREFS...
References: <013601bdd5cd$9e1c2ae0$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <35EC3C2F.3C5FC7FC@finetuning.com>

Aren't these the kinds of inconsistencies that need to be "normalized"
(for lack of a better word) so that you could interoperate more easily
with disparate systems irregardless of which technique they chose?

In a perfect world, let's say, you can run a script and convert 
IDREF attributes to their XPointer equivalents, yes?   

Otherwise I sense danger!  Danger will robinson!

lisa

Michael Kay wrote:
> 
> MK>> ... the real difficulty with [IDREFs] is ... the fact
> that they are
> MK>> incompatible with XPointer.
> >
> Chris Maden>That's not true.
> 
> I can use either an IDREF or an XPointer (within an XLink,
> or otherwise) to define a reference by ID to another element
> in the same XML document, but there are differences:
> 
> * The syntax is different ["idval" versus "ID(idval)" or
> "#idval"]
> * I can have several IDREF attributes in an element, but
> only one XLink attribute
> * A dangling IDREF is an error; a dangling XPointer is not
> 
> That is what I mean by saying the two facilities are
> incompatible. Or to put it another way, once I have made a
> design choice to use IDREF or to use XPointer for the links
> in my documents, I am stuck with my choice.
> 
> This is one of several situations in the XML family of
> standards where there is more than one way of doing the same
> thing, and no obvious way to choose between them. As the
> frequency of questions about the element-vs-attribute choice
> shows, this confuses users no end; it also complicates
> software tools.
> 
> MK
> 
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From paul at arbortext.com  Tue Sep  1 20:05:30 1998
From: paul at arbortext.com (Paul Grosso)
Date: Mon Jun  7 17:04:19 2004
Subject: xsl query or transformation language
Message-ID: <3.0.32.19980901125907.00f3f68c@pophost.arbortext.com>

At 17:28 1998 09 01 +0100, Henry S. Thompson wrote:
>Lisa Rein <lisarein@finetuning.com> writes:
>
>> Henry I am a little confused about perceiving XSL as a query language.
>
>> It was my understanding that it aims to be a transformation language,
>> like DSSSL, that works with a query or scripting language to
>> transform/process and format data.
>
>Well, XSL defines its own query syntax for walking the input document
>tree, 
>
>a) for the purposes of deciding which style rules to apply to
>which input document components (in which case you should think of a
>query as returning a 'yes this matches' or a 'no this doesn't match'
>result when applied to a node in the tree;

Since what gets said here might get copied elsewhere, we should be
careful about wording.

XSL currently has no style rules and may never.  Henry is referring
to construction rules here.

I would disagree with Henry that pattern matching is querying in any
useful, usual sense of the word.  If there is a "query" (in the
non-technical sense of the word) at all here, it's "given a node in
the source document tree, what construction rule's pattern best matches
the given node's context?"  I don't see this as "returning" anything
in the usual sense, and I don't think it's helpful to confuse this
with what most people think of as queries even if the syntax of
match patterns is similar to (or even the same as) the syntax of
select patterns (which is what Henry discusses below).

I also disagree that XSL "walks the tree" as Henry mentions above.

I agree that the syntax of XSL patterns (both match patterns and
select patterns, since they use almost the same syntax) is a potentially
useful syntax for an XML-aware query language.
 
>
>b) for the purposes of finding one or more bits of the tree to process
>next, given a starting point (the node we're processing now), in which
>case you should think of a query as returning a set of nodes in the
>tree given a starting point.
>
>A variant of type (b) where you just want the first element of the set
>occurs as well.
>
>Sounds like what I mean by a query language, how about you?


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From andrew at epiphanysoftware.com  Tue Sep  1 20:28:06 1998
From: andrew at epiphanysoftware.com (Andrew Cogan)
Date: Mon Jun  7 17:04:19 2004
Subject: Tools to convert Word to XML?
Message-ID: <35EC2EA8.E65CF9C1@epiphanysoftware.com>

Can anyone recommend good tools that can convert Word files to XML? I
don't need tools that claim XML compatibility per se; any utility that
gives me control over what tag to insert at the beginning of a style and
at the end of a style would probably suffice. The ability to work with
Word footnotes is a big plus.
-- 
 Andrew Cogan, Epiphany Software


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From roddey at us.ibm.com  Tue Sep  1 20:33:58 1998
From: roddey at us.ibm.com (Dean Roddey)
Date: Mon Jun  7 17:04:19 2004
Subject: A DTD for DCD
Message-ID: <5030300024689352000002L022*@MHS>


Here is a DTD I've been working on for DCD. It supports only the "strict" (i.e.
attribute'y) version of the syntax, and there are some ordering limitations
that are not implied by the spec, just because of I don't want to do 10
different content versions in some places in order to support all of the
possibilities. I'd appreciate any feedback on why it fits or does not fit, or
what sucks or does not suck about it.

Be forwarned that many of the examples in the spec won't make it through this
DTD. In some cases its because of simple ordering, but mostly its just because
the examples were not compliant with the spec. I've passed these failures back
to Ashok who I assume will update the next version. If anyone wants fixed
versions of all of the examples that will also go through this DTD, let me know
and I'll send them to you.

Anyway, any comments on this DTD or the readability of the DCD spec or on DCD
in general would be appreciated. I'll make sure that they get to Ashok. Also,
any comments on the RDF aspects of DCD would be appreciated.

Here is the DTD:

<?xml encoding="US-ASCII"?>

<!--
*   FILE: DCD.Dtd
*
*   This file provides a DTD for the DCD Schema, or at least for a restricted
*   syntax of it that is controlled by the "syntax=explicit" Processing
*   Instruction explained in section
*
*       "2.1.2 Interchangeability of Elements and Attributes"
*
*   This PI forces a subset of the legal properties of an element or attribute
*   definition to be implemented as attributes, not as nested elements.
-->

<!--
*   Define some parameter entities to simplify things below.
*
*   DataTypeList
*       The enumerated values for the "DataType" attribute.
*-->
<!ENTITY % DataTypeList "(id|idref|idrefs|entity|entities|nmtoken|nmtokens|
                        notation|string|number|int|decimal|boolean|dateTime|
                        dateTime.tz|date|time|time.tz|interval|i1|i2|i4|
                        i8|ui1|ui2|ui4|ui8|r4|r8|uri|char|enumeration)" >


<!--
*   The overall syntax of a DCD Schema is that is a sequence of elements,
*   entities, and global attribute defs. It can have a description and
*   namespace declaration at the top.
-->
<!ELEMENT DCD   (
                    Namespace?,
                    Description?,
                    (
                        ElementDef
                        | AttributeDef
                        | InternalEntityDef
                        | ExternalEntityDef
                    )*
                )>


<!--
*   An ElementDef is used to define the elements that the target XML
*   file can contain, the ordering of them, and their content.
*
*   Each ElementDef has an optional description, one or zero Groups, and
*   an open ended list of Attribute and AttributeDef elements. For the
*   sake of flexibility, the content model allows as many variations as
*   is reasonable, in terms of ordering. But it does not allow all of
*   the possible orderings, as it forces the description to come first.
*
-->
<!ELEMENT ElementDef    (
                            Description?,
                            (
                                (Group?, (AttributeDef|Attribute)*)
                                | (AttributeDef*, Group?, Attribute*)
                                | ((AttributeDef|Attribute)*, Group?)
                            )
                        ) >
<!ATTLIST ElementDef    Type NMTOKEN #REQUIRED
                        Model (Empty|Any|Data|Elements|Mixed) "Data"
                        Content (Open|Closed) "Closed"
                        Root (True|False) "False"
                        Default CDATA #IMPLIED
                        Datatype %DataTypeList; "String"
                        Min CDATA #IMPLIED
                        Max CDATA #IMPLIED
                        MinInclusive CDATA #IMPLIED
                        MaxInclusive CDATA #IMPLIED
                        Fixed (True|False) "False"
                        Length CDATA #IMPLIED
                        Scale CDATA #IMPLIED
                        Precision CDATA #IMPLIED
                        Picture CDATA #IMPLIED>


<!--
*   An AttributeDef element defines an attribute that can be used by
*   ElementDefs to define the attributes that type of element can have.
*   It can have a description and default value subelements, both of
*   which are optional.
*-->
<!ELEMENT AttributeDef  (Description?, Values?) >
<!ATTLIST AttributeDef  Name ID #REQUIRED
                        Default (True|False) "False"
                        Fixed (True|False) "False"
                        Global (True|False) "False"
                        Datatype %DataTypeList; #IMPLIED
                        Occurs (Required|Optional) "Optional"
                        ID-Role (ID|IDREF|IDREFS) #IMPLIED
                        Min CDATA #IMPLIED
                        Max CDATA #IMPLIED
                        MinInclusive CDATA #IMPLIED
                        MaxInclusive CDATA #IMPLIED
                        resource CDATA #IMPLIED>


<!--
*   An internal entity has a name and a value element. It is the same as
*   an internal entity in DTDs.
*-->
<!ELEMENT InternalEntityDef (Value?|Values?) >
<!ATTLIST InternalEntityDef Name ID #REQUIRED>


<!--
*   An external entity has a name and a System Id, and an optional public
*   id. It is the same as an external entity in DTDs.
*-->
<!ELEMENT ExternalEntityDef EMPTY >
<!ATTLIST ExternalEntityDef Name ID #REQUIRED
                            PublicID CDATA #IMPLIED
                            SystemID CDATA #REQUIRED
                            resource CDATA #IMPLIED >


<!--
*   A description is just an open ended content model, which can contain
*   text and desired formatting markup, but it is ignored by the DCD
*   validation mechanism.
*-->
<!ELEMENT Description ANY >


<!--
*   A Group is just a collection of Elements and/or nested Group elements.
*   It defines the ordering of those nested elements and groups. A group
*   must contain at least one element or group or it does not make sense.
*-->
<!ELEMENT Group (Element | Group)+ >
<!ATTLIST Group Occurs (Required|Optional|OneOrMore|ZeroOrMore) "Required"
                RDF:Order (Seq|Alt) "Seq">


<!--
*   Attribute elements just name an attribute used by a particular element.
*   They have no content in the DCD itself, just other attributes.
*-->
<!ELEMENT Attribute EMPTY >
<!ATTLIST Attribute Name IDREF #REQUIRED >


<!--
*   Element elements just name elements within a group, so their content
*   is just the name of the element they refer to. So their content is
*   just PCDATA.
*-->
<!ELEMENT Element (#PCDATA) >


<!--
*   A name element is basically a valid name token, and a value is just
*   open ended markup.
*-->
<!ELEMENT Name (#PCDATA) >
<!ELEMENT Value ANY >
<!ELEMENT Values ANY >


<!--
*   A namespace is just a URL, so we take it as PCDATA. Its validity
*   will be discovered when its used I guess.
*-->
<!ELEMENT Namespace (#PCDATA) >


<!--
*   The Length, Scale, and Precision values are actually numbers but we
*   can only indicate that they are PCDATA here.
*-->
<!ELEMENT Length (#PCDATA) >
<!ELEMENT Scale (#PCDATA) >
<!ELEMENT Precision (#PCDATA) >


----------------------------------------
Dean Roddey
Software Weenie
IBM Center for Java Technology - Silicon Valley
roddey@us.ibm.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Tue Sep  1 20:38:03 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:19 2004
Subject: xsl query or transformation language
In-Reply-To: <3.0.32.19980901125907.00f3f68c@pophost.arbortext.com>
Message-ID: <Pine.SUN.3.91.980901141434.23624D-100000@cito.uwaterloo.ca>

On Tue, 1 Sep 1998, Paul Grosso wrote:
> 
> I would disagree with Henry that pattern matching is querying in any
> useful, usual sense of the word.

Querying is about asking "what elements in the grove match this pattern?"
XSL matching is about asking "what patterns does this element match?" 
They are opposite, but they both have the "match" at their heart. The 
syntax of matching, at least, should be the same. Perhaps a generalized 
query language should also have extra stuff for transforming the 
matches.

> I agree that the syntax of XSL patterns (both match patterns and
> select patterns, since they use almost the same syntax) is a potentially
> useful syntax for an XML-aware query language.

I think almost everybody agrees that we should at least attempt this.

 Paul Prescod

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From macherius at darmstadt.gmd.de  Tue Sep  1 20:41:21 1998
From: macherius at darmstadt.gmd.de (Ingo Macherius)
Date: Mon Jun  7 17:04:19 2004
Subject: XML tools and big documents (was: Re: Is there a size limitation on XML file given to MSXSL as input?)
In-Reply-To: <199809011655.MAA00993@unready.megginson.com>
References: <199809011442.QAA29400@sonne.darmstadt.gmd.de>
Message-ID: <199809011837.UAA08308@sonne.darmstadt.gmd.de>

David Megginson <david@megginson.com> wrote at 1 Sep 98, 12:55:

> Ingo Macherius writes:
> 
>  > My afterall impression is that most available tools do well with
>  > toy examples, but any input being in the MB range easily blasts
>  > them. At least that's true for what came from MS so far.
> 
> I don't think that that's true in general.  Most of the Java-based XML
> parsers I've tried seem to be able to handle Jon Bosak's XML Old
> Testament (nearly 4MB) just fine

That's right, but as discussed in some xml list few weeks ago that's 
"just" middleware. With few exceptions (e.g. Techno2000) parsers were 
fine.

> The problem comes if the parser tries to build a tree rather than
> simply reporting an event stream.

How many real world applications will be happy with just the event 
stream ? XSL-visualization always needs two trees, the parser tree 
and the resulting Formatting Object Tree (FOT). Double impact ! XML-
querys/DOM need to build a transformed versions. Triple impact !

Each processing stage seems to duplicate data over and over. A 
possible way out is a shared pool which trees may only point to. 
IBM's xml4j goes in that direction with "subtree hashes". And 
(surprise, surprise) DOM-processing with xml4j was feasible.

> Depending on the implementation,
> document trees tend to be very large.  With a naive tree
> implementation, a 10MB document might use 100MB or more of virtual
> memory for the document tree -- that'll bring most current desktop
> systems to a screeching halt.

IE5b1 needs 28MB for the parse tree of an 0.6 MB document and the 
resulting  (very simple) JScript generated FOT. "Game Over" happens 
if I increase the source document size from 0.6MB to 0.8 MB. Little 
change, great effect. I won't even mention the one minute screen 
freeze while JavaScript/CSS processing. OK, my scripts are straight 
forward, but I wouldn't call them plain dumb. I hope MS does uses a 
"naive" implementation in the beta ...

Cruel reality ... XML rules viewed from theoretical point. But I was 
beamed from campus right to heavy-duty database research. I'm the XML-
geek, and I'm given database community tasks. Solving them with 
today's XML-tools turned out harder than expected.

	++im


--
Ingo Macherius//Dolivostrasse 15//D-64293 Darmstadt//+49-6151-869-882
GMD-IPSI German National Research Center for Information Technology
mailto:macherius@gmd.de http://www.darmstadt.gmd.de/~inim/
Information!=Knowledge!=Wisdom!=Truth!=Beauty!=Love!=Music==BEST (Zappa)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From James.Anderson at mecomnet.de  Tue Sep  1 21:31:00 1998
From: James.Anderson at mecomnet.de (james anderson)
Date: Mon Jun  7 17:04:19 2004
Subject: xsl query or transformation language
References: <3.0.32.19980901125907.00f3f68c@pophost.arbortext.com>
Message-ID: <35EC4D58.E9008CDC@mecomnet.de>

Paul Grosso wrote:
> 
> At 17:28 1998 09 01 +0100, Henry S. Thompson wrote:
> ...
> >
> >Well, XSL defines its own query syntax for walking the input document
> >tree,
> >
> ...
> 
> I also disagree that XSL "walks the tree" as Henry mentions above.
> 

While the following is not specified to be the only implementation, it, in
itself, sounds very much like a pre-order, depth-first tree walk:

2.3 Processing Model

       Ed. Note: This needs expanding and polishing.

A node is processed to create a result tree fragment. The result tree is
constructed by processing the root node. A node is processed by finding all
the template rules with patterns that match the node, and choosing the best
amongst them. The chosen rule's template is then instantiated for the node.
During the instantiation of a template, the node for which the template is
being instantiated is called the current node. A template typically contains
instructions that select an additional sequence of source nodes for
processing. A sequence of source nodes is processed by appending the result
tree structure created by processing each of the members of the sequence in
order. The process of matching, instantiation and selection is continued
recursively until no new source nodes are selected for processing.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ht at cogsci.ed.ac.uk  Tue Sep  1 22:00:17 1998
From: ht at cogsci.ed.ac.uk (Henry S. Thompson)
Date: Mon Jun  7 17:04:19 2004
Subject: xsl query or transformation language
In-Reply-To: james anderson's message of Tue, 01 Sep 1998 21:39:05 +0200
References: <3.0.32.19980901125907.00f3f68c@pophost.arbortext.com> <35EC4D58.E9008CDC@mecomnet.de>
Message-ID: <f5b90k3wqm2.fsf@cogsci.ed.ac.uk>

Thanks to Paul G. for correcting my sloppy terminology in referring to
construction rules by the wrong name, my mistake.

And Paul G. and Paul P. are both right that my (a) is better described
as asking of a set of construction rules which one should be used for
a given node, a process in which pattern matching plays a part, but
not the only part.

As for walking the tree, we're ALL right (all right!) on this one.
The 90% case for construction rules in simple stylesheets is for them
to contain templates with xsl:process-children in their midst.  If ALL
construction rules in a stylesheet are like that, then the nodes in
the source tree will indeed yield results which are assembled as if by
pre-order.  But note the next line of the draft:

 "Implementations are free to process the source document in any way
   that produces the same result as if it were processed using this
   processing model."

It's important to remember this:  there is no guarantee of sequential
processing even in the simple case.  Because construction rules are
independent of one-another, it doesn't matter what order bits of the
result tree are constructed in.

However there is one case in which a preorder sequencing is implied, in
which case things look pretty much like querying:

 <xsl:process select='.//target'/>

will process ALL the descendants of the current element whose element
type is 'target' in preorder [this isn't said explicitly in the draft,
but should be:  it is implied in the description of xsl:process]

ht
-- 
  Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh
     2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440
	    Fax: (44) 131 650-4587, e-mail: ht@cogsci.ed.ac.uk
		     URL: http://www.ltg.ed.ac.uk/~ht/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Tue Sep  1 23:03:51 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:19 2004
Subject: XML tools and big documents (was: Re: Is there a size limitation on XML file given to MSXSL as input?)
In-Reply-To: <199809011837.UAA08308@sonne.darmstadt.gmd.de>
References: <199809011442.QAA29400@sonne.darmstadt.gmd.de>
	<199809011655.MAA00993@unready.megginson.com>
	<199809011837.UAA08308@sonne.darmstadt.gmd.de>
Message-ID: <199809012057.QAA01820@unready.megginson.com>

Ingo Macherius writes:

 > > The problem comes if the parser tries to build a tree rather than
 > > simply reporting an event stream.
 > 
 > How many real world applications will be happy with just the event 
 > stream ? XSL-visualization always needs two trees, the parser tree 
 > and the resulting Formatting Object Tree (FOT). Double impact ! XML-
 > querys/DOM need to build a transformed versions. Triple impact !

Yes, but often the trees can be built and discarded at a fairly low
level.  For example, if I have a serialised database table like

  <table>
   <entry>
    <name>David Megginson</name>
    <email>david@megginson.com</email>
   </entry>
   <entry>
    <name>Ingo Macherius</name>
    <email>macherius@darmstadt.gmd.de</email>
   </entry>
   <!-- etc., 2,500,000 times -->
  </table>

I do not need to build a tree for the whole document; instead, I can
cache the information for each entry (or each n entries, for
efficiency), dump it into my SQL database (or whatever), then move on
to the next set.

The second situation is where you are using XML to serialise a data
model that is already well-defined (as for vector graphics).  In this
case, it makes more sense to build the specialised object tree
directly from the event stream rather than building a DOM tree only to
tear it down.  Specialised object trees can be considerably smaller
than a corresponding DOM tree, depending on the format.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Wed Sep  2 03:12:53 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:19 2004
Subject: xsl query or transformation language
Message-ID: <009f01bdd60f$72e166f0$2c6167cb@ntwork.harvestroad.com.au>

While we are correcting terminology, is it _Formatting_ or _Flow_ Objects?

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Wed Sep  2 03:38:34 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:19 2004
Subject: XML tools and big documents (was: Re: Is there a size limitation on 
 XML file given to MSXSL as input?)
References: <011601bdd5ac$a4a79820$0101a8c0@server.abinfosys.com>
		<199809011442.QAA29400@sonne.darmstadt.gmd.de> <199809011655.MAA00993@unready.megginson.com>
Message-ID: <35ECA1CC.FEF3EF57@infinet.com>

David Megginson wrote:

> Ingo Macherius writes:
>
>  > My afterall impression is that most available tools do well with
>  > toy examples, but any input being in the MB range easily blasts
>  > them. At least that's true for what came from MS so far.
>
> I don't think that that's true in general.  Most of the Java-based XML
> parsers I've tried seem to be able to handle Jon Bosak's XML Old
> Testament (nearly 4MB) just fine, if somewhat slowly -- I used ot.xml
> for routine testing and profiling while developing AElfred, and
> AElfred barely kicked up a sweat.
>
> The problem comes if the parser tries to build a tree rather than
> simply reporting an event stream.  Depending on the implementation,
> document trees tend to be very large.  With a naive tree
> implementation, a 10MB document might use 100MB or more of virtual
> memory for the document tree -- that'll bring most current desktop
> systems to a screeching halt.

This is especially true for Java which is very memory hungry.  Most of the memory
problems with objects can be significantly reduced if your nodes only allocate
memory for sub-arrays as needed (most implementations I would assume would use an
array rather than a Vector to store children).  Also, if there is only one child,
do not create an array just to store that one child.

In other words, you have something like this:

class Node {
  Node child;
  Node[] children;
  int nodeLength
}

if child is null, then there are no elements
if the size ever goes above 1, set child to null and copy the contents of child
into children[0] and the parameter node into children[1].

Then when you look up a child by index of name you first test to see if child is
null.  If it is not then return the child if the index requested is 0, otherwise
the index is out of bounds.  If child is null, test to see if children is null.
If children is not null, then just look up the node by index.  If children is
null then there are no elements (nothing has been added or deleted).

For a lot of trees where it is somewhat common for nodes to only have one child,
this can save you a lot of memory.  It can also speed up your tree traversals a
bit since you do not have to look up the children nodes by index in the case
where there is only one child.  Also, for building the tree, you will likely
speed your app up a lot since you will now only have to create a new array object
if the child index is greater than 1.  Otherwise it is just a reference
assignment which is about as fast as an integer assignment.

I have not had a lot of problems building trees.  For the DOM implementation, in
conjunction with the parser I have, I build a DOM tree off of Jon Bosak's ot.xml
in about 12 seconds running a JIT with JDK 1.2 b4, on an old P-120 with 64 megs
of RAM running Windows NT 4.0.  I have not been able to do any reliable memory
benchmarks because the GC seems to be invoked much frequently with SUN's JDK 1.2
VM.

I would suspect that the DOM package provided by Don Park has similiar
performance and memory consumption.  Your best bet would probably be to look at
an XSL package which takes a DOM tree of your XML data, and a DOM tree of an XSL
stylesheet and spits out the content.  That way you are not stuck with an MS,
IBM, Oracle, or whatever implementation that you are not happy with.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Wed Sep  2 04:38:00 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:04:20 2004
Subject: Cement Shoes for XML?
Message-ID: <199809020237.WAA12934@hesketh.com>

Some of you may be interested in my latest essay, "Cement Shoes for XML?",
which explores some reasons why client-side XML support, at least in the
dominant browsers, may be slow in coming or perhaps worse.  Hopefully, I
haven't said anything too rude, but it's the result of several weeks of
frustration brought on by reading articles about how slow and/or broken the
XML
development efforts in the current browsers appear to be.

Developers planning on using server-side tools to send XML to browser clients
will probably find it most alarming. (Myself included.)

The essay is posted at:
http://www.simonstl.com/articles/cement.htm


Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jjc at jclark.com  Wed Sep  2 07:34:53 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun  7 17:04:20 2004
Subject: xsl query or transformation language
References: <3.0.32.19980901125907.00f3f68c@pophost.arbortext.com>
Message-ID: <35ECC1AA.23C04E7A@jclark.com>

Paul Grosso wrote:

> I would disagree with Henry that pattern matching is querying in any
> useful, usual sense of the word.  If there is a "query" (in the
> non-technical sense of the word) at all here, it's "given a node in
> the source document tree, what construction rule's pattern best matches
> the given node's context?"  I don't see this as "returning" anything
> in the usual sense, and I don't think it's helpful to confuse this
> with what most people think of as queries even if the syntax of
> match patterns is similar to (or even the same as) the syntax of
> select patterns (which is what Henry discusses below).

I think it's possible and desirable to think of match patterns as
queries, because select patterns are definitely queries and it's highly
desirable to have a unified semantic model for both uses of patterns.

If you say that a node matches a match pattern if the node is in the set
returned by evaluating the match pattern as a query with the input set
as all the nodes in the document, everything works out fine.

For example, "foo" as a query returns the nodes that are children of
nodes in the input set and that are of type "foo". When used as a select
pattern, the input node set is the current node, so it returns the
children of the current node of type foo.  When used as a match patterm,
the input nodes set is all the nodes in the document, so it matches the
all nodes of types "foo" (this works for the document element since that
is a child of the root).

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From john at totten.com  Wed Sep  2 08:30:34 1998
From: john at totten.com (John Totten)
Date: Mon Jun  7 17:04:20 2004
Subject: XML tools and big documents (was: Re: Is there a size limitation on XML file given to MSXSL as input?)
References: <011601bdd5ac$a4a79820$0101a8c0@server.abinfosys.com>
		<199809011442.QAA29400@sonne.darmstadt.gmd.de>
		<199809011655.MAA00993@unready.megginson.com>
		<35EC43B6.C25052AB@totten.com> <199809012042.QAA01773@unready.megginson.com>
Message-ID: <35ECF542.850D47EB@totten.com>

Take some time to review this little item.

	http://www.equi4.com/metakit/index.html

	I have been playing around with this and would like to link it to a DOM
parser so that the tree was built in a persistence store rather than
memory. Being dynamically configurable makes this an ideal vehicle for
doing this. You could thereafter deal directly with the object store or
even just the view (indexed into the store) and not repeatedly reparse
the document. It also removes any limits on the size of the document
that you could parse in a single pass.
	If anyone succeeds in doing this then I let me know please.

						John Totten


David Megginson wrote:
> 
> John Totten writes:
> 
>  > > With a naive tree implementation, a 10MB document might use 100MB
>  > > or more of virtual memory for the document tree -- that'll bring
>  > > most current desktop systems to a screeching halt.
>  >
>  >      Do you mean 100MB for the stack of parsed treeview objects as
>  > opposed to the GUI toolkit. And if so then why does it take so much
>  > space when all it takes is the addition of the parent ID to the
>  > serialised data item?
> 
> I don't recognise some of your terminology -- perhaps it is
> MS-specific.  In general, the GUI should not add significantly to the
> storage requirements (especially with an MVC architecture, like the
> one used by the Java Swing components) -- what takes up the room is
> the tree of nodes representing elements, attributes, character data,
> etc.
> 
> All the best,
> 
> David
> 
> --
> David Megginson                 david@megginson.com
>            http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Wed Sep  2 09:20:43 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:20 2004
Subject: XML tools and big documents (was: Re: Is there a size limitation on 
 XML file given to MSXSL as input?)
References: <011601bdd5ac$a4a79820$0101a8c0@server.abinfosys.com>
			<199809011442.QAA29400@sonne.darmstadt.gmd.de>
			<199809011655.MAA00993@unready.megginson.com>
			<35EC43B6.C25052AB@totten.com> <199809012042.QAA01773@unready.megginson.com> <35ECF542.850D47EB@totten.com>
Message-ID: <35ECF201.E3EA1392@infinet.com>

John Totten wrote:

> Take some time to review this little item.
>
>         http://www.equi4.com/metakit/index.html
>
>         I have been playing around with this and would like to link it to a DOM
> parser so that the tree was built in a persistence store rather than
> memory. Being dynamically configurable makes this an ideal vehicle for
> doing this. You could thereafter deal directly with the object store or
> even just the view (indexed into the store) and not repeatedly reparse
> the document. It also removes any limits on the size of the document
> that you could parse in a single pass.
>         If anyone succeeds in doing this then I let me know please.
>
>                                                 John Totten

This is an interesting idea that could probably not be too hard to implement in
Java using a read only random access file.  Basically, a stream based parser would
dump the contents of a very large document directly into some DOM format for a
random access file.  You would then have a special DOM implementation that is an
interface to this file.

Nevertheless, this sort of stuff would probably best be handled by some
comprehensive database which presents a DOM interface to the DOM data.  I think
this is probably what companies like Oracle and IBM may be up to, but who really
knows.  If you wanted to go to the extreme you could even represent an entire
directory service like NDS this way.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bmhughes at ozemail.com.au  Wed Sep  2 10:11:57 1998
From: bmhughes at ozemail.com.au (Baden Hughes)
Date: Mon Jun  7 17:04:20 2004
Subject: Tools to convert Word to XML?
In-Reply-To: <35EC2EA8.E65CF9C1@epiphanysoftware.com>
Message-ID: <001101bdd649$1f4871c0$dd3570c2@bmhmobile>


>  Andrew Cogan, Epiphany Software
> Can anyone recommend good tools that can convert Word files
> to XML? I don't need tools that claim XML compatibility per se; any
> utility that gives me control over what tag to insert at the
beginning
> of a style and at the end of a style would probably suffice. The
ability
> to work with Word footnotes is a big plus.

We've worked on this using Word styles and a heap of macros which
allow us to "export" remapping the styles to markup. Basically you
figure out what markup you'd like applied and then what WYSIWYG style
you would like the user to apply (ie work in) then on export the
styles are all remapped to XML tagging.

The beauty of this is that you can tag as much as you want - the
tagging is all done behind 'formatting' which from the user point of
view is just fine. They don't have to know about XML, just about which
styles to apply where in a Word document. And you can also export/hack
the stylesheet to be CSS if you are really going for it then you have
the markup and the formatting instructions separately.

If you want to know more specifics, mail me privately.

Baden


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Wed Sep  2 10:19:18 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:20 2004
Subject: State of browsers/JUMBO (was Re: Cement Shoes for XML?)
In-Reply-To: <199809020237.WAA12934@hesketh.com>
Message-ID: <3.0.1.16.19980901091011.223f28e8@pop3.demon.co.uk>

At 22:39 01/09/98 -0400, Simon St.Laurent wrote:
>Some of you may be interested in my latest essay, "Cement Shoes for XML?",
>which explores some reasons why client-side XML support, at least in the
>dominant browsers, may be slow in coming or perhaps worse.  Hopefully, I
>haven't said anything too rude, but it's the result of several weeks of
>frustration brought on by reading articles about how slow and/or broken the
>XML
>development efforts in the current browsers appear to be.
>
>Developers planning on using server-side tools to send XML to browser clients
>will probably find it most alarming. (Myself included.)

Minor correction: the essay suggests that JUMBO is a specialist browser for
chemistry - in fact it is a general element-oriented browser which can
address any domain where functionality can be provided by adding code on a
per-element basis. For example I have added a simple vector graphics module
which can be expended to support VML/PGML/FOO when those firm up (and when
I feel confident enough to start using JDK1.2/Java2D for the additional
primitives).

I was surprised not to see more browsers at Montreal developers' day - my
current assessment is that the main authors are:
	- Steve Withall (XXX)
	- Scott Parnell (Raven)
	- PeterMR (JUMBO)
All of these use Swing (Java) for their rendering and all suffer from the
bugginess of Swing. [This bugginess was confirmed by several
conversations.]. For example we have found it difficult to provide proper
formatting using the Style/AttributeSet and received wisdom is that the
author has to rewrite parts of Swing to get it to work.

	As a result of the lack of browsers I have spent time investigating how
JUMBO2 might be expanded to meet 'most' needs. It seems that the following
may be valuable:
	- support for simple styles. I am developing an approach where readers can
select per-element behavior for stylesheets [i.e. bold, leading CR, display
start-tag, colour could be selected from a menu of elements.] It would
probably be relatively simple to make it into a subset of CSS. I don't
intend to do this myself because I don't have the time or passion to worry
about rendering on screen at least till Swing is better.
	- simple searching (similar to XPointer).
	- HTML-like forms/CDC/XML-data. I demo'ed this briefly at Montreal - the
forms are created by the XML input and can be written out with edited
values. This seems to me a simple, valuable thing that a browser can do to
enhance the value of XML over HTML. Thus JUMBO can apply algorithms very
easily to client-side data entry to validate before upload. I don't imagine
I'm the only person who thinks this could be a useful function.
	- vector graphics. JUMBO will have the ability to read or edit simple
vector graphics. Not completely finished, but I've done this before. This
would allow simple - if not instantaneous - collaborative graphical
working. Doesn't this excite other people??
	- structural and per-element editing. The only thing I am not going to do
is write a text editor. Editing other elements is easier, both individually
and for structure.

	I have repeatedly suggested that we develop these tools communally and
have offered JUMBO on this basis. I've had a few replies, but not as many
as I would have hoped. Is everyone paralysed by waiting for others to do
it??? It seems inconceivable that we couldn't write a useful simple browser
starting from where we are. 

	I have written a manifesto about this on http://www.xml.com (xml:geek) if
you want further motivation.

	P.

The next snapshot of JUMBO should be out this in day or two. It's not
polished - and some of the things and only part implemented - but most of
the bits are there somewhere.

Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peterj at wrox.com  Wed Sep  2 11:03:10 1998
From: peterj at wrox.com (Peter Jones)
Date: Mon Jun  7 17:04:20 2004
Subject: DOCTYPE decl
Message-ID: <29AA5A0E3A0CD21196F300A0C9D8575C036020@WROX3>

Can I just check the following with folks on the list:

I have what I would like to be a well-formed, but not valid, document:

<!DOCTYPE mydocname [
<!ENTITY  entityname  "Some replacement text">
]>

<mydocname>&entityname;</mydocname>

Question:
If I have to include a !DOCTYPE declaration in order to declare
entities, am I also forced to declare the root element with an !ELEMENT
decl. too?

Thanks,

Peter Jones
WebDev Technical Editor
Wrox Press
mailto:peterj@wrox.com
***************
Wrox Press UK Ltd.
http://www.wrox.co.uk
Tel 44 121 706 6826
Fax 44 121 706 2967


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Wed Sep  2 11:16:04 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:20 2004
Subject: DOCTYPE decl
Message-ID: <001b01bdd652$ec4d1960$2c6167cb@ntwork.harvestroad.com.au>

-----Original Message-----
From: Peter Jones <peterj@wrox.com>

>Can I just check the following with folks on the list
>
>I have what I would like to be a well-formed, but not valid, document:
>
><!DOCTYPE mydocname [
><!ENTITY  entityname  "Some replacement text">
>]>
>
><mydocname>&entityname;</mydocname>
>
>Question:
>If I have to include a !DOCTYPE declaration in order to declare
>entities, am I also forced to declare the root element with an !ELEMENT
>decl. too?


No, not at all. What you have above is well-formed, but not valid, just as
you want.

James
--
James Tauber / jtauber@jtauber.com      http://www.jtauber.com/
Lecturer and Associate Researcher
Electronic Commerce Network             ( http://www.xmlinfo.com/
Curtin Business School                  ( http://www.xmlsoftware.com/
Perth, Western Australia                ( http://www.schema.net/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Wed Sep  2 11:27:59 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:20 2004
Subject: XML tools and big documents (was: Re: Is there a size limitation on XML file given to MSXSL as input?)
Message-ID: <002901bdd654$7e92f8c0$1e09e391@mhklaptop.bra01.icl.co.uk>

>... You could thereafter deal directly with the object
store or
>even just the view (indexed into the store) and not
repeatedly reparse
>the document.

The sirens have lured you!

I have a lot of experience with storing parsed document
trees in an object database and I have experimented with
storing the Java serialization of DOM-like models on disk,
and for what it's worth, in both cases retrieving the
document takes a lot longer than reparsing original XML. The
main reason is simply that there are more bytes to read.

The only technique that I find really effective for handling
large documents is to split them up into lots of small ones.
That way you only parse the bits the user actually wants to
see.

MK


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peterj at wrox.com  Wed Sep  2 12:44:15 1998
From: peterj at wrox.com (Peter Jones)
Date: Mon Jun  7 17:04:20 2004
Subject: PARAMETER entities & WFness
Message-ID: <29AA5A0E3A0CD21196F300A0C9D8575C036025@WROX3>


Since for well-formed but not valid docs I can't rely on external
parameter entities to be read and included, is it true that

parameter entities really only have one use for us in our
well-formed-but-not-valid documents: to alter the order in which entity
declarations are parsed?

If so, can anyone think of a situation where this would need to be
resorted to?
Peter Jones
WebDev Technical Editor
Wrox Press
mailto:peterj@wrox.com
***************
Wrox Press UK Ltd.
http://www.wrox.co.uk
Tel 44 121 706 6826
Fax 44 121 706 2967


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Wed Sep  2 13:20:02 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:20 2004
Subject: State of browsers/JUMBO 
Message-ID: <005501bdd664$28bb9e60$1e09e391@mhklaptop.bra01.icl.co.uk>

> As a result of the lack of browsers I have spent time
investigating how
>JUMBO2 might be expanded to meet 'most' needs...
> I have repeatedly suggested that we develop these tools
communally and
>have offered JUMBO on this basis. I've had a few replies,
but not as many
>as I would have hoped.

Some observations:

- I tried (more than once) downloading Jumbo and exploring
what it could do for me. I didn't make much progress.
There's clearly an enormous amount of functionality there,
but I found it very hard to know where to start. I've had
the same experience with downloads of other XML software
(most recently XML Toolkit), and I dare say others have had
the same experience with my own SAXON library.

- I suspect that as a community the thing we are desperately
lacking is a commonly understood architecture. We're all
writing bits of code that do useful things with XML, but we
don't have a clear vision as to what the total set of
capabilities should look like or how its components should
relate to each other. I think this is why it's hard to take
something like Jumbo and discover quickly what pieces of the
jigsaw it supplies.

- Having all these people produce free software is great,
but the downside is that most of it was written to satisfy
the intellectual creativity and/or parochial application
requirements of the individual author, which means that the
boring parts of software development, like working out who
the users are and writing good task-oriented documentation
to meet their needs, have been sadly neglected. Perhaps this
is why real product developers like Microsoft seem to be
slow. Fred Brooks, I recall, said that writing a one-off
program is one-tenth the effort of producing a software
product that does the same thing.

Regarding Simon's essay, I don't share his pessimism. I
don't personally regard client-side browser support for XML
as particularly urgent, I'm quite happy to do rendition
server-side either on demand, or in many cases at site
generation time. One reason is that the client-side model
(along with XLink and XSL) seems to assume that the web of
XML documents has the same topology as the web presented to
the user, which I think grossly underexploits the ability of
XML to separate the information structure from the user
view. I think a much more valuable development would be the
integration of XML with database technology. (And in
practice, I'm actually using XML mainly for "EDI" style
applications. )

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Wed Sep  2 15:04:57 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:21 2004
Subject: xsl query or transformation language
Message-ID: <009d01bdd670$e8707fd0$2ee044c6@arcot-main>

>I think it's possible and desirable to think of match patterns as
>queries, because select patterns are definitely queries and it's highly
>desirable to have a unified semantic model for both uses of patterns.


Thinking of match patterns as queries would certainly appeal to database and
IR folks but I like to like of match patterns as 'criteria'.

As far as XSL processing model goes, I think of it as a sort of an inference
engine.  What I would have liked to see in XSL is the nested template
feature which is wonderfully interesting and confusing at the same time
because it is like a clever expression looking for a deep meaning.

Don


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Wed Sep  2 15:05:16 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:21 2004
Subject: XML tools and big documents (was: Re: Is there a size limitation on XML file given to MSXSL as input?)
Message-ID: <009e01bdd670$e935a490$2ee044c6@arcot-main>

>I would suspect that the DOM package provided by Don Park has similiar
>performance and memory consumption.  Your best bet would probably be to
look at
>an XSL package which takes a DOM tree of your XML data, and a DOM tree of
an XSL
>stylesheet and spits out the content.  That way you are not stuck with an
MS,
>IBM, Oracle, or whatever implementation that you are not happy with.


About 10 seconds and 10 meg of memory to convert each meg of XML into DOM
with JIT enabled.

My solution to the problem of DOM building speed is based on that famous
doctor joke about a guy telling his doctor "Doctor, it hurts when I do
this!".  The trick lies in reducing the need to build DOM everytime the XML
document changes.  Unless you are into pain, that is.

Don


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Wed Sep  2 15:05:54 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:21 2004
Subject: XML-QL
Message-ID: <009f01bdd670$ea00e3d0$2ee044c6@arcot-main>

>Henry I am a little confused about perceiving XSL as a query language.
>
>It was my understanding that it aims to be a transformation language,
>like DSSSL, that works with a query or scripting language to
>transform/process and format data.
>
>Am I confused again :-)?


No.  XSL is definitely not a query language although it uses ideas which
could be useful for querying.

What I am concerned about is the obsession with thinking about XML as a
'thing'.  I think it is much more interesting to think of XML as a language
of communication.

When we are talking about XML query language, are we talking about
expressing a query in XML or are we talking about an expression, not
necessarily in XML, that will return information from data sources, not
necessarily XML document repositories, in XML format?

What we have is a multitude of needs and I think we need to sort out which
of them address before we unintentionally drop XQL into Wired Hot/Cold list.

BTW, there is another topic related to XML-QL which is XBE or
XML-[query]-By-Example.  I am not sure if anyone is working on it because I
just thought of it <g>.  You just write an example of the XML document you
want by specifying the tags as you want it and fill in the values to match.
XBE engine uses it to return the data the way you want exactly, rejecting
mismatches and weeding out unwanted information.

Don


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jlapp at webMethods.com  Wed Sep  2 15:28:41 1998
From: jlapp at webMethods.com (Joe Lapp)
Date: Mon Jun  7 17:04:21 2004
Subject: XML-QL
Message-ID: <3.0.32.19980902092901.00a7b3c4@gw1.webmethods.com>

At 05:53 AM 9/2/98 -0700, Don Park wrote:
>No.  XSL is definitely not a query language although it uses ideas which
>could be useful for querying.

I would argue that XSL patterns is definitely a query language, but that
the template language portion of XSL (the big picture) probably is not.

>[...]
>When we are talking about XML query language, are we talking about
>expressing a query in XML or are we talking about an expression, not
>necessarily in XML, that will return information from data sources, not
>necessarily XML document repositories, in XML format?

I can represent SQL in XML, OQL in XML, even XPointers in XML.  XML is a
way to represent data structures.  I can represent C, C++, Java, and Pascal
all in XML if I want to.  I can chose an XML representation that provides
exactly the information found in the non-XML representation.

This suggests to me that the more interesting challenge is to create a
language that queries data structures that are represented in XML, not to
just create a query language that is expressed in XML.

I don't mean to make any statement about whether this language for querying
XML should itself be expressed in XML -- that's an orthogonal issue.

>[...]
>BTW, there is another topic related to XML-QL which is XBE or
>XML-[query]-By-Example.  I am not sure if anyone is working on it because I
>just thought of it <g>.  You just write an example of the XML document you
>want by specifying the tags as you want it and fill in the values to match.
>XBE engine uses it to return the data the way you want exactly, rejecting
>mismatches and weeding out unwanted information.

Interesting approach -- sort of the inverse of XSL -- but I suspect that in
order to be useful it won't be so simple.  In its simplest form, you'd only
return the elements that you provided, serving only as an existence test,
returning no other information.  QBE at a minimum requires wildcards, and
it would be interesting to find an appropriate set of wildcards.
--
Joe Lapp, Senior Engineer | jlapp@webMethods.com
webMethods, Inc.          | Voice: 703-267-1726
http://www.webMethods.com |   Fax: 703-352-0370

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jlapp at webMethods.com  Wed Sep  2 15:46:55 1998
From: jlapp at webMethods.com (Joe Lapp)
Date: Mon Jun  7 17:04:21 2004
Subject: Another XML query language (was Re: XML-QL)
Message-ID: <3.0.32.19980902094723.00ab57a0@gw1.webmethods.com>


Here's another markup query language that is actually quite mature.  It is
called the webMethods Object Model (WOM), which is not to be confused with
APIs such as DOM (WOM is a string-based query language).

We've been using it in our products for for than a year and a half now.
WIDL depends on it.  You can think of WIDL as an application of a query
language.  WOM was originally developed for HTML, but has proven more
valuable with XML.  We use our latest incarnation of WIDL with XML all the
time; WIDL is a language for crossing the tree/data-structure barrier that
exists between flow object representations and programming languages.
(I'll be giving a speech on WIDL for XML at the XML 98 conference.)

I've put some pages up on our site.  These pages are from our product
documentation.  They probably aren't as accessible in this form, since I
went through and removed the product tutorial pieces.  Hopefully I didn't
leave to many links dangling.  The URL:

  http://www.webMethods.com/technology/wom.html

At 09:41 AM 9/1/98 -0700, Lisa Rein wrote:
>
>So far I've got:
>
>SQL (of course)
>XPointer
>XML-QL
>XQL?
>Appel...
>
>more?
>
>thanks everybody,
>
>lisa
>
>Jonathan Robie wrote:
>> 
>> At 11:19 AM 9/1/98 +0100, Michael Kay wrote:
>> 
>> >My immediate reaction is to compare this not with SQL, but
>> >with the new XSL "tree construction" facilities which
>> >essentially provide an XML transformation language. I don't
>> >have time to do a detailed point-by-point comparison but it
>> >would certainly be a useful exercise. Conceptually they have
>> >many similarities but there are many points of detail where
>> >one is stronger than the other. I would think it is entirely
>> >possible to devise a language that combines the power of
>> >both without a significant loss of usability.
>> 
>> In fact, at Metastructures 98 I presented a language called XQL that uses a
>> syntax very similar to XSL Patterns. This language was developed primarily
>> by Joe Lapp of webMethods and me. Like XML-QL, XQL is declarative.
>> 
>> One of the significant differences between XML-QL and XQL is that XQL can
>> do both hierarchy and sequence. The fundamental structural relationships in
>> XQL are:
>> 
>> o       hierarchy
>> 
>>         o       parent/child
>>         o       ancestor/descendant
>> 
>> o       sequence
>> 
>>         o       precedes
>>         o       immediately precedes
>> 
>> o       position
>> 
>>         o       subscripts
>>         o       ranges
>> 
>> I think sequence is pretty important in documents, though it is not
>> important in many data-oriented systems. XML-QL's heritage in relational
>> theory has caused it to ignore sequence.
>> 
>> Jonathan
>> 
>> jonathan@texcel.no
>> Texcel Research
>> http://www.texcel.no
>> 
>> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>> (un)subscribe xml-dev
>> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
>> subscribe xml-dev-digest
>> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
--
Joe Lapp, Senior Engineer | jlapp@webMethods.com
webMethods, Inc.          | Voice: 703-267-1726
http://www.webMethods.com |   Fax: 703-352-0370

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Wed Sep  2 15:49:58 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:21 2004
Subject: XBE (was Re: XML-QL)
Message-ID: <000c01bdd677$2e712060$2ee044c6@arcot-main>

>Interesting approach -- sort of the inverse of XSL -- but I suspect that in
>order to be useful it won't be so simple.  In its simplest form, you'd only
>return the elements that you provided, serving only as an existence test,
>returning no other information.  QBE at a minimum requires wildcards, and
>it would be interesting to find an appropriate set of wildcards.


At the simple level, fuzzy match of attribute values and text contents
should work pretty well.  Any missing attribute or elements can be treated
as wildcards.

At the complex level, scripting language for matching should work pretty
well.

Here is an example:

<order department="electronics" salesperson="bob">
    <descr>CD-ROM</descr>
    <quantity>
        <xbe:script>
            quantity &gt; 10
        </xbe:script>
    </quantity>
    <comment/>
</order>

Above XBE should return all order records for sales made at the electronics
department by Bob where product description 'contains' the string "CD-ROM"
and quantity exceeds 10.  The scripting language refers to context elements
and attributes by name (order.department for attribute).

There are obvious rough spots but I think it has promises.  Whether or not I
will invest time and effort into XBE is another question of course.

Don


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jlapp at webMethods.com  Wed Sep  2 16:06:40 1998
From: jlapp at webMethods.com (Joe Lapp)
Date: Mon Jun  7 17:04:21 2004
Subject: Another XML query language (was Re: XML-QL)
Message-ID: <3.0.32.19980902100707.00ab9c84@gw1.webmethods.com>


Oops, here are some corrections.  Sorry about that.  Working too much:

At 09:47 AM 9/2/98 -0400, Joe Lapp wrote:
>[...]
>We've been using it in our products for for than a year and a half now.
                                         ^^^ more

>[...]WIDL is a language for crossing the tree/data-structure barrier that
>exists between flow object representations and programming languages.
                ^^^^^^^^^^^ grove

--
Joe Lapp, Senior Engineer | jlapp@webMethods.com
webMethods, Inc.          | Voice: 703-267-1726
http://www.webMethods.com |   Fax: 703-352-0370

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jgarrett at navix.net  Wed Sep  2 16:13:34 1998
From: jgarrett at navix.net (Jim Garrett (NAVIX))
Date: Mon Jun  7 17:04:21 2004
Subject: Cement Shoes for XML
Message-ID: <000401bdd679$757264e0$14c8c8c8@jgnt40>

Simon:
So what do we(the XML users) do now...?

And how do we keep XML from the getting a new
pair of these shoes...??

This is most alarming....!!!

I had "Hiiiiiigh hopes" for Ubiquitous XML..!!!

JDGarrett

>Some of you may be interested in my latest essay, "Cement Shoes for XML?",
>which explores some reasons why client-side XML support, at least in the
>dominant browsers, may be slow in coming or perhaps worse.  Hopefully, I
>haven't said anything too rude, but it's the result of several weeks of
>frustration brought on by reading articles about how slow and/or broken the
>XML
>development efforts in the current browsers appear to be.
>
>Developers planning on using server-side tools to send XML to browser
clients
>will probably find it most alarming. (Myself included.)
>
>The essay is posted at:
>http://www.simonstl.com/articles/cement.htm
>
>
>
>Simon St.Laurent
>Dynamic HTML: A Primer / XML: A Primer
>Cookies / Sharing Bandwidth (November)
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Wed Sep  2 16:27:34 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:21 2004
Subject: PARAMETER entities & WFness
Message-ID: <3.0.32.19980902072809.00f6828c@207.34.179.21>

At 11:38 AM 9/2/98 +0100, Peter Jones wrote:
>parameter entities really only have one use for us in our
>well-formed-but-not-valid documents: to alter the order in which entity
>declarations are parsed?

I don't think parameter entities have any use whatsoever in any 
circumstances unless you're validating.  Reason: a non-validating
processor doesn't have to read them. -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From gcsfred at magma.ca  Wed Sep  2 16:53:14 1998
From: gcsfred at magma.ca (Gustavo Frederico)
Date: Mon Jun  7 17:04:21 2004
Subject: Java IDEs and XML
Message-ID: <199809021305.JAA01216@mag1.magmacom.com>

   I am prototyping a new application, and it will be using WebObjects.
WebObjects has a development environment, a JVM and does a bridge
between Java objects and the database. I was thinking about using XML for
handling the contents of news, event announcements and for storing metadata
about documents.
   Would that be better than having the application to talk directly to
the database, updating directly database entries? 
I think the general answer is: depends on the application. In this case,
I like the idea of using XML because it would easyly handle documents metadata,
and I would easyly handle news stuff with something like CDF.
   On the other hand, the whole development environment
is driven to model objects, map them to the database and generate Java code.
And if I don't use any database at all for those sections of the project,
how am I going to handle concurrent updates, data (or document, in this case)
consistency on blablablaML documents? Is this responsability over my Java
code now and not on the database?
   Do you think XML is a good alternative to this application?
   I think that would be a problem that aplies to other Java tools also,
although I don't know deeply other Java IDE.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From gcsfred at magma.ca  Wed Sep  2 16:53:16 1998
From: gcsfred at magma.ca (Gustavo Frederico)
Date: Mon Jun  7 17:04:21 2004
Subject: Java IDEs and XML
Message-ID: <199809021351.JAA16290@mag1.magmacom.com>

   I am prototyping a new application, and it will be using WebObjects.
WebObjects has a development environment, a JVM and does a bridge
between Java objects and the database. I was thinking about using XML for
handling the contents of news, event announcements and for storing metadata
about documents.
   Would that be better than having the application to talk directly to
the database, updating directly database entries? 
I think the general answer is: depends on the application. In this case,
I like the idea of using XML because it would easyly handle documents metadata,
and I would easyly handle news stuff with something like CDF.
   On the other hand, the whole development environment
is driven to model objects, map them to the database and generate Java code.
And if I don't use any database at all for those sections of the project,
how am I going to handle concurrent updates, data (or document, in this case)
consistency on blablablaML documents? Is this responsability over my Java
code now and not on the database?
   Do you think XML is a good alternative to this application?
   I think that would be a problem that aplies to other Java tools also,
although I don't know deeply other Java IDE.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From gustavo.frederico at ott.montage.ca  Wed Sep  2 16:56:45 1998
From: gustavo.frederico at ott.montage.ca (Gustavo Frederico)
Date: Mon Jun  7 17:04:21 2004
Subject: Java IDEs and XML (again)
Message-ID: <SIMEON.9809021005.B@Galdikas.ott.montage.ca>

(sorry if you are reading twice. I got a problem with my 
SMTP server)
   I am prototyping a new application, and it will be using WebObjects.
WebObjects has a development environment, a JVM and does a bridge
between Java objects and the database. I was thinking about using XML for
handling the contents of news, event announcements and for storing metadata
about documents.
   Would that be better than having the application to talk directly to
the database, updating directly database entries? 
I think the general answer is: depends on the application. In this case,
I like the idea of using XML because it would easyly handle documents metadata,
and I would easyly handle news stuff with something like CDF.
   On the other hand, the whole development environment
is driven to model objects, map them to the database and generate Java code.
And if I don't use any database at all for those sections of the project,
how am I going to handle concurrent updates, data (or document, in this case)
consistency on blablablaML documents? Is this responsability over my Java
code now and not on the database?
   Do you think XML is a good alternative to this application?
   I think that would be a problem that aplies to other Java tools also,
although I don't know deeply other Java IDE.

----------------------
Gustavo Frederico
gustavo.frederico@ott.montage.ca


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Wed Sep  2 19:16:41 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:21 2004
Subject: State of browsers/JUMBO 
In-Reply-To: <005501bdd664$28bb9e60$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <3.0.1.16.19980901181721.289f6b3e@pop3.demon.co.uk>

At 12:23 02/09/98 +0100, Michael Kay wrote:
>
>Some observations:
>
>- I tried (more than once) downloading Jumbo and exploring
>what it could do for me. I didn't make much progress.
>There's clearly an enormous amount of functionality there,
>but I found it very hard to know where to start. I've had

Fully accepted. This is true of most enthusiast software. I had expected
JUMBO to be overtaken.  I need it for my own purposes and offer it to others.
[A new version should be out tonight - but it's still 'alpha'].

>the same experience with downloads of other XML software
>(most recently XML Toolkit), and I dare say others have had
>the same experience with my own SAXON library.
>
>- I suspect that as a community the thing we are desperately
>lacking is a commonly understood architecture. We're all

Certainly. The architecture that JUMBO needs is the ability to
render/process/validate on a per-object basis. This is also what XXX does.
Steve Withal uses verify() [I think] while I use what I think is a similar
function processXML(). Something that runs code on an endElement event to:
	- customise/transform the internals
	- verify (possibly includes DTD-like validation)
	- render in interactive form.
For chemistry I have to have processing for data types (a la DCD) and
specials such as <molecule>

>writing bits of code that do useful things with XML, but we
>don't have a clear vision as to what the total set of
>capabilities should look like or how its components should
>relate to each other. I think this is why it's hard to take
>something like Jumbo and discover quickly what pieces of the
>jigsaw it supplies.

I'd be delighted if there were others who have this need and we can work
out an API. This is the most obvious area for me - and I would assume many
others. Or is everyone quite happy to wait for stylesheets - in which case
I am in a minority as stylesheets and ECMAScript won't do much useful for
chemistry.
>
>- Having all these people produce free software is great,
>but the downside is that most of it was written to satisfy
>the intellectual creativity and/or parochial application
>requirements of the individual author, which means that the
>boring parts of software development, like working out who
>the users are and writing good task-oriented documentation
>to meet their needs, have been sadly neglected. Perhaps this
>is why real product developers like Microsoft seem to be
>slow. Fred Brooks, I recall, said that writing a one-off
>program is one-tenth the effort of producing a software
>product that does the same thing.

No question. But the enthusiast community has - on occasion - shown it can
be done. I don't see why it couldn't be attempted here. 
>
>Regarding Simon's essay, I don't share his pessimism. I
>don't personally regard client-side browser support for XML
>as particularly urgent, I'm quite happy to do rendition
>server-side either on demand, or in many cases at site

If 'rendition' means rendering for humans to read then we can use PDF. The
points of client side functionality for me are that:
	- we can carry out operations without troubling the server. Examples are
interactive graphics (my background is molecular graphics). Holding the
displayTree in the server for processing is untenable for many operations.
And my graphics have to be intelligent enough to be coupled to a data model
	- we may not even *have* a server (some people will interact with XML
documents disconnected from a remote server)
	- we may wish to interact with the local resources

I still feel that XML is suited to many operations other than sending
human-readable non-interactive text over the WWW. For example I could see
it as a way of building a MOO (or - more ambitiously - interactive games)
in a platform-independent manner. You can't do this well in HTML. 

>generation time. One reason is that the client-side model
>(along with XLink and XSL) seems to assume that the web of
>XML documents has the same topology as the web presented to
>the user, which I think grossly underexploits the ability of
>XML to separate the information structure from the user
>view. I think a much more valuable development would be the
>integration of XML with database technology. (And in
>practice, I'm actually using XML mainly for "EDI" style
>applications. )

Surely one thing we need is form-like data entry implemented on the client
side. [Using XML to generate HTML forms surely misses the point.] So I have
been experimenting with client-side data authoring tools. XML has great
potential as an authoring tool for specialist or complex domains - again I
see little evidence that people are addressing this.

I shall keep appealing for enthusiasts and we'll see what turns up. Doesn't
need many. Maybe they don't yet come to XML-DEV...

	P.


Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep  2 21:01:31 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:21 2004
Subject: PARAMETER entities & WFness
In-Reply-To: <29AA5A0E3A0CD21196F300A0C9D8575C036025@WROX3> from "Peter Jones" at Sep 2, 98 11:38:47 am
Message-ID: <199809021904.PAA15517@locke.ccil.org>

Peter Jones scripsit:

> Since for well-formed but not valid docs I can't rely on external
> parameter entities to be read and included, is it true that
> parameter entities really only have one use for us in our
> well-formed-but-not-valid documents: to alter the order in which entity
> declarations are parsed?

Actually, that won't work either.  Parameter entities have no use
at all in documents that must be processed by minimally conformant
parsers.

The good news is that almost all parsers actually do process external entity
references, both general and parameter.

-- 
John Cowan					cowan@ccil.org
		e'osai ko sarji la lojban.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dent at highway1.com.au  Wed Sep  2 22:00:44 1998
From: dent at highway1.com.au (Andy Dent)
Date: Mon Jun  7 17:04:22 2004
Subject: auto table mapping
Message-ID: <v03102808b21352a3d27b@[203.23.215.116]>

The context is our storing a report-writer output into a standalone
document so the same engine can later reparse the document, and allow user
editing.

We have a tree of report-writer objects, almost all of which link to one or
more database views.

An obvious shortcut is to put a lot of specialist attributes into our DTD
so that our structures are easily rebuilt.

I'm playing with the idea of having our parser able to recreate such a tree
in a fairly generic manner (ie: handle other people's XML :-).

The essential issue is the flattening of the bottom level. Instead of a
general tree, with leaves containing PCDATA (or b64-encoded images) we want
to recognise the situation where a number of sibling leaves are actually
fields in a single record.

Thus, two levels of the tree map to an internal 'table' object.

Nested elements which contain other PCDATA leaves would be modelled as
related tables (as an OORDBMS OOFILE models relationships as well as
tables).

The key assumptions here are
1) we will auto-generate some internal ID to track the relationships (I
haven't time for the complexities of XML-Data, and join keys may not be in
the exported data)

2) some portion of the DTD will fall into a pattern of 'tables' with
possible nested tables

3) inside this pattern, we may require you to explicitly bracket multiple
occurrences of an element inside another element so we can recognise a
nested table (ie: <student> can't contain multiple <grade> elements
directly, but needs <student> <results> <grade>)

Is there interest in discussing this further or should I just carry on and
implement something merely good enough for our immediate load/save
operation?

Andy Dent BSc MACS AACM, Software Designer, A.D. Software, Western Australia
OOFILE - Database, Reports, Graphs, GUI for c++ on Mac, Unix & Windows
PP2MFC - PowerPlant->MFC portability
http://www.highway1.com.au/adsoftware/crossplatform.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From fabien at girardin.org  Wed Sep  2 23:27:32 1998
From: fabien at girardin.org (Fabien Girardin)
Date: Mon Jun  7 17:04:22 2004
Subject: XML and servlets
Message-ID: <35EDB7FD.8E26C7D4@girardin.org>

Hi all,

I have done some experiments where XML documents are processed on the
client side. But I think, for my project, it would be more interesting
to parse these files on the server side using a servlet.

I am using MSXML, but I can't load the XML file in the servlet (while a
had no problems doing it on my prior experiments in applets and
applications).

Has anybody have tried the same kind of experiment or could point me to
some valuable documentation or code to help me solve this problem.

-- Fabien


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Wed Sep  2 23:52:09 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:22 2004
Subject: XML tools and big documents (was: Re: Is there a size limitation on 
 XML file given to MSXSL as input?)
References: <009e01bdd670$e935a490$2ee044c6@arcot-main>
Message-ID: <35EDBE44.1DA8781B@infinet.com>

Don Park wrote:

> >I would suspect that the DOM package provided by Don Park has similiar
> >performance and memory consumption.  Your best bet would probably be to
> look at
> >an XSL package which takes a DOM tree of your XML data, and a DOM tree of
> an XSL
> >stylesheet and spits out the content.  That way you are not stuck with an
> MS,
> >IBM, Oracle, or whatever implementation that you are not happy with.
>
> About 10 seconds and 10 meg of memory to convert each meg of XML into DOM
> with JIT enabled.

For my implementation, for ot.xml (a 4 meg document) only about 1-2 megs of RAM
is used to store the 4 meg file in RAM due to all Names being cached at the
parser level.  It also takes only 10-12 seconds with a P-120 running Symantec's
JIT for JDK 1.2 b4 to build the entire DOM tree.  For spitting out the DOM tree
(and normalizing all the Text nodes) it takes about 15-20 seconds of which 5
seconds is spent normalizing text nodes and most of the rest of this time is
actually spent in a brute force search and replace method that scans all
character data and attribute values and replaces any occurrences of entity values
with entity names.  This can be very expensive but I know no other way around it.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From kent at trl.ibm.co.jp  Thu Sep  3 02:37:09 1998
From: kent at trl.ibm.co.jp (TAMURA Kent)
Date: Mon Jun  7 17:04:22 2004
Subject: Update: IBM XML for Java version 1.0.9
Message-ID: <199809030035.JAA38798@ns.trl.ibm.com>


http://www.alphaworks.ibm.com/formula/xml

XML for Java, an XML processor written in Java, has been
updated.  It runs on Java 1.1.x and some samples require Swing
1.0.x.

CHANGES:
  o DOM-19980818 Proposed Recommendation support
  o An experimental implementation of attribute-based namespace
    (WD-xml-names-19980802)
    PI-based namespace was removed.
  o All sample programs were moved to "samples." package and
    stored in xml4jSamples_1_0_9.jar.
  o etc.

I'm sorry that a fatal bug is already found:
  o A parser crashes by PIs with empty data like <?foo?>.

-- 
TAMURA, Kent  @ Tokyo Research Laboratory, IBM Japan


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From macherius at darmstadt.gmd.de  Thu Sep  3 02:45:41 1998
From: macherius at darmstadt.gmd.de (Ingo Macherius)
Date: Mon Jun  7 17:04:22 2004
Subject: XML tools and big documents
In-Reply-To: <199809012057.QAA01820@unready.megginson.com>
References: <199809011837.UAA08308@sonne.darmstadt.gmd.de>
Message-ID: <199809030044.CAA05059@sonne.darmstadt.gmd.de>

David Megginson <david@megginson.com> wrote at 1 Sep 98, 16:57:
> I do not need to build a tree for the whole document; instead [...] 
> dump it into my SQL database [...]. 
=> Put it into an RDBMS

> [...] it makes more sense to build the specialised object tree
> directly from the event stream rather than building a DOM tree
=> Put it into an OODBMS

"Michael Kay" <M.H.Kay@eng.icl.co.uk> wrote at Wed, 2 Sep 1998 10:31:41 +0100:
> [...] storing the Java serialization of DOM-like models on disk [...]
> takes a lot longer than reparsing original XML
=> Put it in a file and reparse

So when it gets big, use a database ? Did I get this wrong and XML 
was never ment to be a storage paradigm ? 

Anyway, I can affirm Michael's results.
We implemented an experimental database storage for SGML with jjc's 
SP and Informix's IUS. It generalizes something similar to David's 
second suggestion. Object-aggregation is done by marking the content 
of specified element types (e.g. <act> in a Shakespeare play) to be 
stored unparsed. When it comes to queries it is reparsed on the fly. 
Kind of automatic object generation.
Queries turned out to become slow when granularity gets less coarse. 
Most navigations trigger child/sibling lookups, which trigger object 
ID table lookups. That's at least one SQL statement firing for every 
DOM navigation call. Caching helps, but doesn't really the problem. 
Trees in RDBM are no fun. Michael writes they are no fun in OODB, 
too. IMHO the good timings in in-memory DOM implementations result 
from the fact that looking up children is a cheap operation. In 
current DB systems it's not cheap at all.

Is anybody aware of literature for efficient addressing in trees ? 
This should help both in-memory DOMs and DBs.

A bit disillusioned,
	++im

--
Ingo Macherius//Dolivostrasse 15//D-64293 Darmstadt//+49-6151-869-882
GMD-IPSI German National Research Center for Information Technology
mailto:macherius@gmd.de http://www.darmstadt.gmd.de/~inim/
Information!=Knowledge!=Wisdom!=Truth!=Beauty!=Love!=Music==BEST (Zappa)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Thu Sep  3 02:48:13 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:22 2004
Subject: XML tools and big documents
Message-ID: <005c01bdd6d3$04ed3560$2ee044c6@arcot-main>

>For my implementation, for ot.xml (a 4 meg document) only about 1-2 megs of
RAM
>is used to store the 4 meg file in RAM due to all Names being cached at the
>parser level.  It also takes only 10-12 seconds with a P-120 running
Symantec's

My test results were from running on Atari 800 (just kidding <g>).  My test
machine is Pentium-133 with JDK 1.1.6 with JIT enabled.  Building DOM is a
slow process but there are intermediate forms I am investigating which cuts
down DOM loading drastically.

>JIT for JDK 1.2 b4 to build the entire DOM tree.  For spitting out the DOM
tree
>(and normalizing all the Text nodes) it takes about 15-20 seconds of which
5
>seconds is spent normalizing text nodes and most of the rest of this time
is
>actually spent in a brute force search and replace method that scans all
>character data and attribute values and replaces any occurrences of entity
values
>with entity names.  This can be very expensive but I know no other way
around it.

Why are you normalizing text nodes before writing them out?  Also, blindly
replacing entity values with entity names is error prone.

Don Park
Docuverse


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Thu Sep  3 03:01:24 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:22 2004
Subject: XML tools and big documents
In-Reply-To: <199809030044.CAA05059@sonne.darmstadt.gmd.de>
References: <199809011837.UAA08308@sonne.darmstadt.gmd.de>
	<199809012057.QAA01820@unready.megginson.com>
	<199809030044.CAA05059@sonne.darmstadt.gmd.de>
Message-ID: <199809030053.UAA04222@unready.megginson.com>

Ingo Macherius writes:

 > > [...] it makes more sense to build the specialised object tree
 > > directly from the event stream rather than building a DOM tree
 > => Put it into an OODBMS

No, not exactly -- I'm suggesting building an application-specific
object tree, not a generic XML one, and am not particularly concerned
with where it is stored.

 > So when it gets big, use a database ? Did I get this wrong and XML
 > was never ment to be a storage paradigm ?

XML is for interchange -- for simple applications you can use it for
storage (as I do on my notebook), but for larger, multi-user
applications, you probably want to put it into some kind of
specialised storage, if only for the sake of revision and access
control.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eric at hellman.net  Thu Sep  3 05:34:16 1998
From: eric at hellman.net (Eric Hellman)
Date: Mon Jun  7 17:04:22 2004
Subject: Tools to convert Word to XML?
Message-ID: <v04011703b213bced5d71@[192.168.1.1]>

Sounds like what you want is RTF2HTML http://www.sunpack.com/RTF/ (Brand
new version out this week!)
It runs on any platform, and with custom configurations you can do
precisely what you ask for. (You need Word to save as RTF, though.)
Previous version have been very solid.

Eric


>From: Andrew Cogan <andrew@epiphanysoftware.com>
>Date: Tue, 01 Sep 1998 10:28:09 -0700
>Subject: Tools to convert Word to XML?
>
>Can anyone recommend good tools that can convert Word files to XML? I
>don't need tools that claim XML compatibility per se; any utility that
>gives me control over what tag to insert at the beginning of a style and
>at the end of a style would probably suffice. The ability to work with
>Word footnotes is a big plus.
>- --
> Andrew Cogan, Epiphany Software
Eric Hellman
Openly Informatics, Inc.
http://www.openly.com/           Tools for 21st Century Scholarly Publishing

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From yikim at savage.comeng.chungnam.ac.kr  Thu Sep  3 06:17:36 1998
From: yikim at savage.comeng.chungnam.ac.kr (=?euc-kr?B?sei/tcDPKEtpbSBZb3VuZyBJbCk=?=)
Date: Mon Jun  7 17:04:23 2004
Subject: About xml Style tag
Message-ID: <00db01bdd6f0$232a1ee0$bd2cbca8@hero.comeng.chungnam.ac.kr>

Skipped content of type multipart/alternative-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: image/gif
Size: 5665 bytes
Desc: not available
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980903/bee56526/attachment.gif
From tyler at infinet.com  Thu Sep  3 15:15:46 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:23 2004
Subject: XML tools and big documents
References: <005c01bdd6d3$04ed3560$2ee044c6@arcot-main>
Message-ID: <35EE96CC.B0115079@infinet.com>

Don Park wrote:

> >For my implementation, for ot.xml (a 4 meg document) only about 1-2 megs of
> RAM
> >is used to store the 4 meg file in RAM due to all Names being cached at the
> >parser level.  It also takes only 10-12 seconds with a P-120 running
> Symantec's
>
> My test results were from running on Atari 800 (just kidding <g>).  My test
> machine is Pentium-133 with JDK 1.1.6 with JIT enabled.  Building DOM is a
> slow process but there are intermediate forms I am investigating which cuts
> down DOM loading drastically.

Most of all of the hard work of building the DOM tree is already done by the
parser.  Right now with the current state of Java, the single most important
thing I have found that you can do to speed up building the DOM tree is to only
allocate memory for container structures like Arrays, Vectors, or other utility
Collection classes as needed.  In other words, an leaf-node should not have any
memory allocated for storing children.

As for the memory issue, I have thought about some sort of LZW compression of all
of the text in a document tree.  This would save a lot of memory, but may slow
down building the DOM tree a bit.  Any ideas on this?

> >JIT for JDK 1.2 b4 to build the entire DOM tree.  For spitting out the DOM
> tree
> >(and normalizing all the Text nodes) it takes about 15-20 seconds of which
> 5
> >seconds is spent normalizing text nodes and most of the rest of this time
> is
> >actually spent in a brute force search and replace method that scans all
> >character data and attribute values and replaces any occurrences of entity
> values
> >with entity names.  This can be very expensive but I know no other way
> around it.
>
> Why are you normalizing text nodes before writing them out?  Also, blindly
> replacing entity values with entity names is error prone.

In general I agree with this, and this sort of stuff should be done at the
application level.  Nevertheless, the programmer I feel should have a choice
whether to manually do this or let the formatter do all of the hard work.

As for the text nodes, they do not have to be normalized, just that in my tests I
accounted for this since many people will want to normalize the document tree to
make the output look pretty.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Thu Sep  3 15:45:25 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:23 2004
Subject: XML tools and big documents
Message-ID: <003601bdd73f$b66b51c0$2ee044c6@arcot-main>

>As for the memory issue, I have thought about some sort of LZW compression
of all
>of the text in a document tree.  This would save a lot of memory, but may
slow
>down building the DOM tree a bit.  Any ideas on this?


Your performance will suffer and memory problem still remains.

Don


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Thu Sep  3 16:07:30 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:23 2004
Subject: XML tools and big documents
In-Reply-To: <003601bdd73f$b66b51c0$2ee044c6@arcot-main>
References: <003601bdd73f$b66b51c0$2ee044c6@arcot-main>
Message-ID: <199809031400.KAA06725@unready.megginson.com>

Don Park writes:

 > > As for the memory issue, I have thought about some sort of LZW
 > > compression of all of the text in a document tree.  This would
 > > save a lot of memory, but may slow down building the DOM tree a
 > > bit.  Any ideas on this?
 > 
 > 
 > Your performance will suffer and memory problem still remains.

Agreed.  The overhead comes from the node objects, not from the text.
The biggest hogs can be attributes, especially in the standard SGML
DTDs which often include dozens of defaulted attributes for each
document type.  If you can optimise those (allocating nodes only on
demand and then freeing them as soon as they're not needed), you're
half-way there.

The second biggest hogs are leaf elements which contain only text.  If
you can treat those as special cases and allocate only one object for
each one instead of three (element node, node list, text node), then
you're another quarter of the way there.

PIs , doctype declarations, notations, etc. are rare enough that you
don't gain much by optimising them.  Your mileage on comments, entity
references and CDATA sections may vary, but you're probably best
skipping them or replacing them with their contents when you build the
tree, unless your application has very specialised requirements.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From nigelk at umich.edu  Thu Sep  3 17:01:08 1998
From: nigelk at umich.edu (Nigel Kerr)
Date: Mon Jun  7 17:04:23 2004
Subject: XML tools and big documents
In-Reply-To: David Megginson's message of "Wed, 2 Sep 1998 20:53:57 -0400"
References: <199809011837.UAA08308@sonne.darmstadt.gmd.de> <199809012057.QAA01820@unready.megginson.com> <199809030044.CAA05059@sonne.darmstadt.gmd.de> <199809030053.UAA04222@unready.megginson.com>
Message-ID: <p7gr9xt2qci.fsf@dev.hti.umich.edu>


Quoth David Megginson <david@megginson.com>:

> XML is for interchange -- for simple applications you can use it for
> storage (as I do on my notebook), but for larger, multi-user
> applications, you probably want to put it into some kind of
> specialised storage, if only for the sake of revision and access
> control.

My interest in descriptive markup in general is in describing large,
relatively static (meaning they don't get revised, usually), text
collections: the many fine TEI or EAD encoding projects, for instance,
the various literature collections one can get from Chadwyck-Healey,
and the like (our shop here takes such things and indexes them with
some version or other of OpenText for searching and structured
retrieval).  These are all described by SGML DTD's.

I've seen discussion and work on making an XML DTD to correspond as
closely as possible to the TEI, similarly with EAD.  The kinds of
documents these two DTD's can describe can be arbitrarily large (the
average EAD finding aid is relatively small, but we have a couple
pushing several megabytes).  Are there then folks interested in XML
for things other than interchange?  Authoring, certainly, but also
storage and retrieval of large text collections?

To this end, I have been (in such spare time as i have) tinkering
about with Mr. Clark's XP API (com.jclark.xml.tok, mostly) to write an
application that will allow me to attach the logical element structure
to offsets in the storage entity, so that I can consider the logical
structure's relationship to points in the text without reparsing the
document.  I want to be able to ask questions like:

	"what's the most immediate containing element of offset X in
	file Y?"

	"traverse up the logical structure from offset X until a DIV
	element with a HEAD is found, and return me the offsets of
	that HEAD"

Exact expression language is, uh, gee.  These are the kinds of
questions we could ask with "some XML query language", but if i have a
gigabyte or so of variously-structured English text marked up this
way, i really don't want to have to parse the document entity just to
answer these kinds of simple questions.  This is a weak specification
of what I'm trying to do, i realize.  (this all largely because i am
disatisfied with the limited information of the logical tree that
OpenText's sgmlrgnXX gives... ).

Anyone else here interested in these kinds of problems, and using XML
tools on them?


Nigel Kerr				              nigelk@umich.edu
Digital Library Production Services         http://www.umdl.umich.edu/
University of Michigan

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Thu Sep  3 17:14:12 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:23 2004
Subject: XML tools and big documents
References: <003601bdd73f$b66b51c0$2ee044c6@arcot-main>
Message-ID: <35EEB289.ADF0AEC8@infinet.com>

Don Park wrote:

> >As for the memory issue, I have thought about some sort of LZW compression
> of all
> >of the text in a document tree.  This would save a lot of memory, but may
> slow
> >down building the DOM tree a bit.  Any ideas on this?
>
> Your performance will suffer and memory problem still remains.
>
> Don

Well the memory problem will remain but it could be reduced significantly for
large redundant documents.  Some people have claimed they get 97% compression of
some XML documents when using popular compression utilities like Winzip.
Reducing memory overhead with Names can be done at the parser level and actually
is implemented in some fashion for every major parser I know of.  As for
character content, the idea centers largely around each text node only allocating
a new String if the application requests it.  The String however is created by
looking up all of the character fragments stored in some sort of symbol table and
then parsing the String.  Then the String would be cached.  Nevertheless if the
text node is mutated in any way, the String reference is then set to null.

On second thought this may not degrade performance too much as you will be
getting the added benefit of only needing to allocate memory to store an integer
array (the sequence of symbols used to parse the string from the symbol table)
instead of a using a String which allocates two objects, the String object
itself, and the character array contained within it.  Of course this optimization
is Java specific and in languages like C++ or Eiffel where heap based objects are
not as expensive to deal with, this may be counter-productive.  Who knows it
might be counter-productive in Java.  I guess there is only one way to find out
unless someone has already tried this and has some insight they can lend.

Most parsers and parser interfaces like SAX present the character data as
characters and not as Strings.  So building the DOM tree without ever needing to
create any new String objects initially is very much doable.

I guess the real question is: should the DOM even be used for multi-megabyte
documents in the first place.  Initially I thought of XML as something that would
be used for two main purposes: EDI like web transactions and as a replacement for
HTML.  It seems like people now are using it for so many other things, many of
which may not be suitable for XML's abilities.  I guess the responsibility of XML
tools developers is to provide the most abstract functionality possible so people
can do many more things with XML than what it was intended for.  Nevertheless, I
think it is also a responsibility not to sell XML as the do-all solution of every
computing problem known to man.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Thu Sep  3 17:24:56 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:23 2004
Subject: XML tools and big documents
References: <003601bdd73f$b66b51c0$2ee044c6@arcot-main> <199809031400.KAA06725@unready.megginson.com>
Message-ID: <35EEB507.1FE6FF1E@infinet.com>

David Megginson wrote:

> Don Park writes:
>
>  > > As for the memory issue, I have thought about some sort of LZW
>  > > compression of all of the text in a document tree.  This would
>  > > save a lot of memory, but may slow down building the DOM tree a
>  > > bit.  Any ideas on this?
>  >
>  >
>  > Your performance will suffer and memory problem still remains.
>
> Agreed.  The overhead comes from the node objects, not from the text.
> The biggest hogs can be attributes, especially in the standard SGML
> DTDs which often include dozens of defaulted attributes for each
> document type.  If you can optimise those (allocating nodes only on
> demand and then freeing them as soon as they're not needed), you're
> half-way there.
>
> The second biggest hogs are leaf elements which contain only text.  If
> you can treat those as special cases and allocate only one object for
> each one instead of three (element node, node list, text node), then
> you're another quarter of the way there.

Very true.  However, in Java at least you can get around allocating a new object
for the node list by having your Node implementation also implement the NodeList
implementation as well.  Only allocate a buffer to store the children as needed.
You can do the same thing with the Element Node with regard to attributes.  This
saves a lot of memory and heap-based object allocation that you would have to do
otherwise.  Nevertheless, in Java allocating raw Objects is a memory hog to begin
with.

> PIs , doctype declarations, notations, etc. are rare enough that you
> don't gain much by optimising them.  Your mileage on comments, entity
> references and CDATA sections may vary, but you're probably best
> skipping them or replacing them with their contents when you build the
> tree, unless your application has very specialised requirements.

This is very true.  For large documents both heavily document oriented or
transaction oriented I still think that compressing all of the text in the
document tree may have some promise.  I guess before spending any more time
talking about it, I should spend the necessary hours to just do it.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From robin at ACADCOMP.SIL.ORG  Thu Sep  3 17:47:35 1998
From: robin at ACADCOMP.SIL.ORG (Robin Cover)
Date: Mon Jun  7 17:04:23 2004
Subject: XML Website - new URL
Message-ID: <199809031555.KAA25626@ACADCOMP.SIL.ORG>

The SGML/XML Web Page now has a new URL:

          http://www.oasis-open.org/cover/

Readers are asked to change links and bookmarks to the new URL
- especially those links on prominent Web pages.  The Web site
hierarchy remains substantially unchanged, so a simple string
replacement of 'http://www.oasis-open.org/cover' for
'http://www.sil.org/sgml' should suffice in most cases.  The
surface text in links should be edited to remove reference
to the previous host site.

Formerly sponsored by SIL and SoftQuad Inc., this online
database for SGML/XML and related standards is now hosted
by OASIS (Organization for the Advancement of Structured
Information Standards).  OASIS is a non-profit international
consortium dedicated to the promotion of structured information
processing standards, especially the SGML/XML family of languages,
including XLL, XSL, DSSSL, HyTime, HTML, CGM, STEP, and others.

Under OASIS sponsorship, Robin Cover's SGML/XML Web Page
will remain an industry-neutral resource.  It aims to provide a
comprehensive and cumulative online database containing reference
information and software pertaining to SGML/XML and related
standards.

Sincere gratitude is hereby expressed to the readers of the
SGML/XML Web Page for their support over the years - providing
information about updates and new resources.  Your continued
participation will be greatly appreciated.

-------------------------------------------------------------------------
Robin Cover                      Email: robin@acadcomp.sil.org
6634 Sarah Drive           
Dallas, TX  75236  USA          >>> The SGML/XML Web Page <<<
Tel: +1 (972) 296-1783 (h)     http://www.oasis-open.org/cover/
Tel: +1 (972) 708-7346 (w)
FAX: +1 (972) 708-7380
=========================================================================

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ht at cogsci.ed.ac.uk  Thu Sep  3 18:08:17 1998
From: ht at cogsci.ed.ac.uk (Henry S. Thompson)
Date: Mon Jun  7 17:04:23 2004
Subject: XML tools and big documents
In-Reply-To: Nigel Kerr's message of 03 Sep 1998 11:00:45 -0400
References: <199809011837.UAA08308@sonne.darmstadt.gmd.de> <199809012057.QAA01820@unready.megginson.com> <199809030044.CAA05059@sonne.darmstadt.gmd.de> <199809030053.UAA04222@unready.megginson.com> <p7gr9xt2qci.fsf@dev.hti.umich.edu>
Message-ID: <f5bg1e9ji1w.fsf@cogsci.ed.ac.uk>

Nigel Kerr <nigelk@umich.edu> writes:

> 	"what's the most immediate containing element of offset X in
> 	file Y?"
> 
> 	"traverse up the logical structure from offset X until a DIV
> 	element with a HEAD is found, and return me the offsets of
> 	that HEAD"
> 
> Exact expression language is, uh, gee.  These are the kinds of
> questions we could ask with "some XML query language", but if i have a
> gigabyte or so of variously-structured English text marked up this
> way, i really don't want to have to parse the document entity just to
> answer these kinds of simple questions.  This is a weak specification
> of what I'm trying to do, i realize.  (this all largely because i am

Our LT XML tool set and API were designed for precisely this sort of
application (we regularly work with >1GB language SGML-encoded corpora
such as the BNC).  We get good performance because

1) Our parser is written in C, our search and retrieval tools use it
   directly via a stream-based API, only custom UI tends to get
   written in a scripting language which looks at whole trees;

2) We only produce tree fragments when we get to the interesting bits:
   our query processor is optimised to avoid building large amounts of
   tree unnecessarily;

3) For REALLY big datasets, we do produce and use offset-based
   indices.

For more information, see http://www.ltg.ed.ac.uk/software/xml/.

ht
-- 
  Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh
     2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440
	    Fax: (44) 131 650-4587, e-mail: ht@cogsci.ed.ac.uk
		     URL: http://www.ltg.ed.ac.uk/~ht/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peterj at wrox.com  Thu Sep  3 18:22:32 1998
From: peterj at wrox.com (Peter Jones)
Date: Mon Jun  7 17:04:23 2004
Subject: Browser support practicalities
Message-ID: <29AA5A0E3A0CD21196F300A0C9D8575C036046@WROX3>

Can anyone tell me where I can get the best info on just what Microsoft
or Netscape's v.4+ web browsers are capable of (or not) w.r.t. XML?

Thanks,

Peter Jones
WebDev Technical Editor
Wrox Press
mailto:peterj@wrox.com
***************
Wrox Press UK Ltd.
http://www.wrox.co.uk
Tel 44 121 706 6826
Fax 44 121 706 2967


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Daniel.Brickley at bristol.ac.uk  Thu Sep  3 18:33:11 1998
From: Daniel.Brickley at bristol.ac.uk (Dan Brickley)
Date: Mon Jun  7 17:04:23 2004
Subject: Browser support practicalities
In-Reply-To: <29AA5A0E3A0CD21196F300A0C9D8575C036046@WROX3>
Message-ID: <Pine.GHP.4.02A.9809031726220.9942-100000@mail.ilrt.bris.ac.uk>


On Thu, 3 Sep 1998, Peter Jones wrote:

> Can anyone tell me where I can get the best info on just what Microsoft
> or Netscape's v.4+ web browsers are capable of (or not) w.r.t. XML?

There's an overview of Netscape's v.5 plans on their Mozilla site,
http://www.mozilla.org/rdf/doc/xml.html
(you can probably find marketing PR blurb on their netscape.com site
too...) I don't think v4.5 has much except the rdf-based smartbrowsing
'related links' system. 

Dan


--
Daniel.Brickley@bristol.ac.uk                           
Institute for Learning and Research Technology   http://www.ilrt.bris.ac.uk/
University of Bristol,  Bristol BS8 1TN, UK.     tel: +44(0)117 9288478


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peterj at wrox.com  Thu Sep  3 18:40:08 1998
From: peterj at wrox.com (Peter Jones)
Date: Mon Jun  7 17:04:23 2004
Subject: Browser support practicalities
Message-ID: <29AA5A0E3A0CD21196F300A0C9D8575C036047@WROX3>

Thanks, but I've seen that already. I should have been more specific.
I'm after information about just how closely the browser makers have
followed the DOM (now Proposed recommendation) or not.

> -----Original Message-----
> From:	Dan Brickley [SMTP:Daniel.Brickley@bristol.ac.uk]
> Sent:	Thursday, September 03, 1998 5:33 PM
> To:	'XML-DEV'
> Subject:	Re: Browser support practicalities
> 
> 
> On Thu, 3 Sep 1998, Peter Jones wrote:
> 
> > Can anyone tell me where I can get the best info on just what
> Microsoft
> > or Netscape's v.4+ web browsers are capable of (or not) w.r.t. XML?
> 
> There's an overview of Netscape's v.5 plans on their Mozilla site,
> http://www.mozilla.org/rdf/doc/xml.html
> (you can probably find marketing PR blurb on their netscape.com site
> too...) I don't think v4.5 has much except the rdf-based smartbrowsing
> 'related links' system. 
> 
> Dan
> 
> 
> 
> 
> --
> Daniel.Brickley@bristol.ac.uk                           
> Institute for Learning and Research Technology
> http://www.ilrt.bris.ac.uk/
> University of Bristol,  Bristol BS8 1TN, UK.     tel: +44(0)117
> 9288478
> 
> 
> 
> xml-dev: A list for W3C XML Developers. To post,
> mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
> message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Thu Sep  3 19:44:00 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:23 2004
Subject: How to put msxml to immediate use
Message-ID: <199809031743.KAA00453@mail-gw.pacbell.net>


  If you're a web author, you've probably already played with msxsl and
seen some neat possibilities. One thing that prevents msxsl from being a
being a valuable tool is the fact that it doesn't expand ENTITY references.

  It's my belief that anyone with a Java development environment could fix
this in an hour (or less).

  Wouldn't it be handy to do something like this...

jview msxml -d lots_of_entities.xml > temp.xml
msxls -i temp.xml -s style.xsl -o final_output.html

  If anyone with VJ++ (or something like it) wants to do the web authoring
community a favor, change the way msxml behaves to print the entity itself
rather than the reference when it's run with the -d option.

  Very brief example...

-ExternalDTD.xml-
<?xml version="1.0" encoding="UTF-8" ?>
<!DOCTYPE body SYSTEM "ExternalDTD.dtd" >
<body>
	<greeting>This is a &foo; &bar; </greeting> 
</body>

-EternalDTD.dtd-
<!ELEMENT body (greeting)>
<!ENTITY foo 'bar'>
<!ENTITY big SYSTEM 'bar.xml'>
<!ELEMENT greeting	(#PCDATA) >

-bar.xml-
Hello World!

  Would anyone be willing to modify msxml to output...

<?xml version="1.0" encoding="UTF-8" ?>
<!DOCTYPE body SYSTEM "ExternalDTD.dtd" >
<body>
	<greeting>This is a bar Hello World! </greeting> 
</body>

  ...instead of...

<?xml version="1.0" encoding="UTF-8" ?>
<!DOCTYPE body SYSTEM "ExternalDTD.dtd" >
<body>
	<greeting>This is a &foo; &bar; </greeting> 
</body>

  ...which it does now?

  I've done my reading and researched this and I haven't found any set of
tools that will take map (.xml, .xsl) -> .html

  Any help, thoughts or .class files would be greatly appreciated ;)


-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eriblair at mediom.qc.ca  Thu Sep  3 20:00:14 1998
From: eriblair at mediom.qc.ca (Eric Riblair)
Date: Mon Jun  7 17:04:23 2004
Subject: Where can I found Internet Explorer 4.71...
Message-ID: <199809031759.NAA17915@netra.mediom.qc.ca>

Please ... can somebody help me ...

Where can I found Internet Explorer 4.71...

Thanks for any help

?ric Riblair,
Agronome
e-mail: eriblair@mediom.qc.ca

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From wxie at gmswireless.com  Thu Sep  3 20:06:10 1998
From: wxie at gmswireless.com (Weihong Xie)
Date: Mon Jun  7 17:04:23 2004
Subject: parsing XML within HTML files
Message-ID: <002701bdd764$f58dbc10$89fcd8d0@lastexit.gmswireless.com>


hi,

I am developing some servlet application and am looking at XML to see if I
can use it to present dynamic information.

AlL I want to do is in normal HTML files, there will be some customized XML
tags to mark the places where dynamic values will be inserted, so when the
servlet serves those pages, it will provide those values but leave the HTML
text alone. The question is how I can do this, do I need a DTD that defines
HTML and my customized tags or is there any XML parsers understand HTML? I
am new to XML, so any advice is welcome.

Thanks.

Weihong.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From wxie at gmswireless.com  Thu Sep  3 23:22:25 1998
From: wxie at gmswireless.com (Weihong Xie)
Date: Mon Jun  7 17:04:23 2004
Subject: parsing XML within HTML files
Message-ID: <002e01bdd780$62e5b950$89fcd8d0@lastexit.gmswireless.com>

I posted this before and was returned by mail failure. Sorry if duplicated.

----------------------------------------------------------------------------
--
hi,

I am developing some servlet application and am looking at XML to see if I
can use it to present dynamic information.

AlL I want to do is in normal HTML files, there will be some customized XML
tags to mark the places where dynamic values will be inserted, so when the
servlet serves those pages, it will provide those values but leave the HTML
text alone. The question is how I can do this, do I need a DTD that defines
HTML and my customized tags or is there any XML parsers understand HTML? I
am new to XML, so any advice is welcome.

Thanks.

Weihong.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Curt.Arnold at hyprotech.com  Fri Sep  4 01:37:09 1998
From: Curt.Arnold at hyprotech.com (Arnold, Curt)
Date: Mon Jun  7 17:04:24 2004
Subject: parsing XML within HTML files
Message-ID: <E0zEivr-0005fK-00@punch.ic.ac.uk>

"Presenting XML" by Light discusses how to write HTML so that it is also
valid XML.  Basically you have to do things like explicitly close your
paragraphs, i.e.

<p>
This is a paragraph.  The following close paragraph tag is legal HTML
(and required for it to be legal XML) but no HTML authoring tool would
ever add it.
</p> 

If that is tolerable, you could insert you special tags within the
document that you replace.

<p>
This is a <subst src="url of source">replace this</subst>
</p>

You wouldn't need to have a HTML DTD (but you could if you wanted).

However, given all that, if you aren't interested in extracting any
meaning from the XML (like the URL in the same), It would seem easier to
take the approach Microsoft did with VB 6 and just use a DIV tag within
your normal HTML stream and have your servlet scan and replace the
"<DIV></DIV>" blocks it recognizes.

<p>This is a <DIV ID="REPLACEMENT1"></DIV>


-----Original Message-----
From: Weihong Xie [mailto:wxie@gmswireless.com]
Sent: Thursday, September 03, 1998 4:18 PM
To: Xml-Dev
Subject: parsing XML within HTML files


I posted this before and was returned by mail failure. Sorry if
duplicated.

------------------------------------------------------------------------
----
--
hi,

I am developing some servlet application and am looking at XML to see if
I
can use it to present dynamic information.

AlL I want to do is in normal HTML files, there will be some customized
XML
tags to mark the places where dynamic values will be inserted, so when
the
servlet serves those pages, it will provide those values but leave the
HTML
text alone. The question is how I can do this, do I need a DTD that
defines
HTML and my customized tags or is there any XML parsers understand HTML?
I
am new to XML, so any advice is welcome.

Thanks.

Weihong.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Fri Sep  4 02:36:43 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:24 2004
Subject: ANNOUNCE: JUMBO2a2 and xml-cml.org
Message-ID: <3.0.1.16.19980903013822.45f7e46c@pop3.demon.co.uk>

This is to announce the release of the latest snapshot of JUMBO2 (alpha2)
and also the page it's located at: xml-cml.org.

xml-cml.org is the home page of the nascent Chemical Markup Forum,
metamorphosing from the Open Molecular Foundation. Henry Rzepa, Steve Zara
and I are involved in getting this going - hopefully more info later.

JUMBO2 is an element-oriented XML-browser, in Java/Swing. Its source is
freely available with the normal sort of copyright. The architecture tries
to follow the specs and anticipate the possible XML-related APIs. The
tension between time available and achievement is evident; there are many
bits not fully finished, but I felt there was a sufficient shortage of
'browsers' that you will forgive the buglets.

JUMBO2 is offered to the community as a catalyst to spawn the creation of
high-quality client-side tools ('browsers'). Ideally we converge towards a
set of core APIs and all that remains of my code will be the
elephant-specific stuff. I have already started to get some offers of help.
At present JUMBO2:
	- uses SAX and a range of parsers
	- uses Swing (tree, table, text, and various windows/widgets). Of these
the text is the worst to make work - not just my opinion.
	- has a namespace kludge (<?jumbo:namespace?> to provide per-element
functionality. This allows a variety of client-side processes:
		- validation (e.g. for data values)
		- transformation of complex objects (e.g. molecules)
		- creation of element-specific rendering (forms, etc.)
		- vector graphics (embryonic, but so is Java until we get Java2D - I'm
told JDK1.2beta4 is rather buggy)
		- other authoring/editing functionality
	- has a per-element stylesheet table editable by the reader, and a number
of default styles
	- can analyse the elements/attributes/values in the tree and navigate to them
	- is not well documented

The latest *.jar is mounted and the *.java should be posted soon. Follow
the WWW site for incremental announcements. I haven't distributed much in
the way of ex maples - there are some simple data files including graphics.
Jon Bosak's Shakespeare works very well. I hope to develop the styletable
approach to support things like rec.xml - you are welcome to play.

**Since this is an alpha release I'd be very grateful for bug-reports, but
not beginners' questions.**

[I had expected that JUMBO would have been overtaken by commercial
client-side browsers by now, but get the sad impression that client-side
XML is not being addressed as excitingly as it could. (The idea of using
XML server-side to generate PDF is underwhelming as a global revolution).
There are so many really exciting things we can do with client-side tools -
I would be very grateful to have more offers of help. JUMBO is critical for
some of the things I need to do and I haven't yet seen much alternative. At
the least I hope we can come up with some useful APIs.]

	P.


Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mikeb at amgen.com  Fri Sep  4 02:54:39 1998
From: mikeb at amgen.com (Michael Brennan)
Date: Mon Jun  7 17:04:24 2004
Subject: Tools to convert Word to XML?
References: <35EC2EA8.E65CF9C1@epiphanysoftware.com>
Message-ID: <35EF3A23.D9B460E5@amgen.com>

Andrew Cogan wrote:
> 
> Can anyone recommend good tools that can convert Word files to XML? I
> don't need tools that claim XML compatibility per se; any utility that
> gives me control over what tag to insert at the beginning of a style and
> at the end of a style would probably suffice. The ability to work with
> Word footnotes is a big plus.

Inso Corporation, I believe, has a tool to convert Word documents to
SGML files. I know nothing of the tool, though. I've simply seen it
mentioned on their web site.
----
Michael Brennan
Sr. Systems Analyst
Amgen Inc.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bckman at ix.netcom.com  Fri Sep  4 06:03:15 1998
From: bckman at ix.netcom.com (Frank Boumphrey)
Date: Mon Jun  7 17:04:24 2004
Subject: Please.....
Message-ID: <00bd01bdd7b9$8009f620$31aedccf@ix.netcom.com>

>Can any one help he from where I can get
>introduction of xml and other stuff.

Try my tutorial at the URL below. Follow the XML link. I also list other
tutorials.

regards,
Frank

Frank Boumphrey

XML and style sheet info at Http://www.hypermedic.com/style/index.htm
Author: - Professional Style Sheets for HTML and XML http://www.wrox.com
-----Original Message-----
From: ruchig <ruchig@iitk.ac.in>
To: <xml-dev@ic.ac.uk>
Date: Sunday, August 30, 1998 8:05 AM
Subject: Please.....


>


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dent at highway1.com.au  Fri Sep  4 07:42:15 1998
From: dent at highway1.com.au (Andy Dent)
Date: Mon Jun  7 17:04:24 2004
Subject: WP conversion issues
Message-ID: <v03102803b2152ccf3e6b@[203.23.215.55]>

Possibly not immediately, but within the next few months, our XML output
from the report writer is going to have to do some conversion of embedded
WP formats. (We are currently planning to use the Led cross-platform WP
toolkit).

I've got a fairly good understanding now of how we can make XSL and XML
work from our overall report layout, but I'm unsure of the idioms in
converting the straight WP formatted notes.

BTW something I'd like to see a lot more of are examples of markup decision
making and the reasoning behind them, like the Design Patterns movement in
programming.

Anyway, given a lump of styled text with font changes, bolding etc, there
seem to be several possible ways to map this.

1) the obvious "HTML-style" of defining tags for <B> etc. and applying them
inline, then adding the XSL definitions to match. This has a huge benefit
of giving us HTML conversion in the same hit.

2) some attempt to infer a document model, and defining the stylesheet on
the basis of location (eg: the word "school" within paragraph 1). This
feels very awkward but delivers cleaner text with separate styles.

3) conversion to RTF and encoding as same, on the basis that this will be
parsable by other tools as an embedded format in future.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ricko at allette.com.au  Fri Sep  4 08:17:28 1998
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun  7 17:04:24 2004
Subject: WP conversion issues
References: <v03102803b2152ccf3e6b@[203.23.215.55]>
Message-ID: <35EF78CD.8BB4A586@allette.com.au>


Andy Dent wrote:

>  BTW something I'd like to see a lot more of are examples of markup decision
> making and the reasoning behind them, like the Design Patterns movement in
> programming.

Since you asked, my book "The XML & SGML Cookbook: Recipes for Structured
Information", Charles F. Goldfarb Series on Structured Information Management,
Prentice Hall, 1998, 650 pages + CD-ROM, ISBN 0-13-614-223, is the only attempt
I know of to look at markup from the Design Patterns movement viewpoint.  Part 2
of the book is called "Document Patterns".  It has patterns for most basic
structures, with discussions of when one is more appropriate than another and
tips and warnings.

I am not aware of any material on the internet, though there may be some general
discussions, e.g. relating to design of particular structures in HTML, i.e.
tables.

 Another possible source, targetted at explaining particular DTDs, is Dave
Megginsons'  "Structuring XML Documents", which you may also find useful.

Rick Jelliffe


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Fri Sep  4 08:20:19 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:24 2004
Subject: WP conversion issues
Message-ID: <012301bdd7cc$122d49a0$bc6118cb@caleb>

>BTW something I'd like to see a lot more of are examples of markup decision
>making and the reasoning behind them, like the Design Patterns movement in
>programming.


See Rick Jelliffe's SGML/XML Cookbook or David Megginson's Structuring XML
Documents ( http://www.xmlinfo.com/books/ )

>Anyway, given a lump of styled text with font changes, bolding etc, there
>seem to be several possible ways to map this.
>
>1) the obvious "HTML-style" of defining tags for <B> etc. and applying them
>inline, then adding the XSL definitions to match. This has a huge benefit
>of giving us HTML conversion in the same hit.

Just look at Word97's Save as HTML to see why this isn't trivial.

>2) some attempt to infer a document model, and defining the stylesheet on
>the basis of location (eg: the word "school" within paragraph 1). This
>feels very awkward but delivers cleaner text with separate styles.

Hard to do generically.

>3) conversion to RTF and encoding as same, on the basis that this will be
>parsable by other tools as an embedded format in future.

If you are happy to transport a presentational format around.

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mrc at allette.com.au  Fri Sep  4 10:15:55 1998
From: mrc at allette.com.au (Marcus Carr)
Date: Mon Jun  7 17:04:24 2004
Subject: Tools to convert Word to XML?
References: <35EC2EA8.E65CF9C1@epiphanysoftware.com> <35EF3A23.D9B460E5@amgen.com>
Message-ID: <35EFA185.BB4554E1@allette.com.au>

Michael Brennan wrote:

> Andrew Cogan wrote:
>
> > Can anyone recommend good tools that can convert Word files to XML? I
> > don't need tools that claim XML compatibility per se; any utility that
> > gives me control over what tag to insert at the beginning of a style and
> > at the end of a style would probably suffice. The ability to work with
> > Word footnotes is a big plus.
>
> Inso Corporation, I believe, has a tool to convert Word documents to
> SGML files. I know nothing of the tool, though. I've simply seen it
> mentioned on their web site.

I think that's DynaTag - used to get documents into DynaText quickly. I believe
it works OK, though I've never used it. My pick is Rick Geimer's beerware - it
can be found at http://www.sesha.com/omlette/#rtf2xml. I've tested it in the past
and found it to be very good. It leaves you with valid XML ripe for the
inevitable next stage of manipulation - trying to infer the nesting. It's a great
way out of RTF and into something valid. Rick kindly released this under the GNU
General Public License and only requests payment if you roll it into a paying
project, I think.


--
Regards,

Marcus Carr                 email:  mrc@allette.com.au
_______________________________________________________________
Allette Systems (Australia) email:  info@allette.com.au
Level 10, 91 York Street    www:    http://www.allette.com.au
Sydney 2000 NSW Australia   phone:  +61 2 9262 4777
                            fax:    +61 2 9262 4774
_______________________________________________________________


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Fri Sep  4 12:37:52 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:24 2004
Subject: XML and servlets
Message-ID: <005c01bdd7f0$7cc43c00$1e09e391@mhklaptop.bra01.icl.co.uk>

>[I would like] to parse these files on the server side
using a servlet.
>Has anybody have tried the same kind of experiment or could
point me to
>some valuable documentation or code to help me solve this
problem.


I have a couple of servlet demo applications in the SAXON
package available from
http://home.iclweb.com/icl2/mhkay/saxon.html They produce an
HTML rendition of the Shakespeare XML documents, which must
first be split into separate scenes using another sample
app.

I have run these servlets under Microsoft IIS using the Live
Software JRUN servlet environment. I don't recall whether or
not MSXML was one of the parsers I tested in this
environment but any SAX parser should work.

(MSXSL is a different matter, it doesn't seem to be
supported server-side and is generally rather fussy about
its environment).

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Fri Sep  4 12:56:05 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:24 2004
Subject: XML tools and big documents
Message-ID: <009701bdd7f3$24de11c0$1e09e391@mhklaptop.bra01.icl.co.uk>


>To this end, I have been (in such spare time as i have)
tinkering
>about with Mr. Clark's XP API (com.jclark.xml.tok, mostly)
to write an
>application that will allow me to attach the logical
element structure
>to offsets in the storage entity, so that I can consider
the logical
>structure's relationship to points in the text without
reparsing the
>document
I think we're all looking for a solution to the problem that
a >1Mb document is too big, we don't want to parse it every
time we want to look at it, but storing the fine-grained DOM
representation has the opposite problem, it takes too much
space and takes too long to reassemble a reasonable unit
like a page. Indexing the original serial XML (say at
"chapter" level) is one solution; it's essentially
equivalent to my approach, which has been to split the
original XML (say at "chapter" level) and store the
"chapters" as separate linked XML documents.

What I mean by "chapter" is typically 1-10Kb, or
alternatively, a chunk of text such that the user doesn't
mind pressing "Next" when he's got to the end of it.

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From elm at arbortext.com  Fri Sep  4 16:35:22 1998
From: elm at arbortext.com (Eve L. Maler)
Date: Mon Jun  7 17:04:25 2004
Subject: WP conversion issues
In-Reply-To: <35EF78CD.8BB4A586@allette.com.au>
References: <v03102803b2152ccf3e6b@[203.23.215.55]>
Message-ID: <199809041435.KAA21285@doctools.com>

I'm not familiar with the Design Patterns movement, but my book Developing
SGML DTDs: From Text to Model to Markup (Prentice-Hall PTR, ISBN
0-13-309881-8) has a whole chapter on markup design considerations.  Most
of the information applies directly to XML, although the book is obviously
written to full SGML.  It also presents an entire methodology for markup
design, again based on full SGML but equally applicable to XML.  In the
course of demonstrating the methodology, it covers a lot of typical design
decisions and their rationales.

	Eve

At 04:21 PM 9/4/98 +1100, Rick Jelliffe wrote:
>
>
>Andy Dent wrote:
>
>>  BTW something I'd like to see a lot more of are examples of markup
decision
>> making and the reasoning behind them, like the Design Patterns movement in
>> programming.
>
>Since you asked, my book "The XML & SGML Cookbook: Recipes for Structured
>Information", Charles F. Goldfarb Series on Structured Information
Management,
>Prentice Hall, 1998, 650 pages + CD-ROM, ISBN 0-13-614-223, is the only
attempt
>I know of to look at markup from the Design Patterns movement viewpoint.
Part 2
>of the book is called "Document Patterns".  It has patterns for most basic
>structures, with discussions of when one is more appropriate than another and
>tips and warnings.
>
>I am not aware of any material on the internet, though there may be some
general
>discussions, e.g. relating to design of particular structures in HTML, i.e.
>tables.
>
> Another possible source, targetted at explaining particular DTDs, is Dave
>Megginsons'  "Structuring XML Documents", which you may also find useful.
>
>Rick Jelliffe

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From MSCALIA at kti.com  Fri Sep  4 17:02:35 1998
From: MSCALIA at kti.com (Michael Scalia)
Date: Mon Jun  7 17:04:25 2004
Subject: Tools to convert Word to XML?
Message-ID: <s5efc745.055@kti.com>

>From: Andrew Cogan <andrew@epiphanysoftware.com>
>Date: Tue, 01 Sep 1998 10:28:09 -0700
>Subject: Tools to convert Word to XML?
>
>Can anyone recommend good tools that can convert Word files to XML? I
>don't need tools that claim XML compatibility per se; any utility that
>gives me control over what tag to insert at the beginning of a style and
>at the end of a style would probably suffice. The ability to work with
>Word footnotes is a big plus.
>- --
> Andrew Cogan, Epiphany Software

Andrew,

Check out "Ace" from RMIT in Melbourne, Australia.  Freely downloadable at http://ace.mds.rmit.edu.au/adl.  Ace can convert RTF and does a lot more.

To convert RTF to SGML in Ace:
  String bufRTF := readFile(rtfFile);
  String bufSGML := bufRTF.rtfToSgml();
Then you can stream through the SGML, to modify and insert tags as you wish.  Can also create an in-memory parse tree.  You can choose to parse as SGML or XML.

Michael
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Fri Sep  4 18:02:59 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:25 2004
Subject: Notation Declarations?
Message-ID: <35F00F7D.2F6CF07A@infinet.com>

Unlike the specification for Element Declarations, Entity Declarations,
and Attribute List declarations, there is nothing that I can find on
Notations which says what you are supposed to do if a Notation with a
particular Name is declared more than once.

Basically I am just wondering if you should:

(1) Replace the old NotationDecl with the new NotationDecl
(2) Ignore all new NotationDecls after the first encountered
NotationDecl has been declared
(3) Throw an error

One other thing I have been wondering about is how best to present
validity errors to the application.  Many validity errors cannot be
found in a stream-based parser until the end of the document has been
reached, so in a lot of ways it would make sense to batch all validation
errors in a list and present them to the application at the end of the
document.

>From what I have already been told, the spec says nothing about how a
validating processor is supposed to present validity errors, just that
they are to be presented as recoverable errors in some fashion.

Thanx in advance,

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From larsga at ifi.uio.no  Fri Sep  4 23:06:20 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:04:25 2004
Subject: parsing XML within HTML files
In-Reply-To: <002701bdd764$f58dbc10$89fcd8d0@lastexit.gmswireless.com>
Message-ID: <3.0.5.32.19980904230525.007bfd00@ifi.uio.no>


* Weihong Xie
>
>AlL I want to do is in normal HTML files, there will be some customized XML
>tags to mark the places where dynamic values will be inserted, so when the
>servlet serves those pages, it will provide those values but leave the HTML
>text alone. 

In that case you could probably use cpp or just write your own tool that
does the substitution. Mark the places with something like $place$ and that's
it.

It sounds like overkill to use XML for this.

>The question is how I can do this, do I need a DTD that defines
>HTML and my customized tags or is there any XML parsers understand HTML? 

HEX does.

<URL:http://www-uk.hpl.hp.com/people/ak/java/hex.html>

--Lars M.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Fri Sep  4 23:39:00 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:25 2004
Subject: A utility to make msxsl more useful
Message-ID: <199809042138.OAA25850@mail-gw.pacbell.net>


  I wrote a small Perl script that can be used to preprocess XML files
before sending them to msxsl. Why might you want to do this? So you can
expand ENTITY references and do something like <INCLUDE
HREF="included_file.xml"/>

  It's very basic and very small so I just attached it to this message for
anyone who's interested.

  Here's the syntax for using it from the DOS command prompt...

C:\<your path to Perl>\Perl.exe expand.pl myfile.xml > temp.xml
msxsl -i myfile.xml -s myfile.xsl -o output.html

  myfile.xml can define entities in its internal and external DTD by saying
<!ENTITY entityname 'VALUE'> or <!ENTITY entityname SYSTEM 'filepath'> You
can use single or double quotes.

  I also made it so you can include a file by saying <INCLUDE
HREF="filetoinclude"/>

  Basically, I'm trying to find ways to make msxsl usable now. I was sort
of hoping some Java programmers would leap to the rescue and turn msxml (or
some equivalent parser) into type of preprocessor for msxsl but, failing
that, I worked up a quick and dirty way to do what I want. Hopefully some
one else will find it useful.
-------------- next part --------------


main();

sub main {
	$xml = (&readFile($ARGV[0]));
    %externalEntities = &parseExternalDTD($xml);
    %internalEntities = &parseInternalDTD($xml);
    my($moreToGo) = (1);
    while ($moreToGo) {
    	$moreToGo = &expandEntities(%externalEntities, %internalEntities) | &expandLinks(%externalEntities, %internalEntities);
	}
    print $xml;
}

# $_[0] = file name or path
# returns full text of file
sub readFile {
	my($contents);
	my(@fileInfo) = stat($_[0]);
	open(F, $_[0]) or die "Couldn't open $_[0]\n";
	read F, $contents, $fileInfo[7];
	close(F);
    return $contents;
}

# $_[0] full text of an XML document
# returns hash of external entities and what they reference
sub parseExternalDTD {
	# Looking for...  <!DOCTYPE foo SYSTEM 'bar.dtd'>
	unless ($_[0] =~ /<!DOCTYPE\s+\w+\s+SYSTEM\s+['"]([^"']+)/) {
    	return {};
    }
    my($dtdPath) = ($1);
    my($dtd) = &readFile($dtdPath);
    my(%entities) = (&extractEntities($dtd));
    return %entities;
}

# $_[0] full text of XML document
# returns hash of internally defined entities and what they reference
sub parseInternalDTD {
	my(%entities) = (&extractEntities($_[0]));
    return %entities;
}

# $_[0] text, possibly containing <!ENTITY> declarations
# returns entity has of names and values
sub extractEntities {
	my($text) = $_[0];
	my(%entities);
    my($entityName, $entityPath);
    # Looking for <!ENTITY foo 'bar'> or <!ENTITY foo SYSTEM 'bar'>
    while ($text =~ /<!ENTITY/) {
    	if ($text =~ s/<!ENTITY\s+(\w+)\s+['"]([^'"]*)['"]>//s) {
        	$entities{$1} = $2;
		} elsif ($text =~ s/<!ENTITY\s+(\w+)\s+SYSTEM\s+['"]([^'"]+)['"]>//s) {
        	($entityName, $entityPath) = ($1, $2);
            $entities{$entityName} = &readFile($entityPath);
		}
	}
    return %entities;
}

# @_ is a hash of entities and what they expand to
# works on global variable $xml searching for &foo; references
# returns true if it was able to make any replacements
sub expandEntities {
	my(%entities) = @_;
    my($gotOne) = (0);
    while ($xml =~ s/\&(\w+);/$entities{$1}/) {
    	$gotOne = 1;
    }
    return $gotOne;
}

sub expandLinks {
	my($gotOne) = (0);
	# We're looking for... <INCLUDE HREF="foo"/>
    # This is not a complete implementation! A real XML processor would
    # look for any type of link that's defined to have SHOW="EMBED" and ACTUATE="AUTO"
    # ...but that's too much work for what I'm after
    while ($xml =~ s/<INCLUDE\s+HREF=["']([^"']+)["']\/>/&readFile($1)/se) {
    	$gotOne = 1;
	}
    return $gotOne;
}
-------------- next part --------------

-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 
From larsga at ifi.uio.no  Sat Sep  5 09:21:28 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:04:25 2004
Subject: A utility to make msxsl more useful
In-Reply-To: <199809042138.OAA25850@mail-gw.pacbell.net>
Message-ID: <3.0.5.32.19980905092036.007a62c0@ifi.uio.no>


* Andrew Bunner
>
>Basically, I'm trying to find ways to make msxsl usable now. I was sort
>of hoping some Java programmers would leap to the rescue and turn msxml (or
>some equivalent parser) into type of preprocessor for msxsl but, failing
>that, I worked up a quick and dirty way to do what I want. Hopefully some
>one else will find it useful.

Andrew, there are a couple of things you should know:

 - MSXSL implements the old XSL proposal, which was obsoleted by the new
   XSL Working Draft that was released on 19980818. This means that developing
   new tools for MSXSL is rather pointless.

 - There are several other XSL tools, some of which implement the 19980818
   Working Draft:

   <URL:http://www.stud.ifi.uio.no/~larsga/linker/xmltools/by-standard.html>

--Lars M.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From larsga at ifi.uio.no  Sat Sep  5 13:13:15 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:04:25 2004
Subject: Validating IDREFS...
Message-ID: <3.0.5.32.19980905131219.007be900@ifi.uio.no>


* Michael Kay
| 
| * A dangling IDREF is an error; a dangling XPointer is not

They are both errors, but at different levels, which IMHO makes
perfect sense. To check your XPointers you have to run an XPointer
checker, since XPointer is not part of XML (and IMHO shouldn't be).

| That is what I mean by saying the two facilities are
| incompatible. Or to put it another way, once I have made a design
| choice to use IDREF or to use XPointer for the links in my
| documents, I am stuck with my choice.

Definitely not. As Lisa Rein points out it's easy to convert from
IDREFs to XPointers. Going the other way may not be possible, since
XPointer can do much that IDREFs cannot, in which case I guess that's
not what you want anyway. :)
 
| This is one of several situations in the XML family of standards
| where there is more than one way of doing the same thing, and no
| obvious way to choose between them. 

I've never felt that this was a difficult choice. For links inside the
document where you can count on IDs to be present, use IDREF, for
external links, links to arbitrary elements (or with even finer
granularity if required) use XPointer.

-- 
"These are, as I began, cumbersome ways / to kill a man. Simpler, direct, 
and much more neat / is to see that he is living somewhere in the middle /
of the twentieth century, and leave him there."     -- Edwin Brock

 http://www.stud.ifi.uio.no/~larsga/      http://birk105.studby.uio.no/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From larsga at ifi.uio.no  Sat Sep  5 13:15:45 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:04:25 2004
Subject: XML-QL
Message-ID: <3.0.5.32.19980905131429.007a5130@ifi.uio.no>

To: 
Subject: 
References: <00e501bdd591$faef4180$1e09e391@mhklaptop.bra01.icl.co.uk>
<35EBF5E5.1BB74120@technologist.com>
Gcc: nnml+archive:Sendt
--text follows this line--

* Paul Prescod
| 
| It would also be useful to compare XPointer, which is a sort of
| query that returns a single node.

XPointer can return sets of nodes, parts of nodes and a span of nodes.
Whether they can operate on a set of nodes is not stated in the
current WD (it's on the list of things to be clarified), but I would
assume not.

But it's still useful to compare XPointer. The CSS2 selectors are also
a sort of query language, and seem rather similar to XSL patterns.  (I
have just briefly skimmed the XSL WD so far.)

In fact I think it would make very good sense for XSL patterns to
extend the CSS2 selectors instead of starting again from scratch with
XML query language number 3.

-- 
"These are, as I began, cumbersome ways / to kill a man. Simpler, direct, 
and much more neat / is to see that he is living somewhere in the middle /
of the twentieth century, and leave him there."     -- Edwin Brock

 http://www.stud.ifi.uio.no/~larsga/      http://birk105.studby.uio.no/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ruchig at iitk.ac.in  Sat Sep  5 17:17:56 1998
From: ruchig at iitk.ac.in (Prashant)
Date: Mon Jun  7 17:04:25 2004
Subject: This is Java question ..
Message-ID: <Pine.HPP.3.96.980905204510.7773A-100000@apah.cc.iitk.ernet.in>


Hello Everybody:

Can anybody tell me what is the Algorithm used by the Java VM
Garbage-Collector ?.  I am sorry to post this question in xml
discussion group.
Thanks
Rg


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Sat Sep  5 23:50:30 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:25 2004
Subject: This is Java question ..
Message-ID: <002401bdd915$d68ff0a0$2ee044c6@arcot-main>

>Can anybody tell me what is the Algorithm used by the Java VM
>Garbage-Collector ?.  I am sorry to post this question in xml
>discussion group.


Every Wednesday morning although they won't guarantee pickup.

I would like to suggest that you search the JavaSoft site at
http://www.javasoft.com or subscribe to one of the Java related mailing
lists at http://www.xcf.berkeley.edu/lists.html

Don Park


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Sun Sep  6 04:46:51 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:25 2004
Subject: ANN: Docuverse DOM SDK PR2 Released
Message-ID: <003501bdd93f$3a2ac350$2ee044c6@arcot-main>

Docuverse DOM SDK Preview Release 2 is now available at:

http://www.docuverse.com/domsdk/index.html

PR2 includes W3C DOM HTML API support and minor bug fixes.

Also, as of PR2, DOM SDK can be used for commercial purpose for free.  This
change in licensing policy was made in response to numerous pleas,
complaints, and comparisons to IBM's free commercial license for XML4J.
Although comparing Docuverse to IBM is like comparing Tweety Bird to Dolly
Parton IMHO, we thought the change was necessary to encourage development of
DOM-based XML software.  In another word, we made a mistake when we
restricted commercial use before.  Our appologies.

Best,

Don Park
Docuverse


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bmhughes at ozemail.com.au  Sun Sep  6 09:50:42 1998
From: bmhughes at ozemail.com.au (Baden Hughes)
Date: Mon Jun  7 17:04:25 2004
Subject: Word to XML : how to's are on their way
Message-ID: <000001bdd96a$bed9fdc0$e83670c2@bmhmobile>

Since I replied about how to take Word documents and export them with
markup, there have been a lot of requests for how to do this
(specifics). I'm going to put up a page or two about it in a couple of
weeks, after I've managed to justify my budget for next year in the
next week in the UK.

Baden


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Patrice.Bonhomme at loria.fr  Sun Sep  6 22:06:30 1998
From: Patrice.Bonhomme at loria.fr (Patrice Bonhomme)
Date: Mon Jun  7 17:04:25 2004
Subject: [ANN] Silfide XML Parser in Java - 0.8
Message-ID: <199809061952.VAA06633@chimay.loria.fr>


<hi/>

The Silfide Working Group is happy to announce the availability of the Silfide 
XML Parser (SXP, v0.8 - Sun Sep  6 1998 ), a release of our validating XML 
Parser writing in Java.

The SXP entirely implements the XML 1.0 recommendation and most of its 
satellite recommendations:

- XML Namespaces (WD 18-05-1998, the old draft)
- Document Object Model Level 1 (DOM Core and XML, PR 08-08-1998)
- XPointer (WD 03-03-1998)
- XLink (WD 03-03-1998)

SXP provides also a driver for the SAX interface (fr.loria.xml.sax.SAXDriver).

Both of the XML and XPointer parsers are developed with the tool JavaCC.


Changes from last revision:

 - Implements the DOM Proposed Recommendation 18 August, 1998
 - DTD parsing has been improved (with PE inclusion)
 - A new NodeFilter interface (to use with, for example, an XMLTreeIterator)
 - some bugs has been fixed (thanks for your bug reports)
 - some new bugs may have been introduced (!)

Java source files, java classes, some samples and documentation are freely 
available here:

	http://www.loria.fr/projets/XSilfide/EN/sxp/


SILFIDE is a project of CNRS and AUPELF-UREF. Server SILFIDE, as an 
interactive server, wants to offer to the whole of the French-speaking 
university community working starting from the language (linguists, teachers, 
data processing specialists...) a tool user-friendly and reasoned for the 
handling of electronic resources.

A more detailed description of the Silfide project is available here:

	http://www.loria.fr/projets/XSilfide/

We are waiting for all of your comments, questions and suggestions.

Pat.


silfide-dev: the Silfide development mailing list, maintained/organized by 
Patrice Bonhomme (bonhomme@loria.fr)
To subscribe, send email to listserv@loria.fr with the single message line 
'SUBscribe silfide-dev Your Name'.
To unsubscribe, send email to listserv@loria.fr with the single message line 
'SIGnoff silfide-dev'.

-- 
  ==============================================================
  bonhomme@loria.fr               |      Office : B.228
  http://www.loria.fr/~bonhomme   |      Phone  : 03 83 59 30 52
  --------------------------------------------------------------
   * Serveur Silfide  : http://www.loria.fr/projets/Silfide
   * Projet Aquarelle : http://aqua.inria.fr
  ==============================================================


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jamesr at steptwo.com.au  Mon Sep  7 02:49:06 1998
From: jamesr at steptwo.com.au (James Robertson)
Date: Mon Jun  7 17:04:25 2004
Subject: Tools to convert Word to XML?
In-Reply-To: <35EC2EA8.E65CF9C1@epiphanysoftware.com>
Message-ID: <199809070048.KAA12555@fep2.mail.ozemail.net>

At 03:28 2/09/1998 , you wrote:

  | Can anyone recommend good tools that can convert Word files to XML? I
  | don't need tools that claim XML compatibility per se; any utility that
  | gives me control over what tag to insert at the beginning of a style and
  | at the end of a style would probably suffice. The ability to work with
  | Word footnotes is a big plus.

Andrew,

This may be a good candidate for a custom-written conversion. Converting
RTF to SGML/XML is not too hard, particularly using a tool such as
Omnimark.

Cheers,

James


-------------------------
James Robertson
Step Two Designs Pty Ltd
SGML, XML & HTML Consultancy
http://www.steptwo.com.au/
jamesr@steptwo.com.au

"Beyond the Idea"
 ACN 081 019 623

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From h.rzepa at ic.ac.uk  Mon Sep  7 09:54:39 1998
From: h.rzepa at ic.ac.uk (Rzepa, Henry)
Date: Mon Jun  7 17:04:25 2004
Subject: Fwd: Re: Announcing PrismEd (a configurable metadata editor)
Message-ID: <v04011702b21941c6df52@[155.198.224.86]>

Apologies for cross posting, but this seems a useful tool!

>To: Andrew Waugh <Andrew.Waugh@cmis.CSIRO.AU>
>cc: meta2@mrrl.lboro.ac.uk, Andrew.Waugh@cmis.CSIRO.AU
>Subject: Re: Announcing PrismEd (a configurable metadata editor) 
>Date: Mon, 07 Sep 1998 10:10:27 +1000
>From: Andrew Waugh <Andrew.Waugh@cmis.CSIRO.AU>
>Sender: owner-meta2@net.lut.ac.uk
>Precedence: bulk
>
>Dear all,
>
>On Saturday I wrote...
>> Some of you may be interested in trying PrismEd. PrismEd is a
>> configurable metadata editor which will cope with structured metadata
>> values. Schema files are provided for Dublin Core (in French and
>> English), and ANZLIC. It produces RDF, and will read the RDF it produces
>> (but I don't claim that it can read arbitrary RDF!).
>>
>> You can view the documentation for PrismEd at
>>	http://www.mel.dit.csiro.au:8080/~ajw/prismEd/prismEd/help.html
>>
>> If you have a reasonably modern web browser (with Java 1.1), you can
>> try PrismEd as an applet at
>>	http://www.mel.dit.csiro.au:8080/~ajw/prismEd/prismEd.html
>>
>> If you prefer, you can download the PrismEd class files and run it
>> locally using anonymous ftp:
>>	ftp://weever.vic.cmis.csiro.au/staff/ajw/prismEd.jar
>>
>> I'd be very interested in bug reports, additional features that people
>> might be interested in, etc.
>
>Our system administrators just turned off anonymous ftp (account
>hackers :-(. To download the class files try the following URLs:
>
>with INTERNET EXPLORER (250K)
>	http://www.mel.dit.csiro.au:8080/~ajw/prismEd/prismEd.jar
>
>with NETSCAPE (590K)
>	http://www.mel.dit.csiro.au:8080/~ajw/prismEd/prismEd.tar
>(Due to unfortunate interactions between our http server and Netscape,
>the jar file doesn't download correctly. The TAR file can be extracted
>using WinZip).
>
>Sorry for the confusion!
>
>andrew waugh
>

Dr Henry Rzepa,  Dept. Chemistry,  Imperial College,  LONDON SW7 2AY;
mailto:rzepa@ic.ac.uk; Tel  (44) 171 594 5774; Fax: (44) 171 594 5804.
URL: http://www.ch.ic.ac.uk/rzepa/ 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From byrnes at prl.research.philips.com  Mon Sep  7 19:40:24 1998
From: byrnes at prl.research.philips.com (Nigel Byrnes)
Date: Mon Jun  7 17:04:26 2004
Subject: Newbie Question
Message-ID: <35F41AD8.7F530CD1@prl.research.philips.com>

<SwallowsPride/>

Hi

I'm just getting started in the XML world and I'm working
through Simon St. Laurent's book "XML: A Primer". The start of
chapter 5 looks at the parsing of a simple xml document. So I
type it into a text editor and parse it with MSXML only to
receive the following error message:

C:\msxml>jview msxml -d1 me\simple.xml
Root element name must match the DOCTYPE name
Location: file:/C:/msxml/me/simple.xml(10,2)
Context: <null>

(Attached as an appendix to this mail is the listing of
simple.xml.) From what i can gather, the error occurs at the
second character in the <DOCUMENT> element. The error message is
telling me that the root element name must match the DOCTYPE
name ["simple"]. However, i haven't being able to solve this
error.

I haven't got many more hairs to pull out, so can someone point
in the the right direction. Many thanks,

Nigel

-=-=-=-=- Listing of simple.xml -=-=-=-=-=-=-=-=

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE simple [
<!ELEMENT DOCUMENT (#PCDATA)>
<!ATTLIST DOCUMENT
  trackNum CDATA #REQUIRED
  secLevel (unclassified|classified)
"unclassified">
<!ENTITY Description "this is a very simple sample document.">
]>
<DOCUMENT trackNum="1234">This is an entity inside an element:
&Description;</DOCUMENT>

--

Nigel Byrnes,                     Software Engineering
Applications Group
Philips Research Laboratories,
Redhill.                          Tel: +44 (0)1293 815578
Surrey,                           Fax: +44 (0)1293 815024
RH1 5HA. UK                       Email:
byrnes@prl.research.philips.com


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From digitome at iol.ie  Mon Sep  7 20:09:29 1998
From: digitome at iol.ie (Sean Mc grath)
Date: Mon Jun  7 17:04:26 2004
Subject: Newbie Question
Message-ID: <1.5.4.32.19980907175356.0094c8cc@gpo.iol.ie>

[Nigel Byrnes]

>C:\msxml>jview msxml -d1 me\simple.xml
>Root element name must match the DOCTYPE name

A validating XML parser enforces the constraint that the root
element of the document matches the element type name specified
in the doctype. So this snippet is ok:

        <!DOCTYPE foo "bar.dtd">
        <foo>

(foo matches foo)

but this is not:

        <!DOCTYPE foo "bar.dtd">
        <baz>

(foo does not match baz)


Non-validating parsers, on the other hand, don't care.

Sean Mc Grath
http://www.digitome.com/sean.htm
+353 96 47391


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simpson at polaris.net  Mon Sep  7 20:16:39 1998
From: simpson at polaris.net (John E. Simpson)
Date: Mon Jun  7 17:04:26 2004
Subject: Newbie Question
In-Reply-To: <35F41AD8.7F530CD1@prl.research.philips.com>
Message-ID: <3.0.3.32.19980907141557.00ba07e0@nexus.polaris.net>

Hi Nigel. Don't be worried about being a newbie -- nearly everyone is yet,
at some level. :)

At 06:41 PM 9/7/98 +0100, Nigel Byrnes wrote:
>... I type it into a text editor and parse it with MSXML only to
>receive the following error message:
>
>C:\msxml>jview msxml -d1 me\simple.xml
>Root element name must match the DOCTYPE name
>Location: file:/C:/msxml/me/simple.xml(10,2)
>Context: <null>
>... the error occurs at the
>second character in the <DOCUMENT> element. The error message is
>telling me that the root element name must match the DOCTYPE
>name ["simple"]. However, i haven't being able to solve this
>error.
	<snip>
><?xml version="1.0" encoding="UTF-8"?>
><!DOCTYPE simple [
><!ELEMENT DOCUMENT (#PCDATA)>
><!ATTLIST DOCUMENT
>  trackNum CDATA #REQUIRED
>  secLevel (unclassified|classified)
>"unclassified">
><!ENTITY Description "this is a very simple sample document.">
>]>
><DOCUMENT trackNum="1234">This is an entity inside an element:
>&Description;</DOCUMENT>

This should be pretty, er, simple. Your DOCTYPE declaration says that the
root element of your document is the <simple> element. However, the actual
document (which follows the close of the internal DTD, that is, the line
containing the ]> characters) contains as its root an element called
<DOCUMENT>. Either change the DTD so that the root element is DOCUMENT
(<!DOCTYPE DOCUMENT...) , or change the actual root of the document to
<simple>. Remember to keep the capitalization consistent, as (for example)
an element called <DOCUMENT> is *not* the same as one called <document>. 

Then you should be all set.

=================================================
John E. Simpson
simpson@flixml.org
http://www.flixml.org
Just XML - coming in September from Prentice-Hall

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Mon Sep  7 22:20:33 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:26 2004
Subject: Validating IDREFS...
In-Reply-To: <3.0.5.32.19980905131219.007be900@ifi.uio.no>
Message-ID: <3.0.1.16.19980907185746.2a0768f6@pop3.demon.co.uk>

At 13:12 05/09/98 +0200, Lars Marius Garshol wrote:
>
>I've never felt that this was a difficult choice. For links inside the
>document where you can count on IDs to be present, use IDREF, for
>external links, links to arbitrary elements (or with even finer
>granularity if required) use XPointer.

I have also struggled gently with this and - at present - tend not to use
IDREF at all. The only - but valuable - benefit of IDREF is that it
requires the parser to check presence of IDs. However I believe that the
XLink approach is not to use IDREF for href so that IDREF cannot be used
with XLink. Since XLink is a cornerstone of much of what I do, I can't see
that I should use IDREF. Is this reasonable?

	P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Mon Sep  7 22:20:39 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:26 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
  Released)
In-Reply-To: <003501bdd93f$3a2ac350$2ee044c6@arcot-main>
Message-ID: <3.0.1.16.19980907190338.2a070b12@pop3.demon.co.uk>

At 19:36 05/09/98 -0700, Don Park wrote:
>Also, as of PR2, DOM SDK can be used for commercial purpose for free.  This
>change in licensing policy was made in response to numerous pleas,
>complaints, and comparisons to IBM's free commercial license for XML4J.
>Although comparing Docuverse to IBM is like comparing Tweety Bird to Dolly
>Parton IMHO, we thought the change was necessary to encourage development of
>DOM-based XML software.  In another word, we made a mistake when we
>restricted commercial use before.  Our appologies.

My sincere thanks for this change of policy. It can and will make an
enormous difference to many of us - otherwise we end up rewriting each
other's software. It is a courageous decision - as was IBM's - and should
be applauded.

I also think it will be a fruitful decision. I have been through the same
thoughts when people have asked whether they could distribute JUMBO in
their book/CDROM or whatever. I now have no problem with this, and am
looking at the GPL for this purpose [comments would be much appreciated.] I
think the quid-pro-quo would be to ask people to register their use with
you and you may well benefit from this.

	P.


>
>Best,
>
>Don Park
>Docuverse
>
>
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Tue Sep  8 05:36:13 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:26 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
In-Reply-To: <3.0.1.16.19980907190338.2a070b12@pop3.demon.co.uk> from "Peter Murray-Rust" at Sep 7, 98 07:03:38 pm
Message-ID: <199809080341.XAA12969@locke.ccil.org>

Peter Murray-Rust scripsit:

> My sincere thanks for this change of policy. It can and will make an
> enormous difference to many of us - otherwise we end up rewriting each
> other's software. It is a courageous decision - as was IBM's - and should
> be applauded.

Indeed! Unfortunately, the license is still very restrictive, disallowing
modification of the DOM SDK.

> I now have no problem with this, and am
> looking at the GPL for this purpose [comments would be much appreciated.] I
> think the quid-pro-quo would be to ask people to register their use with
> you and you may well benefit from this.

I would urge Don and you to look at http://www.opensource.org/intro/free, which
has information on why loosening restrictions can be beneficial for
everyone.

The Artistic License is a good substitute for the GPL and allows you
to keep control of the named product while allowing others to create
differently named variations.  See http://language.perl.com/misc/Artistic.html .

-- 
John Cowan					cowan@ccil.org
		e'osai ko sarji la lojban.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Usha_R2 at verifone.com  Tue Sep  8 05:44:15 1998
From: Usha_R2 at verifone.com (Usha_R2@verifone.com)
Date: Mon Jun  7 17:04:26 2004
Subject: Newbie Question
Message-ID: <7BA6E16CF180D111944700A0C9979DE51D4F77@blr-nt-mail2.verifone.com>

Just make the Document type to be of "DOCUMENT". It will work.

------------------------------------------------------------------------
--------

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE DOCUMENT [
<!ELEMENT DOCUMENT (#PCDATA)>
<!ATTLIST DOCUMENT
  trackNum CDATA #REQUIRED
  secLevel (unclassified|classified)
"unclassified">
<!ENTITY Description "this is a very simple sample document.">
]>
<DOCUMENT trackNum="1234">This is an entity inside an element:
&Description;</DOCUMENT>

------------------------------------------------------------------------
--------
One more clarification. When the above XML file is compiled using MSXML,
in the output the parser does not substitute the contents of the Entity
&Description;. Why it so? Is it a bug in MSXML parser or is there any
error in ENTITIY declaration

Thank you

K. Usha Rani

> ----------
> From: 	Nigel Byrnes
> Reply To: 	Nigel Byrnes
> Sent: 	Monday, September 07, 1998 11:11 PM
> To: 	xml-dev@ic.ac.uk
> Subject: 	Newbie Question
> 
> <SwallowsPride/>
> 
> Hi
> 
> I'm just getting started in the XML world and I'm working
> through Simon St. Laurent's book "XML: A Primer". The start of
> chapter 5 looks at the parsing of a simple xml document. So I
> type it into a text editor and parse it with MSXML only to
> receive the following error message:
> 
> C:\msxml>jview msxml -d1 me\simple.xml
> Root element name must match the DOCTYPE name
> Location: file:/C:/msxml/me/simple.xml(10,2)
> Context: <null>
> 
> (Attached as an appendix to this mail is the listing of
> simple.xml.) From what i can gather, the error occurs at the
> second character in the <DOCUMENT> element. The error message is
> telling me that the root element name must match the DOCTYPE
> name ["simple"]. However, i haven't being able to solve this
> error.
> 
> I haven't got many more hairs to pull out, so can someone point
> in the the right direction. Many thanks,
> 
> Nigel
> 
> -=-=-=-=- Listing of simple.xml -=-=-=-=-=-=-=-=
> 
> <?xml version="1.0" encoding="UTF-8"?>
> <!DOCTYPE simple [
> <!ELEMENT DOCUMENT (#PCDATA)>
> <!ATTLIST DOCUMENT
>   trackNum CDATA #REQUIRED
>   secLevel (unclassified|classified)
> "unclassified">
> <!ENTITY Description "this is a very simple sample document.">
> ]>
> <DOCUMENT trackNum="1234">This is an entity inside an element:
> &Description;</DOCUMENT>
> 
> --
> 
> Nigel Byrnes,                     Software Engineering
> Applications Group
> Philips Research Laboratories,
> Redhill.                          Tel: +44 (0)1293 815578
> Surrey,                           Fax: +44 (0)1293 815024
> RH1 5HA. UK                       Email:
> byrnes@prl.research.philips.com
> 
> 
> 
> xml-dev: A list for W3C XML Developers. To post,
> mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
> message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
> 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Tue Sep  8 05:59:38 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:26 2004
Subject: Java Script, and it's conflicts with 2.4.4
Message-ID: <199809080359.UAA11275@mail-gw.pacbell.net>


  The XSL Working Draft seems to make it impossible to generate an HTML
file with embedded Java Script.

  "Impossible" might be too strong a word, but I can't find any method to
get the literal character '<' into my generated file. So, if you're
interested in doing Java Script comparisons, you seem to be limited to
equality and inequality.

  I can only see three reasons for this...

1) I've missed something and there is a way to do it, but it's not obvious
2) The XSL Working Group made a gross oversight
3) There's some extremely good reason for this design decision, but it's
not obvious

  If we're looking at case two, then I'd like to suggest a revision to
section 2.4.4. Since we already have a way to generate &lt; I think we
ought to make <![CDATA[<]]> put the literal character '<' in the generated
file.

  If we're looking at case three, then I'd like say that, IMHO, the
well-formedness of the generated document is not so important that we
should prohibit the author from putting in characters that need to be there.

-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jeremy at omsys.com  Tue Sep  8 06:18:03 1998
From: jeremy at omsys.com (Jeremy H. Griffith)
Date: Mon Jun  7 17:04:26 2004
Subject: Java Script, and it's conflicts with 2.4.4
In-Reply-To: <199809080359.UAA11275@mail-gw.pacbell.net>
References: <199809080359.UAA11275@mail-gw.pacbell.net>
Message-ID: <3652af44.622192074@mail.together.net>

On Mon, 07 Sep 1998 21:05:17 -0700, Andrew Bunner <bunner@massquantities.com>
wrote:

>  "Impossible" might be too strong a word, but I can't find any method to
>get the literal character '<' into my generated file. So, if you're
>interested in doing Java Script comparisons, you seem to be limited to
>equality and inequality.

There's always the hack used for older browsers that suffered from
the same problem... reverse the terms in the comparison and use '>'.
Or is that prohibited too?

-- Jeremy H. Griffith, at Omni Systems Inc.
  (jeremy@omsys.com)  http://www.omsys.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Tue Sep  8 06:46:22 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:26 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
Message-ID: <001f01bddae2$3caee1c0$2ee044c6@arcot-main>

>Indeed! Unfortunately, the license is still very restrictive, disallowing
>modification of the DOM SDK.


John,

Docuverse is not a non-profit organization nor is it a purely
consulting-based business.  We build development tools and high performance
servers.  It is our intention to offer free commercial quality DOM
implementation to promote popularity of DOM-based software which will turn
increase the need for high performance DOM implementation which we intend to
introduce soon as a commercial product.  To this end, it is important that
we promote our API and minimize proliferation of variations so that users
can simply 'plugin' the high performance version.  Furthermore, we believe
that extensions are better than variations and we will make sure that our
API is easily extensible.

Best,

Don Park


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Tue Sep  8 07:01:37 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:26 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
References: <199809080341.XAA12969@locke.ccil.org>
Message-ID: <35F4BA73.EA707E0C@infinet.com>

John Cowan wrote:

> Peter Murray-Rust scripsit:
>
> > My sincere thanks for this change of policy. It can and will make an
> > enormous difference to many of us - otherwise we end up rewriting each
> > other's software. It is a courageous decision - as was IBM's - and should
> > be applauded.
>
> Indeed! Unfortunately, the license is still very restrictive, disallowing
> modification of the DOM SDK.
>
> > I now have no problem with this, and am
> > looking at the GPL for this purpose [comments would be much appreciated.] I
> > think the quid-pro-quo would be to ask people to register their use with
> > you and you may well benefit from this.
>
> I would urge Don and you to look at http://www.opensource.org/intro/free, which
> has information on why loosening restrictions can be beneficial for
> everyone.
>
> The Artistic License is a good substitute for the GPL and allows you
> to keep control of the named product while allowing others to create
> differently named variations.  See http://language.perl.com/misc/Artistic.html .

If you ever plan on making any money off of a product, never give it away for free.
If you are not planning on ever making money on a product, then it is of the most
benefit to essentially publish the source-code as is and let anyone do what they
want with it.  The only reasons I see for creating free software is either idealism,
enhancing your personal or company reputation in the developer community, or to kill
your up and coming competitors.

Not charging for a product initially, and then later on (when the competition has
subsided) charging for a product is about the same as marking down airline fares
below cost to kill of competition and then raising them later on to insane levels.

The worst way to lose face with other developers is to not be clear about your
long-term plans for a product.  Developers (at least the intelligent ones) will pay
for a superior product that cuts down the development time of their current project
so long as the licensing is clear and consistent over time.  Anything less is
pulling a fast one in my book...

The DOM SDK license is as restrictive as Docuverse wants to make it.  In the real
world there is no such thing as a free lunch so you should not expect Docuverse or
any other small ISV to be the angels of free software.

Even though I don't plan on using the DOM SDK myself anytime soon, I think it would
do the developer community more benefit in the long run if Docuverse were to charge
a fair price for a commercial license so there is incentive in the future for
Docuverse to do bug-fixes and updates and maybe even provide some level of support.

If you look at Netscape, they have basically capitulated on improving the web
browser (no incentive to improve it or add new features) and their future as a
profitable company is suspect.  They were a company that gave everything away for
free to kill off browser competition early on and then tried to charge for it when
the competition died off.  Then of course, Microsoft jumped in and did to Netscape
what Netscape did to everyone else.  In the end, the customers lose because from
this point on web browsers will likely have little innovation applied to them from
this point out.  In other words they will just plain suck.

For those people using the DOM SDK now and who enjoy the product, I would seriously
encourage these people to plea for Docuverse to charge something for a commercial
license, even if it is as low as $99 so that they can have some solace in the fact
that there will be future quality versions of the DOM SDK.  99$ is basically the
same cost as 3 development hours for the average engineer.  If 99$ is too much money
to spend on any commercial product, then your whole business plan for your product
needs some serious reevaluation.  Small ISV's like Docuverse should not feel
pressured to capitulate to the large ISV's like IBM or Microsoft who can afford to
give all their tools away for free in their efforts to squelch the up and coming.

If you look at the best XML tools to date you will find that they are not from the
big names that we know of, rather small guys who are dedicated to quality.  If we
all want quality tools to work with we will all need to put our money where our
mouth is one way or another.

My 2 cents...

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Tue Sep  8 07:50:01 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:26 2004
Subject: Java Script, and it's conflicts with 2.4.4
In-Reply-To: <3652af44.622192074@mail.together.net>
References: <199809080359.UAA11275@mail-gw.pacbell.net>
 <199809080359.UAA11275@mail-gw.pacbell.net>
Message-ID: <199809080549.WAA11710@mail-gw.pacbell.net>


>There's always the hack used for older browsers that suffered from
>the same problem... reverse the terms in the comparison and use '>'.
>Or is that prohibited too?

  The problem runs deeper than that.

  Any occurence of &,',",< or > in the style sheet or in the element
content of the XML document will be escaped to &xxx;

-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Tue Sep  8 07:53:04 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:26 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
In-Reply-To: <199809080341.XAA12969@locke.ccil.org>
References: <3.0.1.16.19980907190338.2a070b12@pop3.demon.co.uk>
Message-ID: <3.0.1.16.19980908065237.2dbfb0d8@pop3.demon.co.uk>

At 23:41 07/09/98 -0400, John Cowan wrote:
>
>The Artistic License is a good substitute for the GPL and allows you
>to keep control of the named product while allowing others to create
>differently named variations.  See
http://language.perl.com/misc/Artistic.html 

I've had a similar [private] suggestion for the AL. I saw it a few years
ago and liked it but hadn't seen it since. 

Since I believe this is a key issue for XML it could be very useful to have
somewhere that people thinking of releasing XML code could go for help. Is
this an OASIS area? i.e. to provide some guidance or material of the sort
'this is what people have already done...'.

IOW my motivation is something like:
	' I want a simple license that allows anyone to use my source code but:
		to respect authors' moral rights
		not to include legal liability
		not to imply author support or responsibility for distributed versions
(i.e. if someone else distributes my code, *I* don't suffer if it something
goes wrong).

I'd probably be quite happy to use and amend James' Clark's license but
there might be small things I want to add. To be able to pick a license off
the shelf would be very useful and also leads to a greater degree of
community support and feeling.

	P.


	P.
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Tue Sep  8 08:01:32 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:26 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
Message-ID: <000c01bddaec$a82c3b50$2ee044c6@arcot-main>

Tyler,

>For those people using the DOM SDK now and who enjoy the product, I would
seriously
>encourage these people to plea for Docuverse to charge something for a
commercial
>license, even if it is as low as $99 so that they can have some solace in
the fact
>that there will be future quality versions of the DOM SDK.  99$ is
basically the
>same cost as 3 development hours for the average engineer.  If 99$ is too
much money
>to spend on any commercial product, then your whole business plan for your
product
>needs some serious reevaluation.  Small ISV's like Docuverse should not
feel
>pressured to capitulate to the large ISV's like IBM or Microsoft who can
afford to
>give all their tools away for free in their efforts to squelch the up and
coming.


Thanks for the thought but even if all of the hundred or so DOM SDK users I
am aware of sent me a check for $99, it will not even begin to cover the
cost of developing and maintaining the DOM SDK. It is also unfair to tax
early developers with financial burden.  They are mostly individual
developers with pioneering spirits working in explorative projects which are
not usually funded well.  I think it makes more sense to 'invest' in
encouraging these pioneers so that XML-based technologies will be widely
accepted in corporations around the world.  At this time, far less than 1%
of data in corporations are in XML format.  When the figure exceeds 10%, we
will begin to see the fruits of our combined efforts.

Best,

Don Park
Docuverse


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Tue Sep  8 09:03:40 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:27 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
References: <000c01bddaec$a82c3b50$2ee044c6@arcot-main>
Message-ID: <35F4D71E.C88CDDD0@infinet.com>

Don Park wrote:

> Tyler,
>
> >For those people using the DOM SDK now and who enjoy the product, I would
> seriously
> >encourage these people to plea for Docuverse to charge something for a
> commercial
> >license, even if it is as low as $99 so that they can have some solace in
> the fact
> >that there will be future quality versions of the DOM SDK.  99$ is
> basically the
> >same cost as 3 development hours for the average engineer.  If 99$ is too
> much money
> >to spend on any commercial product, then your whole business plan for your
> product
> >needs some serious reevaluation.  Small ISV's like Docuverse should not
> feel
> >pressured to capitulate to the large ISV's like IBM or Microsoft who can
> afford to
> >give all their tools away for free in their efforts to squelch the up and
> coming.
>
> Thanks for the thought but even if all of the hundred or so DOM SDK users I
> am aware of sent me a check for $99, it will not even begin to cover the
> cost of developing and maintaining the DOM SDK. It is also unfair to tax
> early developers with financial burden.  They are mostly individual
> developers with pioneering spirits working in explorative projects which are
> not usually funded well.  I think it makes more sense to 'invest' in
> encouraging these pioneers so that XML-based technologies will be widely
> accepted in corporations around the world.  At this time, far less than 1%
> of data in corporations are in XML format.  When the figure exceeds 10%, we
> will begin to see the fruits of our combined efforts.

I agree totally here, but I think this misses the point.  Large ISV's are rarely
innovators, but adapters.  They let all the small guys like Docuverse do all the
hard work to grow the market and come up with useful implementations and
marketing plans and then clone both the implementations and marketing plans and
use anti-competitive business practices like giving away free software to own the
market that the little guys worked so hard to create.

Yes the early adopters should not be punished for developing with your product,
but they should be charged something if they ever use it in a real application.
Whatever you charge can then be discounted at your discretion based upon things
like how active they were in beta-testing, but only give away your software for
free if you never intend upon charging for it.  Likewise, developers who use
free-tools should accept them as is and not expect any kind of support
whatsoever, nor should they expect any degree of product quality.

Though this may bring about a lot of flames, for-profit organizations should not
be allowed to give out free-software.  What once was a great libertarian idea I
feel now has become a tool of large software monopolies to protect their own turf
and promote an anti-competive software market in general.  Large ISV's make most
of their money selling support for their own tools, not on the actual software
itself.  The crappier the software, the more support they sell.  You would think
most IT organizations would catch on, but to date they have not.  They continue
to buy overpriced databases (and support), overpriced computing systems and spend
billions on fly-by night consultants that would be unnecessary if the software
was robust in the first place.  When a large ISV gives away free software, it is
a simple bait and trap.

You should be able to take a company to court and sue for damages if you can
prove that they are willing to lose money on a software product solely to win
market-share.  These sort of practices are bad for the software industry and
discourage entrepreneurial endeavours in general.

Anyways this is an XML-DEV list so I will try and end this thread here as the
politics of the software industry have little to with promoting and developing
XML in the first place...

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From b.laforge at jxml.com  Tue Sep  8 10:29:10 1998
From: b.laforge at jxml.com (Bill la Forge)
Date: Mon Jun  7 17:04:27 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
Message-ID: <002701bddb02$85580e40$5255fea9@laforge>

I think it is important to distinguish between enabling software and
product.

The W3C DOM is a technology enabler. A high-speed implementation is a
product. The Docuverse SDK is something in between. Call it an expedient?

I have been wrestling with these concepts for a while myself. I'd like to
see the Coins api widly adopted. So I've tried to make use of common api
(SAX and DOM), rather than provide a propriatary one. But I am driven by a
need to make a profit. I hope to do this by charging a reasonable price for
related development tools.

The bet that I am making is that Coins will become widespread and that I can
make a reasonable profit selling those tools for $99.

Bill la Forge
http://www.jxml.com/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From b.laforge at jxml.com  Tue Sep  8 10:32:07 1998
From: b.laforge at jxml.com (Bill la Forge)
Date: Mon Jun  7 17:04:27 2004
Subject: Coins 1 release
Message-ID: <002d01bddb02$e34e02c0$5255fea9@laforge>

Just a quick note to say that version 1 of Coins has been completed and
released:  http://www.jxml.com/coins/index.html

Coins is an XML-based alternative to Java Beans. Coins version 1 is
available for commercial use without charge to all registered
developers--see the download page for details:
http://www.jxml.com/coins/download.html

Coins uses both SAX and the Docuverse SDK 1.0 pr 2.

Bill la Forge


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Tue Sep  8 10:53:27 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:27 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
References: <002701bddb02$85580e40$5255fea9@laforge>
Message-ID: <35F4F0D9.EB9B6991@infinet.com>

Bill la Forge wrote:

> I think it is important to distinguish between enabling software and
> product.
>
> The W3C DOM is a technology enabler. A high-speed implementation is a
> product. The Docuverse SDK is something in between. Call it an expedient?
>
> I have been wrestling with these concepts for a while myself. I'd like to
> see the Coins api widly adopted. So I've tried to make use of common api
> (SAX and DOM), rather than provide a propriatary one. But I am driven by a
> need to make a profit. I hope to do this by charging a reasonable price for
> related development tools.
>
> The bet that I am making is that Coins will become widespread and that I can
> make a reasonable profit selling those tools for $99.

I haven't looked at coins too much in the last 5 months myself, but it seems
like a useful tool for building parameter-driven applications that if put behind
a big name software label could easily sell for $999 a seat.  The biggest
failure of a lot of software companies is being indecisive with pricing.  Either
you go for high-volume, low-margins in a general market, or else you go for
low-value, high-margins in a niche market.  Low-level tools like parsers
generally sell in the high-volume market, while high-level software like Coins I
would think is more of a niche application that some organizations would pay
top-dollar for.  On the other end of the spectrum, if you could prove to people
that parameter-driven application development is far superior to traditional
application development, then Coins may become much more pervasive allowing you
to lower-the price of coins and go for volume.

On a sidenote to coins, an individual developer named Jack Harich who hangs out
on the Advanced-Java mailing list (I am sure you are familiar with it) has done
a lot of personal research on stuff that is right up the alley of what I
perceive coins to be.  Maybe you two should correspond.  I remember someone
referring Jack to your work before, so maybe you two have already corresponded.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From lisarein at finetuning.com  Tue Sep  8 12:23:52 1998
From: lisarein at finetuning.com (Lisa Rein)
Date: Mon Jun  7 17:04:27 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
References: <199809080341.XAA12969@locke.ccil.org> <35F4BA73.EA707E0C@infinet.com>
Message-ID: <35F50BAC.DA69F331@finetuning.com>

I think it is very important that we all get used to the idea of giving
everything away for now.  

As a writer and developer that has chosen the path of working harder
more often for less money, this last year of working more for less has
to have been the most fulfilling year of my life.

I would literally not be able to even consider trying affording
something like a $99 license for something like DOM SDK.  That would be
enough of a financial imposition to knock me right out of the game.

Now, of course, being out of the game is not an option.  So that means I
would have to (gasp) use it without a license, pirate it, steal it,
never pay my little reg fee, whatever.  

So be it, if necessary, to be honest. But I much prefer an open, honest,
atmosphere of open source-based generosity:  where everybody does the
best that they can with whatever community-based tools are available --
which in our case are the best available anyway (as far as tools that
"let" you think for yourself go -- don't get me started on the black box
again :-)

So what am I  (admittedly) taking so long to say?  I'm saying:
don't sweat the small shit man!  There are bigger fish to fry in the
long run if we all stick together now and just relish in sharing
knowledge for the sake of itself -- mmmmm, mmmmm, mmmmm, look at all
that free knowledge and understanding -- it has a snowball effect once
it gets going -- it's raising the bar in people's minds of what are
acceptable software practices -- and you know as well as I do that THAT
was a bar that REALLY needed to be raised -- and I for one am willing to
eat a little more top ramen now if that means that we have a superior
foundation for universal data interchange in the future.

And the other thing to consider, for those of you who (understandably)
have maybe already been biting the bullet financially for a lot longer
than I've even known what an abstract data model was, is that when this
stuff takes off, and it will ;-)...there are really only going to be a
couple hundred people (max) that are going to really understand it well
enough to implement it on the grand scale that is going to be required. 
And THAT is then when the capitalistic principles of supply and demand
will be in our favor in a big way -- plus we'll be able to offer more
practically-feasible, intellectually-fulfilling solutions, to the world
and each other, precisely because we did not sacrifice quality and
integrity for a quick buck early on.

By taking this kind of idealistic pride in our work, we are all making a
very serious investment in each others' future that is valuable in the
long run.  We also provide a good example  to the rest of the world that
it does pay to "give it away".

We must foster and cherish these kinds of community-based, open
development environments that may be only written about in the future --
we have been fortunate enough to have been here when much of this was
"just beginning", and now it is our responsibility to do everything in
our power to see that it never ends.

So anyway, I'm just taking a long time to say that this is not time to
get pessimistic about anything being "all for nothing," but rather it's
more important than ever to continue to lead by example, and seize the
day!

If unlimited informational exchange is the free-love of the 90's  
-- I say "love is all you need".

lisa


Tyler Baker wrote:
> 
> John Cowan wrote:
> 
> > Peter Murray-Rust scripsit:
> >
> > > My sincere thanks for this change of policy. It can and will make an
> > > enormous difference to many of us - otherwise we end up rewriting each
> > > other's software. It is a courageous decision - as was IBM's - and should
> > > be applauded.
> >
> > Indeed! Unfortunately, the license is still very restrictive, disallowing
> > modification of the DOM SDK.
> >
> > > I now have no problem with this, and am
> > > looking at the GPL for this purpose [comments would be much appreciated.] I
> > > think the quid-pro-quo would be to ask people to register their use with
> > > you and you may well benefit from this.
> >
> > I would urge Don and you to look at http://www.opensource.org/intro/free, which
> > has information on why loosening restrictions can be beneficial for
> > everyone.
> >
> > The Artistic License is a good substitute for the GPL and allows you
> > to keep control of the named product while allowing others to create
> > differently named variations.  See http://language.perl.com/misc/Artistic.html .
> 
> If you ever plan on making any money off of a product, never give it away for free.
> If you are not planning on ever making money on a product, then it is of the most
> benefit to essentially publish the source-code as is and let anyone do what they
> want with it.  The only reasons I see for creating free software is either idealism,
> enhancing your personal or company reputation in the developer community, or to kill
> your up and coming competitors.
> 
> Not charging for a product initially, and then later on (when the competition has
> subsided) charging for a product is about the same as marking down airline fares
> below cost to kill of competition and then raising them later on to insane levels.
> 
> The worst way to lose face with other developers is to not be clear about your
> long-term plans for a product.  Developers (at least the intelligent ones) will pay
> for a superior product that cuts down the development time of their current project
> so long as the licensing is clear and consistent over time.  Anything less is
> pulling a fast one in my book...
> 
> The DOM SDK license is as restrictive as Docuverse wants to make it.  In the real
> world there is no such thing as a free lunch so you should not expect Docuverse or
> any other small ISV to be the angels of free software.
> 
> Even though I don't plan on using the DOM SDK myself anytime soon, I think it would
> do the developer community more benefit in the long run if Docuverse were to charge
> a fair price for a commercial license so there is incentive in the future for
> Docuverse to do bug-fixes and updates and maybe even provide some level of support.
> 
> If you look at Netscape, they have basically capitulated on improving the web
> browser (no incentive to improve it or add new features) and their future as a
> profitable company is suspect.  They were a company that gave everything away for
> free to kill off browser competition early on and then tried to charge for it when
> the competition died off.  Then of course, Microsoft jumped in and did to Netscape
> what Netscape did to everyone else.  In the end, the customers lose because from
> this point on web browsers will likely have little innovation applied to them from
> this point out.  In other words they will just plain suck.
> 
> For those people using the DOM SDK now and who enjoy the product, I would seriously
> encourage these people to plea for Docuverse to charge something for a commercial
> license, even if it is as low as $99 so that they can have some solace in the fact
> that there will be future quality versions of the DOM SDK.  99$ is basically the
> same cost as 3 development hours for the average engineer.  If 99$ is too much money
> to spend on any commercial product, then your whole business plan for your product
> needs some serious reevaluation.  Small ISV's like Docuverse should not feel
> pressured to capitulate to the large ISV's like IBM or Microsoft who can afford to
> give all their tools away for free in their efforts to squelch the up and coming.
> 
> If you look at the best XML tools to date you will find that they are not from the
> big names that we know of, rather small guys who are dedicated to quality.  If we
> all want quality tools to work with we will all need to put our money where our
> mouth is one way or another.
> 
> My 2 cents...
> 
> Tyler
> 
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From lisarein at finetuning.com  Tue Sep  8 12:28:53 1998
From: lisarein at finetuning.com (Lisa Rein)
Date: Mon Jun  7 17:04:27 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
References: <000c01bddaec$a82c3b50$2ee044c6@arcot-main> <35F4D71E.C88CDDD0@infinet.com>
Message-ID: <35F51037.2753FF72@finetuning.com>

Tyler wrote:

> Anyways this is an XML-DEV list so I will try and end this thread here as the
> politics of the software industry have little to with promoting and developing
> XML in the first place...

exactly.....we can not let the fear-inspired patterns of the past
infiltrate the purity of our vision of the future.

lisa

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From arcdev at mail.matav.hu  Tue Sep  8 13:02:15 1998
From: arcdev at mail.matav.hu (Attila Torcsvari)
Date: Mon Jun  7 17:04:27 2004
Subject: XML tools and big documents (was: Re: Is there a size limitation on XML file given to MSXSL as input?)
Message-ID: <01BDDB28.D9EAF230@p2>

Fellow DOMers,
have anybody tested/compared (consequently) the mem. requirements and speed of different DOM implementations?

Attila Torcsvari
Arcanum Development


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Tue Sep  8 14:01:24 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:27 2004
Subject: ANN: Docuverse HTML SDK 0.1
Message-ID: <001a01bddb1e$eeae3ce0$2ee044c6@arcot-main>

Docuverse HTML SDK 0.1 is available at:

http://www.docuverse.com/htmlsdk/index.html

It is currently very small right now (about 10K ZIP file) but it contains
something I am quite sure all the SAX users will want: a HTML parser with
SAX driver.  Actually, it does not contain a HTML parser, instead the HTML
parser in the latest Swing release (1.1 Beta 2) is used.  Docuverse's own
HTML parser is being written but it is a painful process so this will have
to do for now.

A DOMReader implementation is also included.  Note that with HTML SDK and
DOM SDK together, you can now create DOM out of any HTML files.

Best,

Don Park
Docuverse


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Tue Sep  8 14:01:25 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:27 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
Message-ID: <001901bddb1e$ee01a930$2ee044c6@arcot-main>

Lisa,

I agree with everything you said except for two things:

1. Not all of us can adopt open-source policy.
2. Top Ramen sucks.  You should try Shin Ramen (available in Korean grocery
stores) which is sinfully spicy and cheaper.

Best,

Don Park
Docuverse

PS: I think you just earned yourself one of those LISTRIVIA from Peter <g>.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From h.rzepa at ic.ac.uk  Tue Sep  8 16:42:14 1998
From: h.rzepa at ic.ac.uk (Rzepa, Henry)
Date: Mon Jun  7 17:04:27 2004
Subject: Fwd: Proposal to XML-DEV: Collaborative XML
Message-ID: <v0401171ab21af3579c6f@[155.198.224.86]>

>X-Sender: pazpmr@unix.ccc.nottingham.ac.uk
>Date: Tue, 08 Sep 1998 14:40:11 +0100
>To: h.rzepa@ic.ac.uk
>From: Peter Murray-Rust <Peter.Murray-rust@nottingham.ac.uk>
>Subject: Proposal to XML-DEV: Collaborative XML
>Mime-Version: 1.0
>
>Henry,
>	Please could you forward this to XML-DEV - ta.
>------------------------------------------------------
>
>I am continuing to miss 'real' applications of XML that I can enthuse about
>to people - and I suspect others share this feeling (cf. SimonStL's posting
>on cement shoes). In this message I propose that XML-DEV - a Bazaar as Eric
>Raymond calls it [1] - develop a small rapid communal application to
>demonstrate that XML can do new things. If you are excited by this, read on.
>
>IMO many killer apps will come from using XML client-side. The following
>proposal requires  an almost completely dumb server capable only of
>re-routing XML documents. 
>
>I propose that we develop an XML-based system for games. 
>
>Chess is chosen as somewhere to start. It is not a killer app but the
>methodology is easy to extend. It would be as easy to do it for go. Henry
>and I will use it for molecular whiteboards - discussing molecules over the
>WWW - for example. If you have better ideas than 2-player games, suggest
>them (but be prepared to make some contribution...)
>
>We assume a simple XML representation of the state of a chess game (a
>simple 8x8 table with characters would suffice.) A player makes a (legal)
>alteration to this state and sends it as an XML document to the other
>player, either through the dumb server or possibly through e-mail (I am
>ignorant of whether this is a good idea). The updated state is then
>recreated for the receiving player and so on.
>
>It is straightforward to display this with a per-element browser such as
>JUMBO or (I assume) XXX from Steve Withall. The programmer has to be able to:
>	- display the state
>	- allow moves to be made (mouse, typed text, etc.)
>	- verify their legality (the main point of client-side code)
>	- send the new state to the other player.
>
>In JUMBO there are only about 3 modules that need to be written for such a
>component:
>	- constructor // resets the board
>	- processXML() // having reached endElement() in SAX, process the subtree
>	- getDisplayComponent() // provide a JComponent for embedding in the browser
>The rest of the code is unrelated to XML and might include verifying the
>legality of moves.
>
>This should be fairly easy to do - it should also provide a simple
>demonstrator which can be used anywhere. The palyer would have to:
>
>	- download a browser
>	- download the chess.jar file
>	- know how to play chess.
>
>It would be really fun to do it with more than one browser, showing that
>XML was not browser-dependent. i.e. player 1 could have JUMBO2 and player2
>could have XXX. [Of course until we get a consistent per-element API there
>would be different chess classes for each browser - or each class might
>have to to have multiple hooks.]
>
>There are several reasons why this is a useful thing to do:
>	- could bring in new people
>	- gives an easily understandable demo of XML that does something that HTML
>can't do (yes, I know that you can do *anything* server-side and send the
>results to an HTML form, but that misses the point of XML)
>	- gives us experience in per-element programming
>	- gives us experience of developing collaborative environments using XML.
>Obviously for games with >2 players the server may have to make some
>decisions but this should be a valuable area to explore.
>
>	P.
>
>[1]http://sagan.earthspace.net/~esr/writings/cathedral-bazaar/ - well worth
>reading as are some of the links.
>-----------------------------------------------------
>

Dr Henry Rzepa,  Dept. Chemistry,  Imperial College,  LONDON SW7 2AY;
mailto:rzepa@ic.ac.uk; Tel  (44) 171 594 5774; Fax: (44) 171 594 5804.
URL: http://www.ch.ic.ac.uk/rzepa/ 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From gcsfred at magma.ca  Tue Sep  8 16:54:44 1998
From: gcsfred at magma.ca (Gustavo Frederico)
Date: Mon Jun  7 17:04:27 2004
Subject: Fwd: Proposal to XML-DEV: Collaborative XML
Message-ID: <199809081453.KAA25180@mag1.magmacom.com>

I have some other questions:
If you want to build this chess game for 2 players ove the net, is
XML a good way to do it? Is it the best way? Why? What about a java servlet?
  I would like to hear more about that from you list members.

On Tue, 8 Sep 1998 15:44:44 +0100, "Rzepa, Henry" <h.rzepa@ic.ac.uk> wrote:
> >X-Sender: pazpmr@unix.ccc.nottingham.ac.uk
> >Date: Tue, 08 Sep 1998 14:40:11 +0100
> >To: h.rzepa@ic.ac.uk
> >From: Peter Murray-Rust <Peter.Murray-rust@nottingham.ac.uk>
> >Subject: Proposal to XML-DEV: Collaborative XML
> >Mime-Version: 1.0
> >
> >Henry,
> >	Please could you forward this to XML-DEV - ta.
> >------------------------------------------------------
> >
> >I am continuing to miss 'real' applications of XML that I can enthuse about
> >to people - and I suspect others share this feeling (cf. SimonStL's posting
> >on cement shoes). In this message I propose that XML-DEV - a Bazaar as Eric
> >Raymond calls it [1] - develop a small rapid communal application to
> >demonstrate that XML can do new things. If you are excited by this, read on.
> >
> >IMO many killer apps will come from using XML client-side. The following
> >proposal requires  an almost completely dumb server capable only of
> >re-routing XML documents. 
> >
> >I propose that we develop an XML-based system for games. 
> >
> >Chess is chosen as somewhere to start. It is not a killer app but the
> >methodology is easy to extend. It would be as easy to do it for go. Henry
> >and I will use it for molecular whiteboards - discussing molecules over the
> >WWW - for example. If you have better ideas than 2-player games, suggest
> >them (but be prepared to make some contribution...)
> >
[snip]
> >
> >There are several reasons why this is a useful thing to do:
> >	- could bring in new people
> >	- gives an easily understandable demo of XML that does something that 
HTML
> >can't do (yes, I know that you can do *anything* server-side and send the
> >results to an HTML form, but that misses the point of XML)
> >	- gives us experience in per-element programming
> >	- gives us experience of developing collaborative environments using 
XML.
> >Obviously for games with >2 players the server may have to make some
> >decisions but this should be a valuable area to explore.
> >
> 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From b.laforge at jxml.com  Tue Sep  8 17:16:32 1998
From: b.laforge at jxml.com (Bill la Forge)
Date: Mon Jun  7 17:04:27 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
Message-ID: <001a01bddb3b$69663020$1a10fea9@laforge>

>And the other thing to consider, for those of you who (understandably)
>have maybe already been biting the bullet financially for a lot longer
>than I've even known what an abstract data model was, is that when this
>stuff takes off, and it will ;-)...there are really only going to be a
>couple hundred people (max) that are going to really understand it well
>enough to implement it on the grand scale that is going to be required.
>And THAT is then when the capitalistic principles of supply and demand
>will be in our favor in a big way -- plus we'll be able to offer more
>practically-feasible, intellectually-fulfilling solutions, to the world
>and each other, precisely because we did not sacrifice quality and
>integrity for a quick buck early on.


Lisa,

Any product in the $99 class has got to be looking at a broad base of
sales--which is what I want to do with some of the Coins add-on's like a
revised mint capability. What seems to make sense to me is to price it at
$99.00 for the final commercial version, but to keep it in "free beta" mode
until the market is ready.

(Not to mislead, JXML is also looking at other development tools that would
be at a much higher price and consequently require a much smaller market.)

Frankly, I'd rather be working on this stuff (only) full time and get my
life back. I think it is important to set up a reasonable business model,
while taking care to develop that market.

There will always be a place for enabling technology. I believe in open
source, too--makes for better software. And coming from The Open Group, I've
seen how commercial constraints on software lead to all kinds of
complications for researchers.

So lets admit that it is complicated, and be sensitive in our policies. But
to realize our dreams, we need the commercial side too. I for one would like
to see a market develop which encouraged independent developers and small
independent companies. There's got to be a better business model than trying
to be bought up by Microsoft.

XML is a simplifier. That makes it both pro-freeware and pro-small business.
We don't need the large products to have something useable. Can we see our
way as a community to a business model that truely serves all our needs?

Bill


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From lisarein at finetuning.com  Tue Sep  8 17:36:13 1998
From: lisarein at finetuning.com (Lisa Rein)
Date: Mon Jun  7 17:04:27 2004
Subject: Fwd: Proposal to XML-DEV: Collaborative XML
References: <199809081453.KAA25180@mag1.magmacom.com>
Message-ID: <35F5583C.BD9070D4@finetuning.com>

yes an xml-enabled javaservlet is the way to go!  (in general :-)

lisa


Gustavo Frederico wrote:
> 
> I have some other questions:
> If you want to build this chess game for 2 players ove the net, is
> XML a good way to do it? Is it the best way? Why? What about a java servlet?
>   I would like to hear more about that from you list members.
> 
> On Tue, 8 Sep 1998 15:44:44 +0100, "Rzepa, Henry" <h.rzepa@ic.ac.uk> wrote:
> > >X-Sender: pazpmr@unix.ccc.nottingham.ac.uk
> > >Date: Tue, 08 Sep 1998 14:40:11 +0100
> > >To: h.rzepa@ic.ac.uk
> > >From: Peter Murray-Rust <Peter.Murray-rust@nottingham.ac.uk>
> > >Subject: Proposal to XML-DEV: Collaborative XML
> > >Mime-Version: 1.0
> > >
> > >Henry,
> > >     Please could you forward this to XML-DEV - ta.
> > >------------------------------------------------------
> > >
> > >I am continuing to miss 'real' applications of XML that I can enthuse about
> > >to people - and I suspect others share this feeling (cf. SimonStL's posting
> > >on cement shoes). In this message I propose that XML-DEV - a Bazaar as Eric
> > >Raymond calls it [1] - develop a small rapid communal application to
> > >demonstrate that XML can do new things. If you are excited by this, read on.
> > >
> > >IMO many killer apps will come from using XML client-side. The following
> > >proposal requires  an almost completely dumb server capable only of
> > >re-routing XML documents.
> > >
> > >I propose that we develop an XML-based system for games.
> > >
> > >Chess is chosen as somewhere to start. It is not a killer app but the
> > >methodology is easy to extend. It would be as easy to do it for go. Henry
> > >and I will use it for molecular whiteboards - discussing molecules over the
> > >WWW - for example. If you have better ideas than 2-player games, suggest
> > >them (but be prepared to make some contribution...)
> > >
> [snip]
> > >
> > >There are several reasons why this is a useful thing to do:
> > >     - could bring in new people
> > >     - gives an easily understandable demo of XML that does something that
> HTML
> > >can't do (yes, I know that you can do *anything* server-side and send the
> > >results to an HTML form, but that misses the point of XML)
> > >     - gives us experience in per-element programming
> > >     - gives us experience of developing collaborative environments using
> XML.
> > >Obviously for games with >2 players the server may have to make some
> > >decisions but this should be a valuable area to explore.
> > >
> >
> 
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Tue Sep  8 17:44:54 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:28 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
References: <3.0.1.16.19980907190338.2a070b12@pop3.demon.co.uk> <3.0.1.16.19980908065237.2dbfb0d8@pop3.demon.co.uk>
Message-ID: <35F55071.7204B28C@locke.ccil.org>

Peter Murray-Rust wrote:

> IOW my motivation is something like:
>         ' I want a simple license that allows anyone to use my source code but:
>                 to respect authors' moral rights
>                 not to include legal liability
>                 not to imply author support or responsibility for distributed versions
> (i.e. if someone else distributes my code, *I* don't suffer if it something
> goes wrong).
> 
> I'd probably be quite happy to use and amend James' Clark's license but
> there might be small things I want to add. To be able to pick a license off
> the shelf would be very useful and also leads to a greater degree of
> community support and feeling.

James Clark's license is what is called a "BSD" license, because
the BSD version of Unix is distributed under it.  It's the least
restrictive license: basically, anyone can do anything they want to
with the code, except hike off the author's name or use his name
for advertising purposes.  In addition, the author must be
acknowledged in uses made of the code.  (That's just simple courtesy.)

The Artistic License allows people to hack on the code for their
own use, and distribute modified versions if they give them new
names.  (This guarantees, e.g., that "perl" is always Larry Wall's
version, but anyone may distribute something called "emereld"
that contains modified code.)

The heavyweight license is the GPL, which is designed to keep code
under it "forever free" by preventing anyone from distributing
modified versions except under the GPL.  (It does not affect the
*output* of GPLed programs, nor is it forbidden for a program
to be distributed both under the GPL and otherwise.)

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Tue Sep  8 18:20:28 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:28 2004
Subject: Clever ideas to do toc...
Message-ID: <199809081620.JAA07371@mail-gw.pacbell.net>


  Let's say I'm describing the hiearchy of our web site in an XML document
like
so...

<site-map>
         <page title="Section 1" url="1.html">
                  <page title="Page 1.1" url="1.1.html"/>
               <page title="Page 1.2" url="1.2.html"/>
          </page>
  <page title="Section 2" url="2.html">
                  <page title="Page 2.1" url="2.1.html"/>
               <page title="Page 2.2" url="2.2.html"/>
       </page>
  <page title="Section 3" url="3.html">
                  <page title="Page 3.1" url="3.1.html"/>
       </page>
</site-map>

  In a given section, the user has links to all the other pages in that
section
as well as top-level links to the other sections. In other words, I only want
to "expand" the section that the user is currently in.

  An XML document that's a part of this hiearchy has some tag that says where
it fits in. Maye 1.1.xml will have <page title="Page 1.1"/> or something...
(e.g. <document-root>&entity-that-expands-to-site-map;<page title="Page
2.1"/></document-root>

  In the XSL proposal, I'd have a field day writing a nifty Java Script
function to do exactly this... in the new world, I'm not sure it can be
done at
all.

  The crux of the problem is that I don't know how to make the <xsl:if
test="..."> compare the contents or attribute values of one tag to the content
or attribute values of another tag. It appears as though the only thing you
can
do is with the test attribute is test the position of a tag relative to other
tags and what attributes it has.

  I'd like to do something like this...

1: <xsl:for-each select="site-map/page">
2:    <A HREF="{attribute(url)}">
3:        <xsl:value-of expr="attribute(title)"/>
4:    </A>
5:    <xsl:if test=".[attribute(title)]='value of page title for this
document'">
6:              <xsl:for-each select="page">
.                          <A HREF="{attribute(url)}">
.                           <xsl:value-of expr="attribute(title)"/>
.                        </A>
              </xsl:for-each>
   </xsl:if>
</xsl:for-each>

  Obviously, line 5 needs some work. If anyone has any ideas or suggestions on
how to reach the goal of spitting out something like the table below, I would
be very grateful...

<A HREF="1.html">Section 1</A>
<A HREF=2.html">Section 2</A>
     <A HREF=2.1.html">Page 2.1</A>
       <A HREF=2.2.html">Page 2.2</A>
<A HREF="3.html">Section 3</A>


-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980908/2514a08a/attachment.htm
From Kenneth.J.Meltsner at jci.com  Tue Sep  8 19:00:54 1998
From: Kenneth.J.Meltsner at jci.com (Meltsner, Kenneth J)
Date: Mon Jun  7 17:04:28 2004
Subject: Clever ideas to do toc...
Message-ID: <86256679.005D502D.00@Corpnotes.JCI.Com>


You could swipe a good idea from the relational database folks, and number
the tree nodes sequentially, and then select a sub-tree based on a range of
node numbers.  I've lost the original reference (from DBMS Magazine) but
here's a quick example from a similar problem:

Hierarchical organizational trees:

1.    Company (1,6)
2.        Department A (2,4)
3.            Group A1 (3,3)
4.            Group A2 (4,4)
5.        Department B (5,6)
6.            Group B1 (6,6)

This allows you to represent the whole company, hierarchical info, etc. in
one table.  A node that contains additional nodes can be expanded by
showing the range of nodes listed for that parent node .  It doesn't handle
more complicated relationships, such as multiple parents, though.

As I'm not enough of an XML type to be sure, should the node number and
range info be shoved into an attribute (during authoring) or should it be
generated as the document is parsed?


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Tue Sep  8 20:19:54 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:28 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
References: <001901bddb1e$ee01a930$2ee044c6@arcot-main>
Message-ID: <35F575A3.F4B6B762@infinet.com>

Don Park wrote:

> Lisa,
>
> I agree with everything you said except for two things:
>
> 1. Not all of us can adopt open-source policy.
> 2. Top Ramen sucks.  You should try Shin Ramen (available in Korean grocery
> stores) which is sinfully spicy and cheaper.

#2 is very true.  But it is nice to have the cash floating around when you want
to order a pizza.  Either way all of these food products are bad for your health
and if eaten in great concentrations will give you a heart attack by age 32.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From lisarein at finetuning.com  Tue Sep  8 20:30:45 1998
From: lisarein at finetuning.com (Lisa Rein)
Date: Mon Jun  7 17:04:28 2004
Subject: Licensing policies (was Re: ANN: Docuverse DOM SDK PR2
References: <001901bddb1e$ee01a930$2ee044c6@arcot-main> <35F575A3.F4B6B762@infinet.com>
Message-ID: <35F5812C.AE8C2833@finetuning.com>

don't worry, i add vegetables

...i don't eat the salt packet ;-)

lisa

Tyler Baker wrote:
> 
> Don Park wrote:
> 
> > Lisa,
> >
> > I agree with everything you said except for two things:
> >
> > 1. Not all of us can adopt open-source policy.
> > 2. Top Ramen sucks.  You should try Shin Ramen (available in Korean grocery
> > stores) which is sinfully spicy and cheaper.
> 
> #2 is very true.  But it is nice to have the cash floating around when you want
> to order a pizza.  Either way all of these food products are bad for your health
> and if eaten in great concentrations will give you a heart attack by age 32.
> 
> Tyler

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Tue Sep  8 21:14:07 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:28 2004
Subject: More SAX Parsers and Applications?
Message-ID: <199809081913.PAA01323@unready.megginson.com>

I've just added Silfide's SXP, Docuverse's HTML SDK, and JXML's Coins
to the SAX 1.0 applications page:

  http://www.megginson.com/SAX/applications.html

I'm certain that I'm missing more -- could everyone who has released
SAX-based software take a glance at this page, and let me know if I
have missed you or if my information is out of date?  (I'm not
including software that has been announced but not released.)

I have a feeling that this particular HTML page will soon become a
victim of SAX's success -- SAX 1.0 support is becoming so common that
it's hardly worth trying to list every piece of Java or Python
software that includes it.


Thanks, and all the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Tue Sep  8 22:07:25 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:28 2004
Subject: More SAX Parsers and Applications?
Message-ID: <003001bddb62$eb8100e0$2ee044c6@arcot-main>

David,

It might be a Good Thing (tm by Tyler) to setup a SAX Service Directory
Server.  This way, any SAX client can find the latest and the greatest SAX
parser over the Net.

Just a thought.

Don Park
Docuverse


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tgt at lanl.gov  Tue Sep  8 22:20:13 1998
From: tgt at lanl.gov (Thierry Thelliez)
Date: Mon Jun  7 17:04:28 2004
Subject: XML and behavior description
References: <199809081913.PAA01323@unready.megginson.com>
Message-ID: <35F58F1C.B7B49233@lanl.gov>

Hello,

I need to find a solution to exchange complex definitions.

I have the following difficulties:

1- Dynamic behavior

These 'Entities' can have behavior as value of their attributes.
I guess I could do something like
<!ENTITY % script "CDATA"> and use %script where needed
but:
1-a- How can I define the language used ?
something like <script LANGUAGE="JavaScript"> ?

1-b- These functions/methods/procedures should respect formatted
parameters in input and in output. How can we document that ?

1-c- I guess this could be very similar as documenting in XML an
OO code.

2- Composition

The Entities will depend on several others. Is that a 'link' in XML ?


Maybe it will be easier with an example :-) Let's assume that we
write a code to simulate a car engine.

The final XML could be an ENTITY engine including all the parts number.
But these parts are experimental. A simple pipe is described in a very
generic way by it's author. There will be pipe's parameters like
length (assuming that it is a straight pipe), physics description (behavior)
like flow dynamic (simulation code).

Using OO words, an instance Engine will contain instances of the
class Pipe (and other more complex parts !). The difficult part is that
the consortium want to exchange instances of the class Engine as well
as versions of the class Pipe (or any new 'generic' part created).

Do you see XML fitting these requirements ? Already done somewhere ?

Fom my OO eyes, I see that as a problem of exchanging not only instances
but also class (version) definitions. Of course there is a big configuration
management problem but that's not an XML issue (description language).


Thanks
Thierry


--

.....................................................................
. Thierry Thelliez                   Los Alamos National Laboratory .
.   Email: tgt@lanl.gov                                      CIC-15 .
.   Voice: (505) 665 8631                                   MS M310 .
.     Fax: (505) 665 5725                       Los Alamos NM 87545 .
.     URL: http://www.lanl.gov/cgi-bin/phone/113845             USA .
.....................................................................


-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980908/dcd3b25b/attachment.htm
From david at megginson.com  Tue Sep  8 22:39:45 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:28 2004
Subject: SAX Service Directory (was Re: More SAX Parsers and Applications?)
In-Reply-To: <003001bddb62$eb8100e0$2ee044c6@arcot-main>
References: <003001bddb62$eb8100e0$2ee044c6@arcot-main>
Message-ID: <199809082038.QAA01605@unready.megginson.com>

Don Park writes:

 > It might be a Good Thing (tm by Tyler) to setup a SAX Service
 > Directory Server.  This way, any SAX client can find the latest and
 > the greatest SAX parser over the Net.

It's a great idea, though it's not one that I have the time to take
on.  Anyone interested in giving it a shot?


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tgt at lanl.gov  Tue Sep  8 23:07:32 1998
From: tgt at lanl.gov (Thierry Thelliez)
Date: Mon Jun  7 17:04:28 2004
Subject: XML and behavior description
Message-ID: <35F59A34.B40E1256@lanl.gov>

Hello,

I need to find a solution to exchange complex definitions.

I have the following difficulties:

1- Dynamic behavior

These 'Entities' can have behavior as value of their attributes.
I guess I could do something like
<!ENTITY % script "CDATA"> and use %script where needed
but:
1-a- How can I define the language used ?
something like <script LANGUAGE="JavaScript"> ?

1-b- These functions/methods/procedures should respect formatted
parameters in input and in output. How can we document that ?

1-c- I guess this could be very similar as documenting in XML an
OO code.

2- Composition

The Entities will depend on several others. Is that a 'link' in XML ?


Maybe it will be easier with an example :-) Let's assume that we
write a code to simulate a car engine.

The final XML could be an ENTITY engine including all the parts number.
But these parts are experimental. A simple pipe is described in a very
generic way by it's author. There will be pipe's parameters like
length (assuming that it is a straight pipe), physics description
(behavior)
like flow dynamic (simulation code).

Using OO words, an instance Engine will contain instances of the
class Pipe (and other more complex parts !). The difficult part is that
the consortium want to exchange instances of the class Engine as well
as versions of the class Pipe (or any new 'generic' part created).

Do you see XML fitting these requirements ? Already done somewhere ?

Fom my OO eyes, I see that as a problem of exchanging not only instances

but also class (version) definitions. Of course there is a big
configuration
management problem but that's not an XML issue (description language).


Thanks
Thierry


--

.....................................................................
. Thierry Thelliez                   Los Alamos National Laboratory .
.   Email: tgt@lanl.gov                                      CIC-15 .
.   Voice: (505) 665 8631                                   MS M310 .
.     Fax: (505) 665 5725                       Los Alamos NM 87545 .
.     URL: http://www.lanl.gov/cgi-bin/phone/113845             USA .
.....................................................................


-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980908/453b8398/attachment.htm
From tgt at lanl.gov  Wed Sep  9 00:13:59 1998
From: tgt at lanl.gov (Thierry Thelliez)
Date: Mon Jun  7 17:04:29 2004
Subject: Attributes or not Attributes ?
Message-ID: <35F5A9CE.79F0761A@lanl.gov>

>From the Microsoft site
http://www.microsoft.com/xml/tutorial/author_doc.asp

there is an example:

<books>
   <book isbn="0345374827">
     <title>The Great Shark Hunt</title>
     <author>Hunter S. Thompson</author>
   </book>
 </books>

why isn't it:

<books>
   <book isbn="0345374827" title ="The Great Shark Hunt" author="Hunter
S. Thompson"
   </book>
 </books>

or

<books>
   <book
    <isbn>0345374827</isbn>
     <title>The Great Shark Hunt</title>
     <author>Hunter S. Thompson</author>
   </book>
 </books>

So far I have understood that the 3 notations are legal. But are they
identical ?


Thierry


--

.....................................................................
. Thierry Thelliez                   Los Alamos National Laboratory .
.   Email: tgt@lanl.gov                                      CIC-15 .
.   Voice: (505) 665 8631                                   MS M310 .
.     Fax: (505) 665 5725                       Los Alamos NM 87545 .
.     URL: http://www.lanl.gov/cgi-bin/phone/113845             USA .
.....................................................................


-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980908/acfc0479/attachment.htm
From bunner at massquantities.com  Wed Sep  9 00:24:48 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:29 2004
Subject: Clever ideas to do toc...
In-Reply-To: <86256679.005D502D.00@Corpnotes.JCI.Com>
Message-ID: <4.0.1.19980908110118.0116a8c0@postoffice.pacbell.net>


>You could swipe a good idea from the relational database folks, and number
>the tree nodes sequentially, and then select a sub-tree based on a range of
>node numbers.  I've lost the original reference (from DBMS Magazine) but
>here's a quick example from a similar problem:
>
>Hierarchical organizational trees:
>
>1.    Company (1,6)
>2.        Department A (2,4)
>3.            Group A1 (3,3)
>4.            Group A2 (4,4)
>5.        Department B (5,6)
>6.            Group B1 (6,6)
>
>This allows you to represent the whole company, hierarchical info, etc. in
>one table.  A node that contains additional nodes can be expanded by
>showing the range of nodes listed for that parent node .  It doesn't handle
>more complicated relationships, such as multiple parents, though.

  This sounds like a good idea, and the way one might do this in XML is by
using the id attribute and then doing some kind of comparison on it.
Unforunately, I can't really explore this possibility because this is one
of only two features missing from xt (the XSL processor I'm using).
id(name) in patterns is specifically mentioned in the xt release notes as
not working. I don't know of a better XSL processor (that implements the
working draft)... if anyone else does, please share.

  Thanks for your input! If it looks like I'm on the wrong page re:
implementation, don't hesitate to shove me in the correct direction.

-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Wed Sep  9 00:26:43 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:29 2004
Subject: A utility to make msxsl^H^H^H^H^H xt more useful
In-Reply-To: <01BDDAFC.022F9B10@p2>
Message-ID: <199809082224.PAA03568@mail-gw.pacbell.net>


>Is &lt; not working?! (found in point 2.4 of the XML standard). Hmmm.

  Not working the way one would expect it to. If I want to put &lt; in the
generated file, I can just include it the XML source file as element
content. If I want to put < in the generated file, I'm out of luck.

  It looks like the only way to include Java Script in an XML document and
have it be processed in a way that preserves ',",&,< and > is to write your
own little utility. "Utility" is a friendlier way to say "ugly hack" in
this case.

>I believe that the standards stand on week legs, or at least the
standardization process of XML-related formats is not well-organized.

  My way of saying it is, "There is still much work to be done."

  I believe that msxsl does handle Java Script in an intelligent way.
However, I'd like to take this opportunity to learn the XSL Working Draft
now that I've found a processor that (almost) fully implements it.

-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From fabien at girardin.org  Wed Sep  9 00:28:54 1998
From: fabien at girardin.org (Fabien Girardin)
Date: Mon Jun  7 17:04:29 2004
Subject: Attributes or not Attributes ?
References: <35F5A9CE.79F0761A@lanl.gov>
Message-ID: <35F5AEA5.8CC699E6@girardin.org>

Thierry,

Have a look at:
http://www.sil.org/sgml/elementAttr9804.html

-- Fabien

Thierry Thelliez wrote:

> <books>
>    <book isbn="0345374827">
>      <title>The Great Shark Hunt</title>
>      <author>Hunter S. Thompson</author>
>    </book>
>  </books>
>
> why isn't it:
>
> <books>
>    <book isbn="0345374827" title ="The Great Shark Hunt"
> author="Hunter S. Thompson"
>    </book>
>  </books>
>
> or
>
> <books>
>    <book
>     <isbn>0345374827</isbn>
>      <title>The Great Shark Hunt</title>
>      <author>Hunter S. Thompson</author>
>    </book>
>  </books>
>
> So far I have understood that the 3 notations are legal. But are they
> identical ?


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Wed Sep  9 00:37:06 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:29 2004
Subject: Attributes or not Attributes ?
In-Reply-To: <35F5A9CE.79F0761A@lanl.gov>
Message-ID: <199809082236.PAA08065@mail-gw.pacbell.net>

At 04:03 PM 9/8/98 -0600, you wrote: 
>
> >From the Microsoft site 
>
> <http://www.microsoft.com/xml/tutorial/author_doc.asp>http://www.microsoft
> .com/xml/tutorial/author_doc.asp 
>
> there is an example: 
>
> <books> 
>    <book isbn="0345374827"> 
>      <title>The Great Shark Hunt</title> 
>      <author>Hunter S. Thompson</author> 
>    </book> 
>  </books> 
>
> why isn't it: 
>
> <books> 
>    <book isbn="0345374827" title ="The Great Shark Hunt" author="Hunter S.
> Thompson" 
>    </book> 
>  </books> 
>
> or 
>
> <books> 
>    <book 
>     <isbn>0345374827</isbn> 
>      <title>The Great Shark Hunt</title> 
>      <author>Hunter S. Thompson</author> 
>    </book> 
>  </books> 
>
> So far I have understood that the 3 notations are legal. But are they
> identical ? 


  They are definitely not identical, but they are all equally valid. The only
difference is in how you access the information in your XML application. If
you
were using xt (a freeware XML processor), you could say attribute(isbn) or you
could say {isbn} to get at that value.

  I tend to use attributes in cases where the attribute value is likely to be
short and all tags of with the same name are likely to have that attribute
associated with them. Does anyone else have another (better?) rule of thumb?

-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980908/2e8b791b/attachment.htm
From tyler at infinet.com  Wed Sep  9 02:48:49 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:29 2004
Subject: More SAX Parsers and Applications?
References: <003001bddb62$eb8100e0$2ee044c6@arcot-main>
Message-ID: <35F5D0CE.883A6273@infinet.com>

Don Park wrote:

> David,
>
> It might be a Good Thing (tm by Tyler) to setup a SAX Service Directory

Hehe...  I suppose all you would really need here is to have a web server that
indexes the JAR files (or other binary container format for other languages) of
every parser.


> Server.  This way, any SAX client can find the latest and the greatest SAX
> parser over the Net.
>

Another idea that has come to mind is moving the SAX directory structure from:

org.xml.sax

to:

org.xml.sax.parser

Things like standard output APIs may have some claim to being put in a standard
namespace like org.xml.sax  For the sake of flexibility, perhaps moving SAX to a
sub-package may be best in the long run.  SAX stands for Simple API for XML.
Right now this seems to only apply for parsers, but I think the next step may be
for formatters to be standardized as well.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Wed Sep  9 03:21:23 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:29 2004
Subject: [ANN] Kludgey workarounds for xt
Message-ID: <199809090121.SAA15093@mail-gw.pacbell.net>


  Some of you may have already found that you can't include <,>,&,' or " in
the element content of an XSL or XML document and expect it to make it to a
generated HTML file without getting escaped to &lt;, etc.

  Since I want my generated HTML files to have Java Script, I needed a way
around this.

  I made a little Perl utility that makes it so that if you put...

<SCRIPT><![CDATA[
	function test (x) {
		return (x < 1);
	}
]]></SCRIPT>

  ...in an XML or XSL file, you can post-process the generated HTML file to
get back...

<SCRIPT>
	function test (x) {
		return (x < 1);
	}
</SCRIPT>

  For many of us, this is more useful than having "x < 1" get transformed
to "x &lt; 1".

  Those that are interested can go to
http://www.massquantities.com/xml-kludges/ to download the scripts and get
instructions on how to use them.

  If anyone knows of a better (read: not-so-hacked-up) way to do this, I'd
really like to hear about it.

-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From crism at oreilly.com  Wed Sep  9 07:24:43 1998
From: crism at oreilly.com (Chris Maden)
Date: Mon Jun  7 17:04:29 2004
Subject: [ANN] Kludgey workarounds for xt
In-Reply-To: <199809090121.SAA15093@mail-gw.pacbell.net> (message from Andrew
	Bunner on Tue, 08 Sep 1998 18:27:03 -0700)
Message-ID: <199809090522.BAA11695@ruby.ora.com>

[Andrew Bunner]
>   Some of you may have already found that you can't include <,>,&,'
> or " in the element content of an XSL or XML document and expect it
> to make it to a generated HTML file without getting escaped to &lt;,
> etc.

The transformation part of XSL is intended to produce well-formed XML.

>   If anyone knows of a better (read: not-so-hacked-up) way to do
> this, I'd really like to hear about it.

There won't be one.  In XML, &quot; and " and equivalent.  This is
also true in HTML; if your browser doesn't accept x < 5 and x &lt; 5
as equivalent, then the browser is broken.  I appreciate that this is
not your fault and I sympathize, but if XSL attempts to include a
workaround for every existing HTML browser implementation, it will do
no one any good.  Please stop referring to this as an XSL hack-up
instead of a broken-browser workaround, and suggesting that XSL has
grossly overlooked something because of this problem.  Support for
pre-XML HTML was explicitly considered and rejected by the Working
Group.

-Chris, not speaking for the WG in any way
-- 
<!NOTATION SGML.Geek PUBLIC "-//Anonymous//NOTATION SGML Geek//EN">
<!ENTITY crism PUBLIC "-//O'Reilly//NONSGML Christopher R. Maden//EN"
"<URL>http://www.oreilly.com/people/staff/crism/ <TEL>+1.617.499.7487
<USMAIL>90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Wed Sep  9 08:14:03 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:29 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
In-Reply-To: <199809090522.BAA11695@ruby.ora.com>
References: <199809090121.SAA15093@mail-gw.pacbell.net>
Message-ID: <199809090613.XAA28064@mail-gw6.pacbell.net>


>>   Some of you may have already found that you can't include <,>,&,'
>> or ...

>The transformation part of XSL is intended to produce well-formed XML.

  If that was the design then it's working exactly as planned. The
difference is, I'm approaching this from the stand point of how can I use
XSL now to simplify web design and you've probably got a longer term view.

  By your (perfectly valid) definition, the major 4th generation browsers
are broken. I think we can both agree that they'll be in use for quite some
time, though. My goal is to produce files that are broken-browser-readable :)

>Please stop referring to this as an XSL hack-up
>instead of a broken-browser workaround, and suggesting that XSL has
>grossly overlooked something because of this problem.

  I apologize if I sounded insulting--I was probably frustrated when I
wrote my last message. I meant to leave open the possibility that there
exists a good reason for this design decision.

  I question the relative importance of insisting that generated documents
be well-formed. Perhaps I don't have a full understanding of what good
things come from this.


-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ahaeckel at io-software.com  Wed Sep  9 09:45:49 1998
From: ahaeckel at io-software.com (Arne Haeckel)
Date: Mon Jun  7 17:04:29 2004
Subject: Fwd: Proposal to XML-DEV: Collaborative XML
In-Reply-To: <35F5583C.BD9070D4@finetuning.com>
Message-ID: <199809090750.JAA25178@miles.io-software.com>

Lisa Rein (Lisa Rein <lisarein@finetuning.com>) wrote:

> yes an xml-enabled javaservlet is the way to go!  (in general :-)
> 
> lisa
Why server centric? Peer to Peer would do for a 2 players game. 
And a peer to peer XML based communicateion would show every 
one, that XML could be light weight and does not necessarily need 
expensive servers ;-)

My design would be: Java applet for a nice User Interface and input 
validation, encoding of the new state in XML, transfering this XML 
document to the other player, decoding the state and presenting 
this again to the user with java applet. (There is still the question of 
wire protocol: http, RMI, CORBA, socket, ...)

Arne
-----< iO >--------------------------------------------------------------
Interactive Objects Software GmbH 
mailto:Arne.Haeckel@io-software.com
http://www.io-software.com
Basler Strasse. 63, D-79100 Freiburg, Germany
Tel: [+49]-761-40073-0, Fax: [+49]-761-40073-73

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Wed Sep  9 09:48:07 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:29 2004
Subject: More SAX Parsers and Applications?
Message-ID: <00d001bddbc6$28bfade0$e96118cb@caleb>

-----Original Message-----
From: Don Park <donpark@quake.net>
>It might be a Good Thing (tm by Tyler) to setup a SAX Service Directory
>Server.  This way, any SAX client can find the latest and the greatest SAX
>parser over the Net.

This sort of thing is certainly the kind of thing I've been planning for
xmlsoftware.com and Lars might have been thinking of it too for his site.

James

--
James Tauber / jtauber@jtauber.com      http://www.jtauber.com/
Lecturer and Associate Researcher
Electronic Commerce Network             ( http://www.xmlinfo.com/
Curtin Business School                  ( http://www.xmlsoftware.com/
Perth, Western Australia                ( http://www.schema.net/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Wed Sep  9 09:48:06 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:29 2004
Subject: SAX Service Directory (was Re: More SAX Parsers and Applications?)
Message-ID: <00d101bddbc6$29c31420$e96118cb@caleb>

-----Original Message-----
From: David Megginson <david@megginson.com>


>Don Park writes:
>
> > It might be a Good Thing (tm by Tyler) to setup a SAX Service
> > Directory Server.  This way, any SAX client can find the latest and
> > the greatest SAX parser over the Net.
>
>It's a great idea, though it's not one that I have the time to take
>on.  Anyone interested in giving it a shot?

Give me until the end of the week and there'll be something at:

    http://www.xmlsoftware.com/sax/

(note that there's nothing there at the time of writing this)

James
--
James Tauber / jtauber@jtauber.com      http://www.jtauber.com/ 
Lecturer and Associate Researcher       
Electronic Commerce Network             ( http://www.xmlinfo.com/
Curtin Business School                  ( http://www.xmlsoftware.com/
Perth, Western Australia                ( http://www.schema.net/ 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ahaeckel at io-software.com  Wed Sep  9 11:03:24 1998
From: ahaeckel at io-software.com (Arne Haeckel)
Date: Mon Jun  7 17:04:29 2004
Subject: (Fwd) Re: Proposal to XML-DEV: Collaborative XML
Message-ID: <199809090907.LAA25780@miles.io-software.com>


------- Forwarded Message Follows -------
Date sent:      	Wed, 09 Sep 1998 09:56:43 +0100
To:             	"Arne Haeckel" <ahaeckel@io-software.com>
From:           	Peter Murray-Rust <Peter.Murray-rust@nottingham.ac.uk>
Subject:        	Re: Proposal to XML-DEV: Collaborative XML

Thanks very much Arne - you have picked up exactly what I was getting at.
[Since I can't post to XML-DEV from where I am could you please forward
this reply?]

At 09:53 AM 9/9/98 +0200, Arne Haeckel wrote:
ARNE>Why server centric? Peer to Peer would do for a 2 players game. 
>And a peer to peer XML based communicateion would show every 
>one, that XML could be light weight and does not necessarily need 
>expensive servers ;-)

This is the whole point. If we have a generic Peer to Peer system then the
work to be done on the server is minimal and one-off. [The example of chess
was simply to something that most people can relate to, but the more
exciting areas are domain-specific collaborative working.]
>
ARNE>My design would be: Java applet for a nice User Interface and input
validation, 

Agreed.

ARNE>encoding of the new state in XML, 

agreed.

ARNE>transfering this XML document to the other player,

agreed.

ARNE> decoding the state and presenting 
>this again to the user with java applet. (There is still the question of 
>wire protocol: http, RMI, CORBA, socket, ...)

This is my main point - I'm not experienced enough to know what the best
approach is. I am only looking for simple ones for proof of concept. I
expect http would do fine for a simple demo. What would be required?

[Personally I'd prefer to use Java applications simply because I haven't
yet got JUMBO2/Swing running in a browser in less than exponential time.] I
imagine they could be browser helper applications?

	P.


-----< iO >--------------------------------------------------------------
Interactive Objects Software GmbH 
mailto:Arne.Haeckel@io-software.com
http://www.io-software.com
Basler Strasse. 63, D-79100 Freiburg, Germany
Tel: [+49]-761-40073-0, Fax: [+49]-761-40073-73

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Wed Sep  9 12:18:06 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:30 2004
Subject: HTML != XML (was Re: [ANN] Kludgey workarounds for xt)
In-Reply-To: <199809090522.BAA11695@ruby.ora.com>
References: <199809090121.SAA15093@mail-gw.pacbell.net>
	<199809090522.BAA11695@ruby.ora.com>
Message-ID: <199809091017.GAA00201@unready.megginson.com>

Chris Maden writes:

 > Support for pre-XML HTML was explicitly considered and rejected by
 > the Working Group.

Absolutely correct.

Since HTML <= 4.0 is *not* XML, it is best to treat it as an output
format, like PDF, TeX, RDF, Postscript, etc. -- in other words, first
produce your XML, then run it through a filter (such as a SAX-based
app) that does a down-translation to HTML syntax.  If the XML document
contains the same element types as the HTML, the translation will be
very simple.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mct at foyt.indyrad.iupui.edu  Wed Sep  9 15:28:19 1998
From: mct at foyt.indyrad.iupui.edu (Mark Tucker)
Date: Mon Jun  7 17:04:30 2004
Subject: Shocking News: Namespaces and Non-Validation
Message-ID: <199809091312.IAA06775@foyt.indyrad.iupui.edu>


I was shocked to hear that namespaces invalidate validation.

The problem seems to be that DTD validation does not expand 
prefixes, nor does it apply namespace defaulting.

Can you all set me straight?


(Apologies in advance to two knowledgable people who gave me advice on
this subject in private.  They unfortunately disagreed with each
other, and now I am at a loss.

I hope you'll tell me that namespaces (esp. namespace defaulting) can
live peacefully with DTD validation.
)

The problem arises if a document uses <DATE>, with two different
content models.  Suppose that in the "alpha" namespace, DATE contains
DAY and MONTH, while in the "beta" namespace, DATE has an attribute v.
Without namespaces, <DATE> would be ambiguous. It would need to
satisfy two different content models.

====================: Validation works with consistent PREFIXES

With prefixes, you could say (with appropriate definitions of the ALPHA
and BETA prefixes)

<TOP>
<SITE1>
    <ALPHA:DATE>	-- This 
	<DAY>10</DAY>
	<MONTH>Sept</MONTH>
    </ALPHA:DATE>
</SITE1>
<ELT2>
  <BETA:DATE v="tuesday"/>    
</ELT2>
</TOP>

The above would be valid, if only because a DTD processor could just
ignore the namespace, and treat the element name's as ALPHA:DATE and
BETA:DATE.

================: Validation fails with locally chosen prefixes
Now, suppose the DTD defines

	xmlns:KAPPA="uri:alpha"
<!ELEMENT KAPPA:DATE (KAPPA:DAY KAPPA:MONTH) >
...

and the document that uses the "uri:alpha" dtd uses the prefix ALPHA

In this case the document would mention

	<ALPHA:DATE>

MY QUESTION: Would a DTD processor figure out that KAPPA:DATE
and ALPHA:DATE are the same element, (since the expansions of KAPPA
and ALPHA are the same?

================:  Validation dies when namespace defaults are used

And finally, DTD's seem to die completely if a document uses
namespace defaulting.  The DTD validator will not even attempt
to think that the first <DATE> refers to "uri:alpha"+DATE.


But with namespace defaulting
<TOP>
<SITE1 xmlns="uri:alpha">
    <DATE>	-- This is just DATE
	<DAY>10</DAY>
	<MONTH>Sept</MONTH>
    </DATE>
</SITE1>
<ELT2 xmlns="uri:beta">
  <DATE v="tuesday"/>     -- This is also just DATE
</ELT2>
</TOP>

a DTD processor would not figure out that 
	<DATE v="tuesday"/>    should be from the "beta" DTD,
and 
    <DATE>	-- This 
	<DAY>10</DAY>
	<MONTH>Sept</MONTH>
    </DATE>
should be checked against the "alpha" DTD.


MY QUESTION: Is there any hope that namespaces and DTD's
can get along?

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peterj at wrox.com  Wed Sep  9 15:56:27 1998
From: peterj at wrox.com (Peter Jones)
Date: Mon Jun  7 17:04:30 2004
Subject: Shocking News: Namespaces and Non-Validation
Message-ID: <29AA5A0E3A0CD21196F300A0C9D8575C036070@WROX3>

Here's how I see it (but I'm no expert).
Namespaces and DTDs do get along, but only if the declarations of
element types in the DTD contain qualified names (prefix:elementname)
that match the names of namespaced elements in the doc entity.
e.g. (and I'm not sure of the syntax here...)

<!ELEMENT     qual:name (content | model | #PCDATA) >

<qual:name> blah blah </qual:name>

But if you use two versions of qual:name in the doc then, even if you
define the namespace prefix to refer to a different URI in each case,
the process of validation will force both versions of qual:name to
conform to the same content model.
So basically, avoid duplicate prefixes for validation.

[For consideration of XML-DEV
I don't see why the URI for the namespace prefix could not refer to the
address of the DTD for the element concerned, for validation.]


> -----Original Message-----
> From:	Mark Tucker [SMTP:mct@foyt.indyrad.iupui.edu]
> Sent:	Wednesday, September 09, 1998 2:13 PM
> To:	xml-dev@ic.ac.uk
> Subject:	Shocking News: Namespaces and Non-Validation
> 
> 
> I was shocked to hear that namespaces invalidate validation.
> 
> The problem seems to be that DTD validation does not expand 
> prefixes, nor does it apply namespace defaulting.
> 
> Can you all set me straight?
> 
> 
> 
> (Apologies in advance to two knowledgable people who gave me advice on
> this subject in private.  They unfortunately disagreed with each
> other, and now I am at a loss.
> 
> I hope you'll tell me that namespaces (esp. namespace defaulting) can
> live peacefully with DTD validation.
> )
> 
> The problem arises if a document uses <DATE>, with two different
> content models.  Suppose that in the "alpha" namespace, DATE contains
> DAY and MONTH, while in the "beta" namespace, DATE has an attribute v.
> Without namespaces, <DATE> would be ambiguous. It would need to
> satisfy two different content models.
> 
> ====================: Validation works with consistent PREFIXES
> 
> With prefixes, you could say (with appropriate definitions of the
> ALPHA
> and BETA prefixes)
> 
> <TOP>
> <SITE1>
>     <ALPHA:DATE>	-- This 
> 	<DAY>10</DAY>
> 	<MONTH>Sept</MONTH>
>     </ALPHA:DATE>
> </SITE1>
> <ELT2>
>   <BETA:DATE v="tuesday"/>    
> </ELT2>
> </TOP>
> 
> The above would be valid, if only because a DTD processor could just
> ignore the namespace, and treat the element name's as ALPHA:DATE and
> BETA:DATE.
> 
> ================: Validation fails with locally chosen prefixes
> Now, suppose the DTD defines
> 
> 	xmlns:KAPPA="uri:alpha"
> <!ELEMENT KAPPA:DATE (KAPPA:DAY KAPPA:MONTH) >
> ...
> 
> and the document that uses the "uri:alpha" dtd uses the prefix ALPHA
> 
> In this case the document would mention
> 
> 	<ALPHA:DATE>
> 
> MY QUESTION: Would a DTD processor figure out that KAPPA:DATE
> and ALPHA:DATE are the same element, (since the expansions of KAPPA
> and ALPHA are the same?
> 
> ================:  Validation dies when namespace defaults are used
> 
> And finally, DTD's seem to die completely if a document uses
> namespace defaulting.  The DTD validator will not even attempt
> to think that the first <DATE> refers to "uri:alpha"+DATE.
> 
> 
> But with namespace defaulting
> <TOP>
> <SITE1 xmlns="uri:alpha">
>     <DATE>	-- This is just DATE
> 	<DAY>10</DAY>
> 	<MONTH>Sept</MONTH>
>     </DATE>
> </SITE1>
> <ELT2 xmlns="uri:beta">
>   <DATE v="tuesday"/>     -- This is also just DATE
> </ELT2>
> </TOP>
> 
> a DTD processor would not figure out that 
> 	<DATE v="tuesday"/>    should be from the "beta" DTD,
> and 
>     <DATE>	-- This 
> 	<DAY>10</DAY>
> 	<MONTH>Sept</MONTH>
>     </DATE>
> should be checked against the "alpha" DTD.
> 
> 
> MY QUESTION: Is there any hope that namespaces and DTD's
> can get along?
> 
> xml-dev: A list for W3C XML Developers. To post,
> mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
> message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jonathan at texcel.no  Wed Sep  9 16:17:30 1998
From: jonathan at texcel.no (Jonathan Robie)
Date: Mon Jun  7 17:04:30 2004
Subject: Shocking News: Namespaces and Non-Validation
In-Reply-To: <199809091312.IAA06775@foyt.indyrad.iupui.edu>
Message-ID: <3.0.3.32.19980909101703.02e11b30@pop.mindspring.com>

At 08:12 AM 9/9/98 -0500, Mark Tucker wrote:
>
>I was shocked to hear that namespaces invalidate validation.

Well, not quite. As long as you specify the prefix in both the DTD and the
document, you can still validate, since the prefix is basically treated as
part of the element name or attribute name.

>The problem seems to be that DTD validation does not expand 
>prefixes, nor does it apply namespace defaulting.

I think we're hoping for the schema group to fix this for us. The DCD
proposal, for instance, provides namespace support. For now, though, I
think we're stuck with specifying the prefix in both the DTD and the document.

Jonathan
 
jonathan@texcel.no
Texcel Research
http://www.texcel.no

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eddie.sheffield at enterworks.com  Wed Sep  9 16:44:59 1998
From: eddie.sheffield at enterworks.com (Eddie Sheffield)
Date: Mon Jun  7 17:04:30 2004
Subject: HTML != XML (was Re: [ANN] Kludgey workarounds for xt)
References: <199809090121.SAA15093@mail-gw.pacbell.net>
		<199809090522.BAA11695@ruby.ora.com> <199809091017.GAA00201@unready.megginson.com>
Message-ID: <35F693BC.7089E4AA@enterworks.com>

But it seems that the problem isn't the HTML, but rather with SCRIPTS that might
be included in the HTML. I believe that HTML defines the <SCRIPT
LANGUAGE="whatever">...</SCRIPT> tags, but NOT the actual script that lies within
the tags. This is where the problem is. That script might be one of many
languages (javascript, jscript, vbscript, ecmascript, etc.) and knowing exactly
how to properly post-process the fine would be VERY non-trivial, especially if
the script itself has to generate HTML on the fly. For example:

What I want:

document.write("She said &quot;Run away!&quot;");

but the generated code is:

document.write(&quot;She said &quot;Run away!&quot;&quot;);

Obviously a post-processor can't simply replace EVERY &quot; in the line, or the
script becomes invalid. But how do you know which to replace and which not? I
suppose you could parse the script and try replacing the ones that are necessary
for the script to be valid, but then you would need separate processors/parsers
for each type of script language that might be in the script.

As much as possible, a workaround would be to use external scripts that are never
processed at all, but are pointed to with the optional SRC attribute on the
SCRIPT tag. This only works for scripts that don't have to be dynamically
generated, though.

It does seem odd that with the advent of the DOM which really eases scripting and
makes it much more powerful that almost simultaneously problems occur that make
generating those scripts more difficult.

Eddie


David Megginson wrote:

> Chris Maden writes:
>
>  > Support for pre-XML HTML was explicitly considered and rejected by
>  > the Working Group.
>
> Absolutely correct.
>
> Since HTML <= 4.0 is *not* XML, it is best to treat it as an output
> format, like PDF, TeX, RDF, Postscript, etc. -- in other words, first
> produce your XML, then run it through a filter (such as a SAX-based
> app) that does a down-translation to HTML syntax.  If the XML document
> contains the same element types as the HTML, the translation will be
> very simple.
>
> All the best,
>
> David
>
> --
> David Megginson                 david@megginson.com
>            http://www.megginson.com/
>
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jamsden at us.ibm.com  Wed Sep  9 17:28:49 1998
From: jamsden at us.ibm.com (Jim Amsden)
Date: Mon Jun  7 17:04:30 2004
Subject: Shocking News: Namespaces and Non-Validation
Message-ID: <5040100022254857000002L072*@MHS>

The problem is that the namespace spec does not define the semantics of a name.

<D:multistatus xmlns:D="DAV:">

specifies that prefix D corresponds to namespace DAV:, but it does not say what
this means. Applications are free to treat the prefix and the namespace name in
any way they desire. So the above is NOT equivalent to

<DA:multistatus xmlns:DA="AV:">

in any way. This isn't too surprising as DAV: and AV: are two different URIs
possibly representing two different namespaces But its not so obvious that this
one isn't equivalent either:

<DA:multistatus xmlns:DA="DAV:">

In this case, the local name for the elements are the same, as are their
namespaces. But the namespace spec says these are different elements having
different tag names. I think this is putting misplaced implied semantics on tag
name prefixes. The semantics should be on the local name and the namespace name
with the prefix being a notional convenience used to indicate the namespace
name. Then the prefix used in the DTD is independent of the ones used in any
document that is validated by it. This allows users to use whatever prefix they
need in order to avoid name collisions. If we're forced to use the same prefix
as defined in the DTD in order to validate documents, the namespace spec hasn't
changed anything. Names collisions will now happen because we can't use the
same prefix for more than one namespace.

Note that I'm not implying that tag names are the concatenation of the
namespace name and the local part of the tag name as specified by WebDAV
semantics, only that two elements with the same namespace name and local name
are treated as the same element type.


owner-xml-dev@ic.ac.uk on 09/09/98 09:27:29 AM
Please respond to mct@foyt.indyrad.iupui.edu
To: xml-dev@ic.ac.uk
cc:
Subject: Shocking News: Namespaces and Non-Validation


I was shocked to hear that namespaces invalidate validation.

The problem seems to be that DTD validation does not expand
prefixes, nor does it apply namespace defaulting.

Can you all set me straight?


(Apologies in advance to two knowledgable people who gave me advice on
this subject in private.  They unfortunately disagreed with each
other, and now I am at a loss.

I hope you'll tell me that namespaces (esp. namespace defaulting) can
live peacefully with DTD validation.
)

The problem arises if a document uses <DATE>, with two different
content models.  Suppose that in the "alpha" namespace, DATE contains
DAY and MONTH, while in the "beta" namespace, DATE has an attribute v.
Without namespaces, <DATE> would be ambiguous. It would need to
satisfy two different content models.

====================: Validation works with consistent PREFIXES

With prefixes, you could say (with appropriate definitions of the ALPHA
and BETA prefixes)

<TOP>
<SITE1>
    <ALPHA:DATE> -- This
 <DAY>10</DAY>
 <MONTH>Sept</MONTH>
    </ALPHA:DATE>
</SITE1>
<ELT2>
  <BETA:DATE v="tuesday"/>
</ELT2>
</TOP>

The above would be valid, if only because a DTD processor could just
ignore the namespace, and treat the element name's as ALPHA:DATE and
BETA:DATE.

================: Validation fails with locally chosen prefixes
Now, suppose the DTD defines

 xmlns:KAPPA="uri:alpha"
<!ELEMENT KAPPA:DATE (KAPPA:DAY KAPPA:MONTH) >
...

and the document that uses the "uri:alpha" dtd uses the prefix ALPHA

In this case the document would mention

 <ALPHA:DATE>

MY QUESTION: Would a DTD processor figure out that KAPPA:DATE
and ALPHA:DATE are the same element, (since the expansions of KAPPA
and ALPHA are the same?

================:  Validation dies when namespace defaults are used

And finally, DTD's seem to die completely if a document uses
namespace defaulting.  The DTD validator will not even attempt
to think that the first <DATE> refers to "uri:alpha"+DATE.


But with namespace defaulting
<TOP>
<SITE1 xmlns="uri:alpha">
    <DATE> -- This is just DATE
 <DAY>10</DAY>
 <MONTH>Sept</MONTH>
    </DATE>
</SITE1>
<ELT2 xmlns="uri:beta">
  <DATE v="tuesday"/>     -- This is also just DATE
</ELT2>
</TOP>

a DTD processor would not figure out that
 <DATE v="tuesday"/>    should be from the "beta" DTD,
and
    <DATE> -- This
 <DAY>10</DAY>
 <MONTH>Sept</MONTH>
    </DATE>
should be checked against the "alpha" DTD.


MY QUESTION: Is there any hope that namespaces and DTD's
can get along?

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tgt at lanl.gov  Wed Sep  9 18:38:07 1998
From: tgt at lanl.gov (Thierry Thelliez)
Date: Mon Jun  7 17:04:30 2004
Subject: XML graphical viewer
Message-ID: <35F6AC90.31270562@lanl.gov>

What is the best XML viewer available today ?

I am looking for a graphical view of the XML.


Thanks
Thierry

--

.....................................................................
. Thierry Thelliez                   Los Alamos National Laboratory .
.   Email: tgt@lanl.gov                                      CIC-15 .
.   Voice: (505) 665 8631                                   MS M310 .
.     Fax: (505) 665 5725                       Los Alamos NM 87545 .
.     URL: http://www.lanl.gov/cgi-bin/phone/113845             USA .
.....................................................................


-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980909/0e21410b/attachment.htm
From cowan at locke.ccil.org  Wed Sep  9 18:51:32 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:30 2004
Subject: External subset/external PE equivalence?
Message-ID: <35F6B194.1DF3DEB@locke.ccil.org>

Are these guaranteed to be the same?

<!DOCTYPE foo SYSTEM "foo.dtd">

and

<!DOCTYPE foo [
	<!ENTITY % hitherto-unheard-of-param-entity SYSTEM "foo.dtd">
	%hitherto-unheard-of-param-entity;
	]>

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Wed Sep  9 20:04:23 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:30 2004
Subject: HTML != XML (was Re: [ANN] Kludgey workarounds for xt)
In-Reply-To: <35F693BC.7089E4AA@enterworks.com>
References: <199809090121.SAA15093@mail-gw.pacbell.net>
 <199809090522.BAA11695@ruby.ora.com>
 <199809091017.GAA00201@unready.megginson.com>
Message-ID: <199809091804.LAA05055@mail-gw6.pacbell.net>


>It does seem odd that with the advent of the DOM which really eases
scripting and
>makes it much more powerful that almost simultaneously problems occur that
make
>generating those scripts more difficult.

  Odd is on way to describe it, I would say "frustrating".

>> Since HTML <= 4.0 is *not* XML, it is best to treat it as an output
>> format, like PDF, TeX, RDF, Postscript, etc. -- in other words, first
>> produce your XML, then run it through a filter (such as a SAX-based
>> app) that does a down-translation to HTML syntax.

  This doesn't have to be a very complicated animal. If I wanted to, I
could generate something that is ~almost~ HTML by doing things like <BR/>
and <IMG .../>. From there, a one line regular expression would make my
pages be readable by the major browsers. (Interesting aside, IE will
display the right thing if you close a stand-along tag with />, but
Netscape will not)

  It sounds like the smart thing to do is write an XML to HTML converter as
you suggest.

  The scripting workaround isn't as hard as it seems. You're example of...

document.write("She said &quot;run&quot;")

  ...would actually get turned into...

document.write(&quot;She said &amp;quot;run&amp;quot;&quot;)

  So, if we just go through and replace all the predefined entities with
the literal characters they represent, the workaround works.

  The fact that including a script in the generated file requires a
seperate utility will, I expect, become a non-trivial barrier to the
broader acceptance of XSL. I hope the working group chooses to address this
in their second draft.

  <stating-the-obvious>Java Script engines are not easy things to
write.</stating-the-obvious> I think it's unlikely that developers are
going to redefine the Java Script language to interpret &lt; as < ... my
opinion (hope) is that the standard should accomodate this.

-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mct at foyt.indyrad.iupui.edu  Wed Sep  9 20:07:34 1998
From: mct at foyt.indyrad.iupui.edu (Mark Tucker)
Date: Mon Jun  7 17:04:30 2004
Subject: Namespace Prefix should be Purely Convienience
Message-ID: <199809091752.MAA14874@foyt.indyrad.iupui.edu>


Jim Amsden <jamsden@us.ibm.com>


jm> The semantics should be on the local name and the namespace name
jm> with the prefix being a notional convenience 

I agree.  The prefix should be "purely notational convenience."
Which makes me say: "Give us a way to write element tags with namespaces
but bypassing the prefix." 

So, instead of 

	xmlns:FOO="someUniqueString"
<FOO:DATE>

we could just write
<"someUniqueString":FOO>
....

and be done with the problem of prefix collisions!

I wish Namespaces didn't try to be so helpful!

jm> Note that I'm not implying that tag names are the concatenation of the
jm> namespace name and the local part of the tag name as specified by
jm> WebDAV semantics, only that two elements with the same namespace name
jm> and local name are treated as the same element type.

If you say 
	"two elements with the same namespace name
        and local name are treated as the same element type.",

isn't this saying that, operationally, the "effective ELEMENT tag name"
is the concatenation of the namespace string and the local tag name?


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From andrewl at microsoft.com  Wed Sep  9 20:16:16 1998
From: andrewl at microsoft.com (Andrew Layman)
Date: Mon Jun  7 17:04:30 2004
Subject: Namespace Prefix should be Purely Convienience
Message-ID: <5BF896CAFE8DD111812400805F1991F7038CA71C@RED-MSG-08>

The "effective element name" is illustrated in section 6.3 of the proposed
spec.

-----Original Message-----
From: Mark Tucker [mailto:mct@foyt.indyrad.iupui.edu]
Sent: Wednesday, September 09, 1998 10:52 AM
To: xml-dev@ic.ac.uk
Subject: Namespace Prefix should be Purely Convienience


Jim Amsden <jamsden@us.ibm.com>


jm> The semantics should be on the local name and the namespace name
jm> with the prefix being a notional convenience 

I agree.  The prefix should be "purely notational convenience."
Which makes me say: "Give us a way to write element tags with namespaces
but bypassing the prefix." 

So, instead of 

	xmlns:FOO="someUniqueString"
<FOO:DATE>

we could just write
<"someUniqueString":FOO>
....

and be done with the problem of prefix collisions!

I wish Namespaces didn't try to be so helpful!

jm> Note that I'm not implying that tag names are the concatenation of the
jm> namespace name and the local part of the tag name as specified by
jm> WebDAV semantics, only that two elements with the same namespace name
jm> and local name are treated as the same element type.

If you say 
	"two elements with the same namespace name
        and local name are treated as the same element type.",

isn't this saying that, operationally, the "effective ELEMENT tag name"
is the concatenation of the namespace string and the local tag name?


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Wed Sep  9 20:28:55 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:30 2004
Subject: HTML != XML (was Re: [ANN] Kludgey workarounds for xt)
In-Reply-To: <199809091804.LAA05055@mail-gw6.pacbell.net>
References: <199809090121.SAA15093@mail-gw.pacbell.net>
	<199809090522.BAA11695@ruby.ora.com>
	<199809091017.GAA00201@unready.megginson.com>
	<35F693BC.7089E4AA@enterworks.com>
	<199809091804.LAA05055@mail-gw6.pacbell.net>
Message-ID: <199809091828.OAA02266@unready.megginson.com>

Andrew Bunner writes:

 >   <stating-the-obvious>Java Script engines are not easy things to
 > write.</stating-the-obvious> I think it's unlikely that developers
 > are going to redefine the Java Script language to interpret &lt; as
 > < ... my opinion (hope) is that the standard should accomodate
 > this.

The problem is that the HTML 4.0 DTD defines the <SCRIPT> element as
follows:

  <!ELEMENT SCRIPT - - CDATA>

This is perfectly legal SGML, and HTML 4.0 is based on SGML.  It would
actually be *wrong* to use &lt; and &amp; in a <SCRIPT> element in
HTML -- the browsers, probably by accident, have it right (at least
this far).

Here's the crux, though: HTML 4.0 is based on a non-XML subset of
SGML.  That means that XML cannot represent (and was never intended to
represent) an HTML <= 4.0 document.  It's just wrong.  If you need to
do that, why bother with XML when there are perfectly good HTML/SGML
tools out there?  

XML is *not* an extension of HTML, and there is no safe way to include
XML in an HTML <= 4.0 page (except by reference, using a <LINK>
element or something similar).


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jamsden at us.ibm.com  Wed Sep  9 20:35:39 1998
From: jamsden at us.ibm.com (Jim Amsden)
Date: Mon Jun  7 17:04:30 2004
Subject: Namespace Prefix should be Purely Convienience
Message-ID: <5040100022264364000002L042*@MHS>

Response below in <jra> elements.


owner-xml-dev@ic.ac.uk on 09/09/98 02:05:30 PM
Please respond to mct@foyt.indyrad.iupui.edu
To: xml-dev@ic.ac.uk
cc:
Subject: Namespace Prefix should be Purely Convienience


Jim Amsden <jamsden@us.ibm.com>


jm> The semantics should be on the local name and the namespace name
jm> with the prefix being a notional convenience

I agree.  The prefix should be "purely notational convenience."
Which makes me say: "Give us a way to write element tags with namespaces
but bypassing the prefix."

So, instead of

 xmlns:FOO="someUniqueString"
<FOO:DATE>

we could just write
<"someUniqueString":FOO>
....
<jra>
This generally wouldn't be convenient as the namespace name might be long as
well as contain characters that are not valid in a tag name. The prefix is OK
as long as it doesn't mean anything. Note that it's a little strange for an
attribute to define something about the element tag name. That is, the prefix
is always used before it is defined. Perhaps the attribute should specify the
namespaces for the content of the element, not the element itself. The document
root doesn't need a namespace because when used in that context, there can be
only one instance.
</jra>

and be done with the problem of prefix collisions!

I wish Namespaces didn't try to be so helpful!

jm> Note that I'm not implying that tag names are the concatenation of the
jm> namespace name and the local part of the tag name as specified by
jm> WebDAV semantics, only that two elements with the same namespace name
jm> and local name are treated as the same element type.

If you say
 "two elements with the same namespace name
        and local name are treated as the same element type.",

isn't this saying that, operationally, the "effective ELEMENT tag name"
is the concatenation of the namespace string and the local tag name?
<jra>
Yes, that's probably true in most cases, and is the WebDAV convention (an
application dependent interpretation of namesapces in order to eliminate
ambiguity). However, a client application could use the constituent parts of
the name in any way it wanted and apply some additional semantics. Like
checking to see if the local name is in the namespace, verifying the namespace
exists in some context, etc. Some of these semantics would be handled by the
DTD anyway though. For example, checking that the namespace scopes a local name
is the same thing as ensuring the element is defined in a DTD and is redundant.
</jra>


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep  9 20:39:15 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:30 2004
Subject: HTML != XML (was Re: [ANN] Kludgey workarounds for xt)
References: <199809090121.SAA15093@mail-gw.pacbell.net>
	 <199809090522.BAA11695@ruby.ora.com>
	 <199809091017.GAA00201@unready.megginson.com> <199809091804.LAA05055@mail-gw6.pacbell.net>
Message-ID: <35F6CA98.9EA06C78@locke.ccil.org>

Andrew Bunner scripsit:

> (Interesting aside, IE will
> display the right thing if you close a stand-along tag with />, but
> Netscape will not)

Both IE and Netscape will cope if you add a space before the
"/>".  This is legal XML.

>   <stating-the-obvious>Java Script engines are not easy things to
> write.</stating-the-obvious> I think it's unlikely that developers are
> going to redefine the Java Script language to interpret &lt; as < ... my
> opinion (hope) is that the standard should accomodate this.

The problem arrived when Netscape decided to treat SCRIPT elements
specially.  This behavior then got standardized by giving SCRIPT
a CDATA content model, which is Evil.  The Right Thing would have
been for Netscape to wrap SCRIPT content in a CDATA section.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mct at foyt.indyrad.iupui.edu  Wed Sep  9 20:49:44 1998
From: mct at foyt.indyrad.iupui.edu (Mark Tucker)
Date: Mon Jun  7 17:04:30 2004
Subject: Effective Element Name == "Expanded Name"
Message-ID: <199809091834.NAA15959@foyt.indyrad.iupui.edu>


Andrew,
	
You wrote

al> The "effective element name" is illustrated in section 6.3 of the proposed
al> spec.

Unfortunately, what is lacking in section 6.3 of the proposal,
is a clear statement of the semantics of an "Expanded Name."

>From your remark, I take it that you agree that the "meaning" of an element
is governed by the expanded name and that the particular prefix used
in a document is irrelevant.  That is good news.

To define the semantics of namespaces, if you said something like:

	Namespace conformant applications treat elements
	as if their ELEMENT TAG were the Expanded Name

then you would have defined how DTD validators are to treat
documents with namespaces.
	

As it stands, the consensus on XMLDEV seems to be that documents with
namespaces are validated by "ignoring the meaning of the xmlns
prefix."  So, <PN:NAME> is validated without regard to what the PN prefix
is actually defined to represent.  

The statements
	xmlns:PN="someMeaning"
seem to be ignored, 
and namespace defaulting cannot be used.

-- Mark


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cvonsee at onramp.net  Wed Sep  9 21:51:49 1998
From: cvonsee at onramp.net (Chris von See)
Date: Mon Jun  7 17:04:31 2004
Subject: Are elements allowed to nest within themselves?
Message-ID: <199809091951.OAA27548@mailhost.onramp.net>

The XML 1.0 spec is unclear (unless I missed the reference) on whether or
not an element can nest within itself.  For example, let's say I define a
DTD with:

<!ELEMENT foo ( foo*, bar* )>
<!ATTLIST foo answer CDATA #IMPLIED>

<!ELEMENT bar (#PCDATA)>

and a document with: 

<!-- top-level "foo" element -->
<foo>
     <!-- first nested "foo" element -->
     <foo answer="yes">
          <bar>this one</bar>
          <bar>that one</bar>
     </foo>
     <!-- second nested "foo" element -->
     <foo answer="no">
          <bar>the other one</bar>
     </foo>
     <!-- third nested "foo" element (empty) -->
     <foo/>
</foo>

Is this legal?


Chris


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dent at highway1.com.au  Wed Sep  9 22:43:39 1998
From: dent at highway1.com.au (Andy Dent)
Date: Mon Jun  7 17:04:31 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
In-Reply-To: <199809090613.XAA28064@mail-gw6.pacbell.net>
References: <199809090522.BAA11695@ruby.ora.com>
 <199809090121.SAA15093@mail-gw.pacbell.net>
Message-ID: <v03102801b21c945aab3e@[203.23.215.128]>

At 14:19 +0800 9/9/98, Andrew Bunner wrote:
>>The transformation part of XSL is intended to produce well-formed XML.
My perspective may be a little different from others because my initial use
of XML/XSL is for a report writer which renders to screen/print and only
secondarily to RTF and HTML.

The contrast with most of the tools I've looked at (and books I've read) is
to an XSL community that appear concerned with transforming one set of XML
to another. To be honest, I keep feeling there's something I've missed here
- if the transformation is just generating more XML I don't see that
containing enough information for a renderer.

Is there anything in the standard which says that, if you are rendering to
HTML, that you *have* to produce well-formed XML? If you have a server-side
processor of XML/XSL that is producing HTML I don't see why there's a
problem. Similarly, an embedded processor in a browser is surely free to
make its own interpretation.


Andy Dent, Software Designer, A.D. Software, Western Australia
OOFILE - Database, Reports, Graphs, GUI for c++ on Mac, Unix & Windows
PP2MFC - PowerPlant->MFC portability
http://www.highway1.com.au/adsoftware/crossplatform.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Thu Sep 10 00:18:48 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:31 2004
Subject: HTML != XML (was Re: [ANN] Kludgey workarounds for xt)
References: <199809090121.SAA15093@mail-gw.pacbell.net>
			<199809090522.BAA11695@ruby.ora.com> <199809091017.GAA00201@unready.megginson.com> <35F693BC.7089E4AA@enterworks.com>
Message-ID: <35F6FEF5.B0D1FCEE@infinet.com>

Eddie Sheffield wrote:

> But it seems that the problem isn't the HTML, but rather with SCRIPTS that might
> be included in the HTML. I believe that HTML defines the <SCRIPT
> LANGUAGE="whatever">...</SCRIPT> tags, but NOT the actual script that lies within
> the tags. This is where the problem is. That script might be one of many
> languages (javascript, jscript, vbscript, ecmascript, etc.) and knowing exactly
> how to properly post-process the fine would be VERY non-trivial, especially if
> the script itself has to generate HTML on the fly. For example:
>
> What I want:
>
> document.write("She said &quot;Run away!&quot;");
>
> but the generated code is:
>
> document.write(&quot;She said &quot;Run away!&quot;&quot;);
>
> Obviously a post-processor can't simply replace EVERY &quot; in the line, or the
> script becomes invalid. But how do you know which to replace and which not? I
> suppose you could parse the script and try replacing the ones that are necessary
> for the script to be valid, but then you would need separate processors/parsers
> for each type of script language that might be in the script.
>
> As much as possible, a workaround would be to use external scripts that are never
> processed at all, but are pointed to with the optional SRC attribute on the
> SCRIPT tag. This only works for scripts that don't have to be dynamically
> generated, though.
>
> It does seem odd that with the advent of the DOM which really eases scripting and
> makes it much more powerful that almost simultaneously problems occur that make
> generating those scripts more difficult.
>
> Eddie

The approach I use for the XML Formatter I have is to have a boolean setting that can
be optionally set which will either auto-replace occurrences of entity values in
character data and attribute values with entity names (this includes character
entities) or else do none of this.  Another alternative is to wrap any character data
that includes processed text that is read for output which includes entity references
in some special object that is essentially a flag saying do not process this stuff or
even normalize it.  This is what I do now for CDATA Sections and this same technique
is pretty much what is used for the DOM so you can distinguish between text that can
be normalized and text that should not be normalized.

Maybe XT should have something like:

document.writeAsIs("");

which does not auto-replace instances of <, >, &, ", '.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at goon.stg.brown.edu  Thu Sep 10 02:27:15 1998
From: richard at goon.stg.brown.edu (Richard L. Goerwitz III)
Date: Mon Jun  7 17:04:31 2004
Subject: Are elements allowed to nest within themselves?
Message-ID: <199809100027.UAA24286@goon.stg.brown.edu>

Just for future reference, this sort of question is easily
answered by a validator.  E.g., go to

  http://www.stg.brown.edu/service/xmlvalid/

and paste in a brief document like the one you posted be-
fore (with a few bits of sugar to make it taste good to
the validator):

<!DOCTYPE foo [
<!ELEMENT foo ( foo*, bar* )>
<!ATTLIST foo answer CDATA #IMPLIED>
<!ELEMENT bar (#PCDATA)>
]>

<!-- top-level "foo" element -->
<foo>
     <!-- first nested "foo" element -->
     <foo answer="yes">
          <bar>this one</bar>
          <bar>that one</bar>
     </foo>
     <!-- second nested "foo" element -->
     <foo answer="no">
          <bar>the other one</bar>
     </foo>
     <!-- third nested "foo" element (empty) -->
     <foo/>
</foo>

Richard Goerwitz
Brown University


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Thu Sep 10 06:01:56 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:31 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
Message-ID: <018801bddc6f$fc2055a0$e16118cb@caleb>

-----Original Message-----
From: Andy Dent <dent@highway1.com.au>
>The contrast with most of the tools I've looked at (and books I've read) is
>to an XSL community that appear concerned with transforming one set of XML
>to another. To be honest, I keep feeling there's something I've missed here
>- if the transformation is just generating more XML I don't see that
>containing enough information for a renderer.

That's what the formatting object vocabulary is all about. You transform
your XML into XML that expresses formatting objects and their properties. My
software FOP, is an example that takes formatting objects and generates a
PDF file. I am planning on also doing the same with Tk widgets for screen
display.

James
--
James Tauber / jtauber@jtauber.com      http://www.jtauber.com/
Lecturer and Associate Researcher
Electronic Commerce Network             ( http://www.xmlinfo.com/
Curtin Business School                  ( http://www.xmlsoftware.com/
Perth, Western Australia                ( http://www.schema.net/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ricko at allette.com.au  Thu Sep 10 08:25:38 1998
From: ricko at allette.com.au (Rick Jellife)
Date: Mon Jun  7 17:04:31 2004
Subject: Shocking News: Namespaces and Non-Validation
References: <199809091312.IAA06775@foyt.indyrad.iupui.edu>
Message-ID: <35F7708D.72E0E594@allette.com.au>

Mark Tucker wrote:

> I was shocked to hear that namespaces invalidate validation.

This is only sort-of true.

> The problem seems to be that DTD validation does not expand
> prefixes, nor does it apply namespace defaulting.

DTD validation is completely namespace unaware. So no prefix expansion takes
place.Namespaces are a layer.

Namespace defaulting is superficially more disruptive to DTDs. It means that you
may indeed have two element types from different schemas which would (if you
used a single DTD to directly model them) have the same GI and content model.
Under these circumstances, the content model would tend to become ANY, of
course.  But it is not really more disruptive: if you do not want to use
defaulting,
dont use it!  Put a note in your products or documents saying "NO DEFAULTING"
and encourage people not to ue it.

At the moment, when you combine 2 DTDs, you have to rename element
types or combine content models. The namespace procedure does not alter this, so
even though it is superficially alarming, it is not much different from what
happens now.

Rick Jelliffe


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From larsga at ifi.uio.no  Thu Sep 10 08:39:03 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:04:31 2004
Subject: More SAX Parsers and Applications?
In-Reply-To: <00d001bddbc6$28bfade0$e96118cb@caleb>
Message-ID: <3.0.1.32.19980910083559.00738398@ifi.uio.no>


* Don Park
>
>It might be a Good Thing (tm by Tyler) to setup a SAX Service Directory
>Server.  This way, any SAX client can find the latest and the greatest SAX
>parser over the Net.

* James Tauber
>
>This sort of thing is certainly the kind of thing I've been planning for
>xmlsoftware.com and Lars might have been thinking of it too for his site.

I didn't, but it's certainly a good idea. I will be evaluating the OMG COS
Trader service shortly, and this looks like a possible way to make a real
evaluation of it. If an ORB becomes part of JDK 1.2 we may not even need
a lightweight alternative.

BTW: David, if you have problems keeping track of which products support
     SAX you can look here:


<URL:http://www.stud.ifi.uio.no/~larsga/linker/xmltools/by-standard.html#SAX>

     I can make an index of the products that use it as well, if you think
     that will be useful. I've got the information in my XML docs, I just
     don't use it yet.

--Lars M.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tms at ansa.co.uk  Thu Sep 10 12:13:47 1998
From: tms at ansa.co.uk (Toby Speight)
Date: Mon Jun  7 17:04:31 2004
Subject: HTML != XML
In-Reply-To: David Megginson's message of "Wed, 9 Sep 1998 14:28:05 -0400"
References: <199809090121.SAA15093@mail-gw.pacbell.net> 	<199809090522.BAA11695@ruby.ora.com> 	<199809091017.GAA00201@unready.megginson.com> 	<35F693BC.7089E4AA@enterworks.com> 	<199809091804.LAA05055@mail-gw6.pacbell.net> <199809091828.OAA02266@unready.megginson.com>
Message-ID: <u90jsff3a.fsf_-_@delivery.ansa.co.uk>

David> David Megginson <URL:mailto:david@megginson.com>

0> In article <199809091828.OAA02266@unready.megginson.com>, David
0> wrote:

David> The problem is that the HTML 4.0 DTD defines the <SCRIPT>
David> element as follows:
David>
David>   <!ELEMENT SCRIPT - - CDATA>
David>
David> This is perfectly legal SGML, and HTML 4.0 is based on SGML.  It
David> would actually be *wrong* to use &lt; and &amp; in a <SCRIPT>
David> element in HTML -- the browsers, probably by accident, have it
David> right (at least this far).

I think it should be pointed out that HTML was *changed* in order to
accommodate the browsers' behaviour.  Earlier drafts (HTML 3.0, IIRC)
had  <!ELEMENT SCRIPT - - (#PCDATA)>, but the browsers didn't deal
with it correctly.

-- 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Nidal.AMER at hdmp.com  Thu Sep 10 13:33:15 1998
From: Nidal.AMER at hdmp.com (AMER, Nidal)
Date: Mon Jun  7 17:04:31 2004
Subject: DTD vs Schema
Message-ID: <A789DF8BDE02D211B63C00805FC78E9702434A@BES40ENT000>

Dear all,

Some beginner's questions, just don't laugh: 
1.	What is the right way to describe XML message syntax: DTD or
schema. I started by looking at Microsoft site. The publish
documentation on schema but nothing about DTD. They also say schema is a
better way to describe XML as it is XML. 
2.	Is there anywhere a clear documentation on DTD without diving
into the whole SGML DTD stuff? I am lost between HTML4 DTD, SGML DTD and
XML DTD.
3.	The database I am working on is highly hierarchical. MS schema
documentation describes uses SUPERTYPE to declare inheritance. Is there
any equivalent DTD declaration?

Could anybody give me a hand with this?
Many thanks in advance.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Nidal AMER
R & D Deputy Manager
HDMP ( Health Data Management Partners)
A SmithKline-Beecham Company
6 Rue de Gen?ve,
1140 Brussels, Belgium
Tel: + 32 (2) 724 00 93
Fax: + 32 (2) 726 91 59
E-mail: nidal.amer@hdmp.com <mailto:nidal.amer@hdmp.com> 
Visit our web site: http://www.hdmp.com <http://www.hdmp.com> 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jjc at jclark.com  Thu Sep 10 13:50:43 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun  7 17:04:31 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
References: <199809090522.BAA11695@ruby.ora.com>
	 <199809090121.SAA15093@mail-gw.pacbell.net> <v03102801b21c945aab3e@[203.23.215.128]>
Message-ID: <35F7B89F.2E779FDB@jclark.com>

Andy Dent wrote:
> 
> At 14:19 +0800 9/9/98, Andrew Bunner wrote:
> >>The transformation part of XSL is intended to produce well-formed XML.
> My perspective may be a little different from others because my initial use
> of XML/XSL is for a report writer which renders to screen/print and only
> secondarily to RTF and HTML.
> 
> The contrast with most of the tools I've looked at (and books I've read) is
> to an XSL community that appear concerned with transforming one set of XML
> to another. To be honest, I keep feeling there's something I've missed here
> - if the transformation is just generating more XML I don't see that
> containing enough information for a renderer.
> 
> Is there anything in the standard which says that, if you are rendering to
> HTML, that you *have* to produce well-formed XML? If you have a server-side
> processor of XML/XSL that is producing HTML I don't see why there's a
> problem. Similarly, an embedded processor in a browser is surely free to
> make its own interpretation.

An XSL processor can do other things with the result tree than just
write it out as XML.

If you want to use XSL to produce some non-XML format, first you need to
devise an XML representation of it.  For example, in the case of HTML,
this would be "well-formed HTML", that is XML using the element types
and attributes of XML.  Now write some code that turns this XML
representation into the real thing.  Now you've just got to arrange for
this code to get run instead of the usual code that writes the result
tree out as XML.  You've got two possibilities. One possibility is to
make this a run-time option for your XSL processor.  Another is to use
the result-ns attribute.  To use this you need to define a namespace for
your HTML representation, make all result elements be from this
namespace, and specify this using result-ns.  Then your XSL processor
can recognize the namespace and do the right thing.  For example,

<xsl:stylesheet
  xmlns:xsl="http://www.w3.org/TR/WD-xsl"
  xmlns:h="http://www.w3.org/TR/REC-html40"
  result-ns="h">

<xsl:template match="/">
  <h:html>
  ...
  </h:html>
</xsl:template>

</xsl:stylesheet>

This gives an XSL processor everything it needs to recognize that you're
generating HTML and do the right thing so you get real HTML.

The XML representation for another format can be very simple.  For
example, if you wanted to generate RTF, you could probably get away with
two element types: one element type for the root, and one element type
to contain RTF control information (outside elements of this type you
would escape {, } and \ as \{, \} and \\}, inside it you would not). 
You might want an element type for \bin too.  

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Thu Sep 10 13:56:13 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:31 2004
Subject: DTD vs Schema
Message-ID: <007401bddcb2$41f1d8a0$de6118cb@caleb>

-----Original Message-----
From: AMER, Nidal <Nidal.AMER@hdmp.com>
>1. What is the right way to describe XML message syntax: DTD or
>schema. I started by looking at Microsoft site. The publish
>documentation on schema but nothing about DTD. They also say schema is a
>better way to describe XML as it is XML.

DTDs are the only finalised schema language for XML at present. Alternatives
such as DCD, XML-Data and XSchema are proposals still being developed
(although it looks like DCD replaces XML-Data).

(see http://www.schema.net/otherschemata/ for more information about these)

>2. Is there anywhere a clear documentation on DTD without diving
>into the whole SGML DTD stuff? I am lost between HTML4 DTD, SGML DTD and
>XML DTD.

A DTD tutorial will be available soon at schema.net

>3. The database I am working on is highly hierarchical. MS schema
>documentation describes uses SUPERTYPE to declare inheritance. Is there
>any equivalent DTD declaration?

You could achieve something like this by using a FIXED attribute on the
subtype. For example:

<!ELEMENT Person (#PCDATA)>

<!ELEMENT Employee (#PCDATA)>
<!ATTLIST Employee
    SuperType NMTOKEN #FIXED "Person">

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at goon.stg.brown.edu  Thu Sep 10 14:25:47 1998
From: richard at goon.stg.brown.edu (Richard L. Goerwitz)
Date: Mon Jun  7 17:04:31 2004
Subject: Shocking News: Namespaces and Non-Validation
References: <3.0.3.32.19980909101703.02e11b30@pop.mindspring.com>
Message-ID: <35F7C540.4EFFE88A@goon.stg.brown.edu>

> I was shocked to hear that namespaces invalidate validation.

It is a bit ironic that, the way namespaces have been designed, it looks
as if the much-maligned schema group is going to be performing a central
role after all.

Namespaces don't so much invalidate validation as do nothing productive
with it (so you have to edit your DTDs to cover all the new:elements
anyway).  To many people's way of thinking, this is broken.

Interesting statement by a recent poster:

> I started by looking at Microsoft site. The publish documentation on
> schema but nothing about DTD. They also say schema is a better way to
> describe XML as it is XML

One has to wonder, if DTDs are abandoned (or become legacy items) whe-
ther, practically speaking, SGML compatibility will end up abandoned
as well.

If that happens, one has to wonder whether a fundamental redesign is in
order.  I.e., if DTDs, SGML, etc. go out the window, why not just go
back to the drawing board?  There's a lot of cruft that only made it
into the XML standard because of SGML compatibility requirements.

I'm as interested in SGML compatibility as the next guy (here I am in a
shop supporting several projects involving SGML; and I just finished
writing a DTD-based XML validator geared especially for people trying
to convert SGML to XML).

But it's too early (for me at least) to tell whether namespaces, XSL,
etc. will keep DTDs and validation in the forefront, or whether they
are actually just laying the groundwork for their replacement and/or
demise.

-- 

Richard Goerwitz
PGP key fingerprint:    C1 3E F4 23 7C 33 51 8D  3B 88 53 57 56 0D 38 A0
For more info (mail, phone, fax no.):  finger richard@goon.stg.brown.edu

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Thu Sep 10 15:34:17 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:31 2004
Subject: DTD vs Schema
In-Reply-To: <007401bddcb2$41f1d8a0$de6118cb@caleb>
References: <007401bddcb2$41f1d8a0$de6118cb@caleb>
Message-ID: <199809101333.JAA00210@unready.megginson.com>

James Tauber writes:

 > DTDs are the only finalised schema language for XML at
 > present. Alternatives such as DCD, XML-Data and XSchema are
 > proposals still being developed (although it looks like DCD
 > replaces XML-Data).

Since XML-Data was never a work item for a W3C working group, there is
no one who can official declare its passing, but yes, it's dead.

I still haven't had time to give XSchema a proper going-over, but from
what I've seen, XSchema and DCD are not competing for the same
territory:

- XSchema defines an instance-based representation of DTDs
- DCD defines an alternative to DTDs

Does this make sense?


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dent at highway1.com.au  Thu Sep 10 16:11:28 1998
From: dent at highway1.com.au (Andy Dent)
Date: Mon Jun  7 17:04:31 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
In-Reply-To: <35F7B89F.2E779FDB@jclark.com>
References: <199809090522.BAA11695@ruby.ora.com>	
 <199809090121.SAA15093@mail-gw.pacbell.net>
 <v03102801b21c945aab3e@[203.23.215.128]>
Message-ID: <v04011701b21d8ea34317@[203.23.215.86]>

At 7:31 PM +0800 10/9/98, James Clark wrote:
>An XSL processor can do other things with the result tree than just
>write it out as XML.
>
>If you want to use XSL to produce some non-XML format, first you need to
>devise an XML representation of it.
Why?

Why can't a product like our report-writer take
- XML describing content
- XSL specifying layout
and produce, for example, a report preview window on a Mac?
After all, if you regard a browser, it's doing something very similar.

I don't see the need for the intermediate translation to another set of XML
data, but there may be something I've missed in the XSL processing standard.

I agree that a clean design mandates some separate structured collections
of objects between XML and output, but I don't see how they are necessarily
either XML or anything closely related. For one thing, they are 'highly
decorated' by comparison with the original XML.
Andy Dent BSc MACS AACM, Software Designer, A.D. Software, Western Australia
OOFILE - Database, Reports, Graphs, GUI for c++ on Mac, Unix & Windows
PP2MFC - PowerPlant->MFC portability
http://www.highway1.com.au/adsoftware/crossplatform.html

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mct at foyt.indyrad.iupui.edu  Thu Sep 10 16:31:11 1998
From: mct at foyt.indyrad.iupui.edu (Mark Tucker)
Date: Mon Jun  7 17:04:31 2004
Subject: Where are discussions about Schemas taking place?
Message-ID: <199809101415.JAA25656@foyt.indyrad.iupui.edu>


Where is the proper forum to discuss Schemas?

	DCD		-- ?
	XSchema		-- here
	RDF Schemas	-- ?

	RDF in general	-- ?	


Replacing DTD's sure looks like it would a Big Deal.

And I wonder along with "Richard L. Goerwitz" <richard@goon.stg.brown.edu>

rg> If that happens, one has to wonder whether a fundamental redesign is
rg> in order.  I.e., if DTDs, SGML, etc. go out the window, why not just
rg> go back to the drawing board?  There's a lot of cruft that only made
rg> it into the XML standard because of SGML compatibility requirements.

==============================================================
Mark Tucker			tucker_m@regenstrief.iupui.edu
Regenstrief Institute		phone: (317) 630-2606
1001 W. 10'th St; Indianapolis, IN; 46202-2859;	fax: (317) 630-6962
	

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Thu Sep 10 17:07:00 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:32 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
References: <199809090522.BAA11695@ruby.ora.com>	
	 <199809090121.SAA15093@mail-gw.pacbell.net>
	 <v03102801b21c945aab3e@[203.23.215.128]> <v04011701b21d8ea34317@[203.23.215.86]>
Message-ID: <35F7E9C7.EA6B8BA8@technologist.com>

Andy Dent wrote:
> 
> Why can't a product like our report-writer take
> - XML describing content
> - XSL specifying layout
> and produce, for example, a report preview window on a Mac?
> After all, if you regard a browser, it's doing something very similar.

The result of an XSL process must be well-defined, right? So the most
logical thing to create as the result of the process is an XML document.
To me, your question is equivalent to "Why can't my car producing product
take an XML document describing content, and an XSL describing the
automobile to produce and generate the car?" Well, if XSL were an
automobile producing language, that would make sense, but it isn't, it is
an XML producing language.

> After all, if you regard a browser, it's doing something very similar.

The browser takes XML, pumps it through an XSL engine, receives an XML
result (according to a known DTD with formatting semantics) and renders
*that*. You can do the same with your report writer.

Of course, you can optimize the heck out of this process by not actually
linearizing the XML result, or even creating an XML tree, as long as it
looks the same to the XSL stylesheet author. This is what James was
getting at in his message.

> I agree that a clean design mandates some separate structured collections
> of objects between XML and output, but I don't see how they are necessarily
> either XML or anything closely related. For one thing, they are 'highly
> decorated' by comparison with the original XML.

Well, we could require the output to be PDF or PostScript or something,
but XML seems the most logical choice. The important thing is to recognize
that we do have to choose *something*.

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

The past is inaccurate. Whoever lives long enough knows how much what he
had seen with his own eyes becomes overgrown with rumor, legend a
magnifying or belittling hearsay. "It was not like that at all!" -- 
he would like to exclaim, but will not, for they would have seen only 
his moving lips without hearing his voice. - Czeslaw Milosz (translated)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Thu Sep 10 17:10:14 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:32 2004
Subject: DTD vs Schema
In-Reply-To: <199809101333.JAA00210@unready.megginson.com>
References: <007401bddcb2$41f1d8a0$de6118cb@caleb>
 <007401bddcb2$41f1d8a0$de6118cb@caleb>
Message-ID: <3.0.1.16.19980910160508.3f1f4ed0@pop3.demon.co.uk>

At 09:33 10/09/98 -0400, David Megginson wrote:
>I still haven't had time to give XSchema a proper going-over, but from
>what I've seen, XSchema and DCD are not competing for the same
>territory:
>
>- XSchema defines an instance-based representation of DTDs

This is certainly my understanding - in the virtual group we have been very
careful to restrict ourselves to current DTD functionality. The intention
is that dtd2xsc is essentially lossless *after* normalisation of PEs,
inclusion of entity files, etc. xsc2dtd would lose the non-DTD information.
XSchema will allow DTD information to be managed with XML technology (e.g.
edited, printed nicely, etc.) I - and I suspect others - will see it as a
way to create DTD-aware authoring tools.
 However, XSchema is in XML and therefore eXtensible :-) What value people
put on that is up to them :-)  

>- DCD defines an alternative to DTDs

Yes. Exciting new territory. In my mind XSchema may be a transition towards
this.

	P.

>
>Does this make sense?
>
>
>All the best,
>
>
>David
>
>-- 
>David Megginson                 david@megginson.com
>           http://www.megginson.com/
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bckman at ix.netcom.com  Thu Sep 10 17:26:07 1998
From: bckman at ix.netcom.com (Frank Boumphrey)
Date: Mon Jun  7 17:04:32 2004
Subject: DTD vs Schema
Message-ID: <005a01bddccf$ceafa340$1aaddccf@ix.netcom.com>

>2. Is there anywhere a clear documentation on DTD without diving
>into the whole SGML DTD stuff? I am lost between HTML4 DTD, SGML DTD and
>XML DTD.

I give a very '101' tutorial on XML DTD's at my web site at
Http://www.hypermedic.com/style/index.htm
Follow the XML link and open the text file.


>1. What is the right way to describe XML message syntax: DTD or
>schema. I started by looking at Microsoft site. The publish
>documentation on schema but nothing about DTD. They also say schema is a
>better way to describe XML as it is XML.


At present Schemas are no more than Notes. The DT is the only official way
to describe your XML document.

regards,

Frank

Frank Boumphrey

XML and style sheet info at Http://www.hypermedic.com/style/index.htm
Author: - Professional Style Sheets for HTML and XML http://www.wrox.com
-----Original Message-----
From: AMER, Nidal <Nidal.AMER@hdmp.com>
To: <xml-dev@ic.ac.uk>
Date: Thursday, September 10, 1998 7:36 AM
Subject: DTD vs Schema


>Dear all,
>
>Some beginner's questions, just don't laugh:
>1. What is the right way to describe XML message syntax: DTD or
>schema. I started by looking at Microsoft site. The publish
>documentation on schema but nothing about DTD. They also say schema is a
>better way to describe XML as it is XML.
>2. Is there anywhere a clear documentation on DTD without diving
>into the whole SGML DTD stuff? I am lost between HTML4 DTD, SGML DTD and
>XML DTD.
>3. The database I am working on is highly hierarchical. MS schema
>documentation describes uses SUPERTYPE to declare inheritance. Is there
>any equivalent DTD declaration?
>
>Could anybody give me a hand with this?
>Many thanks in advance.
>~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>Nidal AMER
>R & D Deputy Manager
>HDMP ( Health Data Management Partners)
>A SmithKline-Beecham Company
>6 Rue de Gen?ve,
>1140 Brussels, Belgium
>Tel: + 32 (2) 724 00 93
>Fax: + 32 (2) 726 91 59
>E-mail: nidal.amer@hdmp.com <mailto:nidal.amer@hdmp.com>
>Visit our web site: http://www.hdmp.com <http://www.hdmp.com>
>~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From april at syclo.com  Thu Sep 10 17:51:21 1998
From: april at syclo.com (David S. April)
Date: Mon Jun  7 17:04:32 2004
Subject: DTD vs Schema
In-Reply-To: <005a01bddccf$ceafa340$1aaddccf@ix.netcom.com>
Message-ID: <3.0.1.32.19980910104620.00e42d50@quake.xnet.com>

At 11:29 AM 9/10/98 -0400, you wrote:
>>2. Is there anywhere a clear documentation on DTD without diving
>>into the whole SGML DTD stuff? I am lost between HTML4 DTD, SGML DTD and
>>XML DTD.

An excellent book on the subject is: 

	Xml : A Primer
	by Simon St. Laurent
	ISBN: 155828592X

Dave
--
David S. April                  Syclo LLC
april@syclo.com                 101 Lions Dr - Suite 118
(847) 842-0320                  Barrington, IL 60010
http://www.soasoas.com/april/

"Paradise is exactly like where you are right now, only much, much *better*."
    - Laurie Anderson


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Thu Sep 10 18:36:36 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:32 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
In-Reply-To: <35F7B89F.2E779FDB@jclark.com>
References: <199809090522.BAA11695@ruby.ora.com>
 <199809090121.SAA15093@mail-gw.pacbell.net>
 <v03102801b21c945aab3e@[203.23.215.128]>
Message-ID: <199809101633.JAA23541@mail-gw6.pacbell.net>


[James Clark]
>An XSL processor can do other things with the result tree than just
>write it out as XML.
>
>If you want to use XSL to produce some non-XML format, first you need to
>devise an XML representation of it.  For example, in the case of HTML,
>this would be "well-formed HTML", that is XML using the element types
>and attributes of XML.  Now write some code that turns this XML
>representation into the real thing.  Now you've just got to arrange for
>this code to get run instead of the usual code that writes the result
>tree out as XML.

  The moral of the story is that if your target language is not XML, then
you have to write your own tool to take it from XML to, let's say, HTML.
One way is to get into the XSL processor and add your own code, another
(less clean) way is to write something that post-processes the XML
representation of the target language.

  Unless, of course, we change the standard.

[Paul Prescod]
>Well, we could require the output to be PDF or PostScript or something,
>but XML seems the most logical choice. The important thing is to recognize
>that we do have to choose *something*.

  XSL seems perfectly well equipped to handle any text-based target
language. So why not let it?

  I guess I don't see the same need to "choose something" or restrict it in
any way other than to say "you must produce text". There must be something
very important that we gain by insisting the target language be one thing
or another. Help me understand what this important thing is.

-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mct at foyt.indyrad.iupui.edu  Thu Sep 10 18:45:50 1998
From: mct at foyt.indyrad.iupui.edu (Mark Tucker)
Date: Mon Jun  7 17:04:32 2004
Subject: Summary of Namespaces and Validation
Message-ID: <199809101630.LAA27616@foyt.indyrad.iupui.edu>


Well, 

	There seem to be two camps on the issue of how Namespaces
interact with DTD.


*****************************************
	Camp1 : Most of this Newsgroup 
*****************************************

		Documents with Namespaces can be validated provided
		the prefix abbreviations are used consistently.
		
		DTD processors are un-aware of Namespaces.

	<PN:NAME>  coexists with <BK:NAME>, because they
	use different prefixes.

	DTD processors never convert XML Element names
	into their "Expanded Form" (See 6.3 of the Namespa

	This camp seems to be equivalent to saying: 
		"Treat <PN:NAME> as an XML ELEMENT tag with a colon in it."


*****************************************
	Camp 2 : Tim Bray, Andrew Layman
*****************************************

		To validate a document that uses namespaces, do  all
		the ELEMENT and ATTRIBUTE handling using the Expanded
		Names.
		
	This group thinks namespaces coexist with DTD's, provided
	both the DTD and the document instance specify consistent
	URI's for the namespaces. The actual prefix abbreviations
	used in the DTD or the document instance don't matter.

	For example, the DTD could define
	    xmlns:BOOK="uri:book_defn"
        and then define <!ELEMENT BOOK:NAME ...>
	    
	while a document instance could define
	    xmlns:BK="uri:book_defn"
	and use <BK:NAME>

	Proper Namespace aware code will use the expanded name
		"uri:book_defn" // "NAME"
	in handling the element and its definition.

	
*****************************************
	Camp 3 : Implemented Namespace Aware Code
*****************************************

			What does code out there do?
	
	       
-- 
==============================================================
Mark Tucker			tucker_m@regenstrief.iupui.edu
Regenstrief Institute		phone: (317) 630-2606
1001 W. 10'th St; Indianapolis, IN; 46202-2859;	fax: (317) 630-6962

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Thu Sep 10 18:56:54 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:32 2004
Subject: HTML != XML
References: <199809090121.SAA15093@mail-gw.pacbell.net> 	<199809090522.BAA11695@ruby.ora.com> 	<199809091017.GAA00201@unready.megginson.com> 	<35F693BC.7089E4AA@enterworks.com> 	<199809091804.LAA05055@mail-gw6.pacbell.net> <199809091828.OAA02266@unready.megginson.com> <u90jsff3a.fsf_-_@delivery.ansa.co.uk>
Message-ID: <35F80426.6DF96EFE@locke.ccil.org>

Toby Speight wrote:

> I think it should be pointed out that HTML was *changed* in order to
> accommodate the browsers' behaviour.  Earlier drafts (HTML 3.0, IIRC)
> had  <!ELEMENT SCRIPT - - (#PCDATA)>, but the browsers didn't deal
> with it correctly.

If this is true, there's no evidence of it.  HTML 2.0 had no SCRIPT
element, and neither did HTML+ (1993) nor HTML 3.0 (1995).  It appears
full-blown in HTML 3.2 (1996-7) as a CDATA element.

Perhaps you are thinking of STYLE, which did appear in HTML 3.0 only
as (#PCDATA).

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Thu Sep 10 19:00:02 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:32 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
References: <199809090522.BAA11695@ruby.ora.com>
		 <199809090121.SAA15093@mail-gw.pacbell.net> <v03102801b21c945aab3e@[203.23.215.128]> <35F7B89F.2E779FDB@jclark.com>
Message-ID: <35F804CD.47CAAB14@locke.ccil.org>

James Clark scripsit:

> If you want to use XSL to produce some non-XML format, first you need to
> devise an XML representation of it.  For example, in the case of HTML,
> this would be "well-formed HTML", that is XML using the element types
> and attributes of XML [sic; HTML].

Almost, almost, but not quite!

Well-formed HTML is not quite well-formed XML, because of the possible
presence of "&" and "<" in the CDATA elements SCRIPT and STYLE.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Thu Sep 10 19:10:58 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:32 2004
Subject: DTD vs Schema
References: <007401bddcb2$41f1d8a0$de6118cb@caleb> <199809101333.JAA00210@unready.megginson.com>
Message-ID: <35F8077B.74A31C15@locke.ccil.org>

David Megginson scripsit:

> I still haven't had time to give XSchema a proper going-over, but from
> what I've seen, XSchema and DCD are not competing for the same
> territory:
> 
> - XSchema defines an instance-based representation of DTDs
> - DCD defines an alternative to DTDs

Just the opposite: it is a stated requirement of DCD that it represent
everything a DTD can, whereas XSchema long ago abandoned that
requirement.  In particular, XSchema does not handle entities (someday,
information about unparsed entities may be added).

DCD instances are by intention RDF metadata, though the current version
has both intentional and unintentional deviations from the current
RDF Model & Syntax draft.  DCD also has a concept of "datatypes"
that identify the internal (non-XML) syntax of #PCDATA.

XSchema has an emphasis on reusability, although the actual mechanism
of reuse hasn't been designed, pending the next XLink draft.

Personally, I would like to see a convergence between the two, as
I think they *are* competing for the same territory.  The DCD concept
of RDF-compliance is IMHO the Right Thing, whereas XSchema has a bit
better thought out view of what to include.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Thu Sep 10 19:17:46 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:32 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
References: <199809090522.BAA11695@ruby.ora.com>	
		 <199809090121.SAA15093@mail-gw.pacbell.net>
		 <v03102801b21c945aab3e@[203.23.215.128]> <v04011701b21d8ea34317@[203.23.215.86]> <35F7E9C7.EA6B8BA8@technologist.com>
Message-ID: <35F80910.2192F35F@locke.ccil.org>

Paul Prescod wrote:

> Well, we could require the output to be PDF or PostScript or something,
> but XML seems the most logical choice. The important thing is to recognize
> that we do have to choose *something*.

Granted.  But is it so much to ask, to be able to produce well-formed
HTML as well?  After all, the XSL draft is speckled with references
to doing so, but well-formed and valid HTML just isn't XML -- even though
with one little allowance, it can become so.

Given the continuing importance of HTML or HTML+CSS as an output
format, this doesn't seem like such a large change.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From john at spinosa.com  Thu Sep 10 19:22:02 1998
From: john at spinosa.com (John C. Spinosa, MD, PhD)
Date: Mon Jun  7 17:04:32 2004
Subject: Summary of Namespaces and Validation
Message-ID: <3.0.32.19980910102357.0086d100@pop.mindspring.com>

Mark,

It hasn't been clear to me from your posts in regards to DTD validation and
namespaces whether you are considering a single DTD that uses multiple
namespaces or several DTDs (?DTD fragments?) which each use its own
namespace(s). Are you considering the former situation, the latter or both?

John
john@spinosa.com

Small part of previous post included for reference.

At 11:30 AM 9/10/98 -0500, Mark Tucker wrote:
>
>
>Well, 
>
>	There seem to be two camps on the issue of how Namespaces
>interact with DTD.
>
>
>*****************************************
>	Camp1 : Most of this Newsgroup 
>*****************************************


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Thu Sep 10 19:25:26 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:32 2004
Subject: Summary of Namespaces and Validation
Message-ID: <3.0.32.19980910102336.00a02100@pop.intergate.bc.ca>

At 11:30 AM 9/10/98 -0500, Mark Tucker wrote:
>*****************************************
>	Camp1 : Most of this Newsgroup 
>*****************************************
>		Documents with Namespaces can be validated provided
>		the prefix abbreviations are used consistently.

I don't think that too many people feel this way.  I hope not, because
the namespace draft makes it 100% clear that the prefix in and of itself
has no meaning, and is just a short-form stand-in to a URI.

>		DTD processors are un-aware of Namespaces.

True.

>	This camp seems to be equivalent to saying: 
>		"Treat <PN:NAME> as an XML ELEMENT tag with a colon in it."

This is all you can expect from an 8879-based DTD processor.

>*****************************************
>	Camp 2 : Tim Bray, Andrew Layman
>*****************************************
>		To validate a document that uses namespaces, do  all
>		the ELEMENT and ATTRIBUTE handling using the Expanded
>		Names.
>	This group thinks namespaces coexist with DTD's, provided
>	both the DTD and the document instance specify consistent
>	URI's for the namespaces. The actual prefix abbreviations
>	used in the DTD or the document instance don't matter.

Not quite right.  The prefixes are stand-ins for the URIs.  However,
the prefixes are all the DTD processor can see.  Thus, validating 
namespaced documents *has* to be a 3-step process.

1. Build a compound DTD that has prefixed declarations for all your
   elements and attributes.  This is the hard part.
2. Go through the instance and rewrite all the namespace declarations onto
   the root element and undo any defaulting, so that anything that's in
   a namespace has a prefix.  If necessary, rewrite the DTD so that the
   same URIs have the same prefixes in DTD and instance.  This is tedious
   but straightforward, there are dozens of programmers in this group
   who could sort it out in a day, given a decent XML processor.
3. Validate (or not, if the doc is broken).

What really bothers me is that discussion here keeps obsessing over the
tedious but straightforward problem of matching up prefixes, and nobody's
thinking about the interesting and difficult problem of compounding DTDs.
 -Tim


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Thu Sep 10 19:25:27 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:32 2004
Subject: DTD vs Schema
References: <007401bddcb2$41f1d8a0$de6118cb@caleb>
	 <007401bddcb2$41f1d8a0$de6118cb@caleb> <3.0.1.16.19980910160508.3f1f4ed0@pop3.demon.co.uk>
Message-ID: <35F80AAB.8AEAD15C@locke.ccil.org>

Peter Murray-Rust wrote:

> This is certainly my understanding - in the virtual group we have been very
> careful to restrict ourselves to current DTD functionality.

Except for adding full namespace support.  However, we have removed
entities, which are in DCD.  As my other posting says, DCD is
required to be a superset of DTD, whereas XSchema is not.

> >- DCD defines an alternative to DTDs
> 
> Yes. Exciting new territory. In my mind XSchema may be a transition towards
> this.

It does add new information, but not all that much:  namespace
support, datatypes, value ranges.  In addition, DCD is silent about
notations.

There is a large common semantic core between DCD and XSchema.
The efforts ought to be merged, providing something that is
compliant RDF, supports the common element/attribute core,
the value ranges, at least some of the datatypes, unparsed
entities, and notations.

...et iterum censeo Carthago delenda est...

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tms at ansa.co.uk  Thu Sep 10 19:29:16 1998
From: tms at ansa.co.uk (Toby Speight)
Date: Mon Jun  7 17:04:32 2004
Subject: HTML != XML
In-Reply-To: John Cowan's message of "Thu, 10 Sep 1998 12:53:58 -0400"
References: <199809090121.SAA15093@mail-gw.pacbell.net> 	<199809090522.BAA11695@ruby.ora.com> 	<199809091017.GAA00201@unready.megginson.com> 	<35F693BC.7089E4AA@enterworks.com> 	<199809091804.LAA05055@mail-gw6.pacbell.net> <199809091828.OAA02266@unready.megginson.com> <u90jsff3a.fsf_-_@delivery.ansa.co.uk> <35F80426.6DF96EFE@locke.ccil.org>
Message-ID: <ur9xjev23.fsf@delivery.ansa.co.uk>

John> John Cowan <URL:mailto:cowan@locke.ccil.org>

0> In article <35F80426.6DF96EFE@locke.ccil.org>, John wrote:
John> Perhaps you are thinking of STYLE, which did appear in HTML 3.0
John> only as (#PCDATA).

Yes.  Thanks for the correction, John.

-- 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Thu Sep 10 19:35:24 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:32 2004
Subject: XSL and Device-Independent Formatting
In-Reply-To: <199809101633.JAA23541@mail-gw6.pacbell.net>
References: <199809090522.BAA11695@ruby.ora.com>
	<199809090121.SAA15093@mail-gw.pacbell.net>
	<v03102801b21c945aab3e@[203.23.215.128]>
	<35F7B89F.2E779FDB@jclark.com>
	<199809101633.JAA23541@mail-gw6.pacbell.net>
Message-ID: <199809101733.NAA00403@unready.megginson.com>

Andrew Bunner writes:

 >   The moral of the story is that if your target language is not
 > XML, then you have to write your own tool to take it from XML to,
 > let's say, HTML.  One way is to get into the XSL processor and add
 > your own code, another (less clean) way is to write something that
 > post-processes the XML representation of the target language.

<warning>I have not had the opportunity to spend much time with the
 new XSL WD yet, so my answer is based on general practice rather than
 the specific process defined in the XSL WD.</warning>

I think that people are over-thinking the problem.  Try this on for
size: an XSL formatter produces a device-independent formatting tree,
then can render the same tree in different concrete formats (PDF, PS,
DVI, or what have you).  As a happy co-incidence, it happens that the
intermediate formatting tree -- like most structured information --
can be serialised as an XML document.

That means that, if you wish, the two parts of the process (building
the device-independent formatting tree and rendering the tree) can be
handled by separate programs, since the XML provides a common
interchange standard.  If you plan to do the whole thing with a single
process, however, then there is no need actually to produce the XML
representation of the formatting tree -- just keep it as an internal
object tree.

Now, HTML is a special case, because it does not fit in well with
normal formatting semantics (it is considered bad practice to specify
font size, etc., in the document, though you can attach a CSS
stylesheet).

I hope this helps.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peterj at wrox.com  Thu Sep 10 19:44:13 1998
From: peterj at wrox.com (Peter Jones)
Date: Mon Jun  7 17:04:33 2004
Subject: namespaces discussion
Message-ID: <29AA5A0E3A0CD21196F300A0C9D8575C036082@WROX3>

>and
>nobody's
>thinking about the interesting and difficult problem of compounding
>DTDs.
> -Tim

What do you mean by compounding DTDs? I don't know whether any of my
postings to the list have been getting through, ...but why can't the
notion of a DTD be an utterly nebulous concept in the abstract, elements
themselves having a namespace URIs which addresses a DTD entity for that
particular element. Different elements validated against different
declarations lying in dispersed DTD entities.
Why isn't this idea getting through to anyone? (am v. frustrated!)

Peter Jones
WebDev Technical Editor
Wrox Press
mailto:peterj@wrox.com
***************
Wrox Press UK Ltd.
http://www.wrox.co.uk
Tel 44 121 706 6826
Fax 44 121 706 2967


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peterj at wrox.com  Thu Sep 10 19:48:12 1998
From: peterj at wrox.com (Peter Jones)
Date: Mon Jun  7 17:04:33 2004
Subject: namespaces discussion
Message-ID: <29AA5A0E3A0CD21196F300A0C9D8575C036083@WROX3>

I'm posting this again as I'm not sure the first went through
>and
>nobody's
>thinking about the interesting and difficult problem of compounding
>DTDs.
> -Tim

What do you mean by compounding DTDs? I don't know whether any of my
postings to the list have been getting through, ...but why can't the
notion of a DTD be an utterly nebulous concept in the abstract, elements
themselves having a namespace URIs which addresses a DTD entity for that
particular element. Different elements validated against different
declarations lying in dispersed DTD entities.
Why isn't this idea getting through to anyone? (am v. frustrated!)


Peter Jones
WebDev Technical Editor
Wrox Press
mailto:peterj@wrox.com
***************
Wrox Press UK Ltd.
http://www.wrox.co.uk
Tel 44 121 706 6826
Fax 44 121 706 2967


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Thu Sep 10 19:55:21 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:33 2004
Subject: Summary of Namespaces and Validation
In-Reply-To: <3.0.32.19980910102336.00a02100@pop.intergate.bc.ca>
References: <3.0.32.19980910102336.00a02100@pop.intergate.bc.ca>
Message-ID: <199809101754.NAA00465@unready.megginson.com>

Tim Bray writes:

 > At 11:30 AM 9/10/98 -0500, Mark Tucker wrote:

 > >*****************************************
 > >	Camp1 : Most of this Newsgroup 
 > >*****************************************
 > >		Documents with Namespaces can be validated provided
 > >		the prefix abbreviations are used consistently.
 > 
 > I don't think that too many people feel this way.  I hope not,
 > because the namespace draft makes it 100% clear that the prefix in
 > and of itself has no meaning, and is just a short-form stand-in to
 > a URI.

People are fumbling around because they're mixing up validation and
processing.

If we're talking about DTD validation (as the original message was),
then this camp is correct: as defined in the XML 1.0 REC, DTD
validation is concerned entirely with surface names and can have no
knowledge of the underlying namespace URI.

Namespace-aware XML *processing*, on the other hand, must ignore the
prefix and work with the namespace URI, as explained in the Namespaces
WD.

When the XML Schema WG finishes its work, we should have a standard
way to do namespace-aware validation; in the mean time, XML 1.0 DTDs
is all we have.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Thu Sep 10 19:58:04 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:33 2004
Subject: namespaces discussion
References: <29AA5A0E3A0CD21196F300A0C9D8575C036082@WROX3>
Message-ID: <35F81237.CE3F9709@locke.ccil.org>

Peter Jones wrote:

> What do you mean by compounding DTDs? I don't know whether any of my
> postings to the list have been getting through, ...but why can't the
> notion of a DTD be an utterly nebulous concept in the abstract, elements
> themselves having a namespace URIs which addresses a DTD entity for that
> particular element. Different elements validated against different
> declarations lying in dispersed DTD entities.
> Why isn't this idea getting through to anyone? (am v. frustrated!)

This may be feasible for some yet-to-be-standardized schema language,
but not for DTDs as such.  "DTD" is a fixed, narrow, SGML-compatible
notion that can't be changed.

In addition, the namespace draft has laid down that the URI is
used solely for comparison, and needn't represent an existent
resource, much less a specific schema.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Thu Sep 10 20:01:02 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:33 2004
Subject: namespaces discussion
In-Reply-To: <29AA5A0E3A0CD21196F300A0C9D8575C036083@WROX3>
References: <29AA5A0E3A0CD21196F300A0C9D8575C036083@WROX3>
Message-ID: <199809101758.NAA00482@unready.megginson.com>

Peter Jones writes:

 > What do you mean by compounding DTDs? I don't know whether any of my
 > postings to the list have been getting through, ...but why can't the
 > notion of a DTD be an utterly nebulous concept in the abstract, elements
 > themselves having a namespace URIs which addresses a DTD entity for that
 > particular element. Different elements validated against different
 > declarations lying in dispersed DTD entities.

It could, but it's not, because XML 1.0 didn't define DTDs that way,
and the namespaces WD made no attempt to modify XML 1.0.

 > Why isn't this idea getting through to anyone? (am v. frustrated!)

I think that this idea is very much in the minds of the W3C group
working on XML Schemas.  It may be a while, however, before they have
anything ready for public view.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mct at foyt.indyrad.iupui.edu  Thu Sep 10 20:10:12 1998
From: mct at foyt.indyrad.iupui.edu (Mark Tucker)
Date: Mon Jun  7 17:04:33 2004
Subject: Expanded Name vs Prefix Name  + Compound DTD: Whats the difficulty?
Message-ID: <199809101754.MAA29349@foyt.indyrad.iupui.edu>


Mr Bray gives us an algorithm for using DTD's and Namespaces.  

He suggests "fully prefixing" all the elements with unique
prefixes during a rewriting pass.  

I had the suggestion "In your symbol table, handle XML Element and
Attribute names using the combination of the URI and the local name,
instead of just local name."

>From my point of view, Bray's "Full prefixing" technique
is fine.

mct>*****************************************
mct>	Camp 2 : Tim Bray, Andrew Layman
mct>*****************************************
mct>		To validate a document that uses namespaces, do  all
mct>		the ELEMENT and ATTRIBUTE handling using the Expanded
mct>		Names.
mct>	This group thinks namespaces coexist with DTD's, provided
mct>	both the DTD and the document instance specify consistent
mct>	URI's for the namespaces. The actual prefix abbreviations
mct>	used in the DTD or the document instance don't matter.
tb> Not quite right.  The prefixes are stand-ins for the URIs.  However,
tb> the prefixes are all the DTD processor can see.  Thus, validating 
tb> namespaced documents *has* to be a 3-step process.
tb> 
tb> 1. Build a compound DTD that has prefixed declarations for all your
tb>    elements and attributes.  This is the hard part.

{See Below)

tb> 2. Go through the instance and rewrite all the namespace declarations onto
tb>    the root element and undo any defaulting, so that anything that's in
tb>    a namespace has a prefix.  If necessary, rewrite the DTD so that the
tb>    same URIs have the same prefixes in DTD and instance.  This is tedious
tb>    but straightforward, there are dozens of programmers in this group
tb>    who could sort it out in a day, given a decent XML processor.

Presumably, during this re-writing, we could give the prefixes new
names PF1,PF2,... without effecting the result.  The particular
prefixes chosen do not effect semantics.


tb> 3. Validate (or not, if the doc is broken).
tb> 

tb> What really bothers me is that discussion here keeps obsessing over
tb> the tedious but straightforward problem of matching up prefixes, and
tb> nobody's thinking about the interesting and difficult problem of
tb> compounding DTDs.

I must confess that I don't understand the problem, but it doesn't
look hard.  To the naive eye, I would just read in the relevant DTD's,
and perform the same "Fully Prefixing" operation.  Since the DTD's
would all be prefixed consistenly, can't we just toss all the
productions into the pot, and turn the crank?  How do they interact,
if they all have differently named elements and interactions?


Hot off the press we have...........

Peter Jones wrote:

pj> What do you mean by compounding DTDs? I don't know whether any of my
pj> postings to the list have been getting through, ...but why can't the
pj> notion of a DTD be an utterly nebulous concept in the abstract, elements
pj> themselves having a namespace URIs which addresses a DTD entity for that
pj> particular element. Different elements validated against different
pj> declarations lying in dispersed DTD entities.
pj> Why isn't this idea getting through to anyone? (am v. frustrated!)


I think I agree with you.  I'd just as soon see a DTD as a pile of
ELEMENT and ATTRLIST definitions.

John Cowan replied:

jc> This may be feasible for some yet-to-be-standardized schema language,
jc> but not for DTDs as such.  "DTD" is a fixed, narrow, SGML-compatible
jc> notion that can't be changed.
jc> 
jc> In addition, the namespace draft has laid down that the URI is
jc> used solely for comparison, and needn't represent an existent
jc> resource, much less a specific schema.


Ah, the fact that the URI doesn't represent an existant resource
doesn't matter here!  You can't use the URI to retrieve the DTD,
but, suppose you already know the DTD which corresponds to the URI?
If you magically know the DTD, then you go ahead an apply
and validate the DTD (after either using Tim's re-prefixing, or by
using my symbol table handling mechanism.)

The URI is used only as a unique string that uniquely identifies
a "Resource" (in the RDF sense of abstract thing.)


-- 
==============================================================
Mark Tucker			tucker_m@regenstrief.iupui.edu
Regenstrief Institute		phone: (317) 630-2606
1001 W. 10'th St; Indianapolis, IN; 46202-2859;	fax: (317) 630-6962

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at goon.stg.brown.edu  Thu Sep 10 20:48:58 1998
From: richard at goon.stg.brown.edu (Richard L. Goerwitz)
Date: Mon Jun  7 17:04:33 2004
Subject: namespaces discussion
References: <29AA5A0E3A0CD21196F300A0C9D8575C036082@WROX3>
Message-ID: <35F81F05.AAF601BC@goon.stg.brown.edu>

Peter Jones wrote:

> [W]hy can't...elements themselves [have] a namespace URI which addresses
> a DTD entity for that particular element?

This is exactly what Tim is alluding to.  If the idea can be formally
defined, and implemented - and incorporated into the namespace spec -
we could distribute not only processing, but also definition, of XML
document types.

-- 

Richard Goerwitz
PGP key fingerprint:    C1 3E F4 23 7C 33 51 8D  3B 88 53 57 56 0D 38 A0
For more info (mail, phone, fax no.):  finger richard@goon.stg.brown.edu

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Thu Sep 10 21:01:48 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:33 2004
Subject: namespaces discussion
Message-ID: <3.0.32.19980910120152.00aa2ab0@pop.intergate.bc.ca>

At 02:48 PM 9/10/98 -0400, Richard L. Goerwitz wrote:
>Peter Jones wrote:
>> [W]hy can't...elements themselves [have] a namespace URI which addresses
>> a DTD entity for that particular element?
>This is exactly what Tim is alluding to.  If the idea can be formally
>defined, and implemented - and incorporated into the namespace spec -
>we could distribute not only processing, but also definition, of XML
>document types.

This will never happen.  The namespace URI is just a name.  It would
be really wrong to assume you can go there and get a DTD, because:

 1. some namespaces (like HTML, MathML, a few others) are going to be
    very widely used, and it would be silly to force one poor server
    somewhere to guarantee a DTD-on-demand for them
 2. some namespaces aren't going to have DTDs or any other kind of schema
 3. some schemas aren't going to be DTDs

For all of these reasons, the namespace name has to remain just that:
a name.  

We are obviously going to need ways to find schemas associated with
namespaces.  SGML/XML have a way of doing this, the <!DOCTYPE declaration,
but it can only really point to one DTD at a time.  Presumably future
schema proposals will have ways of pointing to multiple parallel schemas.
 -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Thu Sep 10 21:17:18 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:33 2004
Subject: Expanded Name vs Prefix Name  + Compound DTD: Whats the difficulty?
References: <199809101754.MAA29349@foyt.indyrad.iupui.edu>
Message-ID: <35F8251D.44A76937@locke.ccil.org>

Mark Tucker wrote:

> I must confess that I don't understand the problem, but it doesn't
> look hard.  To the naive eye, I would just read in the relevant DTD's,
> and perform the same "Fully Prefixing" operation.  Since the DTD's
> would all be prefixed consistenly, can't we just toss all the
> productions into the pot, and turn the crank?  How do they interact,
> if they all have differently named elements and interactions?

Not at all, and that's the point.  You won't be able to place
elements from one DTD inside elements from another, unless the
enclosing element has an ANY content model.  What takes work and
thought is figuring out how the content models of the various elements
should be extended (or not) to incorporate elements from other DTDs.
I don't see how *any* tools whatever can help you much with this,
except in the mechanical parts.

> Ah, the fact that the URI doesn't represent an existant resource
> doesn't matter here!  You can't use the URI to retrieve the DTD,
> but, suppose you already know the DTD which corresponds to the URI?

In that case, full speed ahead.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at goon.stg.brown.edu  Thu Sep 10 21:40:48 1998
From: richard at goon.stg.brown.edu (Richard L. Goerwitz)
Date: Mon Jun  7 17:04:33 2004
Subject: namespaces discussion
References: <3.0.32.19980910120152.00aa2ab0@pop.intergate.bc.ca>
Message-ID: <35F82B2A.834B434C@goon.stg.brown.edu>

Tim Bray wrote:

> >> [W]hy can't...elements themselves [have] a namespace URI which addresses
> >> a DTD entity for that particular element?

> This will never happen.  The namespace URI is just a name.  It would
> be really wrong to assume you can go there and get a DTD
> 
>  1. some namespaces (like HTML, MathML, a few others) are going to be
>     very widely used, and it would be silly to force one poor server
>     somewhere to guarantee a DTD-on-demand for them

Some DTDs will be widely used whether or not one uses them for name-
spaces.  (Deadpan look follows.)

As for namespace URIs being just names:  We're in this deep.  Why not
associate namespaces optionally with DTDs (not necessarily via the name-
space URI)?

>  2. some namespaces aren't going to have DTDs or any other kind of schema

Then they won't validate.  That's all.  Or (lacking an associated DTD),
they could be validated in the kludgy fashion that the current namespace
spec seems to require (i.e., you have to make your DTD aware of all the
new:elements your use of namespaces requires - with allowances for de-
faulting :-( ).

>  3. some schemas aren't going to be DTDs

That's another issue that we will just have to see played out.  To me
it looks as if the people who insist on strict SGML conformance are, in
fact, the ones who are driving the wild proliferation of extensions 
before XML has barely made it out of the gate.

In an ironic twist of fate, they may be the ones most responsible for
DTD and such quickly becoming legacy features.

This may not be a bad thing.  Either way, though, the effort spent on
SGML conformance is starting to look noble but in vain.

-- 

Richard Goerwitz
PGP key fingerprint:    C1 3E F4 23 7C 33 51 8D  3B 88 53 57 56 0D 38 A0
For more info (mail, phone, fax no.):  finger richard@goon.stg.brown.edu

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mct at foyt.indyrad.iupui.edu  Thu Sep 10 21:45:24 1998
From: mct at foyt.indyrad.iupui.edu (Mark Tucker)
Date: Mon Jun  7 17:04:33 2004
Subject: namespaces discussion
Message-ID: <199809101929.OAA01520@foyt.indyrad.iupui.edu>


John Cowan <cowan@locke.ccil.org> wrote:

mct> I must confess that I don't understand the problem, but it doesn't
mct> look hard.  To the naive eye, I would just read in the relevant DTD's,
mct> and perform the same "Fully Prefixing" operation.  Since the DTD's
mct> would all be prefixed consistenly, can't we just toss all the
mct> productions into the pot, and turn the crank?  How do they interact,
mct> if they all have differently named elements and interactions?

jc> Not at all, and that's the point.  You won't be able to place elements
jc> from one DTD inside elements from another, unless the enclosing
jc> element has an ANY content model.  What takes work and thought is
jc> figuring out how the content models of the various elements should be
jc> extended (or not) to incorporate elements from other DTDs.  I don't
jc> see how *any* tools whatever can help you much with this, except in
jc> the mechanical parts.


First a question:

			Can I have a DTD that mentions elements
			defined elsewhere?

If so, then I can easily nest elements from one dtd inside another.

In the example below, the N1:BOOK element will contain N1:NAME elements,
and elements from a different namespace N2:ADDRESS

If DTD1, which is denoted by URI "uri:dtd1" says

<I_m_not_sure_what_to_say_here
	xmlns:N1="uri:biblotheque",
	xmlns:N2="uri:locatie">

<!ELEMENT N1:BOOK  (N1:NAME N2:ADDRESS) >
<!ELEMENT N1:NAME >
<!ATTRLIST N1:NAME
	v #pcdata>


and DTD2 says (notice that the prefix K doesn't matter, only the definition
as "uri:locatie")			

<I_m_not_sure_what_to_say_here
	xmlns:K="uri:locatie">

<!ELEMENT K:ADDRESS >
<!ATTRLIST K:ADDRESS
	v #pcdata>

then an valid instance document could be

<DOCTYPE
   xmlns:J1="uri:bibliotheque"
   xmlns:J2="uri:locatie">

				xm
<J1:BOOK>
    <J1:NAME V="FRED"/>
    <J2:ADDRESS V="Holland"/>
</J1:BOOK>
    
which would conform to DTD1.

    
What's wrong with this picture?.....

For my needs, DTD1 is a perfectly fine DTD. It defines
local stuff (NAME,BOOK), and references other DTD's to define the sub pieces.
(ADDRESS).


-- 
==============================================================
Mark Tucker			tucker_m@regenstrief.iupui.edu
Regenstrief Institute		phone: (317) 630-2606
1001 W. 10'th St; Indianapolis, IN; 46202-2859;	fax: (317) 630-6962

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mct at foyt.indyrad.iupui.edu  Thu Sep 10 22:09:25 1998
From: mct at foyt.indyrad.iupui.edu (Mark Tucker)
Date: Mon Jun  7 17:04:33 2004
Subject: Do you or Dont you buy Tim Bray's Namespace Validation Algorithm?
Message-ID: <199809101953.OAA02029@foyt.indyrad.iupui.edu>


Wait, People, 

	DO YOU OR DONT YOU BUY TIM BRAY'S ALGORITHM for
	DTD validation in the face of Namespaces:
	
	    "Re-write the document instance with consistent Prefixes
	     then do a normal DTD validatation."

	I don't see anything kludgy in it. (modulo my preference to
        use expanded names directly in the processor's symbol table.)

	It just says: 

		1. Determine the Expanded the names the way the
		Namespace Proposal says. (Section 6.3)
		
		2. Define a unique prefix for each namespace definition URI .

		3. Rewrite the element or attribute, prepending the
		(possibly generated) unique prefix for the namespace
		of the element/attributes Expanded Name.
		
		   This give you  P1:BOOK, P1:NAME, P3:ADDRESS
		   if BOOK and NAME come from the same namespace URI.
		
		4. Do the same to the DTD's that you read in.
		
		5. Do normal DTD validation of the rewritten instance 
		   document against the rewritten DTD.
		

It seems to me that DTD's with namespaces CAN be validated.


"Richard L. Goerwitz" <richard@goon.stg.brown.edu> wrote

rg> Then they won't validate.  That's all.  Or (lacking an associated
rg> DTD), they could be validated in the kludgy fashion that the current
rg> namespace spec seems to require (i.e., you have to make your DTD aware
rg> of all the new:elements your use of namespaces requires - with
rg> allowances for de- faulting :-( ).

	You do not have to rewrite your DTD at all.  You  do not have
	to "merge content models", and "introduce ANY".

	The whole validation process is mechanical, and gives the
	expected results.
		
-- 
==============================================================
Mark Tucker			tucker_m@regenstrief.iupui.edu
Regenstrief Institute		phone: (317) 630-2606
1001 W. 10'th St; Indianapolis, IN; 46202-2859;	fax: (317) 630-6962

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From andrewl at microsoft.com  Thu Sep 10 22:09:56 1998
From: andrewl at microsoft.com (Andrew Layman)
Date: Mon Jun  7 17:04:33 2004
Subject: namespaces discussion
Message-ID: <5BF896CAFE8DD111812400805F1991F7038CA72D@RED-MSG-08>

Q:  "Why not associate namespaces optionally with DTDs (not necessarily via
the name-space URI)?"

A:	This was discussed extensively during the design, and rejected for
two reasons: First, it was not the minimal necessary to enable namespaces.
Second, there are many possible resources that could be associated with a
namespace.  DTDs are one, but various forms of schemas, style sheets,
documentation, etc. are also likely.

I hope this is helpful.

-----Original Message-----
From: Richard L. Goerwitz [mailto:richard@goon.stg.brown.edu]
Sent: Thursday, September 10, 1998 12:40 PM
To: xml-dev@ic.ac.uk
Subject: Re: namespaces discussion


Tim Bray wrote:

> >> [W]hy can't...elements themselves [have] a namespace URI which
addresses
> >> a DTD entity for that particular element?

> This will never happen.  The namespace URI is just a name.  It would
> be really wrong to assume you can go there and get a DTD
> 
>  1. some namespaces (like HTML, MathML, a few others) are going to be
>     very widely used, and it would be silly to force one poor server
>     somewhere to guarantee a DTD-on-demand for them

Some DTDs will be widely used whether or not one uses them for name-
spaces.  (Deadpan look follows.)

As for namespace URIs being just names:  We're in this deep.  Why not
associate namespaces optionally with DTDs (not necessarily via the name-
space URI)?

>  2. some namespaces aren't going to have DTDs or any other kind of schema

Then they won't validate.  That's all.  Or (lacking an associated DTD),
they could be validated in the kludgy fashion that the current namespace
spec seems to require (i.e., you have to make your DTD aware of all the
new:elements your use of namespaces requires - with allowances for de-
faulting :-( ).

>  3. some schemas aren't going to be DTDs

That's another issue that we will just have to see played out.  To me
it looks as if the people who insist on strict SGML conformance are, in
fact, the ones who are driving the wild proliferation of extensions 
before XML has barely made it out of the gate.

In an ironic twist of fate, they may be the ones most responsible for
DTD and such quickly becoming legacy features.

This may not be a bad thing.  Either way, though, the effort spent on
SGML conformance is starting to look noble but in vain.

-- 

Richard Goerwitz
PGP key fingerprint:    C1 3E F4 23 7C 33 51 8D  3B 88 53 57 56 0D 38 A0
For more info (mail, phone, fax no.):  finger richard@goon.stg.brown.edu

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jonathan at texcel.no  Thu Sep 10 22:38:30 1998
From: jonathan at texcel.no (Jonathan Robie)
Date: Mon Jun  7 17:04:34 2004
Subject: Do you or Dont you buy Tim Bray's Namespace Validation
  Algorithm?
In-Reply-To: <199809101953.OAA02029@foyt.indyrad.iupui.edu>
Message-ID: <3.0.3.32.19980910163755.02e83480@pop.mindspring.com>

At 02:53 PM 9/10/98 -0500, Mark Tucker wrote:
>
>Wait, People, 
>
>	DO YOU OR DONT YOU BUY TIM BRAY'S ALGORITHM for
>	DTD validation in the face of Namespaces:

Sure. As long as you handle the prefixes in both the document and the dtd
before validation, it works. In other words, the five step algorithm works,
but it implies a little more than this short summary statement:
	
>	    "Re-write the document instance with consistent Prefixes
>	     then do a normal DTD validatation."

I also disagree somewhat with the following:

>	I don't see anything kludgy in it. (modulo my preference to
>        use expanded names directly in the processor's symbol table.)

Well, it's the least kludgy hack we have available to us until schemas can
handle namespace mapping. It's the way I would handle it in my applications.

Jonathan
 
jonathan@texcel.no
Texcel Research
http://www.texcel.no

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Thu Sep 10 22:45:44 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:34 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
References: <199809090522.BAA11695@ruby.ora.com>	
	 <199809090121.SAA15093@mail-gw.pacbell.net>
	 <v03102801b21c945aab3e@[203.23.215.128]> <v04011701b21d8ea34317@[203.23.215.86]>
Message-ID: <35F83A92.867EC3D9@infinet.com>

Andy Dent wrote:

> At 7:31 PM +0800 10/9/98, James Clark wrote:
> >An XSL processor can do other things with the result tree than just
> >write it out as XML.
> >
> >If you want to use XSL to produce some non-XML format, first you need to
> >devise an XML representation of it.
> Why?
>
> Why can't a product like our report-writer take
> - XML describing content
> - XSL specifying layout
> and produce, for example, a report preview window on a Mac?
> After all, if you regard a browser, it's doing something very similar.
>
> I don't see the need for the intermediate translation to another set of XML
> data, but there may be something I've missed in the XSL processing standard.

You might have an XML representation of Word files.  If you wanted to convert
content form an XML representation of Acrobat files, all you would need is a
stylesheet to convert the XML representation of Acrobat files to the XML
representation of the Word file.  This process would require an XSL processor and
a stylesheet and someone with no programming experience could actually execute
this process (they would write the stylesheet).  I think the idea of XSL is to
take the programming out of the conversion process from one data-model to
another.

Ty;er


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From matt at veosystems.com  Thu Sep 10 22:48:15 1998
From: matt at veosystems.com (matt@veosystems.com)
Date: Mon Jun  7 17:04:34 2004
Subject: Summary of Namespaces and Validation
In-Reply-To: <3.0.32.19980910102336.00a02100@pop.intergate.bc.ca> from "Tim Bray" at Sep 10, 98 10:24:20 am
Message-ID: <19980910204728.27334.qmail@veosystems.com>

A non-text attachment was scrubbed...
Name: not available
Type: text
Size: 2963 bytes
Desc: not available
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980910/097f9236/attachment.bat
From cowan at locke.ccil.org  Thu Sep 10 22:51:17 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:34 2004
Subject: Do you or Dont you buy Tim Bray's Namespace Validation Algorithm?
References: <199809101953.OAA02029@foyt.indyrad.iupui.edu>
Message-ID: <35F83ACB.119F7E1A@locke.ccil.org>

Mark Tucker wrote:

>         DO YOU OR DONT YOU BUY TIM BRAY'S ALGORITHM for
>         DTD validation in the face of Namespaces:

On reflection, I don't --- quite.
 
>                 4. Do the same to the DTD's that you read in.

This is the sticky bit.  Tim Bray phrases it as "If necessary, rewrite
the DTD so that the same URIs have the same prefixes in DTD and
instance."

However, there is no formal way to express what prefixes in the DTD
refer to what namespace URIs.  In essence, you must know that in
advance.  For validation to be a mechanical process, you must have
some way of recording the frozen namespace-URI map for the DTD.

(Please don't mutter "PI".  That is what the old draft had, and a good
thing it was too, but the new draft abandoned it.)

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mct at foyt.indyrad.iupui.edu  Thu Sep 10 23:09:04 1998
From: mct at foyt.indyrad.iupui.edu (Mark Tucker)
Date: Mon Jun  7 17:04:34 2004
Subject: Do you or Dont you buy Tim Bray's Namespace Validation Algorithm?
References: <35F83ACB.119F7E1A@locke.ccil.org>
Message-ID: <199809102053.PAA03915@foyt.indyrad.iupui.edu>


John Cowan cowan@ccil.org wrote:

mct>                 4. Do the same to the DTD's that you read in.

jc>This is the sticky bit.  Tim Bray phrases it as "If necessary, rewrite
jc>the DTD so that the same URIs have the same prefixes in DTD and
jc>instance."
jc>
jc>However, there is no formal way to express what prefixes in the DTD
jc>refer to what namespace URIs.  In essence, you must know that in
jc>advance.  

Hmm, isn't there some place at the top of the Document defining
the DTD to write
		xmlns:BK=uri:books
and in so doing define the meaning of the prefixes this DTD uses?


jc> For validation to be a mechanical process, you must have
jc>some way of recording the frozen namespace-URI map for the DTD.
jc>

The namespace_prefix<->URI map for the DTD would be embedded in the DTD

The per-validation normalized_prefix<->namespace mapping
is maintained by the document validator process, and only needs
to live for the validation of a single document.

If you mean the mapping that tells validator processes that
	'The DTD known by "uri:FrenchLibrary"  is the following ....'
then that looks like a resource management issue that can be handled
by a higher level.


-- 
==============================================================
Mark Tucker			tucker_m@regenstrief.iupui.edu
Regenstrief Institute		phone: (317) 630-2606
1001 W. 10'th St; Indianapolis, IN; 46202-2859;	fax: (317) 630-6962

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Thu Sep 10 23:32:41 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:34 2004
Subject: Do you or Dont you buy Tim Bray's Namespace Validation Algorithm?
References: <35F83ACB.119F7E1A@locke.ccil.org> <199809102053.PAA03915@foyt.indyrad.iupui.edu>
Message-ID: <35F844DB.8B1E94D6@locke.ccil.org>

Mark Tucker wrote:

> Hmm, isn't there some place at the top of the Document defining
> the DTD to write
>                 xmlns:BK=uri:books
> and in so doing define the meaning of the prefixes this DTD uses?

Not according to the draft.
 
> The namespace_prefix<->URI map for the DTD would be embedded in the DTD

But there's no way to do that according to the draft.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at goon.stg.brown.edu  Fri Sep 11 00:32:23 1998
From: richard at goon.stg.brown.edu (Richard L. Goerwitz III)
Date: Mon Jun  7 17:04:34 2004
Subject: coming clean with the SGML crowd (was re: namespaces)
In-Reply-To: <5BF896CAFE8DD111812400805F1991F7038CA72D@RED-MSG-08> from Andrew Layman at "Sep 10, 98 01:08:10 pm"
Message-ID: <199809102232.SAA10024@goon.stg.brown.edu>

> Q:  "Why not associate namespaces optionally with DTDs (not necessarily
> via the name-space URI)?"
> 
> A:	This was discussed extensively during the design, and rejected for
> two reasons: First, it was not the minimal necessary to enable namespaces.

You pay now or you pay later.  A minimal namespace spec _now_ means that
DTDs won't know about namespaces - and that, in order to use DTDs, we will
have to follow a silly, kludgy process of expanding all our namespaces,
and then editing the expanded elements back into our DTDs.  In other words,
we pay later.

Note that in this scheme, the SGML compatibility crowd pays more than the
rest of us.  The more namespaces are actually used (and the more they make
the process of writing DTDs that much less direct and useful), the more
these silly kludges become necessary, and the more XML loses its connec-
tion with SGML.

It's unbelievable, but people are already talking about legacy XML and
about the abandonment of SGML.  Is it any wonder that vendors like Micro-
soft seem to be sitting on the fence?  We have some people who declare
firmly that DTDs are the only accepted schema/validation mechanism.  And
we have other louder, stronger voices who are already declaring them dead.
For a while it seemed the loud voices were simply disenfranchised.  But
namespaces (among other things) have shown them to be legitimate.  They're
dealing with the problem.

Maybe it's time to admit to the poor SGML crowd what's really going on,
by the way.

> Second, there are many possible resources that could be associated with a
> namespace.  DTDs are one, but various forms of schemas, style sheets,
> documentation, etc. are also likely.

I don't get your point.  If there's no DTD associated with a namespace,
then we're back where we started.  You either kludge the DTD by tipping in
the namespace elements, or you give up validation.  It's exactly where we
are with the current spec.

So make the DTD optional, and invent a way to tell the processor that it
is a DTD.  There's no reason that we couldn't associate other schemas with
namespaces too.  Each may use its own validation method.  Leaves room for
growth.  And if there is no DTD associated with a given namespace, then you
either kludge your main DTD or live without validation.

Yes, this breaks SGML compatibility in the sense that an SGML-based val-
idator won't pick up the namespace DTDs.  But SGML compatibility is be-
coming more and more superficial in XML anyway - namespaces and their in-
teraction with DTDs being a case in point.

I hear chanting:  "Reverse course; where are architectural forms?"  I hear
raving:  "The whole thing is a mess; log live HTML."  I hear musing:  "Ar-
gue about this long enough and nobody will be using old browsers, and we
won't have any more reason to shun the old PI solution for namespaces."


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Daniel.Brickley at bristol.ac.uk  Fri Sep 11 00:36:21 1998
From: Daniel.Brickley at bristol.ac.uk (Dan Brickley)
Date: Mon Jun  7 17:04:34 2004
Subject: Where are discussions about Schemas taking place?
In-Reply-To: <199809101415.JAA25656@foyt.indyrad.iupui.edu>
Message-ID: <Pine.GHP.4.02A.9809102319440.8771-100000@mail.ilrt.bris.ac.uk>


On Thu, 10 Sep 1998, Mark Tucker wrote:
> 
> 
> Where is the proper forum to discuss Schemas?
> 
> 	DCD		-- ?
> 	XSchema		-- here
> 	RDF Schemas	-- ?
> 
> 	RDF in general	-- ?	

For RDF Schemas and RDF in general, the only general public forum I'm
aware of is XML-DEV's sibling, RDF-DEV. This reminds me that a proper
charter, FAQ and website for that list is long overdue. Will try to get
this done within the fortnight. <blush/>

RDF-DEV signup info and archives are at:
	http://www.mailbase.ac.uk/lists/rdf-dev/ 

Quite what we talk about where will doubtless evolve over time. RDF's
schema mechanism is defined in terms of the RDF abstract data model 
layered above XML, and doesn't talk in terms of attributes and
elements, so might be better discussed on RDF-DEV. But the application
of the RDF data model to creating RDF vocabularies for describing
structured document types (DCD schemas, XSchema2?) would I imagine be
more at home here. Exploring the feasibility of mapping instances of
those doc types (rather than the schema) into RDF might be another
interesting thread to have sometime, somewhere... 

Dan


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Fri Sep 11 01:28:49 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:34 2004
Subject: coming clean with the SGML crowd (was re: namespaces)
Message-ID: <3.0.32.19980910162847.00a013d0@pop.intergate.bc.ca>

At 06:32 PM 9/10/98 -0400, Richard L. Goerwitz III wrote:

>It's unbelievable, but people are already talking about legacy XML and
>about the abandonment of SGML.  Is it any wonder that vendors like Micro-
>soft seem to be sitting on the fence?  We have some people who declare
>firmly that DTDs are the only accepted schema/validation mechanism.  And
>we have other louder, stronger voices who are already declaring them dead.

I'd like to disagree with the perception that people are not in favor of
validation.  Early users of XML, it seems, typically want *more* validation
than SGML offers, rather than less.  That's why there's so much energy going 
into the early stages of the new-schema design.  Yes, we need DTDs, but also, 
we need more, and we need it PDQ.  -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From matt at veosystems.com  Fri Sep 11 02:24:07 1998
From: matt at veosystems.com (matt@veosystems.com)
Date: Mon Jun  7 17:04:34 2004
Subject: coming clean with the SGML crowd (was re: namespaces)
In-Reply-To: <3.0.32.19980910162847.00a013d0@pop.intergate.bc.ca> from "Tim Bray" at Sep 10, 98 04:28:50 pm
Message-ID: <19980911002330.1250.qmail@veosystems.com>

A non-text attachment was scrubbed...
Name: not available
Type: text
Size: 2060 bytes
Desc: not available
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980911/5f874127/attachment.bat
From papresco at technologist.com  Fri Sep 11 04:01:41 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:34 2004
Subject: coming clean with the SGML crowd (was re: namespaces)
References: <199809102232.SAA10024@goon.stg.brown.edu>
Message-ID: <35F88004.70A7F204@technologist.com>

Richard L. Goerwitz III wrote:
> 
> Note that in this scheme, the SGML compatibility crowd pays more than the
> rest of us.  The more namespaces are actually used (and the more they make
> the process of writing DTDs that much less direct and useful), the more
> these silly kludges become necessary, and the more XML loses its connec-
> tion with SGML.

That is not true. SGML is not a static thing. It can change. The XML
community has "volunteered" to test out namespaces. If they prove to be
useful, I expect SGML would adopt them. If they are just useless syntactic
sugar (as opposed to useful syntactic sugar), that will become evident in
the next few months or years and they will fall into disuse.

The same goes for schemas: if the XML world comes up with a truly better
way to do DTDs, then SGML can incorporate it.
 
 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

The past is inaccurate. Whoever lives long enough knows how much what he
had seen with his own eyes becomes overgrown with rumor, legend a
magnifying or belittling hearsay. "It was not like that at all!" -- 
he would like to exclaim, but will not, for they would have seen only 
his moving lips without hearing his voice. - Czeslaw Milosz (translated)


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Fri Sep 11 04:33:04 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:35 2004
Subject: coming clean with the SGML crowd (was re: namespaces)
Message-ID: <3.0.32.19980910193250.00aa08c0@pop.intergate.bc.ca>

At 05:23 PM 9/10/98 -0700, matt@veosystems.com wrote:
>Given how you suggested people do validation with the current NS
>proposal, it will be difficult to convince anyone that much thought
>was given to validation in preparing it.  I, of course, know better,
>but an outsider looking at your recommended validation algorithm would
>not be totally without justification in interpreting it as saying we
>consider validation so important we will make it as difficult as
>possible without making it outright IMpossible. :-)

I repeat: all this noise about the difficulty of validation is
completely missing the real point, namely we have neither a theoretical
basis nor industry experience to guide us in constructing and using
compound schemas (DTDs or any other sort), and doing partial validation.  
Maybe I'm missing something, but the syntactic difficulties in doing 
DTD validation just look infinitesimal to me compared to the compounding 
issue. -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at goon.stg.brown.edu  Fri Sep 11 05:25:19 1998
From: richard at goon.stg.brown.edu (Richard L. Goerwitz III)
Date: Mon Jun  7 17:04:35 2004
Subject: coming clean with the SGML crowd (was re: namespaces)
In-Reply-To: <35F88004.70A7F204@technologist.com> from Paul Prescod at "Sep 10, 98 08:42:28 pm"
Message-ID: <199809110325.XAA15202@goon.stg.brown.edu>

> > ...the more XML loses its connection with SGML.
> 
> That is not true. SGML is not a static thing. It can change.

XML has not even made it out of the gate yet; yet SGML has been off and run-
ning since the 80s.  The notion of huge government text bases, corporate docu-
mentation libraries, and long-term academic research projects suddenly tearing
off after XML seems a bit unrealistic.

XML right now is, well, only slightly more than a dream.  I have a hard time
even finding valid XML document instances on the net.  Its APIs are almost all done
in Java (itself a moving target which, despite the hype, has precious few major
applications to its credit).  And its supporters have fallen to squabbling.

The main reason we need XML, as we all know, is that HTML has become a presenta-
tion language.  Despite its nominal connections with SGML, HTML betrays many of
the shortcomings of the typesetting languages the SGML community railed against
during the 80s.

The trouble with SGML, though, was that it was an embarrassment to anyone who
has taken a moment to examine a book on parsing or automaton theory.  And it was
far too big to be useful to anyone who didn't have major financial and human
resources at his or her disposal.  (Let's count out loud the number of SGML pro-
cessors out there that actually do anything useful; it'll only take a few sec-
onds.)

XML was conceived as a way to steer a middle course, i.e., a course away from
HTML's obsession with presentation markup and away from SGML's computational na-
ivete and bloat.  XML was a way to capture the purity of SGML's vision without
sacrificing elegance or simplicity.

I believe that everyone on this list understands these goals, and that most
are trying to achieve them.  We are not working together, though.  Factions
are splitting off willy nilly, and core WG members (many of whom probably don't
even bother to read this group) often seem to take annoyingly paternalistic, if
not outrightly dismissive and arrogant, stances - further fanning the flames of
discontent and further splintering the nascent XML community.

I don't really know what to develop for right now, what with confusion over our
design goals, about namespaces, about XSL, SGML compatibility, and virtually
everything else.  Perhaps if we could all agree on a few basics, we could set-
tle things down.

Here are two questions for discussion:

  1) Do we have an easy pathway for integrating alternative (non-DTD) schemas?
  2) Can we secure a namespace spec that is either harmless or capable of
     being integrated with any of several schema mechanisms?


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at goon.stg.brown.edu  Fri Sep 11 05:29:15 1998
From: richard at goon.stg.brown.edu (Richard L. Goerwitz III)
Date: Mon Jun  7 17:04:35 2004
Subject: coming clean with the SGML crowd (was re: namespaces)
In-Reply-To: <3.0.32.19980910193250.00aa08c0@pop.intergate.bc.ca> from Tim Bray at "Sep 10, 98 07:33:15 pm"
Message-ID: <199809110329.XAA15286@goon.stg.brown.edu>

> we have neither a theoretical basis nor industry experience to guide us
> in constructing and using compound schemas (DTDs or any other sort)

Why the rush, then, on namespaces?  If the spec doesn't go through for a
few more months, even a year, the earth won't freeze over.

And maybe we'll know what we are talking about by then.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jonathan at texcel.no  Fri Sep 11 05:41:11 1998
From: jonathan at texcel.no (Jonathan Robie)
Date: Mon Jun  7 17:04:35 2004
Subject: MSXML Parser
Message-ID: <3.0.3.32.19980910234010.02f61990@pop.mindspring.com>

Can anybody share their experiences with the MSXML parser? How does this
compare to any other validating XML parser? Are there significant
differences that I should be aware of?

Jonathan
 
jonathan@texcel.no
Texcel Research
http://www.texcel.no

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Fri Sep 11 06:27:29 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:35 2004
Subject: XSL and Device-Independent Formatting
Message-ID: <008101bddd3c$c6016b80$c86118cb@caleb>

-----Original Message-----
From: David Megginson <david@megginson.com>
>I think that people are over-thinking the problem.

I agree.

>Try this on for size: an XSL formatter produces a device-independent
>formatting tree, then can render the same tree in different concrete
formats >(PDF, PS, DVI, or what have you).  As a happy co-incidence, it
happens that
> the intermediate formatting tree -- like most structured information --
>can be serialised as an XML document.

>That means that, if you wish, the two parts of the process (building
>the device-independent formatting tree and rendering the tree) can be
>handled by separate programs, since the XML provides a common
>interchange standard.

This is exactly right and FOP takes advantage of it. At present it reads in
an XML representation of the "formatting tree" via SAX and spits out PDF.
This way it can work with XT and any other XSL processors that may emerge. I
can, at a latter stage, tie it in directly to XT, and it'll access the
formatting tree directly. Having the XML serialisation is great.

James
--
James Tauber / jtauber@jtauber.com      http://www.jtauber.com/
Lecturer and Associate Researcher
Electronic Commerce Network             ( http://www.xmlinfo.com/
Curtin Business School                  ( http://www.xmlsoftware.com/
Perth, Western Australia                ( http://www.schema.net/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Fri Sep 11 06:42:42 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:35 2004
Subject: namespaces discussion
Message-ID: <00f601bddd3e$e4d64380$c86118cb@caleb>

-----Original Message-----
From: Mark Tucker <mct@foyt.indyrad.iupui.edu>
>First a question:
>
> Can I have a DTD that mentions elements
> defined elsewhere?

Very easily and you don't even need namespaces (except to avoid possible
clashes which is all I thought namespaces were for anyway).

This is particularly possible using the common technique of content model
extension via parameter entities.

The following example doesn't use namespaces.

Say my book DTD has:

<!ELEMENT div (%base.div.content;|%extended.div.content;)>

Then I can have (either in the internal subset of a document or some
external DTD entity that gets read before the book DTD):

<!ENTITY % mathml.dtd SYSTEM "mathml.dtd">
<!ENTITY % pgml.dtd SYSTEM "pgml.dtd">
%mathml.dtd
%pgml.dtd

<!ENTITY % extended.div.content "math|pgml">

And bingo!, I can use MathML and PGML in my Book.

James
--
James Tauber / jtauber@jtauber.com      http://www.jtauber.com/
Lecturer and Associate Researcher
Electronic Commerce Network             ( http://www.xmlinfo.com/
Curtin Business School                  ( http://www.xmlsoftware.com/
Perth, Western Australia                ( http://www.schema.net/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dent at highway1.com.au  Fri Sep 11 07:08:47 1998
From: dent at highway1.com.au (Andy Dent)
Date: Mon Jun  7 17:04:35 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
In-Reply-To: <35F7E9C7.EA6B8BA8@technologist.com>
References: <199809090522.BAA11695@ruby.ora.com>		
 <199809090121.SAA15093@mail-gw.pacbell.net>	
 <v03102801b21c945aab3e@[203.23.215.128]>
 <v04011701b21d8ea34317@[203.23.215.86]>
Message-ID: <v03102802b21e5d4514e1@[203.7.224.126]>

At 23:01 +0800 10/9/98, Paul Prescod wrote:
>The browser takes XML, pumps it through an XSL engine, receives an XML
>result (according to a known DTD with formatting semantics) and renders
>*that*. You can do the same with your report writer.

THANK YOU

I don't think anyone in any of the many  messages I've read or the books
I've bought has explained the point of the XSL translation quite so
clearly. I *knew* there was a (stupidly obvious) step I was missing.

I think the thing that confused me is that the generated XML from the XSL
processor will inherently be marked up for rendering and no longer just
structured content, correct? (ie:  glorified HTML).

I kept concentrating on XML as remaining free from rendering-oriented
content, so could not see how the gap was closed other than in the final
renderer, like our product.

Mind you, it still may be vastly easier and more efficient for us to skip
this intermediate step as an XML rendition and keep our intermediate
version purely in-memory as a set of c++ formatting objects, applied to a
database.

Andy Dent, Software Designer, A.D. Software, Western Australia
OOFILE - Database, Reports, Graphs, GUI for c++ on Mac, Unix & Windows
PP2MFC - PowerPlant->MFC portability
http://www.highway1.com.au/adsoftware/crossplatform.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Fri Sep 11 07:14:09 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:35 2004
Subject: coming clean with the SGML crowd (was re: namespaces)
References: <3.0.32.19980910162847.00a013d0@pop.intergate.bc.ca>
Message-ID: <35F8B1D2.BA6968C@infinet.com>

Tim Bray wrote:

> At 06:32 PM 9/10/98 -0400, Richard L. Goerwitz III wrote:
>
> >It's unbelievable, but people are already talking about legacy XML and
> >about the abandonment of SGML.  Is it any wonder that vendors like Micro-
> >soft seem to be sitting on the fence?  We have some people who declare
> >firmly that DTDs are the only accepted schema/validation mechanism.  And
> >we have other louder, stronger voices who are already declaring them dead.
>
> I'd like to disagree with the perception that people are not in favor of
> validation.  Early users of XML, it seems, typically want *more* validation
> than SGML offers, rather than less.  That's why there's so much energy going
> into the early stages of the new-schema design.  Yes, we need DTDs, but also,
> we need more, and we need it PDQ.  -Tim

Much to my surprise, validation in XML can be implemented a lot cheaper than I
had originally thought.  My initial guess was that validation would slow things
down 2x-3x, but actually I am only finding that validation incurs about a 20%-30%
penalty and much of that is actually spent on opening and closing the file with
the DTD.  In this case, it really does not make sense why anyone would not want
validation since it is relatively cheap.  I guess there are other reasons why
people would not want validation, but from a performance perspective my small
experience with this issue makes it a non-issue.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Fri Sep 11 07:47:31 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:35 2004
Subject: coming clean with the SGML crowd (was re: namespaces)
References: <3.0.32.19980910193250.00aa08c0@pop.intergate.bc.ca>
Message-ID: <35F8B861.131FAE4@technologist.com>

Tim Bray wrote:
> 
> I repeat: all this noise about the difficulty of validation is
> completely missing the real point, namely we have neither a theoretical
> basis nor industry experience to guide us in constructing and using
> compound schemas (DTDs or any other sort), and doing partial validation.

I believe that we have both industry experience AND a formal model.

The industry experience is that people can build compound schemas by
combining schema parts through *parameter* entities. I highlight the word
*parameter* because it is the hint to how a more robust solution must
work: it must parameterize content models and schema fragments. The
problem with parameter entities is that they are way too flexible because
they work at a textual level. We have the right model, but the wrong
mechanism. So to move beyond that, we must implement parameterization in a
more structured sense. 
--
The so-called "module proposal" did the same thing with SGML DTD syntax.
(it uses a not-sufficient-constrained variety of parameter entities,
however).

http://www.ornl.gov/sgml/wg8/document/1987.htm
http://www.ornl.gov/sgml/wg8/document/1982.htm
--

The formal model is the forest automata theory. See:

http://lists.w3.org/Archives/Public/www-html/1998Mar/0017.html

This post shows how to do parameterization and composition. 

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

The past is inaccurate. Whoever lives long enough knows how much what he
had seen with his own eyes becomes overgrown with rumor, legend a
magnifying or belittling hearsay. "It was not like that at all!" -- 
he would like to exclaim, but will not, for they would have seen only 
his moving lips without hearing his voice. - Czeslaw Milosz (translated)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From liamquin at interlog.com  Fri Sep 11 09:22:00 1998
From: liamquin at interlog.com (Liam R. E. Quin)
Date: Mon Jun  7 17:04:35 2004
Subject: Summary of Namespaces and Validation
In-Reply-To: <3.0.32.19980910102336.00a02100@pop.intergate.bc.ca>
Message-ID: <Pine.BSI.3.96r.980911030511.28033B-100000@shell1.interlog.com>

On Thu, 10 Sep 1998, Tim Bray wrote:

> 1. Build a compound DTD that has prefixed declarations for all your
>    elements and attributes.  This is the hard part.
Yes, I agree.
MurataSahn's Forest Automata work may help a little -- see Paul's paper
for some examples.  Bt it does not solve the problem itself.

> What really bothers me is that discussion here keeps obsessing over the
> tedious but straightforward problem of matching up prefixes, and nobody's
> thinking about the interesting and difficult problem of compounding DTDs.
I think it's because that part doesn't sound hard.

The main difficulty I see is that we don't have a way of associating DTDs
(or fragments) with prefixes,  I'll start by assuming that such a way
exists and then see if that helps...

File 1, using prefix A:
<!ELEMENT Sup
    (#PCDATA|Sup)*
>

File 2, using prefix B:
<!ELEMENT Sup
    (beverage|food)+
>
<!--* supper is my bestest meal! *-->

Now <Sup> in an instance may have either mixed or element content,
depending on the context in which it occurs.

But we can transform it into A:Sup (mixed) or B:Sup (element) in
every case.

In fact, every element mentioned in a DTD can be algorithmically
normalized by prepending the prefix associated with it.

And if a DTD refers to another fragment with a prefix, the
same rules can be used.

If a DTD refers to multiple external definition sets, with multiple
prefixes, it is necessary to determine the source of each referenced
element type, which may require a two-pass algorithm to collect all
of the element definitions and then associate them with the right
prefixes.  The fact that an element type can be referred to before
it is declared makes this slightly more complex, but is necessary...

Tim, what am I missing?  Apart from the face that we lack a good
way to associate a DTD or DTD fragment with a prefix, I mean?

Lee

-- 
Liam Quin, GroveWare Inc., Toronto;  The barefoot agitator
l i a m q u i n     at    i n t e r l o g    dot   c o m


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Fri Sep 11 09:40:52 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:35 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
In-Reply-To: <199809110325.XAA15202@goon.stg.brown.edu>
References: <35F88004.70A7F204@technologist.com>
Message-ID: <3.0.1.16.19980911084036.9757b832@pop3.demon.co.uk>

My retitling of this message is deliberately provocative, and I do NOT wish
a flame war to ensue. I want the title disproved, but by *actions and
evidence*, not statements of faith.

At 23:25 10/09/98 -0400, Richard L. Goerwitz III wrote:
>
>XML right now is, well, only slightly more than a dream.  I have a hard time
>even finding valid XML document instances on the net.  

I agree with this. Given that XML was designed for use over the Web
(right?) and it has been in gestation for 2 years I find it incredible that
XML has not done anything useful in public view. Lots of hype in the
magazines, etc. but nothing tangible to show for it. Tangible in the sense
that I can show a non-XML person something that will interest them.

XML was effectively launched in Spring 1997 at WWW6, Santa Clara. It's 15
months since then and over a year since the first draft of the XLink spec
was released.  And as far as I can see there are virtually no useful
applications that have been created. I now find it difficult to convince
people that XML is useful, other than by repeatedly stating it as an act of
faith.

There are lots of valuable *tools* developed (many announced on this list)
but is anyone actually using them publicly? [I do not get excited by
statements like "Corporation X can achieve x% reduction in costs by using
XML for its workflow". "We are using XML to store our configuration files,
etc." It may be true, and it may be good business, but it's hardly a turnon.] 

XML has limitless applications. I continue to suggest them on this list -
the response is underwhelming. By an application I mean "something that a
non-XML expert can do something useful with". That "something" might only
be to play minesweeper or whatever. At present I count the following:
	- MathML - IMO this is the one that has most chance of achieving critical
mass.
	- The XSL slide processor announced on XSL lists just now. Haven't looked
at it, but it sounds like an obvious and useful thing to do
	
and my own - which I am actively building up critical mass for:
	- Chemical Markup Language. (http://www.xml-cml.org). It is now
distributable. Henry and I are proseletising in the community, I think with
some success. But the lack of other visible XML makes it very difficult.
	- the Virtual HyperGlossary (http://www.vhg.org.uk). This is something
that couldn't be done with HTML, as it uses the hierarchical nature of XML
and the additional addressing of XLink. AFAIK the only application of XML
that actively uses XLink. Is no-one else interested in the power of Xlink -
my own view is that it's revolutionary.

The criterion for inclusion as a useful XML application is:
	- it must be usable over the WWW AND/OR
	- it must be downloadable and useful
	- it must do something that cannot be easily done with HTML OR
	- it must do something in an *immediately obviously* better way than HTML.
	- it must catch the imagination of someone who is not an XML expert.

I have been confidently predicting that XML would take the WWW by storm
during 1998. I am amazed and saddened that it hasn't done so, but there are
**three months left**. I have thrown out lots of ideas on this list with
virtually zero take up. Is no one else interested in:
	- an XML spreadsheet?
	- an XML drawing tool?
	- a collaborative XML environment?
	- XML games

As far as I can see, most readers of this list are:
	- waiting for XSL because all they are interested in is rendering text
with infinite precision. Worthy, but surely that's not the main point of
XML. Also it's a year away.
	- waiting for MS/NS to come up with 'XML browsers'. Doesn't look very
promising, does it?
	- only really interested in using XML to manage their current client business
	- interested in doing some in-house re-engineering.
	- have some medium/long-term strategy for developing products. No doubt
some of these will be very exciting but I doubt they will spread a flame
across the WWW.
	- just waiting

By contrast, when Mosaic hit the WWW, within *months* I was able to:
	- use search engines (a completely new idea)
	- post requests by interactive forms (again a stunning idea)
	- send and display complex objects (e.g. molecules) painlessly over the
WWW (again stunning).
	- get servers to do incredible calculations (painlessly).

Much the same dynamics were seen for Java.

Where are the excited XML hackers? You don't need a *browser* to do fun
things. Where are the grad students (Paul Prescod doesn't count any longer
:-). 

Isn't OASIS interested in creating a fun demo disk/CDROM to promote XML?  

Or am I right that XML is fundamentally about as boring as the introduction
of  TTL or 3-phase electricity - worthy, but manufacturer-level only?

I am hoping to be inundated with mail that shows I am wrong - that I have
taken a narrow view - that I don't read the newsgroups. But not mere
statements of faith, please. I want something I can *show* people. 

In addition I would like volunteers or code to help with the collaborative
project I suggested two days ago. So far the response has been
underwhelming. All I need is some dumb server-side software that can keep
open channels (or re-open them) between two 'players'. [Of course the
application need not be just games.] This is probably so trivial to some of
you that you think it's not worth offering - but I happen to be ignorant
about it.

	P.

	
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Robert.WAKELING at DG15.cec.be  Fri Sep 11 10:01:41 1998
From: Robert.WAKELING at DG15.cec.be (Robert.WAKELING@DG15.cec.be)
Date: Mon Jun  7 17:04:35 2004
Subject: DTD's and namespaces
Message-ID: <WIN93d-980911075743-4166*/G=Robert/S=WAKELING/O=DG15/PRMD=CEC/ADMD=RTT/C=BE/@MHS>

I am looking for ways to make it easy for people to find and understand public
tender notices (or solicitation synopses). SGML DTD's are routinely used to
format at least 150,000 of these per year. Since they can be in any of eleven
languages we also have a coded multilingual vocabulary of over 9000 product and
service definitions, as well as city names and subject headings. I thought that
XML might provide a neat way to standardise these notices across many sites and
allow selective searching, retrieval and display of these notices in any
language. But I can't find enough guidance on designing DTD's or using
namespaces to see clearly how to do this, or if it can be done.

I have two specific problems: when to use elements or attributes, and how to
make reference to external documents or language vocabularies. The namespace
debate has further confused me here. My first humble attempt looks like this:

<?xml version="1.0" encoding="UTF-8" ?>
<!DOCTYPE TenderNotice [
<!ELEMENT TenderNotice (ContractingAuthority, (ContractInformation,
ContactDetails, Reference)*) >

<!ATTLIST TenderNotice
        PublicationDate         CDATA   #REQUIRED 
        Xml:lang        NMTOKEN         #IMPLIED >

<!-- I don't understand what should be treated as elements and what as
attributes so elements are used below. -->

<!ELEMENT ContractingAuthority (OrganisationName, Address)>
<!ELEMENT OrganisationName (#PCDATA) >
<!ELEMENT Address (Addressline+, Postcode?, Country?) >
<!ELEMENT Addressline (#PCDATA) >
<!ELEMENT Postcode (#PCDATA) >
<!ELEMENT Country (#PCDATA) >

<!-- Should be an ISO3166 Code -->

<!ELEMENT ContractInformation (ObjectOfContract, EstimatedValue?, DeadlineDate?)
>

<!ELEMENT EstimatedValue (#PCDATA) >

<!-- will need currency attributes -->

<!ELEMENT DeadlineDate (date) >
<!ATTLIST DeadlineDate 
        DeadlineForReceiptof    (Tenders | RequestToParticipate) #REQUIRED
<!ELEMENT date (#PCDATA) >

<! --in accordance with ISO 2014-1976 (YYYYMMDD) for example. There are other
dates that may be equally or more relevant -->

<!ELEMENT ObjectOfContract (WorkDescription, CPVCode*) >
<!ELEMENT WorkDescription (#PCDATA) >
<!ELEMENT CPVCode (#PCDATA) >

<! -- this is the code for Work descriptions with eleven corresponding language
versions so it could automatically generate the WorkDescription in a chosen
language -->


<!ELEMENT ContactDetails (CommonName, Address, (TelephoneNumber,
FacsimileNumber, EmailAddress,)? Other*) >
<!ELEMENT CommonName (Honorific?, (Initials|GivenName)?, SurName) >
<!ELEMENT Honorific (#PCDATA) >
<!ELEMENT Initials (#PCDATA) >
<!ELEMENT GivenName (#PCDATA) >
<!ELEMENT SurName (#PCDATA) >
<!ELEMENT TelephoneNumber (#PCDATA) >
<!ELEMENT FacsimileNumber (#PCDATA) >
<!ELEMENT EmailAddress (#PCDATA) >
<!ELEMENT Other (#PCDATA) >

<!ELEMENT Reference (#PCDATA)>

<! -- this is where I want to provide for references to full tender documents,
standard conditions and so on either through URL's, URIs, external entity
references or some other mechanism perhaps with Processing instructions. -->


]>

I don't know whether this is the right place to ask for answers- but can anyone
help or suggest where else could I look?

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peterj at wrox.com  Fri Sep 11 10:17:28 1998
From: peterj at wrox.com (Peter Jones)
Date: Mon Jun  7 17:04:35 2004
Subject: namespaces discussion
Message-ID: <29AA5A0E3A0CD21196F300A0C9D8575C036084@WROX3>


> -----Original Message-----
> From:	Murray Altheim [SMTP:altheim@mehitabel.eng.Sun.COM]
> Sent:	Thursday, September 10, 1998 7:00 PM
> To:	Peter Jones; xml-dev@ic.ac.uk
> Subject:	Re: namespaces discussion
> 
> Peter Jones <peterj@wrox.com> writes:
> [...]
> > What do you mean by compounding DTDs? I don't know whether any of my
> > postings to the list have been getting through, ...but why can't the
> > notion of a DTD be an utterly nebulous concept in the abstract,
> elements
> > themselves having a namespace URIs which addresses a DTD entity for
> that
> > particular element. Different elements validated against different
> > declarations lying in dispersed DTD entities.
> > Why isn't this idea getting through to anyone? (am v. frustrated!)
> 
> Well, maybe nobody understands you, or maybe it's not an idea with
> much
> fluency. I do DTD work for a living, and spreading one's declarations
> amongst multiple entities doesn't solve anything except spreading
> one's
> declarations amongst multiple entities. Some people call it
> modularization.
	[Peter Jones]  The idea I'm driving at is that DTDs should not
be tied down to namespace prefixes, and should be maximally re-useable.
The namespace prefix should be used only as a shorthand within the
document. THe URI of the namespace can (for user option) be made to have
significance (beyond avoiding name collisions) by denoting the address
of a document entity where declarations lie. Validation would then be
against a declaration which only concerns the name part of the qualified
name.

	DTD entity contains:
	<!ELEMENT   number   (content1 | content2| content3) >

	Document contains
	(ignoring the fact that I can't remember the exact syntax)

	<foo:number xmlns:foo="http://...[whatever]">

	Where the URI refers to the file containing the declaration
above, and validation takes place only on the name "number" NOT the
qualified name "foo:number".

	You can then have old style DTDs or compound docs or whatever.


> It doesn't address the real issues the arise when one is attempting to
> 
> create a 'compound' document type from multiple sources. The namespace
> draft solves only one problem (name collisions), but it introduces a
> few other (what are IMO profound) problems.
> 
> As for nebulosity, we don't need nebulosity, we need a DTD with both
> of
> its feet on solid ground.
> 
> Murray
> 
> ......................................................................
> ..
> ...
> Murray Altheim, SGML Grease Monkey
> <mailto:altheim&#64;eng.sun.com>
> Member of Technical Staff, Tools Development & Support
> Sun Microsystems, 901 San Antonio Rd., UMPK17-102, Palo Alto, CA
> 94303-4900
> 
>        Ernst Martin comments in 1949, "A certain degree of noise in 
>        writing is required for confidence. Without such noise, the 
>        writer would not know whether the type was actually printing 
>        or not, so he would lose control."

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From north at Synopsys.COM  Fri Sep 11 10:19:50 1998
From: north at Synopsys.COM (Simon North)
Date: Mon Jun  7 17:04:35 2004
Subject: XML is boring (long --- sorry)
In-Reply-To: <3.0.1.16.19980911084036.9757b832@pop3.demon.co.uk>
References: <199809110325.XAA15202@goon.stg.brown.edu>
Message-ID: <199809110817.KAA26872@goofy.gr05.synopsys.com>

Peter Murray-Rust says that XML is boring because there are few 
(public) applications ... I think it's even worse than that. Let me 
explain ...

When I was trying to demonstrate the potential of an Intranet back in 
1992, using Netscape 1.1 and CERN's httpd I was able to put together 
a quite impressive webette quite quickly. Thus far I agree with 
Peter, we have nothing 'sexy' to show people and, worse, precious 
little for them to be able to try for themselves. Yes, we have a few 
editors but without some kind of rendition there's nothing visible.

I recently gave a presentation about XML to one of the leading 
technical documentation companies in The Netherlands: 

- I showed them XML in Mozilla (the August build is quite stable 
  under NT now) and sketched some of the possibilities opened up by 
  transclusions (single-source online and paper documentation is 
  still the philosopher's stone of the tech writing world, believe 
  me). 

- I demonstrated 'islands of data' in IE 5 (I'm still worried that on 
  my three-day visit to Redmond for the XML Summit --- a fascinating 
  event --- not a single thing was said about XLL support), important 
  because these people do a lot of catalog publishing and a web 
  browser is a perfect solution for multi-platform delivery (why 
  worry about Mac/Unix/PC/resolution monitor problems when a 
  Microsoft and Netscape have off-the-shelf answers?). 

- I talked about vendor support (yes, I'm *still* waiting for Adobe 
  to finally fulfill their promise to release an upgrade for 
  Frame+SGML to support XML). 

- I showed them IE4's support for structured graphics and discussed 
  Microsoft's committment to VML (vital for interactive docs where 
  hotspot maintenance is an even bigger problem than link 
  maintenance).

- I described Office 2000 and all the features that Microsoft say 
  they will implement (HTML round-tripping, HTTP server's as 
 folders, XML metadata)

- I showed them Chrome ('scuse me but for interactive demos, etc. it 
  is still *cool* even if it isn't exactly leading edge).

- I even (as far as I am able since it's still in R&D) discussed my 
  own implementation of XML as a data format for algorithm synthesis
  model definition files in a new Synopsys product ... and how I hope 
  to be able to single source online presentation and printed 
  documentated from the same source code.

Where this is all leading to? ... a conclusion from one of the 
attendees that "XML is a programming language". Now, maybe I'm 
over-reacting, but I've never thought of SGML as a programming 
language. I find it very hard to keep a straight face at even the 
suggestion that HTML might be a programming language. Remembering 
what John Bosak said at SGML Europe '98 (it's been repeated 
several times since) about how we musn't be trapped into letting XML 
become just a data format, the view of XML as a programming language 
made me wonder if that danger might be a lot closer than we all 
realize. 

So. Please, I give resounding support to Peter's plea. Have a look at 
XML chess or some other "sexy" application. Don't let XML become a 
delivery format for (D)HTML ... 


My 25 cents,

Simon North

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peterj at wrox.com  Fri Sep 11 10:22:48 1998
From: peterj at wrox.com (Peter Jones)
Date: Mon Jun  7 17:04:35 2004
Subject: namespaces discussion
Message-ID: <29AA5A0E3A0CD21196F300A0C9D8575C036085@WROX3>


> -----Original Message-----
> From:	John Cowan [SMTP:cowan@locke.ccil.org]
> Sent:	Thursday, September 10, 1998 6:54 PM
> To:	XML Dev
> Subject:	Re: namespaces discussion
> 
> Peter Jones wrote:
> 
> > What do you mean by compounding DTDs? I don't know whether any of my
> > postings to the list have been getting through, ...but why can't the
> > notion of a DTD be an utterly nebulous concept in the abstract,
> elements
> > themselves having a namespace URIs which addresses a DTD entity for
> that
> > particular element. Different elements validated against different
> > declarations lying in dispersed DTD entities.
> > Why isn't this idea getting through to anyone? (am v. frustrated!)
> 
> This may be feasible for some yet-to-be-standardized schema language,
> but not for DTDs as such.  "DTD" is a fixed, narrow, SGML-compatible
> notion that can't be changed.
> 
> In addition, the namespace draft has laid down that the URI is
> used solely for comparison, and needn't represent an existent
> resource, much less a specific schema.
	[Peter Jones]  
	Doing my best to be tactful here (although I admit I'm not very
good at it :)
	Isn't XML-dev about critiquing specs and pushing the whole issue
of XML as far as it needs to go? Even if that means dropping ideas on to
the list that mean going back to the drawing board as far as specs are
concerned.
> -- 
> John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
> 	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
> 	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
> 		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)
> 
> xml-dev: A list for W3C XML Developers. To post,
> mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
> message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jjc at jclark.com  Fri Sep 11 12:07:50 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun  7 17:04:36 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
References: <199809090522.BAA11695@ruby.ora.com>	
	 <199809090121.SAA15093@mail-gw.pacbell.net>
	 <v03102801b21c945aab3e@[203.23.215.128]> <v04011701b21d8ea34317@[203.23.215.86]>
Message-ID: <35F8F277.633E7C9A@jclark.com>

Andy Dent wrote:
> 
> At 7:31 PM +0800 10/9/98, James Clark wrote:
> >An XSL processor can do other things with the result tree than just
> >write it out as XML.
> >
> >If you want to use XSL to produce some non-XML format, first you need to
> >devise an XML representation of it.
> Why?
> 
> Why can't a product like our report-writer take
> - XML describing content
> - XSL specifying layout
> and produce, for example, a report preview window on a Mac?

It can, as I explained in the rest of my message. Your product doesn't
have to physically create the XML representation.  There needs to be a
XML representation specified because an XSL stylesheet specifies its
result as XML. XML is the interface language between XSL and and the
outside world.

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jjc at jclark.com  Fri Sep 11 12:19:47 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun  7 17:04:36 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
References: <199809090522.BAA11695@ruby.ora.com>
			 <199809090121.SAA15093@mail-gw.pacbell.net> <v03102801b21c945aab3e@[203.23.215.128]> <35F7B89F.2E779FDB@jclark.com> <35F804CD.47CAAB14@locke.ccil.org>
Message-ID: <35F8F3B3.4149A79A@jclark.com>

John Cowan wrote:
> 
> James Clark scripsit:
> 
> > If you want to use XSL to produce some non-XML format, first you need to
> > devise an XML representation of it.  For example, in the case of HTML,
> > this would be "well-formed HTML", that is XML using the element types
> > and attributes of XML [sic; HTML].
> 
> Almost, almost, but not quite!
> 
> Well-formed HTML is not quite well-formed XML, because of the possible
> presence of "&" and "<" in the CDATA elements SCRIPT and STYLE.

The term "well-formed HTML" as used in section 1 of the XSL WD does not
mean SGML that conforms to HTML 4.0. It means well-formed XML that uses
element types and attributes from HTML.

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From sblackbu at erols.com  Fri Sep 11 12:28:41 1998
From: sblackbu at erols.com (Samuel R. Blackburn)
Date: Mon Jun  7 17:04:36 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
Message-ID: <00ed01bddd6e$e7e382b0$432e0318@cc221812-a.hwrd1.md.home.com>

You bring up some interesting points. I first heard about XML about a
year ago at the Microsoft Professional Developer's Conference in
San Diego. At first, I was very excited about the technology. I
came home and added XML to my freeware C++ class library.

What excited me about XML was it's ability to pass data in
a form that anyone could parse. Universal data transfer. Sounded
like a good idea to me.  The syntax of XML is wonderful. However,
IMHO XML is saddled with design goal #3 "XML shall be compatible
with SGML." I thought, "Oh great, yet another way to show pretty
text." I don't need another way of showing pretty text. HTML has
solved that problem well enough.

What I need is a way to pass data around so anyone can use
any part of it they wish. Looking at XML from a data centric
perspective, there are things in it that are worthless, DTD's
for example.

When asked why I use XML in my programs, I tell folks
"what HTML is to text, XML is to data." I've found XML to
be a wonderful solution to exporting data in an easily
consumable format. I could care less if a browser knows
how to consume XML.

Just my $.02,

Sam Blackburn

-----Original Message-----
From: Peter Murray-Rust <peter@ursus.demon.co.uk>
To: xml-dev@ic.ac.uk <xml-dev@ic.ac.uk>
Date: Friday, September 11, 1998 3:45 AM
Subject: XML is boring (was Re: coming clean with the SGML crowd)


>My retitling of this message is deliberately provocative, and I do NOT wish
>a flame war to ensue. I want the title disproved, but by *actions and
>evidence*, not statements of faith.
[snip]
>Where are the excited XML hackers? You don't need a *browser* to do fun
>things. Where are the grad students (Paul Prescod doesn't count any longer
>:-).

[snip]


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dent at highway1.com.au  Fri Sep 11 12:32:42 1998
From: dent at highway1.com.au (Andy Dent)
Date: Mon Jun  7 17:04:36 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
In-Reply-To: <3.0.1.16.19980911084036.9757b832@pop3.demon.co.uk>
References: <199809110325.XAA15202@goon.stg.brown.edu>
 <35F88004.70A7F204@technologist.com>
Message-ID: <v04011704b21eaa82b76d@[203.23.215.128]>

At 8:40 AM +0800 11/9/98, Peter Murray-Rust wrote:

>	- an XML spreadsheet?
our initial rewrite of our report writer to use XML is close to the above,
but without the calculation logic :-). As the report writer preview window
allows editing of content in the cells, it effectively becomes a
spreadsheet editor. We are a LOOONNNGGG way from a full spreadsheet and
frankly am not interested in pursuing this direction without a client
driving it. Without disclosing too much I can say that one of our clients
will be using our toolkit to create their own spreadsheet-like behaviours
but it is very early days for their use.

However, I'm always open to collaboration - the dBase and graphing engines
in OOFILE were developed on a royalty basis by a local developer, and I'd
love the chance to drop in a calculation engine. OOFILE is a commercial
(c++) developer tool, but I'd be willing to release a free Mac/Win compiled
spreadsheet product particularly if it helped boost the XML community.

>Where are the excited XML hackers?
Not sure if I count on any of 3 bases, I'm as much baffled as excited,
can't claim to be an XML person until we've actually shipped the betas in a
few weeks and I'm not really a hacker any more as I'm trying very hard to
lead-by-engineering-example :-)


Andy Dent BSc MACS AACM, Software Designer, A.D. Software, Western Australia
OOFILE - Database, Reports, Graphs, GUI for c++ on Mac, Unix & Windows
PP2MFC - PowerPlant->MFC portability
http://www.highway1.com.au/adsoftware/crossplatform.html

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dent at highway1.com.au  Fri Sep 11 12:34:58 1998
From: dent at highway1.com.au (Andy Dent)
Date: Mon Jun  7 17:04:36 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
In-Reply-To: <35F8F277.633E7C9A@jclark.com>
References: <199809090522.BAA11695@ruby.ora.com>		
 <199809090121.SAA15093@mail-gw.pacbell.net>	
 <v03102801b21c945aab3e@[203.23.215.128]>
 <v04011701b21d8ea34317@[203.23.215.86]>
Message-ID: <v04011705b21eac802f1c@[203.23.215.128]>

At 5:50 PM +0800 11/9/98, James Clark wrote:
>There needs to be a
>XML representation specified because an XSL stylesheet specifies its
>result as XML.
Umm, I'm not sure I can agree.

The XSL parsers that exist, and the standard, specify the result as XML.

As far as I can see there's nothing saying the output of XSL+XML can't be
an invisible set of c++ objects.

I'm not being wilfull, although possibly a little dense. I think part of my
problem comes from the fact that XSL as currently specified is not
sufficiently powerful for layout descriptions of complex nature of the
reports people are already producing with our product.

Therefore, I'm acutely aware that we can't just drop an existing XSL parser
into our app (assuming there was a portable C or C++ one) as we have to do
a lot more processing with the styles, and extend the XSL standard right
now.

The other thing that I haven't probably made clear to anyone following this
argument is that we are not starting from scratch. The report writer exists
at present.

A typical scenario is an application that creates a bunch of c++ objects to
output a report. This logic is all there - what we are doing with XML is
building an engine to re-create the same report objects currently
constructed with application code. Our use of XML is as a saved document
format, more akin to using XML for data interchange. Therefore, using
XSL+XML to output 'styled XML' then parsing the latter for visual
presentation or printing, is a lot more work than necessary.
Andy Dent BSc MACS AACM, Software Designer, A.D. Software, Western Australia
OOFILE - Database, Reports, Graphs, GUI for c++ on Mac, Unix & Windows
PP2MFC - PowerPlant->MFC portability
http://www.highway1.com.au/adsoftware/crossplatform.html

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dent at highway1.com.au  Fri Sep 11 12:52:35 1998
From: dent at highway1.com.au (Andy Dent)
Date: Mon Jun  7 17:04:36 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
In-Reply-To: <00ed01bddd6e$e7e382b0$432e0318@cc221812-a.hwrd1.md.home.com>
Message-ID: <v04011701b21eb24a8b84@[203.23.215.128]>

At 6:28 PM +0800 11/9/98, Samuel R. Blackburn wrote:
>Looking at XML from a data centric
>perspective, there are things in it that are worthless, DTD's
>for example.
I disagree with this - DTD's for example will help us build a database
schema to contain the XML. Otherwise we'd have to scan a large XML file to
deduce the structure then rescan to store into the database.

This is a wholly data-centric viewpoint.
Andy Dent BSc MACS AACM, Software Designer, A.D. Software, Western Australia
OOFILE - Database, Reports, Graphs, GUI for c++ on Mac, Unix & Windows
PP2MFC - PowerPlant->MFC portability
http://www.highway1.com.au/adsoftware/crossplatform.html

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jjc at jclark.com  Fri Sep 11 12:56:05 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun  7 17:04:36 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
References: <199809090522.BAA11695@ruby.ora.com>		
	 <199809090121.SAA15093@mail-gw.pacbell.net>	
	 <v03102801b21c945aab3e@[203.23.215.128]>
	 <v04011701b21d8ea34317@[203.23.215.86]> <v04011705b21eac802f1c@[203.23.215.128]>
Message-ID: <35F8FE21.181ED5C9@jclark.com>

Andy Dent wrote:
> 
> At 5:50 PM +0800 11/9/98, James Clark wrote:
> >There needs to be a
> >XML representation specified because an XSL stylesheet specifies its
> >result as XML.
> Umm, I'm not sure I can agree.
> 
> The XSL parsers that exist, and the standard, specify the result as XML.
> 
> As far as I can see there's nothing saying the output of XSL+XML can't be
> an invisible set of c++ objects.

How are we disagreeing?  I said "specifies its result as XML" not
"specifies that its result is XML".  In other words it describes its
result in terms of an XML document.  That XML document doesn't have to
be created.  An XSL processor is perfectly entitled to create C++
rendering objects that are described by the XML instead of objects
representing XML elements.

James

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From sblackbu at erols.com  Fri Sep 11 13:20:14 1998
From: sblackbu at erols.com (Samuel R. Blackburn)
Date: Mon Jun  7 17:04:36 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
Message-ID: <017701bddd75$fc3907b0$432e0318@cc221812-a.hwrd1.md.home.com>

It all depends on how you look at it. If you agree on reserving
a tag name, then you can express type information in the same
syntax as XML. For example:

<typeinfo>
<EVENT>
<NAME>
<type>string</type>
<description>User supplied name of the event</description>
</NAME>
<TIME>
<type>ui4</type>
<description>Number of seconds since 1970-01-01</description>
</TIME>
<CHECKSUM>
<type>ui8</type>
<description>MD5 checksum of the NAME and TIME fields</description>
</CHECKSUM>
</EVENT>
</typeinfo>

Granted, this is a simplistic method but all it is meant to do is
supply a human reading the XML enough information to write
a validation routine for the data. IMHO you cannot express
enough information in a generic fashion (i.e. DTD or the above
scheme) that will allow data validation in a generic fashion.
It is easy to validate the simple things like "this is a number"
or "this is a date" but validating things like "this is an MD5
checksum of the previous two fields" is impossible.

Sam

-----Original Message-----
From: Andy Dent <dent@highway1.com.au>
To: Samuel R. Blackburn <sblackbu@erols.com>; Peter Murray-Rust
<peter@ursus.demon.co.uk>; xml-dev@ic.ac.uk <xml-dev@ic.ac.uk>
Date: Friday, September 11, 1998 6:51 AM
Subject: Re: XML is boring (was Re: coming clean with the SGML crowd)


>At 6:28 PM +0800 11/9/98, Samuel R. Blackburn wrote:
>>Looking at XML from a data centric
>>perspective, there are things in it that are worthless, DTD's
>>for example.
>I disagree with this - DTD's for example will help us build a database
>schema to contain the XML. Otherwise we'd have to scan a large XML file to
>deduce the structure then rescan to store into the database.
>
>This is a wholly data-centric viewpoint.
>Andy Dent BSc MACS AACM, Software Designer, A.D. Software, Western
Australia
>OOFILE - Database, Reports, Graphs, GUI for c++ on Mac, Unix & Windows
>PP2MFC - PowerPlant->MFC portability
>http://www.highway1.com.au/adsoftware/crossplatform.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Fri Sep 11 13:49:43 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:36 2004
Subject: coming clean with the SGML crowd (was re: namespaces)
In-Reply-To: <199809110325.XAA15202@goon.stg.brown.edu>
References: <35F88004.70A7F204@technologist.com>
	<199809110325.XAA15202@goon.stg.brown.edu>
Message-ID: <199809111148.HAA00267@unready.megginson.com>

Richard L. Goerwitz III writes:

 > XML right now is, well, only slightly more than a dream.  I have a
 > hard time even finding valid XML document instances on the net.
 > Its APIs are almost all done in Java (itself a moving target which,
 > despite the hype, has precious few major applications to its
 > credit).  And its supporters have fallen to squabbling.
                 ^^^

PRONOUN REFERENCE ALERT!  Who's squabbling -- Java's supporters or
XML's supporters?  Not that we ever squabbled about SGML...

Richard is certainly right about the lack of valid (or even just
well-formed) XML document instances on the net.  You'll find the XML
spec itself, Jon's religious and Shakespeare texts (which need
fixing), the XML Heart of Darkness on my personal (Sprynet) web site,
a few converted Sun docs, etc. -- certainly nothing to get excited
about.  If it weren't for the TEI, however, SGML wouldn't be doing
much better than XML on the Web, outside of HTML itself.  Only a tiny
fraction of user systems have the technology to view rendered versions
of XML *or* general SGML documents, so it is not surprising that
people aren't bothering to publish directly in SGML or XML.

As for Java, the comparison might not be fair -- Java is very heavily
used as middleware and on the server side (the most likely place for
XML to be deployed), and the core non-graphical APIs for Java are
relatively stable since 1.1 (modulo a few new classes in java.util).
Furthermore, nearly any user who bothers to go on the Internet
encounters several Java applets/session.

When the Java hype started, people could think of Java only in
conventional terms: vendors would write giant applications that users
would install and run on their systems, just as they did using C++.
Of course, you can still do that if you really want to, but it turns
out that instead of competing directly with C++ (etc.), Java helped
bring along along a new, distributed information model, where users
see small applets on web pages, and servers plug in small servlets to
do the work.

When the XML hype started, people could think of XML only in
conventional terms: authors would write XML documents that users would
view rendered in browsers, just as they did using HTML.  Of course,
you can still work towards that if you really want to...

 > I don't really know what to develop for right now, what with
 > confusion over our design goals, about namespaces, about XSL, SGML
 > compatibility, and virtually everything else.  Perhaps if we could
 > all agree on a few basics, we could set- tle things down.

Develop for XML 1.0.  If you don't need namespaces, don't bother with
them, or at least wait until the bigger parts of the market decide
whether they'll use them.

XML found an unoccupied niche and moved into it.  Now, XML is breeding
and producing dozens of derived and related specs, like tadpoles --
since food is scarce and predators are fierce, most of the offspring
will die immediately, and most of the remainder will not make it to
adulthood.  This is a standard situation, both in nature and in
standards bodies.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Fri Sep 11 14:35:58 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:36 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
In-Reply-To: <00ed01bddd6e$e7e382b0$432e0318@cc221812-a.hwrd1.md.home.co
 m>
Message-ID: <3.0.1.16.19980911122836.9757307c@pop3.demon.co.uk>

At 06:28 11/09/98 -0400, Samuel R. Blackburn wrote:
[...]
>
>When asked why I use XML in my programs, I tell folks
>"what HTML is to text, XML is to data." I've found XML to
>be a wonderful solution to exporting data in an easily
>consumable format. I could care less if a browser knows
>how to consume XML.

I share some of this viewpoint, but I'd go further and say that XML is the
only way forward if we wish to *integrate documents and data*. Properly, I
mean. So you could drop a financial report into an XML tool and it would
give you a spread sheet of all the figures and calculate predictions, etc.
[I have more ideas ...].

	I have had a volunteer!  We need more...

	P.

Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at goon.stg.brown.edu  Fri Sep 11 14:38:28 1998
From: richard at goon.stg.brown.edu (Richard L. Goerwitz III)
Date: Mon Jun  7 17:04:36 2004
Subject: namespaces discussion
In-Reply-To: <00f601bddd3e$e4d64380$c86118cb@caleb> from James Tauber at "Sep 11, 98 12:40:33 pm"
Message-ID: <199809111237.IAA23107@goon.stg.brown.edu>

> Say my book DTD has:
> 
> <!ELEMENT div (%base.div.content;|%extended.div.content;)>
> ... 
> <!ENTITY % mathml.dtd SYSTEM "mathml.dtd">
> <!ENTITY % pgml.dtd SYSTEM "pgml.dtd">
> %mathml.dtd
> %pgml.dtd
> 
> <!ENTITY % extended.div.content "math|pgml">
>
> And bingo!, I can use MathML and PGML in my Book.

For XML, of course, you need to add semicolons to the parameter
entity references, and make sure that this is all in the external
DTD subset (because of the PEs being used in an ELEMENT decl).
You then also have to put your % extended.div.content decl before
the ELEMENT div decl.

But assuming all this was done, you'd still have to have full
control over the mathml and pgml DTDs, to make sure they didn't
define any of the same elements as each other - or your Book
DTD.

It's all one namespace, and XML doesn't allow element redefini-
tion.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Fri Sep 11 14:40:31 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:36 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
In-Reply-To: <v03102802b21e5d4514e1@[203.7.224.126]>
References: <199809090522.BAA11695@ruby.ora.com>
	<199809090121.SAA15093@mail-gw.pacbell.net>
	<v03102801b21c945aab3e@[203.23.215.128]>
	<v04011701b21d8ea34317@[203.23.215.86]>
	<35F7E9C7.EA6B8BA8@technologist.com>
	<v03102802b21e5d4514e1@[203.7.224.126]>
Message-ID: <199809111238.IAA00425@unready.megginson.com>

Andy Dent writes:

 > Mind you, it still may be vastly easier and more efficient for us
 > to skip this intermediate step as an XML rendition and keep our
 > intermediate version purely in-memory as a set of c++ formatting
 > objects, applied to a database.

There's no need to actually generate the XML rendition unless you plan
to exchange.  XML is an interchange standard -- it doesn't limit what
you can do internally.  You don't have to actually write an XML
document and then parse it back in, as long as the results are the
same.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mika.kekkonen at vtt.fi  Fri Sep 11 15:16:40 1998
From: mika.kekkonen at vtt.fi (mika.kekkonen@vtt.fi)
Date: Mon Jun  7 17:04:36 2004
Subject: xml diff program
Message-ID: <3.0.32.19980911161349.00983eb0@vttmail.vtt.fi>

Do you know any sgml/xml diff programs?

I have used sgmldiff 1.2 which I got with Information Manager -program, but
I have some difficulties. For example if I move some element program
reports this move as delete and insert operations and not as move. This
problem arises if element to be moved is situated enough deep (level 3) at
xml tree.   

Thanks for your time.

Mika 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Nidal.AMER at hdmp.com  Fri Sep 11 15:20:19 1998
From: Nidal.AMER at hdmp.com (AMER, Nidal)
Date: Mon Jun  7 17:04:36 2004
Subject: DTD vs Schema
Message-ID: <A789DF8BDE02D211B63C00805FC78E97024358@BES40ENT000>

In fact I am trying to implement in XML an object model I have designed
with UML in which I clearly define generalization and specialization
relationships. In instance, I am treating a generic type called
transaction from which I derive different types of transactions all of
which share a common behavior. 
The XML Data schema uses SUPERTYPE to declare inheritance, even if this
is not very useful unless the DOM model, built by the parser, could
recognize that an element inherits another one and lets the application
treat the subtype the same way as the supertype ( polymorphism ). Also,
when referenced, the subtype should be checked against applying the
validity constraints of the supertype plus its own constraints.

Unfortunately, as this seems not to be implemented, I have to build this
logic in the code.

Regards,
Nidal.

	-----Original Message-----
	From:	Peter Murray-Rust [SMTP:peter@ursus.demon.co.uk]
	Sent:	Friday, September 11, 1998 2:21 PM
	To:	AMER, Nidal
	Subject:	Re: DTD vs Schema

	At 12:44 11/09/98 +0200, you wrote:
	>Thank you all a lot for your answers. I am a step further now.
	>Just I still can't express inheritance with DTD in an elegant
way.

	No - it's not easily built in to XML. I think that we shall
develop some
	general means for doing this. It depends *what* your want to
inherit.

	I have developed this in my Chemical Markup Language,
	http://www.xml-cml.org (BTW I used to work in the pharma
industry). I have
	an array element which can be subclassed to atomArray and
bondArray (and
	all of these are mapped to Java). But there is no way of
*enforcing* the
	parallel between XML and Java here - I have to remember it when
I write the
	code.

	I think DCD and other schemas will start addressing this.

		P.

	>~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
	>Nidal AMER
	>Deputy R & D Director
	>Health Data Management Partners (HDMP)
	>A SmithKline-Beecham Company
	>6 Rue de Gen?ve,
	>1140 Brussels, Belgium
	>Tel: + 32 (2) 724 00 93
	>Fax: + 32 (2) 726 91 59
	>E-mail: nidal.amer@hdmp.com <mailto:nidal.amer@hdmp.com> 
	>Visit our web site: http://www.hdmp.com <http://www.hdmp.com> 
	>~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
	>
	Peter Murray-Rust, Director Virtual School of Molecular
Sciences, domestic
	net connection
	VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
	http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Fri Sep 11 16:03:44 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:37 2004
Subject: namespaces discussion
Message-ID: <007201bddd8d$2c516160$cb6118cb@caleb>

-----Original Message-----
From: Richard L. Goerwitz III <richard@goon.stg.brown.edu>
>For XML, of course, you need to add semicolons to the parameter
>entity references

whoops. yes, of course.

>and make sure that this is all in the external
>DTD subset (because of the PEs being used in an ELEMENT decl).
>You then also have to put your % extended.div.content decl before
>the ELEMENT div decl.

My example assumed this. Sorry if I didn't make that clearer.

>But assuming all this was done, you'd still have to have full
>control over the mathml and pgml DTDs, to make sure they didn't
>define any of the same elements as each other - or your Book
>DTD.

No, you'd just have to look at mathml and pgml and make sure that they
didn't clash with each other and then write your book dtd to not use any of
their element type names.

But my point was not that you don't need namespaces. In fact, my point was
the exact opposite. You need namespaces to avoid clashes. My point was that
that is what namespaces do: they avoid name clashes. Mixing DTDs is achieved
through a different mechanism, namely parameter entities. Paul Prescod has
since said as much.

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cvonsee at onramp.net  Fri Sep 11 16:12:27 1998
From: cvonsee at onramp.net (Chris von See)
Date: Mon Jun  7 17:04:37 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
In-Reply-To: <3.0.1.16.19980911084036.9757b832@pop3.demon.co.uk>
References: <199809110325.XAA15202@goon.stg.brown.edu>
 <35F88004.70A7F204@technologist.com>
Message-ID: <199809111411.JAA02716@mailhost.onramp.net>

At 08:40 AM 9/11/98 +0000, Peter Murray-Rust wrote:
>I agree with this. Given that XML was designed for use over the Web
>(right?) and it has been in gestation for 2 years I find it incredible that
>XML has not done anything useful in public view. Lots of hype in the
>magazines, etc. but nothing tangible to show for it. Tangible in the sense
>that I can show a non-XML person something that will interest them.
>

<snip>

>The criterion for inclusion as a useful XML application is:
>	- it must be usable over the WWW AND/OR
>	- it must be downloadable and useful
>	- it must do something that cannot be easily done with HTML OR
>	- it must do something in an *immediately obviously* better way than HTML.
>	- it must catch the imagination of someone who is not an XML expert.

A newbie's perspective...

I am a commercial developer that became interested in XML about a month ago
as a result of the media hype that Peter refers to.  Rather than coming
into this with an SGML- or publishing-centric viewpoint (which seems to be
the view of the vast majority of the members of this list), I came into
this from a database/data communications perspective - one that saw XML as
a potential tool for a whole new suite of distributed, data-based
(knowledge-based?) applications.  In many instances, this was the way that
XML was portrayed by the media; the introduction of sexy products from
webMethods and DataChannel, the start of the XML/EDI initiative, and other
industry announcements just reinforced this viewpoint.  The key media point
(at least in the trade rags I read) was this: XML was relevant in A)
business-to-business and business-to-consumer electronic commerce
applications, and B) applications that required the linking of data from
disparate sources in a common format for consumption by automata.  There
was also some talk about ease in searching the Web and other
automata-enabled processes, but there was really very little mention of
XML as a replacement for HTML, XML in electronic publishing, XSL, etc. 

Personally (even after reading all the squabbl-uh-uh-discussion on this
list ;-), I still believe that data-based applications are where XML has
the greatest potential to achieve visibility and commercial success.  Even
though XML itself is not particularly sexy or exciting, combining the
concepts it embodies with the power of XLL, RDF, DOM and namespaces (and
even XSL) gives developers that are focused on the applications above the
opportunity to do some things that *are* sexy and exciting.  That's what
Mosaic had - sex-appeal and *immediate* applicability.

>As far as I can see, most readers of this list are:
>	- waiting for XSL because all they are interested in is rendering text
>with infinite precision. Worthy, but surely that's not the main point of
>XML. Also it's a year away.
>	- waiting for MS/NS to come up with 'XML browsers'. Doesn't look very
>promising, does it?
>	- only really interested in using XML to manage their current client
business
>	- interested in doing some in-house re-engineering.
>	- have some medium/long-term strategy for developing products. No doubt
>some of these will be very exciting but I doubt they will spread a flame
>across the WWW.
>	- just waiting
>

I don't really fit into any of Peter's catagories... I'm trying to decide
whether it makes financial/commercial sense to invest my (limited)
development funds in this newfangled XML stuff, or if XML is going to find
a niche somewhere but never get anchored in the general market conscience.

If visibility and commercial success if what you want, I believe that what
XML and its related technologies need is a commercial champion - someone
like IBM, Microsoft, or Netscape that can come forward and put a flag up
that developers can rally around.  No XML spreadsheet, drawing tool or
chess set is going to generate success for XML like an evangelical company.  


Chris


------------------------------------------------------
"Don't *say* things.  What you *are* stands over you the while, and
thunders so that I cannot hear what you say to the contrary."
                  

--- Emerson, "Social Aims"

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Fri Sep 11 16:23:04 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:37 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
Message-ID: <3.0.32.19980911071839.00ac9b70@pop.intergate.bc.ca>

At 08:40 AM 9/11/98, Peter Murray-Rust wrote:
>Or am I right that XML is fundamentally about as boring as the introduction
>of  TTL or 3-phase electricity - worthy, but manufacturer-level only?

That might very well be the case. -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Fri Sep 11 16:23:07 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:37 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
Message-ID: <3.0.32.19980911071651.00aa7670@pop.intergate.bc.ca>

At 01:03 PM 9/11/98 +0800, Andy Dent wrote:
>At 23:01 +0800 10/9/98, Paul Prescod wrote:
>>The browser takes XML, pumps it through an XSL engine, receives an XML
>>result (according to a known DTD with formatting semantics) and renders
>>*that*. You can do the same with your report writer.
>
>THANK YOU

Ouch.  Should have been watching more carefully.  This is not quite right;
most important, DTDs have no formatting semantics.  CSS and XSL stylesheets
do. -Tim


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dave at userland.com  Fri Sep 11 16:27:56 1998
From: dave at userland.com (Dave Winer)
Date: Mon Jun  7 17:04:37 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
In-Reply-To: <3.0.1.16.19980911084036.9757b832@pop3.demon.co.uk>
References: <199809110325.XAA15202@goon.stg.brown.edu>
 <35F88004.70A7F204@technologist.com>
Message-ID: <3.0.5.32.19980911072823.00c60d50@scripting.com>

Peter, thanks for a great message! I am new to the xml-dev list and hadn't
seen your earlier messages. What you're saying sounds like what I've been
saying on DaveNet. Maybe now we can get focused on using XML, instead of
talking about how great it's going to be when we do use it.

We have two projects that generate XML files every day. 

One is a siteChanges.xml file for our server, www.scripting.com. It's very
useful, if the search engines, or one search engine would read the damned
file. If every webmaster produced one, the time to re-index the web would
be dramatically shortened, and the search engines that used this would kick
their competition's butts.

More information on this application is at:

http://www.scripting.com/fatpages/suites/siteChanges.html

The second project is Scripting News in XML. We produce an HTML version of
the content flow at www.scripting.com, but we also produce an
always-current XML version of the content flow. There's an XML file
produced every day. This one is being used. Josh Lucas, a Java programmer
in Boston reads the file every night at midnight, and sends an
email-formatted version of the text to subscribers. So the Scripting News
flow goes out thru email. Also, one of our competitors, Vignette, is
running an experimental server that syndicates the Scripting News flow,
again based on the XML version of the content.

http://www.scripting.com/frontier5/xml/experiments/scriptingNews.html

Of the two, there's no question in my mind that the siteChanges app is a
killer. Now comes the problem of evangelizing the search engine guys to
support it. Once they do, the chicken and egg problem is solved. Then we
can evangelize other webmasters to write a simple script (less than 100
lines for sure) that updates this file every night. We do it in Frontier,
but it would be trivially simple to do it in Visual Basic, Perl, Tcl,
AppleScript, whatever.

Re your other comments, and XML-based spreadsheet and draw program would be
awesome! I agree that XML in web browsers is B O R I N G (beyond belief).
The interesting applications are connections between apps that are not web
browsers.

I have my hands full with Frontier or I would be jumping on those ideas
right now. We would be very supportive if anyone is working on such
programs, the compatibility with Frontier would be incredible. To me that's
the point of XML, it's the idea of open file formats allowing exchange of
info between all kinds of apps, not just web browsers.

Again, thanks for starting this thread. It's a very positive step.

Dave Winer


At 08:40 AM 9/11/98, you wrote:
>My retitling of this message is deliberately provocative, and I do NOT wish
>a flame war to ensue. I want the title disproved, but by *actions and
>evidence*, not statements of faith.
>
>At 23:25 10/09/98 -0400, Richard L. Goerwitz III wrote:
>>
>>XML right now is, well, only slightly more than a dream.  I have a hard time
>>even finding valid XML document instances on the net.  
>
>I agree with this. Given that XML was designed for use over the Web
>(right?) and it has been in gestation for 2 years I find it incredible that
>XML has not done anything useful in public view. Lots of hype in the
>magazines, etc. but nothing tangible to show for it. Tangible in the sense
>that I can show a non-XML person something that will interest them.
>
>XML was effectively launched in Spring 1997 at WWW6, Santa Clara. It's 15
>months since then and over a year since the first draft of the XLink spec
>was released.  And as far as I can see there are virtually no useful
>applications that have been created. I now find it difficult to convince
>people that XML is useful, other than by repeatedly stating it as an act of
>faith.
>
>There are lots of valuable *tools* developed (many announced on this list)
>but is anyone actually using them publicly? [I do not get excited by
>statements like "Corporation X can achieve x% reduction in costs by using
>XML for its workflow". "We are using XML to store our configuration files,
>etc." It may be true, and it may be good business, but it's hardly a
turnon.] 
>
>XML has limitless applications. I continue to suggest them on this list -
>the response is underwhelming. By an application I mean "something that a
>non-XML expert can do something useful with". That "something" might only
>be to play minesweeper or whatever. At present I count the following:
>	- MathML - IMO this is the one that has most chance of achieving critical
>mass.
>	- The XSL slide processor announced on XSL lists just now. Haven't looked
>at it, but it sounds like an obvious and useful thing to do
>	
>and my own - which I am actively building up critical mass for:
>	- Chemical Markup Language. (http://www.xml-cml.org). It is now
>distributable. Henry and I are proseletising in the community, I think with
>some success. But the lack of other visible XML makes it very difficult.
>	- the Virtual HyperGlossary (http://www.vhg.org.uk). This is something
>that couldn't be done with HTML, as it uses the hierarchical nature of XML
>and the additional addressing of XLink. AFAIK the only application of XML
>that actively uses XLink. Is no-one else interested in the power of Xlink -
>my own view is that it's revolutionary.
>
>The criterion for inclusion as a useful XML application is:
>	- it must be usable over the WWW AND/OR
>	- it must be downloadable and useful
>	- it must do something that cannot be easily done with HTML OR
>	- it must do something in an *immediately obviously* better way than HTML.
>	- it must catch the imagination of someone who is not an XML expert.
>
>I have been confidently predicting that XML would take the WWW by storm
>during 1998. I am amazed and saddened that it hasn't done so, but there are
>**three months left**. I have thrown out lots of ideas on this list with
>virtually zero take up. Is no one else interested in:
>	- an XML spreadsheet?
>	- an XML drawing tool?
>	- a collaborative XML environment?
>	- XML games
>
>As far as I can see, most readers of this list are:
>	- waiting for XSL because all they are interested in is rendering text
>with infinite precision. Worthy, but surely that's not the main point of
>XML. Also it's a year away.
>	- waiting for MS/NS to come up with 'XML browsers'. Doesn't look very
>promising, does it?
>	- only really interested in using XML to manage their current client
business
>	- interested in doing some in-house re-engineering.
>	- have some medium/long-term strategy for developing products. No doubt
>some of these will be very exciting but I doubt they will spread a flame
>across the WWW.
>	- just waiting
>
>By contrast, when Mosaic hit the WWW, within *months* I was able to:
>	- use search engines (a completely new idea)
>	- post requests by interactive forms (again a stunning idea)
>	- send and display complex objects (e.g. molecules) painlessly over the
>WWW (again stunning).
>	- get servers to do incredible calculations (painlessly).
>
>Much the same dynamics were seen for Java.
>
>Where are the excited XML hackers? You don't need a *browser* to do fun
>things. Where are the grad students (Paul Prescod doesn't count any longer
>:-). 
>
>Isn't OASIS interested in creating a fun demo disk/CDROM to promote XML?  
>
>Or am I right that XML is fundamentally about as boring as the introduction
>of  TTL or 3-phase electricity - worthy, but manufacturer-level only?
>
>I am hoping to be inundated with mail that shows I am wrong - that I have
>taken a narrow view - that I don't read the newsgroups. But not mere
>statements of faith, please. I want something I can *show* people. 
>
>In addition I would like volunteers or code to help with the collaborative
>project I suggested two days ago. So far the response has been
>underwhelming. All I need is some dumb server-side software that can keep
>open channels (or re-open them) between two 'players'. [Of course the
>application need not be just games.] This is probably so trivial to some of
>you that you think it's not worth offering - but I happen to be ignorant
>about it.
>
>	P.
>
>	
>Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
>net connection
>VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
>http://www.venus.co.uk/vhg
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>

--------------------------------------
http://www.userland.com/directory.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Fri Sep 11 16:40:16 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:04:37 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
In-Reply-To: <3.0.32.19980911071839.00ac9b70@pop.intergate.bc.ca>
Message-ID: <199809111439.KAA32501@hesketh.com>

At 07:22 AM 9/11/98 -0700, Tim Bray wrote:
>At 08:40 AM 9/11/98, Peter Murray-Rust wrote:
>>Or am I right that XML is fundamentally about as boring as the introduction
>>of  TTL or 3-phase electricity - worthy, but manufacturer-level only?
>
>That might very well be the case. -Tim

If that's the case, we've all lost out, and should tell the magazines to
cool the hype and settle down to more important stories on exciting issues
like stock prices and IPOs.

Oh well.  I guess the revolution's over before it got started.  Could have
reached a much wider audience, but somehow snuffed itself out. Sort of like
SGML, perhaps.

I strongly hope that's _not_ the case, because the stuff just isn't so damn
difficult that you need a CS degree to figure it out.  XML is approachable,
even easy.  There's no reason to lock it in a back room for only the
supposed 'experts' to tinker with it.

Now back to writing a chapter on validation...


Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Fri Sep 11 16:47:01 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:37 2004
Subject: coming clean with the SGML crowd (was re: namespaces)
References: <3.0.32.19980910162847.00a013d0@pop.intergate.bc.ca> <35F8B1D2.BA6968C@infinet.com>
Message-ID: <35F9379F.AF0B786C@locke.ccil.org>

Tyler Baker wrote:

> I guess there are other reasons why
> people would not want validation, but from a performance perspective my small
> experience with this issue makes it a non-issue.

But then you have things like RDF, where it is impossible to write
a consistent DTD that covers all the variations).

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Philippe.Le_Hegaret at sophia.inria.fr  Fri Sep 11 16:49:59 1998
From: Philippe.Le_Hegaret at sophia.inria.fr (Philippe Le H�garet)
Date: Mon Jun  7 17:04:37 2004
Subject: Deterministic Content Models ?
Message-ID: <35F937F8.9409C43@sophia.inria.fr>

I have a strong question about deterministic content models ...

 Is (paragraph*)* a determinist content model ?

 If yes, so I think (a+ | b)* is a deterministic content model too.

Philippe.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From fernando at pix.com.br  Fri Sep 11 16:52:13 1998
From: fernando at pix.com.br (Fernando Cabral)
Date: Mon Jun  7 17:04:37 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
References: <199809110325.XAA15202@goon.stg.brown.edu>
	 <35F88004.70A7F204@technologist.com> <3.0.5.32.19980911072823.00c60d50@scripting.com>
Message-ID: <35F94396.AA5C6DC8@pix.com.br>


Dave Winer wrote:

> One is a siteChanges.xml file for our server, www.scripting.com. It's very
> useful, if the search engines, or one search engine would read the damned
> file. If every webmaster produced one, the time to re-index the web would
> be dramatically shortened, and the search engines that used this would kick
> their competition's butts.

In fact Dataware II (publisher) from Dataware (www.dataware.com) does an excelent
job. I've using it myself with great results. What's better: it may automatically
generate databases for web access or for CD distribution. Web is great
for on-line access; CD is wonderful for long-term backup. It's worth a try.

- fernando

--
mailto:fernando@pix.com.br                    http://www.pix.com.br
Fernando Cabral                               Padrao iX Sistemas Abertos
Fernando@Pix.com.br                           Pix@Pix.com.br
Fone: +55 61 321-2433                         Fax: +55 61 225-3082


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Fri Sep 11 16:56:40 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:37 2004
Subject: XML *should* be boring
In-Reply-To: <199809111439.KAA32501@hesketh.com>
References: <3.0.32.19980911071839.00ac9b70@pop.intergate.bc.ca>
	<199809111439.KAA32501@hesketh.com>
Message-ID: <199809111454.KAA00983@unready.megginson.com>

Simon St.Laurent writes:

 > If that's the case, we've all lost out, and should tell the
 > magazines to cool the hype and settle down to more important
 > stories on exciting issues like stock prices and IPOs.

I wish that we could have prevented the hype in the first place, but
that's all spilled milk now.  XML is a very important standard -- I
think that it is roughly to information exchange what TCP and IP are
to networking -- but it's still just a standard, not a product.

Normal people don't get excited about TCP/IP; instead, they get
excited about the applications that happen to use TCP/IP, and they
soon take it for granted that different applications can communicate
so easily under different circumstances (a noisy copper phone wire,
ethernet, fibre optic, wireless, ATM, all across different OS's and
architectures).

If we do our job as well as the TCP/IP people did, users should hardly
notice that XML exists -- after all, we're supposed to help them do
their work, not draw attention to our own.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Fri Sep 11 17:02:30 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:37 2004
Subject: Summary of Namespaces and Validation
References: <Pine.BSI.3.96r.980911030511.28033B-100000@shell1.interlog.com>
Message-ID: <35F93B43.50E1FBC1@locke.ccil.org>

Liam R. E. Quin wrote:

> If a DTD refers to multiple external definition sets, with multiple
> prefixes, it is necessary to determine the source of each referenced
> element type, which may require a two-pass algorithm to collect all
> of the element definitions and then associate them with the right
> prefixes.  The fact that an element type can be referred to before
> it is declared makes this slightly more complex, but is necessary...

One pass is sufficient, even if prefixes are not defined before
they are referenced, provided that the default prefix is declared
somehow.
 
> Tim, what am I missing?  Apart from the face that we lack a good
> way to associate a DTD or DTD fragment with a prefix, I mean?

Nothing as far as I can see.  Of course the result is not SGML-compliant
DTD-based validation, but it is validation of XML using DTDs.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Fri Sep 11 17:12:58 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:38 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
References: <199809090522.BAA11695@ruby.ora.com>
				 <199809090121.SAA15093@mail-gw.pacbell.net> <v03102801b21c945aab3e@[203.23.215.128]> <35F7B89F.2E779FDB@jclark.com> <35F804CD.47CAAB14@locke.ccil.org> <35F8F3B3.4149A79A@jclark.com>
Message-ID: <35F93D78.B8325124@locke.ccil.org>

James Clark wrote:

> The term "well-formed HTML" as used in section 1 of the XSL WD does not
> mean SGML that conforms to HTML 4.0. It means well-formed XML that uses
> element types and attributes from HTML.

Well and good.  But "uses element types" etc. is vague: all element
types, or only some of them?  It can't (straightforwardly) be all
of them, because SCRIPT and STYLE are CDATA elements, and so
have no XML equivalents.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Fri Sep 11 17:14:21 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:38 2004
Subject: DTD vs Schema
In-Reply-To: <A789DF8BDE02D211B63C00805FC78E97024358@BES40ENT000>
Message-ID: <3.0.1.16.19980911161342.899f46ae@pop3.demon.co.uk>

At 15:19 11/09/98 +0200, AMER, Nidal wrote:
>
>Unfortunately, as this seems not to be implemented, I have to build this
>logic in the code.
>
Remember that XML *has no code*. It mentions a processor (== parser). It
occasionally hints at other software (link processor). The DOM defines a
data model and interface. But in general the XML specs define the data, its
structure and very occasionally its semantics (i.e. what it should 'do').

I had expected that real software would start to be developed alongside the
specs. This is true for XML1.0 where we have a very large number of parsers
but very much less true for namespaces, XLink (which is over a year old),
schemas, etc. So yes, the message is that if you actually want XML to do
something you have to hack it yourself - don't sit around waiting for
others. In particular beware of NOTEs to the W3C where there is no evidence
of anyone doing any hacking. 

I think we will get there eventually - but I'm an optimist

	P.


From dave at userland.com  Fri Sep 11 17:19:20 1998
From: dave at userland.com (Dave Winer)
Date: Mon Jun  7 17:04:38 2004
Subject: XML *should* be boring
In-Reply-To: <199809111454.KAA00983@unready.megginson.com>
References: <199809111439.KAA32501@hesketh.com>
 <3.0.32.19980911071839.00ac9b70@pop.intergate.bc.ca>
 <199809111439.KAA32501@hesketh.com>
Message-ID: <3.0.5.32.19980911081706.00c375b0@scripting.com>

>>If we do our job as well as the TCP/IP people did, users should hardly
notice that XML exists -- after all, we're supposed to help them do
their work, not draw attention to our own.

Amen.

Dave

--------------------------------------
http://www.userland.com/directory.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From msuzio at anecdote.com  Fri Sep 11 17:19:37 1998
From: msuzio at anecdote.com (Michael J. Suzio)
Date: Mon Jun  7 17:04:38 2004
Subject: Applying XML
Message-ID: <35F93EAD.CFEE640A@anecdote.com>

I think one of the biggest wins and selling points for XML is that
is truely can be a universal data format, with no assumptions
made for how the data is to be parsed or used.
To drive this, we clearly are going to have to develop patterns and
examples of using XML schemas to describe data.  For instance, what is
the optimal way to represent spreadsheet data in XML?  Why re-invent
the wheel for every project like this?

So, just another good reason to start working publicly on DTDs/schemas
and archiving good document description patterns.

-- 
Michael J. Suzio
Interconnect of Ann Arbor
msuzio@anecdote.com / 1-734-665-5342

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Fri Sep 11 17:19:42 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:38 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
References: <00ed01bddd6e$e7e382b0$432e0318@cc221812-a.hwrd1.md.home.com>
Message-ID: <35F93EA7.2BC8AEC6@locke.ccil.org>

Samuel R. Blackburn wrote:

> What excited me about XML was it's ability to pass data in
> a form that anyone could parse. Universal data transfer. Sounded
> like a good idea to me.  The syntax of XML is wonderful.

I agree.

> However,
> IMHO XML is saddled with design goal #3 "XML shall be compatible
> with SGML." I thought, "Oh great, yet another way to show pretty
> text." I don't need another way of showing pretty text. HTML has
> solved that problem well enough.

That is based on a mistaken view of SGML --- a widely believed
mistake, to be sure.

> What I need is a way to pass data around so anyone can use
> any part of it they wish. Looking at XML from a data centric
> perspective, there are things in it that are worthless, DTD's
> for example.

Surely you don't suppose that you can have maintainable, reusable
data without machine-interpretable schemas?  DTDs aren't the best
possible schema language for a lot of reasons, but right now
they are all we have.
 
> When asked why I use XML in my programs, I tell folks
> "what HTML is to text, XML is to data." I've found XML to
> be a wonderful solution to exporting data in an easily
> consumable format. I could care less if a browser knows
> how to consume XML.

I agree with that too.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Fri Sep 11 17:28:07 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:04:38 2004
Subject: XML *should* be boring
In-Reply-To: <199809111454.KAA00983@unready.megginson.com>
References: <199809111439.KAA32501@hesketh.com>
 <3.0.32.19980911071839.00ac9b70@pop.intergate.bc.ca>
 <199809111439.KAA32501@hesketh.com>
Message-ID: <199809111526.LAA00509@hesketh.com>

>I wish that we could have prevented the hype in the first place, but
>that's all spilled milk now.  XML is a very important standard -- I
>think that it is roughly to information exchange what TCP and IP are
>to networking -- but it's still just a standard, not a product.

Yes, but TCP/IP had the advantage of being first on the ground, a widely
implemented standard that was able to stand up to the OSI model largely on
the basis that people already used it, and it worked well enough.  XML is
moving into a field that already has many contenders, without many
supporting products, relying on the good will of a large number of people
and organizations to get any place.  XML may be a better idea than the
current mess (HTML, delimited text, etc.), but that's not going to take it
very far is OSI's being a 'better idea' is any indication.

(Yes, I know there are lots of folks who don't think OSI was a better idea,
and I tend to sympathize.  Nonetheless, it's the classic example of a
carefully thought out standard that went pretty much nowhere.)

>If we do our job as well as the TCP/IP people did, users should hardly
>notice that XML exists -- after all, we're supposed to help them do
>their work, not draw attention to our own.

I think this is a huge part of the problem - this idea that 'we' are
supposed to help people do their work.  XML isn't rocket science, and it
doesn't need a core of rocket scientists to make it work for many
situations.  XML being invisible, woven into other standards by a devoted
cadre of experts may do a lot to improve those standards - but it does very
little to reduce the cost of the implementations.  

Support for generic XML, with tools widely available for editing /
authoring / viewing / exchanging / storing / searching / objectizing /
developing XML might stand a chance of making sure that XML doesn't stay in
the high-end expensive small-community world that SGML inhabits today.
Enlarging the community of developers is a critical step toward making XML
cheap and ubiquitous.

In the long run, of course, XML should become invisible, like ASCII, to
cite Tim Bray's favorite example.  In the short run, though, it needs to
become visible enough to achieve ubiquity.  Otherwise, it'll be invisible
for good reason - no one will be using it except a few programmers.


Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rbourret at dvs1.informatik.tu-darmstadt.de  Fri Sep 11 17:33:43 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun  7 17:04:38 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
Message-ID: <199809111528.RAA03208@berlin.dvs1.tu-darmstadt.de>

Simon St. Laurent wrote:
> At 07:22 AM 9/11/98 -0700, Tim Bray wrote:
> >At 08:40 AM 9/11/98, Peter Murray-Rust wrote:
> >>Or am I right that XML is fundamentally about as boring as the introduction
> >>of  TTL or 3-phase electricity - worthy, but manufacturer-level only?
> >
> >That might very well be the case. -Tim
> 
> If that's the case, we've all lost out, and should tell the magazines to
> cool the hype and settle down to more important stories on exciting issues
> like stock prices and IPOs.
> 
> Oh well.  I guess the revolution's over before it got started.  Could have
> reached a much wider audience, but somehow snuffed itself out. Sort of like
> SGML, perhaps.

It strikes me that the application that is going to make the public sit up and 
notice XML is the one that lets me ask: "Make a table of all hotels in New York 
that cost less than $100 and are within walking distance of Central Park."

The good news is that this application is completely possible (and almost, but 
not quite, inevitable).  The bad news is that it's still a ways off.  What it 
requires is enough people writing their Web documents in XML (with widely 
accepted element tag names) to make it worthwhile for the search engines to 
offer this kind of functionality.

There are probably a number of ways to jump-start this process, but the most 
obvious is a browser that supports XML+XSL so that Web masters are willing to 
write in XML.  (Sorry to those of you who find browser support of XML boring.) 
We also need namespaces, one or more Yahoo-like repositories for semi-standard 
DTDs/schemas (see www.schema.net for a start), and a solution to Tim's 
ought-to-be-famous "interesting and difficult problem of compounding DTDs".

I suggest that a short-term solution for the latter is to simply combine 
elements from different DTDs as one sees fit.  Although the resulting documents 
are not valid wrt their original DTDs and cannot be used by DTD-specific 
applications, XML does not require valid documents and the use of standard tags 
facilitates the search process.  I am advocating a certain degree of anarchy 
here, but the Web is inherently anarchic and if we wait until we find a way to 
combine DTDs without breaking DTD-specific applications, we're missing the 
chance to build some extremely useful applications right now.

(By the way, a nice feature of XML editors that would help this along would be 
to read DTDs/schemas from said Yahoo-like repositories, let users insert 
elements whereever they want from whatever DTDs/schemas they want, and generate 
new DTDs as requested.)

In the mean time, XML is still extremely useful as a data transport and I agree 
with Chris von See, who said that XML's greatest potential is in data-based, not 
document-based, applications.  Just because the public won't see it doesn't mean 
they won't (indirectly) appreciate it.

-- Ron Bourret

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Fri Sep 11 17:54:16 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:38 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
In-Reply-To: <199809111411.JAA02716@mailhost.onramp.net>
References: <3.0.1.16.19980911084036.9757b832@pop3.demon.co.uk>
 <199809110325.XAA15202@goon.stg.brown.edu>
 <35F88004.70A7F204@technologist.com>
Message-ID: <3.0.1.16.19980911165323.8faf5914@pop3.demon.co.uk>

At 09:07 11/09/98 -0500, Chris von See wrote:
[...]
>
>I don't really fit into any of Peter's catagories... I'm trying to decide
>whether it makes financial/commercial sense to invest my (limited)
>development funds in this newfangled XML stuff, or if XML is going to find
>a niche somewhere but never get anchored in the general market conscience.

Fully understood and appreciated. And I am facing up to this challenge as
far as possible. I've agreed to give a seminar in London (see
http://www.netproject.com/netproject_pages/new_schedule.htm
for the program) to businesses to show the value of XML. Any examples of
reality - and thanks to the positive replies - are most welcome as
ammunition. Otherwise they might get the wrong impression about the extent
of interest. And if anyone is interested in coming along :-).

>
>If visibility and commercial success if what you want, I believe that what
>XML and its related technologies need is a commercial champion - someone
>like IBM, Microsoft, or Netscape that can come forward and put a flag up
>that developers can rally around.  No XML spreadsheet, drawing tool or
>chess set is going to generate success for XML like an evangelical company.  
>
I also fully accept this. (And really I congratulate the progenitors of XML
in getting so much commercial interest). But we can have more than one
strategy, can't we.

HTML did NOT take off because of a major company - exactly the reverse. It
took off because of a single product (and of course the concept and
language itself). And it released an enormous flood of innovation. Have we
used all that up? I'd hate to think so. Is there nothing left to do over
the web that's exciting? So - yes - chess is a game. But chemistry is for
real. It's not the ideal place to start because it's a conservative domain,
but the applications are enormous. Surely there are many other verticals
where that is true. And a drawing package *will* be a killer when it's
coupled to the right applications. I've got mine - just have to hack the
software first.

	P.

Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Fri Sep 11 17:54:21 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:38 2004
Subject: XML *should* be boring
In-Reply-To: <199809111454.KAA00983@unready.megginson.com>
References: <199809111439.KAA32501@hesketh.com>
 <3.0.32.19980911071839.00ac9b70@pop.intergate.bc.ca>
 <199809111439.KAA32501@hesketh.com>
Message-ID: <3.0.1.16.19980911165226.8faf5cfe@pop3.demon.co.uk>

At 10:54 11/09/98 -0400, David Megginson wrote:
[...]
>
>I wish that we could have prevented the hype in the first place, but
>that's all spilled milk now.  XML is a very important standard -- I
>think that it is roughly to information exchange what TCP and IP are
>to networking -- but it's still just a standard, not a product.

I respect this view - and I suggest we don't try to convert each other :-)
And keep posting contrary view to mine. But it could also be argued that
HTML was just a communication language for text, graphics and hyperlinks -
and not necessarily the best one. But the applications of HTML were
dramatic - and without them there wouldn't be an XML.

So if - at the technical level - XML is simply another protocol, what's it
for?

My own feeling is that structured documents and precise markup have an
immense amount to offer, in the same way as hypermedia did. They challenge
the way that people think and organise their information. I don't believe
that these ideas can and should be kept below the surface - I think they
should start moving into the general curriculum. So maybe XML is simply the
messenger, but the message is much more important.

	P.

Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From b.laforge at jxml.com  Fri Sep 11 17:59:30 1998
From: b.laforge at jxml.com (Bill la Forge)
Date: Mon Jun  7 17:04:38 2004
Subject: XML like JAVA, not HTML...  was: XML *should* be boring
Message-ID: <002e01bddd9c$4d5e3c20$2149fea9@laforge>

>Support for generic XML, with tools widely available for editing /
>authoring / viewing / exchanging / storing / searching / objectizing /
>developing XML might stand a chance of making sure that XML doesn't stay in
>the high-end expensive small-community world that SGML inhabits today.
>Enlarging the community of developers is a critical step toward making XML
>cheap and ubiquitous.


I've been thinking that Java may be a better model than HTML for XML.

XML is first a tool for the programming community and later an information
transport like HTTP.

The XML market will be for tools. Later we will need the supporting products
to deal with it as a transport.

I see DOM and SAX as important first steps. The tools are things like DOM
diff (not XML diff), DOM to SAX, Visible Elements (like Visible Java Beans),
code generators for self-validating elements, etc.

I suspect that the reason for XML's current slump in momentum is because of
linear thinking--we need to think around the curve on this one and move past
syntax to semantics.

Yes, I'm speaking here from my coins bias. But I believe it is the direction
we need to develop if we want to realize the XML vision.

If we can support a standard framework for services, we can develop a
component-based approach to products. Then see how fast this whole thing
takes off!

Bill
http://www.jxml.com


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Fri Sep 11 18:01:09 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:38 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
In-Reply-To: <3.0.1.16.19980911084036.9757b832@pop3.demon.co.uk>
References: <199809110325.XAA15202@goon.stg.brown.edu>
 <35F88004.70A7F204@technologist.com>
Message-ID: <199809111558.IAA17790@mail-gw.pacbell.net>


>As far as I can see, most readers of this list are:
>	- waiting for XSL because all they are interested in is rendering text
>with infinite precision. Worthy, but surely that's not the main point of
>XML. Also it's a year away.

  XML/XSL is already making me a much happy web site manager. Different
views of the same data, data lives in one place... no pages that are
out-of-synch with each other, vastly improved human-readable code... 

  This application may not be exciting, but it sure is useful.

-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mda at discerning.com  Fri Sep 11 18:18:02 1998
From: mda at discerning.com (Mark D. Anderson)
Date: Mon Jun  7 17:04:38 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
Message-ID: <00c401bddd9e$8e19f900$0200a8c0@mdaxke.mediacity.com>

>It strikes me that the application that is going to make the public sit up and 
>notice XML is the one that lets me ask: "Make a table of all hotels in New York 
>that cost less than $100 and are within walking distance of Central Park."
>
>The good news is that this application is completely possible (and almost, but 
>not quite, inevitable).  The bad news is that it's still a ways off.  What it 
>requires is enough people writing their Web documents in XML (with widely 
>accepted element tag names) to make it worthwhile for the search engines to 
>offer this kind of functionality.

I would think it also requires a query protocol, for use directly by
a UA, or proxy'd by a search engine (or network of engines,
using referals). Current web search is based on document retrieval,
which is increasingly unrealistic as content becomes dynamic: 
no search engine can index Microsoft's knowledge base, can they?
(And there are many other cases where there isn't even an underlying
set of documents.)

The only proposal I've seen for such a protocol is DASL, which is specific
to WEBDAV. Is there anything in the works for RDF or XML? Of course, such a spec
presumes consolidation of a metadata format (XSchema or whatever).

-mda


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Fri Sep 11 18:22:14 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:38 2004
Subject: Combining DTDs
In-Reply-To: <199809111528.RAA03208@berlin.dvs1.tu-darmstadt.de>
Message-ID: <3.0.1.16.19980911172124.975734ba@pop3.demon.co.uk>

At 17:28 11/09/98 +0200, Ron Bourret wrote:
[...]
>
>I suggest that a short-term solution for the latter is to simply combine 
>elements from different DTDs as one sees fit.  Although the resulting
documents 
>are not valid wrt their original DTDs and cannot be used by DTD-specific 
>applications, XML does not require valid documents and the use of standard
tags 
>facilitates the search process.  I am advocating a certain degree of anarchy 
>here, but the Web is inherently anarchic and if we wait until we find a
way to 
>combine DTDs without breaking DTD-specific applications, we're missing the 
>chance to build some extremely useful applications right now.
>
>(By the way, a nice feature of XML editors that would help this along
would be 
>to read DTDs/schemas from said Yahoo-like repositories, let users insert 
>elements whereever they want from whatever DTDs/schemas they want, and
generate 
>new DTDs as requested.)
>
I agree with this. I have written two DTDs in XML (CML and VHG) both of
which have to interoperate with other *unknown* DTDs. As a simple example,
a paper in chemical physics requires (at least) xHTML, MathML, CML, RDF and
DC. It is inconceivable that a generic DTD can be created that has valid
content models for all conceivable applications in this domain. [It *is*
conceivable that the J.Chem Phys produces a DTD and it's also highly
probable that if J. Phys Chem also does it would use a different one.] I
cannot see how, except in very carefully regulated domains (such as legal,
patent, regulatory) it will be possible to combine generic DTDs to provide
a useful mixture. For example, if someone wishes to embed a <price> in a
<molecule> this is a perfectly possible and reasonable thing to do. Why
should I say they can't?

Example:
<molecule>
  <price currency="USD" unit="litre">1.0</price>
  <atomArray builtin="element">O H H</atomArray>
</molecule>

This does NOT break my software because it simply scans for things it knows
about in content (e.g. <atomArray>). Similarly it's perfectly possible to
scan the document with XLink/Xpointer (whenever they get finalised) to find
a <molecule> with a descendant of  type <price> with attributes of
currency. <price> could easily come from a well defined DTD, as will
<molecule>. This is - and has to be - the approach that CML takes. So
almost all XML-elements will have to have ANY content. This is a pity,
because I'd like to be able to insist that <molecule> contained
(atomArray)* - yes a molecule without atoms is conceivable. I think that
schemas must allow for this - and I believe that XSchema does.

The other approach is to allow links - and I really wish that we could see
some work going on here. There are two ways - one is to have a link on the
molecule, e.g.:
<molecule id="H2O" href="price.xml#water">...

and the other is to have a link database (I have missed out the other XLink
attributes for brevity and because I can't remember the current version of
the spec):

<extendedLink title="chemical catalogue">
  <locator href="molecules.xml#H2O"/>
  <locator href="prices.xml#H2O"/>
</extendedLink>

This is perhaps cleaner, but it's a lot more complicated and not many
people (with 2-3 honourable exceptions) seem to be interested in developing
XLink applications or software.

	P.


Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Fri Sep 11 18:31:32 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:38 2004
Subject: Combining DTDs
References: <3.0.1.16.19980911172124.975734ba@pop3.demon.co.uk>
Message-ID: <35F9501D.C53E2BE4@locke.ccil.org>

Peter Murray-Rust wrote:

> Example:
> <molecule>
>   <price currency="USD" unit="litre">1.0</price>
>   <atomArray builtin="element">O H H</atomArray>
> </molecule>
> 
> This does NOT break my software because it simply scans for things it knows
> about in content (e.g. <atomArray>).

This is the perfect application of namespaces, since your module that
understands molecules wants to process only "C:molecule" elements, where
C is any prefix that is proxied to its namespace URI.  Other random
"molecule" elements created by other people for other reasons should
not get processed.

> This is perhaps cleaner, but it's a lot more complicated and not many
> people (with 2-3 honourable exceptions) seem to be interested in developing
> XLink applications or software.

Part of the trouble is that the XLink WDs are old, and we keep hearing
rumors of massive changes in the next draft --- which has not yet
been forthcoming.  Until that happens, I for one will shy away from
XLink.

(I'm also piqued because I sent 8 or 9 well-drafted defect reports
to the XLink/XPointer editors and heard nothing except "Thanks for
sending, we'll reply soon."  That was in June.  A later ping got
no reply.)

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Fri Sep 11 19:10:13 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:39 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
In-Reply-To: <00c401bddd9e$8e19f900$0200a8c0@mdaxke.mediacity.com>
Message-ID: <199809111709.KAA14131@mail-gw.pacbell.net>

At 09:09 AM 9/11/98 -0700, Mark D. Anderson wrote:
>>It strikes me that the application that is going to make the public sit
up and 
>>notice XML is the one that lets me ask: "Make a table of all hotels in
New York 
>>that cost less than $100 and are within walking distance of Central Park."

>The only proposal I've seen for such a protocol is DASL, which is specific
>to WEBDAV. Is there anything in the works for RDF or XML? Of course, such
a spec
>presumes consolidation of a metadata format (XSchema or whatever).

  Such a beast has been proposed! See http://www.w3.org/TR/NOTE-xml-ql/ for
the XML-QL note.

-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simpson at polaris.net  Fri Sep 11 19:22:38 1998
From: simpson at polaris.net (John E. Simpson)
Date: Mon Jun  7 17:04:39 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
Message-ID: <3.0.32.19980911132133.006a5500@polaris.net>

Re: various postings on the subject (all worthy and none, remarkably, angry
or even disagreeable)....

I believe the main obstacle to XML's mass acceptance -- to its being
understood as non-boring, at least for the nonce -- is that what it seems
(to me) to be best at is not something the masses are clamoring for.

**** XML as a tool for presenting structured documents on the Web: "Gee,
that's great. How can I *see* the documents thereby presented?" Although
the absence of a general-purpose XML-aware Web browser can be derided as
not worthy of XML's promise, it's also the case that for [insert huge
fraction here] of Web users, the Web *IS* their browser. Almost nobody
opens up a book and exclaims, "Hot damn! I'm so happy this was printed
using offset technology instead of hot metal!" (even though the importance
of the printing technology is profound in many "invisible" ways); for them,
the content and the means of delivery are indistinguishable. If XML is
never capable of being rendered through a mass-market browser -- either
directly, or via CSS/XSL post-processing of the XML itself -- it just plain
ain't gonna fly as a "structured document presenter for the Web."

**** XML as a tool for data interchange and/or presenting structured data
on the Web: My sympathies as a writer and Web developer lie with the "Why
use XML for structured documents? I've got HTML for that" crowd, alluded to
in the preceding paragraph. As a *developer*, however, I think the
possibilities for XML as a data storage/interchange/mediation tool are
indeed exciting. The problem here is that almost no one in the real world
-- except other developers and related technoid types -- gives a hoot about
data storage/interchange/mediation. Give someone a complete desktop office
suite and I'd be willing to bet that the DBMS is the very last component
they'll ever run (if they ever run it at all). Instead, they use what they
already know (or can hack their way to from what they know) -- from
mail-merge word processing files through kludged-up spreadsheets that they
(or worse, the suite vendors) mislabel "databases."

OTOH, if you put a nifty database-based product in someone's hands and
don't tell them it's database-based, they'll swoon over it... as long as it
solves a problem that THEY care about.

HTML is a general solution to a general problem, and hence the masses' (and
the media's) excitement about it. XML is (potentially) a device for solving
50,000 specific problems. What makes it boring is that any given one of
those specific problems is of interest to only one of a given 50,000
individual users. I guess you could come up with a general-purpose XML
application that would be one interesting thing to all potential consumers,
but it would probably be so lightweight as to make everybody wonder what
all the fuss was about.

I'm an optimist, too. But I think it's important to focus on the things
worth being optimistic about, and not be distracted by the natural human
tendency to want to share everything that excites *us* with everyone we
meet. (I had an uncle who used to talk like that; his value as a dinner
companion consisted entirely of his helping us all appreciate how late it
was, how late it had been since he last paused for breath in fact, and we
really needed to be running now thankyouverymuch. Not that we didn't love
him all the same, but still....)

So that's another two cents to add to the gleaming (and still growing)
mound of copper <g>.

=============================================================
John E. Simpson          | It's no disgrace t'be poor, 
simpson@polaris.net      | but it might as well be.
                         |            -- "Kin" Hubbard

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From roddey at us.ibm.com  Fri Sep 11 19:26:37 1998
From: roddey at us.ibm.com (Dean Roddey)
Date: Mon Jun  7 17:04:39 2004
Subject: New DTD for DCD
Message-ID: <5030300025056538000002L082*@MHS>


Here is an updated DTD for DCD. As mentioned before it only handles the
attribute oriented style of the DCD syntax. If you want them, drop me a line
and I'll send you a zip file of fixed samples from the spec that work with this
DTD.


Fell free to drop me a line with any comments, good or bad, about the DTD, or
DCD in general or the XML4J parser in general for that matter. We are currently
doing some up front strategizing about the next step in the parser's
architecture and any comments will certainly be logged and taken seriously.

Thanks.

----------------------------------------
Dean Roddey
Software Weenie
IBM Center for Java Technology - Silicon Valley
roddey@us.ibm.com
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/octet-stream
Size: 6010 bytes
Desc: not available
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980911/4bc1e699/attachment.obj
From epalma at fsaa.ulaval.ca  Fri Sep 11 19:34:41 1998
From: epalma at fsaa.ulaval.ca (Eduardo Palma)
Date: Mon Jun  7 17:04:39 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
In-Reply-To: <3.0.32.19980911132133.006a5500@polaris.net>
Message-ID: <199809111756.NAA25166@cerberus.ulaval.ca>


For now...
Due to the fact that xml still unknown.
Is the same retoric when SQL shows up...

Eddie


[John E. Simpson]
The problem here is that almost no one in the real world
>-- except other developers and related technoid types -- gives a hoot about
>data storage/interchange/mediation. 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From roddey at us.ibm.com  Fri Sep 11 19:40:00 1998
From: roddey at us.ibm.com (Dean Roddey)
Date: Mon Jun  7 17:04:39 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
Message-ID: <5030300025056979000002L092*@MHS>


Andy Dent wrote:
>>
>> Why can't a product like our report-writer take
>> - XML describing content
>> - XSL specifying layout
>> and produce, for example, a report preview window on a Mac?
>> After all, if you regard a browser, it's doing something very similar.
>
> The result of an XSL process must be well-defined, right? So the most
> logical thing to create as the result of the process is an XML document.
> To me, your question is equivalent to "Why can't my car producing product
> take an XML document describing content, and an XSL describing the
> automobile to produce and generate the car?" Well, if XSL were an
> automobile producing language, that would make sense, but it isn't, it is
> an XML producing language.


That's not to say of course that, if I am writing an XSL engine, I cannot
provide a programatic object
interface that normally plugs in a "create XML output" widget, but which can if
desired plug in a "spit
out any danged thing you want" widget. In fact, it would be kind of crazy to
write an XSL engine that
did not provide that kind of flexiblity, since it would provide as much "future
proofness" for the
maintainer of the product as it would for his/her customers.

Of course you have to write some code to take advantage of it, but it also
opens the door for pre-fab
(even pre-fab third party) widgets that spit out particular types of output
given a few hints about the
desired attributes of the target output.

----------------------------------------
Dean Roddey
Software Weenie
IBM Center for Java Technology - Silicon Valley
roddey@us.ibm.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Fri Sep 11 19:52:53 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:39 2004
Subject: LISTRIVIA
In-Reply-To: <5030300025056538000002L082*@MHS>
Message-ID: <3.0.1.16.19980911185135.20d7f0c2@pop3.demon.co.uk>

A few simple rules for this list:
	- please do NOT attach any documents to your message. It can foul up the
archive
	- please do NOT reply to both the list and the sender. I have got several
duplicate mails today.
	- please DO quote only the MINIMUM required. I, and many others, have to
pay for our mail. The original mail is available on the archive, so please
only quotes those sentences and paragraphs that apply. Never quote the
XML-DEV signature.

	P.


Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Fri Sep 11 19:55:43 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:39 2004
Subject: XML *should* be boring
Message-ID: <009a01bdddad$2ffd44d0$2ee044c6@arcot-main>

>If we do our job as well as the TCP/IP people did, users should hardly
>notice that XML exists -- after all, we're supposed to help them do
>their work, not draw attention to our own.


I agree entirely from engineering point of view.  However, there are times
when absurd notions are needed to get things going.  XML must serve as a
rallying point, a waving flag, national anthem, a cure-all, an extra sticker
on the box, a true-blue hype of the decade so that the bull#$@# will be
spread evenly across the horizon and fertilize the soil for things to come.

Most things we take for granted now have gone through a great deal of hyping
as if going through a rites of passage.  It happened with TCP/IP and it is
happening with XML.

Best,

Don Park


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Fri Sep 11 20:05:24 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:39 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
References: <3.0.32.19980911071651.00aa7670@pop.intergate.bc.ca>
Message-ID: <35F960F4.4C14E210@technologist.com>

Tim Bray wrote:
> 
> At 01:03 PM 9/11/98 +0800, Andy Dent wrote:
> >At 23:01 +0800 10/9/98, Paul Prescod wrote:
> >>The browser takes XML, pumps it through an XSL engine, receives an XML
> >>result (according to a known DTD with formatting semantics) and renders
> >>*that*. You can do the same with your report writer.
> >
> >THANK YOU
> 
> Ouch.  Should have been watching more carefully.  This is not quite right;
> most important, DTDs have no formatting semantics.  CSS and XSL stylesheets
> do. -Tim

I'm not sure what you mean. Would you prefer "(according to a known
NAMESPACE with formatting semantics)"? Your short message would seem to
imply that one can only format an XML document if it has a CSS or XSL
stylesheet, which is, of course, not true. DTDs like HTML, SPDL and the
new "fo" namespace DO have formatting semantics.

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

The past is inaccurate. Whoever lives long enough knows how much what he
had seen with his own eyes becomes overgrown with rumor, legend a
magnifying or belittling hearsay. "It was not like that at all!" -- 
he would like to exclaim, but will not, for they would have seen only 
his moving lips without hearing his voice. - Czeslaw Milosz (translated)


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Fri Sep 11 20:07:23 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:39 2004
Subject: XML is boring (long --- sorry)
References: <199809110325.XAA15202@goon.stg.brown.edu> <199809110817.KAA26872@goofy.gr05.synopsys.com>
Message-ID: <35F96326.AA6D357D@technologist.com>

I find all of this worrying about XML's place in the universe to be
misdirected. XML is ASCII. You can ignore ASCII for a little while, but
eventually you adopt it because it is so fundamentally simple, right and
useful that it doesn't make any sense to continue with EBCDIC (or whatever
else).

Furthermore, XML has no competitors. PDF cannot do what XML can do. RTF
cannot do it. PostScript cannot do it. HTML cannot do it. People are hyped
about XML because they have been waiting for it without knowing that they
were doing so.

The first wave of XML users will be programmers. So? What's wrong with
that. The second wave will be all of the corporations that should have
adopted SGML before, but didn't. Nothing wrong with that either. Users
will find XML invading their day-to-day computing just as they do Unicode
and ASCII before it. That strikes me as fine.

Now I know that Peter is in a little bit of a hurry. He wants his chemist
peers to adopt XML as quickly as possible. Maybe hype would help them to
do that. Maybe not. But with or without hype, they will eventually adopt
XML (or some other SGML variant, if one ever arises) because there is no
reasonable alternative. You can't encode molecules in PDF, HTML or even
VRML. You could reinvent something LIKE XML, but why would you bother?

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

The past is inaccurate. Whoever lives long enough knows how much what he
had seen with his own eyes becomes overgrown with rumor, legend a
magnifying or belittling hearsay. "It was not like that at all!" -- 
he would like to exclaim, but will not, for they would have seen only 
his moving lips without hearing his voice. - Czeslaw Milosz (translated)


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cvonsee at onramp.net  Fri Sep 11 20:28:08 1998
From: cvonsee at onramp.net (Chris von See)
Date: Mon Jun  7 17:04:39 2004
Subject: Combining DTDs
In-Reply-To: <3.0.1.16.19980911172124.975734ba@pop3.demon.co.uk>
References: <199809111528.RAA03208@berlin.dvs1.tu-darmstadt.de>
Message-ID: <199809111827.NAA20682@mailhost.onramp.net>

At 05:21 PM 9/11/98 +0000, Peter Murray-Rust wrote:
>I agree with this. I have written two DTDs in XML (CML and VHG) both of
>which have to interoperate with other *unknown* DTDs. As a simple example,
>a paper in chemical physics requires (at least) xHTML, MathML, CML, RDF and
>DC. It is inconceivable that a generic DTD can be created that has valid
>content models for all conceivable applications in this domain. [It *is*
>conceivable that the J.Chem Phys produces a DTD and it's also highly
>probable that if J. Phys Chem also does it would use a different one.] I
>cannot see how, except in very carefully regulated domains (such as legal,
>patent, regulatory) it will be possible to combine generic DTDs to provide
>a useful mixture. For example, if someone wishes to embed a <price> in a
><molecule> this is a perfectly possible and reasonable thing to do. Why
>should I say they can't?
>

I wonder if a mechanism similar to Java's "import" statement would help
address some of the problems in combining DTDs?  If such a mechanism
existed, you could address a DTD and import only the element definitions
you needed (which, hopefully, would either solve the problem of different
definitions for a given element or make you more aware of name collisions
across multiple DTDs).

For example, you could feed your parser something like:

import DTDname.elementname;
or
import DTDname.m*;
or 
import DTDname.*;

to tell it that it should only recognize and use certain elements from the
named DTD.


Chris

------------------------------------------------------
"Don't *say* things.  What you *are* stands over you the while, and
thunders so that I cannot hear what you say to the contrary."
                  

--- Emerson, "Social Aims"

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dent at highway1.com.au  Fri Sep 11 20:44:44 1998
From: dent at highway1.com.au (Andy Dent)
Date: Mon Jun  7 17:04:40 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
In-Reply-To: <35F8FE21.181ED5C9@jclark.com>
References: <199809090522.BAA11695@ruby.ora.com>			
 <199809090121.SAA15093@mail-gw.pacbell.net>		
 <v03102801b21c945aab3e@[203.23.215.128]>	
 <v04011701b21d8ea34317@[203.23.215.86]>
 <v04011705b21eac802f1c@[203.23.215.128]>
Message-ID: <v04011703b21f1fd180b9@[203.23.215.128]>

At 6:40 PM +0800 11/9/98, James Clark wrote:
>How are we disagreeing?  I said "specifies its result as XML" not
>"specifies that its result is XML".  In other words it describes its
>result in terms of an XML document.  That XML document doesn't have to
>be created.
Ah.

In that case you are making more precise use of English than I'm used to
parsing (too much dealing with Americans :-) and we are not disagreeing.
I'm also relieved to see that my understanding and intentions are not as
far off the mainstream as I thought.

This has been an interesting discussion. It seems to me that there are 2,
possiby 3 variations on the processing models, as perceived by different
people on this list.

content-only XML + XSL =
1) XML structure as internal representation (eg: c++ objects)
2) XML output formatted for markup, like non-styled HTML
3) structurally transformed XML plus a stylesheet (which may be CSS)

Andy
(Australianised ex-pat)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mct at foyt.indyrad.iupui.edu  Fri Sep 11 21:35:37 1998
From: mct at foyt.indyrad.iupui.edu (Mark Tucker)
Date: Mon Jun  7 17:04:40 2004
Subject: Another Summary of Validation in the face of Namespaces
Message-ID: <199809111919.OAA16900@foyt.indyrad.iupui.edu>


Before the question of Validation of documents that use
namespaces gets lost, I think we are near closure.

We have a 5 step procedure for making unique prefixes, then performing
normal validation.
	       
The open issues are:
	       
	[Step 4] How do you actually define the URI's for namespace
	prefixes used in a DTD?

	[Setp 4a] How do we actually __find__ the DTD's we need?

		   The document instance specifies URI's, but
		   the URI's are not "de-referencable".
	       
	[Question] Several people mentioned 
	       "If Namespaces still used the old PI method
	       of defining prefixes, there would be no problem."
	       
	       Can anyone explain the old PI method?
	       

**********************
The algorithm	
**********************	
	 There is a (5) step algorithm for validating
	   a document against a set of DTDS
	       
		1. Determine the Expanded the names the way the
		Namespace Proposal says. (Section 6.3)
		
		2. Define a unique prefix for each namespace definition URI .

		3. Rewrite the element or attribute, prepending the
		(possibly generated) unique prefix for the namespace
		of the element/attributes Expanded Name.
		
		   This give you  P1:BOOK, P1:NAME, P3:ADDRESS
		   if BOOK and NAME come from the same namespace URI.
		
		4. Do the same to the DTD's that you read in.
		    4.a Find the relevant DTD's
		
		5. Do normal DTD validation of the rewritten instance 
		   document against the rewritten DTD.
		

The following document instance is legal:

-----------------------------------------------------
	A Document Instance
-----------------------------------------------------
<DocumentRoot
   xmlns:J1="uri:bibliotheque"
   xmlns:J2="uri:locatie">

<J1:BOOK>
    <J1:NAME V="Xml Made Easy"/>
    <J2:ADDRESS V="Holland"/>
</J1:BOOK>

------------------------------------------------
NOTE:
   The J1 and J2 are fine, because the prefix doesn't matter.
   Only


This is where there is still a problem.
How do we define the URI's for the prefixes that the DTD uses?

------------------------------------------------
	A DTD for defining Address 
------------------------------------------------
<!DOCTYPE location_dtd [

	xmlns:K="uri:locatie"		<<< -- How can we stick this in there?

<!ELEMENT K:ADDRESS >
<!ATTRLIST K:ADDRESS
	v #pcdata>

	       
]>	       
	-- 
------------------------------------------------
	A DTD for defining Address
------------------------------------------------
<!DOCTYPE library_dtd [

	xmlns:K="uri:bibliotheque"	 <<< -- How can we stick this in there?

<!ELEMENT K:NAME >
<!ATTRLIST K:NAME
	v #pcdata>

	       
]>	       
	-- 
==============================================================
Mark Tucker			tucker_m@regenstrief.iupui.edu
Regenstrief Institute		phone: (317) 630-2606
1001 W. 10'th St; Indianapolis, IN; 46202-2859;	fax: (317) 630-6962

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From sgmlsh at CAM.ORG  Fri Sep 11 22:30:04 1998
From: sgmlsh at CAM.ORG (Sam Hunting)
Date: Mon Jun  7 17:04:40 2004
Subject: Another Summary of Validation in the face of Namespaces
In-Reply-To: <199809111919.OAA16900@foyt.indyrad.iupui.edu>
Message-ID: <Pine.GSO.3.94.980911162025.20463A-100000@Ocean.CAM.ORG>

> 		3. Rewrite the element or attribute, prepending the
> 		(possibly generated) unique prefix for the namespace
> 		of the element/attributes Expanded Name.

Suppose I have a requirement that my XML content cannot be changed in any
way. For example, the content is on a CD, yet I still wish to be able to
associate namespaces with the GIs on that CD. 

So how do I prepend the prefixes to content found on a read-only medium?

I suppose I could copy the data off the CD and validate the copy, but
wouldn't that get old pretty fast?


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Fri Sep 11 22:33:58 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:40 2004
Subject: XML is boring 
In-Reply-To: <3.0.5.32.19980911072823.00c60d50@scripting.com>
References: <3.0.1.16.19980911084036.9757b832@pop3.demon.co.uk>
 <199809110325.XAA15202@goon.stg.brown.edu>
 <35F88004.70A7F204@technologist.com>
Message-ID: <3.0.1.16.19980911212916.09b7a506@pop3.demon.co.uk>

At 07:28 11/09/98 -0700, Dave Winer wrote:
[...]
>We have two projects that generate XML files every day. 
>
>One is a siteChanges.xml file for our server, www.scripting.com. It's very
>useful, if the search engines, or one search engine would read the damned
>file. If every webmaster produced one, the time to re-index the web would
>be dramatically shortened, and the search engines that used this would kick
>their competition's butts.

I think this is a great demonstration of XML-over-the-wire - exactly what I
was looking for. I pointed JUMBO2 at it and the default display shows the
siteChanges file beautifully - without any customisation. If I knew the
date convention I could even sort on dates...

This is the first 'real' non-textual XML file I have downloaded from an
'unknown' site and I'm delighted with how it all works.

	P.


Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From daniela at cnet.com  Fri Sep 11 23:15:04 1998
From: daniela at cnet.com (Daniel B. Austin)
Date: Mon Jun  7 17:04:40 2004
Subject: XML is boring/RDF maps available
In-Reply-To: <3.0.1.16.19980911212916.09b7a506@pop3.demon.co.uk>
Message-ID: <000201bdddc7$cd2f4720$7f53a2cc@cnet.com>

Hi fellow XML-ers,

> This is the first 'real' non-textual XML file I have
> downloaded from an
> 'unknown' site and I'm delighted with how it all works.


	I'd like to point out that there are some other XML files online. In fact,
CNET has RDF site maps for all of its sites available at:
http://www.cnet.com/RDF

These maps were created (by me) several months ago to demonstrate the
feasability of
using RDF for site maps for large websites. These maps are more than just
the top
level URLs avialable on CNET.COM, they clearly and visually represent the
information
architecture of the sites. These are 'real' XML files, and include
(rudimentary)
namespace prefixes, etc.

Thanks are due also to R.V. Guha for his help in preparing these files for
Mozilla (NS5).

Just so no one thinks that XML is not in use on the web, even for clients...

Regards,

D-
************************************************************************
Daniel Austin, Director of Development, Creative Services, CNET
daniela@cnet.com <mailto:daniela@cnet.com> 415-395-7800 x1438
"To change the old into the new, and the shapes of things to come..."


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mct at foyt.indyrad.iupui.edu  Fri Sep 11 23:40:34 1998
From: mct at foyt.indyrad.iupui.edu (Mark Tucker)
Date: Mon Jun  7 17:04:40 2004
Subject: Proposition: "SGML is Gumming Up the Works"
Message-ID: <199809112124.QAA19935@foyt.indyrad.iupui.edu>


******************************************************

SGML is Gumming Up the Works, or
	Down with DTD's.  Long Live Schemas!

******************************************************
<flame>

I wish I had a clean concscience. That I could say "I'm an SGML expert,
and I still believe ...." But I'm a newbie, and definitely NOT
an SGML expert.

Like "Samuel R. Blackburn" <sblackbu@erols.com>, I got interested
in XML not because it is a good document formatting language,
but on the promise that with XML, I can interchange Data!

I'm a computer scientist. I see XML-Data, and think: "Hey, there are
type definitions.  And hey, this data file clearly contains instances
of those types."  

But, stepping into the XML community, I'm overwhelmed by the SGML
history of XML.  I'm told: "No, conforming to type definitions isn't
good enough. That is not Real Validation. You must be valid according
to a DTD."  (Perhaps XML-DATA seems to have died because it wasn't
DTD-ish enough; I don't know.)


And then, looking at DTD's, I find that they aren't even as good as
BNF context free grammars. And BNF is much weaker than type systems,
which we need and want.


So, we end up jumping through hoops to write DTD's to express DATA
which is very, very, very easily described in terms of modern
programming language type systems.  All the while, hearing a low chant:
"What kind of cretin are you? You don't want to *validate* your data! (shock)
You only want well-formed documents." -- NO and YES.  I don't care
if my document can be validated by a pitiful DTD.  I do care that 
it conform to a real type schema!

</flame>

....


I'm not really mad at XML but, I think "Richard L. Goerwitz III"
<richard@goon.stg.brown.edu> is on to something in wondering if SGML
compatibility is going to bring down the XML effort.


If you have to be an SGML wizard to express easy things,
then we're in trouble.  

Much of the initial selling of XML was:
		You don't need DTD's to be a good citizen.

I hope we can honor that promise.


-- Mark

P.S. I'm optimistic about RDF, and am afraid that DCD sold out a bit towards
documents.  I want DATA schemas!

**************************************************


[Why DTD's aren't as good as BNF]

It's easier to have name clashes with
ELEMENT NAMES than it is to create ambigouous BNF grammars. Several
BNF productions can start with open paren 
		function f (a:int) return (a + 3);
All ELEMENT names must be unique.


-- 
==============================================================
Mark Tucker			tucker_m@regenstrief.iupui.edu
Regenstrief Institute		phone: (317) 630-2606
1001 W. 10'th St; Indianapolis, IN; 46202-2859;	fax: (317) 630-6962

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mct at foyt.indyrad.iupui.edu  Fri Sep 11 23:56:24 1998
From: mct at foyt.indyrad.iupui.edu (Mark Tucker)
Date: Mon Jun  7 17:04:40 2004
Subject: Tiny Point: Rewriting and Documents on readonly CD's
Message-ID: <199809112140.QAA20165@foyt.indyrad.iupui.edu>


Sam Hunting <sgmlsh@CAM.ORG> wrote:

mct> 		3. Rewrite the element or attribute, prepending the
mct> 		(possibly generated) unique prefix for the namespace
mct> 		of the element/attributes Expanded Name.

sh> Suppose I have a requirement that my XML content cannot be changed in any
sh> way. For example, the content is on a CD, yet I still wish to be able to
sh> associate namespaces with the GIs on that CD. 
sh> 
sh> So how do I prepend the prefixes to content found on a read-only medium?
sh> 
sh> I suppose I could copy the data off the CD and validate the copy, but
sh> wouldn't that get old pretty fast?

The validation takes place in the RAM of the validation process.

If your validator builds a DOM internally, then the re-writing
is actually done by assigning new values to the DOM tree.

If you want to do it "streaming", can't a namespace-aware SAX
processor spit out [LocalName, applicableNamespace] pairs as it parses
the document? 

It seems that you don't even have to use two passes to find unique
prefixes:
	Just use the namespace URI as the definition, with bad
	characters escaped, as the normalized prefix.

For example,

<book	xmlns:Q="uri:/alpha">
  <name/>
</book>

is rewritten as

<book>
  <uri_2falpha:name>
</book>


(where bad characters in the uri are escaped with '_' hex hex.)


-- 
==============================================================
Mark Tucker			tucker_m@regenstrief.iupui.edu
Regenstrief Institute		phone: (317) 630-2606
1001 W. 10'th St; Indianapolis, IN; 46202-2859;	fax: (317) 630-6962

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Sat Sep 12 00:17:21 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:40 2004
Subject: XML is boring/RDF maps available
In-Reply-To: <000201bdddc7$cd2f4720$7f53a2cc@cnet.com>
References: <3.0.1.16.19980911212916.09b7a506@pop3.demon.co.uk>
Message-ID: <3.0.1.16.19980911231721.36efa9c6@pop3.demon.co.uk>

At 14:04 11/09/98 -0700, Daniel B. Austin wrote:
>Hi fellow XML-ers,
>
>> This is the first 'real' non-textual XML file I have
>> downloaded from an
>> 'unknown' site and I'm delighted with how it all works.
>
>
>	I'd like to point out that there are some other XML files online. In fact,
>CNET has RDF site maps for all of its sites available at:
>http://www.cnet.com/RDF

I got 'this page does not exist' - any revised address?

BTW the reasons I'm keen on finding these files is that I want to try to
create a 'browser' that does as much as possible to process the 'average
XML file on the WWW'. Yes - I know that's stupid - but apart from MathML
and CML it's working so far.

	P.

Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From roddey at us.ibm.com  Sat Sep 12 01:49:35 1998
From: roddey at us.ibm.com (Dean Roddey)
Date: Mon Jun  7 17:04:40 2004
Subject: A nice propeller head XML question :-)
Message-ID: <5030300025071433000002L032*@MHS>


If the expansion of an entity reference is required to have a space on either
side of the expanded text, correct? If so, then something like this must be
invalid?

<!ENTITY % Foo 'ENTITY'>
<!%Foo; Bar 'Baz'>

right? Because the expansion of Foo requires that spaces be placed on either
side, creating this:

<! ENTITY Bar 'Baz'>

and the production for a PE declaration is:

PEDecl ::=  '<!ENTITY' S '%' S Name S PEDef S? '>'

which does not allow any space between the ! and ENTITY.

Is this correct? Basically what I'm looking for here is a more general rule
which says that I can consider any %blah; occurance at some place in the
grammar that does not allow spaces at that point to be in error without even
checking any further?

----------------------------------------
Dean Roddey
Software Weenie
IBM Center for Java Technology - Silicon Valley
roddey@us.ibm.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dent at highway1.com.au  Sat Sep 12 03:23:50 1998
From: dent at highway1.com.au (Andy Dent)
Date: Mon Jun  7 17:04:41 2004
Subject: page header footer resolution
Message-ID: <v03102800b219ed18c1a9@[203.23.215.11]>

I think I've come to an understanding of how we should be handling page
headers/footers, helped largely by reading the XSL spec.

In the following discussion header will stand for header and/or footer.

I'm assuming an embedded XSL stylesheet to create a single standalone
document, but that's probably irrelevant.

The two issues I've struggled to come to terms with are:
1) headers contain some flow objects (page number, count, current date) as
well as data. Some of that data may not appear anywhere else in the report.
Thus, the items in a header are a mixture of data content (should appear in
the plain XML) and flow objects (appear in the XSL). It has taken me a
while to resolve exactly WHERE in the XML the data content should appear.

2) our report-writer model allows you to specify headers with content that
will vary depending on where they appear (eg: using a database field that
varies as you move down the report) and which appear at effectively random
locations, being triggered by page breaks. Note: this creates problems for
us with our current RTF rendering. The solution to both is a "section"
model which defines the header/footer anew but without forcing a page
break. That way the new headers are available when needed.

What I'm a little puzzled about at the moment is the action of queues and
the different page areas.

I've seen slightly conflicting suggestions. In some postings it's been
suggested that there is not currently a standard way to HIDE content.

However, if you are routing parts of your XML to a pageHeaderQueue that's
one page area and parts to bodyQueue, then they are effectively hidden from
the other queues.

If pageHeaderQueue was not then linked to a page master object wouldn't
this have the effect of hiding the elements in that queue?

Is there some exclusion rule on formatting objects that says, once an
element has been consumed by one object, it is effectively obscured from
the others?

Is this an example of the transformation actions of formatting objects vs
the cascading effect of rendering styles?

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at goon.stg.brown.edu  Sat Sep 12 04:53:54 1998
From: richard at goon.stg.brown.edu (Richard Goerwitz)
Date: Mon Jun  7 17:04:41 2004
Subject: idea about how to make everyone happy
References: <199809102232.SAA10024@goon.stg.brown.edu>
Message-ID: <35F9E11D.1ECD7948@goon.stg.brown.edu>

Richard L. Goerwitz III wrote:
> 
> > Q:  "Why not associate namespaces optionally with DTDs (not necessarily
> > via the name-space URI)?"

Expanding a bit, for those who wish SGML and DTDs would die, here's
an idea that would keep you guys from totally pissing the SGMLers
off, and vice versa:

  1) associate schemas optionally with namespaces
  2) make sure there is a mechanism for identifying the schema type
     (e.g., a DTD) that's being associated with a given namespace
  3) bring back the PI syntax - or find some other way to tell the
     processor about what schemas it'll need to process in conjunction
     with what prefixes _before_ it starts parsing the document

Upshot: Those of you who wish SGML would just die (and along with it,
DTDs), you can have your cake and eat it, too.  Just put your top-
level element into a namespace, associate it with your schema, and
skip the DTD (or put in an empty one).

Note that this in no way prevents anyone from using the SGML DTD
mechanism - or from distributing validation across several different
DTDs (both the main one and ones associated with namespaces).  So
nobody is going to get lynched for killing SGML or DTDs.

It might even be possible to use the DTD, but to validate portions
of the document according to other schema types.

What I'm really trying to say here is that if we put aside the ani-
mosities for a second and think, we can figure out a way to work
everything out.  There are too many smart people running around
here for me to be willing to give up just yet.

-- 

Richard Goerwitz
PGP key fingerprint:    C1 3E F4 23 7C 33 51 8D  3B 88 53 57 56 0D 38 A0
For more info (mail, phone, fax no.):  finger richard@goon.stg.brown.edu

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From daniela at cnet.com  Sat Sep 12 05:01:52 1998
From: daniela at cnet.com (Daniel B. Austin)
Date: Mon Jun  7 17:04:41 2004
Subject: RDF site maps back up
In-Reply-To: <3.0.1.16.19980911231721.36efa9c6@pop3.demon.co.uk>
References: <000201bdddc7$cd2f4720$7f53a2cc@cnet.com>
 <3.0.1.16.19980911212916.09b7a506@pop3.demon.co.uk>
Message-ID: <3.0.5.32.19980911195349.0094b820@central.cnet.com>

Hi all,


	The CNET RDF maps are now available for the RDF-unfortified
at http://www.cnet.com/RDF/index.html. You can look at the files
even if they don't display directly in your browser.
	The files can be viewed directly in Mozilla 
(Netscape Navigator 5.0), providing a useful and visually intuitive
site map for CNET sites. 

You can try it yourself if you dare:
http://www.wynholds.com/mike/mozilla/


Thanks to all who noted the error!

Regards,

D-


At 11:17 PM 9/11/98, you wrote:
>At 14:04 11/09/98 -0700, Daniel B. Austin wrote:
>>Hi fellow XML-ers,
>>
>>> This is the first 'real' non-textual XML file I have
>>> downloaded from an
>>> 'unknown' site and I'm delighted with how it all works.
>>
>>
>>	I'd like to point out that there are some other XML files online. In fact,
>>CNET has RDF site maps for all of its sites available at:
>>http://www.cnet.com/RDF
>
>I got 'this page does not exist' - any revised address?
>
>BTW the reasons I'm keen on finding these files is that I want to try to
>create a 'browser' that does as much as possible to process the 'average
>XML file on the WWW'. Yes - I know that's stupid - but apart from MathML
>and CML it's working so far.
>
>	P.
>
>Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
>net connection
>VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
>http://www.venus.co.uk/vhg
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>
>
**************************************
Daniel B. Austin
CNET: The Computer Network
daniela@cnet.com (415) 395-7800 x1438

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bckman at ix.netcom.com  Sat Sep 12 05:06:16 1998
From: bckman at ix.netcom.com (Frank Boumphrey)
Date: Mon Jun  7 17:04:41 2004
Subject: page header footer resolution
Message-ID: <003101bdddfa$a94e85e0$b3acdccf@ix.netcom.com>

>I've seen slightly conflicting suggestions. In some postings it's been
>suggested that there is not currently a standard way to HIDE content.
>

my understanding was that if you didn't want to display an object you just
ommited to process the children.

Thus
<greeting>Hello XSL!</greeting>

<xsl:stlesheet>
    <xsl:template match="greeting">
        <fo:block font-size="16pt">
            <process-children/>
        </fo:block>
    </xsl:template>
</xsl:stlesheet>

 would result in a styled text flow object, "Hello XSL!"

whereas:

<xsl:template match="greeting">
    <fo:block font-size="16pt">
        <!--<process-children/>-->
    </fo:block>
</xsl:template>

would not.

>Is there some exclusion rule on formatting objects that says, once an
>element has been consumed by one object, it is effectively obscured from
>the others?

Again it was my understanding that the XSL processor looked for the best
match for any source element, and just made one template for each source
node.

Once it had found it it went on to the next.

you can however use the <xsl:process select....> to create  different views
of the same source element.

What is not clear to me at the moment is how one orders the result tree in
an order that is different from the source tree.

>1) headers contain some flow objects (page number, count, current date) as
>well as data. Some of that data may not appear anywhere else in the report.
>Thus, the items in a header are a mixture of data content (should appear in
>the plain XML) and flow objects (appear in the XSL). It has taken me a
>while to resolve exactly WHERE in the XML the data content should appear.
>

Actually the processor should use the source XML document + the XSL style
sheet to construct a brand new tree. What is rendered as flow objects are
the nodes of the brand new tree, and it doesnt matter whether these nodes
were derived from the source document, the style sheet, or were infact
generated as page numbers would be.


Frank


-----Original Message-----
From: Andy Dent <dent@highway1.com.au>
To: <xml-dev@ic.ac.uk>
Date: Friday, September 11, 1998 9:28 PM
Subject: page header footer resolution


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From liamquin at interlog.com  Sat Sep 12 06:31:58 1998
From: liamquin at interlog.com (Liam R. E. Quin)
Date: Mon Jun  7 17:04:41 2004
Subject: Deterministic Content Models ?
In-Reply-To: <35F937F8.9409C43@sophia.inria.fr>
Message-ID: <Pine.BSI.3.96r.980912000954.25689C-100000@shell1.interlog.com>

On Fri, 11 Sep 1998, Philippe Le H?garet wrote:

>  Is (paragraph*)* a determinist content model ?
>  If yes, so I think (a+ | b)* is a deterministic content model too.

Yes, they both are.

The constraint in section 3.3.1 of the XML spec is
    it is an error if an element in the document can match more
    than one occurrence of an element type in the content model.

Hence, a content model can only be non-deterministic in the XML sense
if it has a name that repeats.  For example,
    (a*, a*)
is non-deterministic, because the input <a> could match either "a*" in
the content model.  In the same way,
    (a+, b?, a+)
is bad because <a><a> could match the first "a+" or the first a+,
a missing b between the two elements, and the second <a/> could match
the second a+ in the content model.

Non-terminals are not an issue, so in your example of
    (paragraph*)*
although either * could be matched, * is not an element, so this is
still deterministic.

It's generally easier to ignore this error than it is to check it.

I hope this helps.

Lee

-- 
Liam Quin, GroveWare Inc., Toronto;  The barefoot agitator
l i a m q u i n     at    i n t e r l o g    dot   c o m


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Sat Sep 12 07:46:09 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:41 2004
Subject: A nice propeller head XML question :-)
Message-ID: <3.0.32.19980911224522.00aa3c70@pop.intergate.bc.ca>

At 07:56 PM 9/11/98 -0400, Dean Roddey wrote:
>
>If the expansion of an entity reference is required to have a space on either
>side of the expanded text, correct? If so, then something like this must be
>invalid?
>
><!ENTITY % Foo 'ENTITY'>
><!%Foo; Bar 'Baz'>
>
>right?

Right.

> Because the expansion of Foo requires that spaces be placed on either
>side, creating this:
>
><! ENTITY Bar 'Baz'>

Right. 

>and the production for a PE declaration is:
>
>PEDecl ::=  '<!ENTITY' S '%' S Name S PEDef S? '>'
>
>which does not allow any space between the ! and ENTITY.

Right.

>Is this correct? Basically what I'm looking for here is a more general rule
>which says that I can consider any %blah; occurance at some place in the
>grammar that does not allow spaces at that point to be in error without even
>checking any further?

Seems to me you got it right based on the spec.   -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Sat Sep 12 09:14:40 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:41 2004
Subject: RDF site maps back up
In-Reply-To: <3.0.5.32.19980911195349.0094b820@central.cnet.com>
References: <3.0.1.16.19980911231721.36efa9c6@pop3.demon.co.uk>
 <000201bdddc7$cd2f4720$7f53a2cc@cnet.com>
 <3.0.1.16.19980911212916.09b7a506@pop3.demon.co.uk>
Message-ID: <3.0.1.16.19980912071716.28e7847e@pop3.demon.co.uk>

At 19:53 11/09/98 -0500, Daniel B. Austin wrote:
>	The files can be viewed directly in Mozilla 
>(Netscape Navigator 5.0), providing a useful and visually intuitive
>site map for CNET sites. 

Thanks very much - this sounds exactly the sort of app we are looking for.

	P.

>
>You can try it yourself if you dare:
>http://www.wynholds.com/mike/mozilla/
>
>
>Thanks to all who noted the error!
>
>Regards,
>
>D-
>
>
>
>At 11:17 PM 9/11/98, you wrote:
>>At 14:04 11/09/98 -0700, Daniel B. Austin wrote:
>>>Hi fellow XML-ers,
>>>
>>>> This is the first 'real' non-textual XML file I have
>>>> downloaded from an
>>>> 'unknown' site and I'm delighted with how it all works.
>>>
>>>
>>>	I'd like to point out that there are some other XML files online. In fact,
>>>CNET has RDF site maps for all of its sites available at:
>>>http://www.cnet.com/RDF
>>
>>I got 'this page does not exist' - any revised address?
>>
>>BTW the reasons I'm keen on finding these files is that I want to try to
>>create a 'browser' that does as much as possible to process the 'average
>>XML file on the WWW'. Yes - I know that's stupid - but apart from MathML
>>and CML it's working so far.
>>
>>	P.
>>
>>Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
>>net connection
>>VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
>>http://www.venus.co.uk/vhg
>>
>>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>>(un)subscribe xml-dev
>>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
>>subscribe xml-dev-digest
>>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>>
>>
>>
>**************************************
>Daniel B. Austin
>CNET: The Computer Network
>daniela@cnet.com (415) 395-7800 x1438
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Sat Sep 12 09:14:42 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:41 2004
Subject: RDF site maps back up
In-Reply-To: <3.0.5.32.19980911195349.0094b820@central.cnet.com>
References: <3.0.1.16.19980911231721.36efa9c6@pop3.demon.co.uk>
 <000201bdddc7$cd2f4720$7f53a2cc@cnet.com>
 <3.0.1.16.19980911212916.09b7a506@pop3.demon.co.uk>
Message-ID: <3.0.1.16.19980912081354.2adf9c50@pop3.demon.co.uk>

At 19:53 11/09/98 -0500, Daniel B. Austin wrote:
>Hi all,
>
>
>	The CNET RDF maps are now available for the RDF-unfortified
>at http://www.cnet.com/RDF/index.html. You can look at the files
>even if they don't display directly in your browser.

They display nicely in JUMBO2 (which picks up the name attribute and uses
it as a title).

To what extent are these files RDF (other than having an RDF:RDF element
wrapping them)? Is there a DTD or other spec for the files? Although JUMBO
does a reasonable job of presenting them (i.e. it guesses that href is a
link) it will need schemas and XLink before this is formally correct.

	P.


Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Sat Sep 12 13:37:19 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:41 2004
Subject: XML is boring (was Re: coming clean with the SGML crowd)
Message-ID: <027801bdde42$41e6e260$0e11e391@mhklaptop.bra01.icl.co.uk>

>>Looking at XML from a data centric
>>perspective, there are things in it that are worthless,
DTD's
>>for example.
There are some things that are worthless, like the
distinction between elements and attributes.
DTD's are not worthless in this context, but they are
grossly inadequate.
I share the view that XML today is primarily interesting as
a way of storing and transmitting complex information
objects.
I hope it will some day also be useful for presenting the
information content of a document to a browser for
rendition, but that is futures.

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From egonw at sci.kun.nl  Sat Sep 12 13:49:46 1998
From: egonw at sci.kun.nl (Egon Willighagen)
Date: Mon Jun  7 17:04:41 2004
Subject: XML is boring
Message-ID: <199809121149.NAA17724@wn1.sci.kun.nl>


In my humble opinion I think XML is great!

As a pre-'Master' student is am started playing with XML about a year ago and 
a few
months ago i released DatabaseAccessDefinitionML (DADML) and found that XML has
many very interesting aspects that complement current internet standards. Like
siteChanges.xml.

DADML makes it possible to connect database over the internet. It consist of 
three
components: one defines a superdatabase and links other real databases; a 
second
defines a 'real' database with definitions of the fiels, how they are indexed,
how they can be retrieved (URL, free/commercial etc.) and more; the last 
component
defines the indices for which information is available. This last file can be 
generated
automatically through CGI scripting ofcourse.

This project is pure out of enthousiasm for XML and as small as it is shows how
easy it is AND how much fun it is to work with XML. Even if it is just a 
hobby...

The first practical use of DADML is currently being developed. A preview can be
viewed at the Woordenboek Organische Chemie. Try the following link
  http://www.sci.kun.nl/cgi-bin-sigma/woc/view?azijnzuur

The links bellow the Databases picture are the fields of the 'real' databases 
written
boldly. The designation (misschien aanwezig) means 'note sure if information is
available for this compound'. That is because there is no data indexfile for 
this
field given. Although a superdatabase can also check the availability by
questioning the http deamon. If File Not Found is returned this field is
obviously not available for this compound.

More information can be found at 
  http://www.sci.kun.nl/sigma/Persoonlijk/egonw/dadml/

or email me: egonw@sci.kun.nl

XMl really is a lot of fun!

Egon

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Sat Sep 12 16:21:04 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:41 2004
Subject: Proposition: "SGML is Gumming Up the Works"
In-Reply-To: <199809112124.QAA19935@foyt.indyrad.iupui.edu>
Message-ID: <Pine.SUN.3.91.980912095138.9503B-100000@cito.uwaterloo.ca>

The hardest part of coming to a new domain is recognizing what parts of 
what we know from other domains do NOT apply.

On Fri, 11 Sep 1998, Mark Tucker wrote:
>
> Like "Samuel R. Blackburn" <sblackbu@erols.com>, I got interested
> in XML not because it is a good document formatting language,
> but on the promise that with XML, I can interchange Data!

Documents are data. Documents are pretty much the most complicated type 
of data in existence. You can ignore everything that SGML taught us about 
encoding complex data for interchange, but then you'll be forced to 
reinvent it (and probably not as well).

> I'm a computer scientist. I see XML-Data, and think: "Hey, there are
> type definitions.  And hey, this data file clearly contains instances
> of those types."
> 
> But, stepping into the XML community, I'm overwhelmed by the SGML
> history of XML.  I'm told: "No, conforming to type definitions isn't
> good enough. That is not Real Validation. You must be valid according
> to a DTD."  (Perhaps XML-DATA seems to have died because it wasn't
> DTD-ish enough; I don't know.)

XML-Data died because it was half-baked crap that did not meet real 
needs. DCD will hopefully follow it into oblivion. Nevertheless, XML-Data 
was very "DTDish". At its heart is the same grammar that you complain 
about. The type system junk is a slight extension to that grammar.

> And then, looking at DTD's, I find that they aren't even as good as
> BNF context free grammars. And BNF is much weaker than type systems,
> which we need and want.

Consider, for a moment, your brain. It has all kinds of great type 
systems. It's full of Venn diagrams and hierarchies, right? But then, 
when you want to converse, you flatten all of that out into text streams 
described not by hierarchies and Venn diagrams, but by regular 
expressions and context-free grammars.

Saying that BNF is weaker than types systems is equivalent to saying that 
hammers are weaker than screwdrivers. They are not comparable. Grammars
describes serialization syntax and the other describes a data model.

If "type systems" could replace serializations, then we wouldn't need 
XML, would we? We'd just use Java's type system.

> So, we end up jumping through hoops to write DTD's to express DATA
> which is very, very, very easily described in terms of modern
> programming language type systems.  All the while, hearing a low chant:
> "What kind of cretin are you? You don't want to *validate* your data! (shock)
> You only want well-formed documents." -- NO and YES.  I don't care
> if my document can be validated by a pitiful DTD.  I do care that 
> it conform to a real type schema!

"Bang. Bang. Bang. I think I bent my screwdriver." I hate to let you 
down, but when you serialize your data model into XML, all you have is 
characters. Characters have to be verified according to the techniques that 
God and Chomsky provided for verifying character streams: regular 
languages, context free grammars, regular tree grammars, etc.

> I'm not really mad at XML but, I think "Richard L. Goerwitz III"
> <richard@goon.stg.brown.edu> is on to something in wondering if SGML
> compatibility is going to bring down the XML effort.

That's not what Richard L. Goerwitz III said. You are projecting your own 
feelings into his messages.

> If you have to be an SGML wizard to express easy things,
> then we're in trouble.  
> 
> Much of the initial selling of XML was:
> 		You don't need DTD's to be a good citizen.


> P.S. I'm optimistic about RDF, and am afraid that DCD sold out a bit towards
> documents.  I want DATA schemas!
> 
> **************************************************
> 
> 
> [Why DTD's aren't as good as BNF]

DTDs are much better than BNF. DTDs describe XML data. BNF describes a
MUCH larger family of languages. If we were to use BNF, we would have to
put constraints on the BNF that would make it almost identical to DTDs. 

Here's the ironic part: you are right that it should be possible to use 
the same element type name in multiple contexts as long as it isn't 
ambiguous (as in C). I have a proposal for an extension to DTDs (or 
schemas) that would allow that.

The problem is, that when you try to combine this advanced facility with 
type system-based proposals (e.g. inheritance, subtyping, etc.) 
everything goes to hell. The irony is that it is people who are screaming 
for "types" instead of lexical constraints who are *weakening* the 
lexical constraints that would make DTDs (or schemas) closer in power to BNF.

Consider:

<FUNCTIONDEF><NAME>Foo</NAME><PAREN><ARGS/></PAREN>
<BODY>
    a=<PAREN>B+1</PAREN>
</BODY>

What does it mean to "subclass" the PAREN element type when it is clearly 
used in two different contexts with two different content models? The 
answer: there is no PAREN type, really. There is a PAREN "tag" that can 
be used in completely different ways in completely different contexts.

In my opinion, you must THROW OUT the notion of type to make progress on 
this front. Of course, you can then re-introduce the notion of type at 
some higher level. But I think that we should make this lexical level 
powerful enough to do everything we need it to do before we move on to 
the type level.

 Paul Prescod


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Sat Sep 12 20:25:22 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:41 2004
Subject: XML is boring (long --- sorry)
In-Reply-To: <35F96326.AA6D357D@technologist.com>
References: <199809110325.XAA15202@goon.stg.brown.edu>
 <199809110817.KAA26872@goofy.gr05.synopsys.com>
Message-ID: <3.0.1.16.19980912192523.4487b850@pop3.demon.co.uk>

At 12:51 11/09/98 -0500, Paul Prescod wrote:
[...]

Paul is right, of course.
>
>Furthermore, XML has no competitors. PDF cannot do what XML can do. RTF

I keep reminding myself of this - that there is nowhere else to hide.

>cannot do it. PostScript cannot do it. HTML cannot do it. People are hyped
>about XML because they have been waiting for it without knowing that they
>were doing so.
>
[...]
>
>Now I know that Peter is in a little bit of a hurry. He wants his chemist

Mainly because assumed that the hype would drive the creation of XML
resources. My strategy - within chemistry - assumed that there would be a
large number of XML applications on the WWW by the end of the year. That's
what everyone had been saying. Although the current speed is not
unreasonable for the development of a difficult new subject, I am sup=rised
that it is so difficult to find DTDs or document fragments out on the WWW.
And that things like XLink have not generated anything.

>peers to adopt XML as quickly as possible. Maybe hype would help them to
>do that. Maybe not. But with or without hype, they will eventually adopt

I have no illusions about the time it will take mainstream chemists to
adopt XML. It will be the document-driven areas that drive it (patents,
regulatory, safety, publishing). I had assumed, however, that if XML was
ubiquitous - as we were led to believe, then it would be difficult to ignore. 

So the recent posting was as much as anything a reality check to make sure
that I (and others) were not missing large amounts of public XML resources.
It seems not - a few applications that emit XML have been posted here, and
I have been told of a few likely commercial developments. I have probably
done some people a disfavour - e.g. I remembered SMIL and BSML after my
initial posting.

I would not like to think that XML ends up like X-windows - a powerful
system with 5 thick impenetrable manuals before you can actually do
anything. Because, although I understand the view that XML can be compared
to ASCII, it's much more than that and we shouldn't let that get lost.

I shall continue to try to rally the enthusiast community - there is
nothing in XML that says you can only play if you are a company - though it
often appears that way. 
Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From crism at oreilly.com  Sat Sep 12 20:47:23 1998
From: crism at oreilly.com (Chris Maden)
Date: Mon Jun  7 17:04:41 2004
Subject: Proposition: "SGML is Gumming Up the Works"
In-Reply-To: <199809112124.QAA19935@foyt.indyrad.iupui.edu> (message from Mark
	Tucker on Fri, 11 Sep 1998 16:24:43 -0500 (EST))
Message-ID: <199809121845.OAA12537@ruby.ora.com>

[Mark Tucker]
> I wish I had a clean concscience. That I could say "I'm an SGML
> expert, and I still believe ...." But I'm a newbie, and definitely
> NOT an SGML expert.

If I may be so bold, it's possible that that lack of experience is
causing you to perceive a dichotomy where there really isn't one.

> Like "Samuel R. Blackburn" <sblackbu@erols.com>, I got interested in
> XML not because it is a good document formatting language, but on
> the promise that with XML, I can interchange Data!

As Paul Prescod pointed out, documents *are* data: very complex,
non-regularized data.  An informal proof of this is that XML was
designed for documents, and proved (completely as a side-effect) to be
very useful for regularized data.

> But, stepping into the XML community, I'm overwhelmed by the SGML
> history of XML.  I'm told: "No, conforming to type definitions isn't
> good enough. That is not Real Validation. You must be valid
> according to a DTD."  (Perhaps XML-DATA seems to have died because
> it wasn't DTD-ish enough; I don't know.)

Who told you that?  The biggest difference between XML and SGML is
that you do *not* have to be valid according to a DTD.  The only kind
of validation *defined by XML* is DTD validation, but (a) that
validation is not required, and (b) validation outside of the scope of
REC-XML is allowed (and in fact encouraged by the specification).

What you may have heard was a caution about imprecise langauge, which
SGMLers tend to be picky about.  If you used the word "validation" in
an XML context, someone may have pointed out that the word is well-
and precisely-defined for XML, and that you were misusing it in that
sense.

> So, we end up jumping through hoops to write DTD's to express DATA
> which is very, very, very easily described in terms of modern
> programming language type systems.

So describe it in terms of modern programming language type systems.
I'm not sure what the problem here is:

<data>
  <type-specification>
    <int name="i"/>
    <char name="c"/>
    <float name="f"/>
  </type-specification>
  <i>5</i>
  <c>h</c>
  <f>1.541</f>
  <j>Undefined data type</j>
  <c>Type violation error</c>
</data>

Of course, you'll have to write your own program to check the
type-validity of your document, whereas you can get DTD validation for
free.  But if what you need to do goes beyond DTDs, and you haven't
the patience to wait for the various data specification efforts going
on right now, then you have to roll your own.

> All the while, hearing a low chant: "What kind of cretin are you?
> You don't want to *validate* your data! (shock) You only want
> well-formed documents." -- NO and YES.

I'm not sure where you're getting this shock and horror.  Validation
is good because, and only because, it verifies that your data is what
it claims to be, and therefore other applications may make certain
assumptions without breaking.  If your serialized data stream can be
guaranteed because it came out of a database or is the result of
literate programming, then you *have* validated your data, though not
in the XML sense.

> I don't care if my document can be validated by a pitiful DTD.  I do
> care that it conform to a real type schema!

So create your own form of validation.  It's as simple as that.
Document geeks created XML, and its built-in validation is optimized
for validating documents.  Want something else?  Make it!

> I'm not really mad at XML but, I think "Richard L. Goerwitz III"
> <richard@goon.stg.brown.edu> is on to something in wondering if SGML
> compatibility is going to bring down the XML effort.

Quite the contrary.  If XML had not been built on SGML, new
applications would have had to be written to test the ideas in the
specification.  Building on SGML, XML was usable *the moment it was
created* with existing, high-powered tools.  If this had not been
true, it might have succeeded, but not nearly so quickly.

> Much of the initial selling of XML was:
> 		You don't need DTD's to be a good citizen.
> 
> I hope we can honor that promise.

The first part ("You don't need DTDs") isn't a promise, it's a fact,
enshrined in the XML specification:

   [22] prolog ::= XMLDecl? Misc* (doctypedecl Misc*)?

The second part ("to be a good citizen") is a completely subjective
statement and depends on so many philosophical variables that I won't
attempt to address it, except to say that no one could possibly have
promised that in any meaningful way.

If you like XML, use it.  If you don't, use something else; take the
good stuff from XML and leave the bad stuff out.  That's what XML did
with SGML; you go right ahead and do the same thing.  If we like what
you do, we'll use that instead, and you'll have your picture on the
cover of _Wired_.

-Chris
-- 
<!NOTATION SGML.Geek PUBLIC "-//Anonymous//NOTATION SGML Geek//EN">
<!ENTITY crism PUBLIC "-//O'Reilly//NONSGML Christopher R. Maden//EN"
"<URL>http://www.oreilly.com/people/staff/crism/ <TEL>+1.617.499.7487
<USMAIL>90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at goon.stg.brown.edu  Sat Sep 12 21:20:11 1998
From: richard at goon.stg.brown.edu (Richard L. Goerwitz III)
Date: Mon Jun  7 17:04:42 2004
Subject: inclusiveness (was "SGML is Gumming Up the Works")
In-Reply-To: <Pine.SUN.3.91.980912095138.9503B-100000@cito.uwaterloo.ca> from Paul Prescod at "Sep 12, 98 10:14:11 am"
Message-ID: <199809121919.PAA29365@goon.stg.brown.edu>

> > I'm not really mad at XML but, I think "Richard L. Goerwitz III"
> > <richard@goon.stg.brown.edu> is on to something in wondering if SGML
> > compatibility is going to bring down the XML effort.
> 
> That's not what Richard L. Goerwitz III said. You are projecting your own 
> feelings into his messages.

Just to alleviate the fears of anyone who thinks I'm out to trash SGML com-
patibility, I recently wrote a public validation service optimized specific-
ally for people who are moving SGML documents, along with their associated
DTDs, to XML:

  http://www.stg.brown.edu/service/xmlvalid/

I work in a shop associated with several important text-base projects that
we are doing in SGML.

Yes, in fact I have some deep philosophical disagreements with some of the
ways my own co-workers do things.  And my feeling about SGML is that it in-
curs way too much overhead, especially for small workgroups.

But on the other hand, I don't think the engineering community has been able
to appreciate properly the thought and work that has gone into the way the
SGML crowd operates.  Most American engineers, in particular, don't even
speak a "foreign" language, and most haven't the faintest idea what textual
information (monolingual or not) really is.

So I'm not in either camp, really.

What I believe is that we have the ability here to settle our differences.
I really believe that namespaces offer us a way of providing a migration
path from one schema type to another, if we can find a reasonable way to
associate namespaces with schemas, to type the schemas, and to distribute
validation across multiple namespaces (and maybe also schemas).

This doesn't mean I want everyone to dump DTDs.  Frankly I don't think that
the software engineering community has given them a fair shake yet.  And
loudly proclaiming them dead while running half cocked after other schemas
will only make XML seem unstable - and scare away major vendors we need to
keep our momentum.

On the other end, I don't think it's reasonable for core W3C committee mem-
bers to sit there whining about how foolish the rest of the world is - or
worse yet, simply ignoring us.

Maybe a decent compromise is in order.  The changes I suggest to the name-
space spec are just one possibility.

I am not one of the greater minds here.  I'm sure that with the talent we
have here, people can come up with even better suggestions.

These suggestions, though, should be simple and _inclusive_.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at goon.stg.brown.edu  Sat Sep 12 21:23:54 1998
From: richard at goon.stg.brown.edu (Richard L. Goerwitz III)
Date: Mon Jun  7 17:04:42 2004
Subject: A nice propeller head XML question :-)
In-Reply-To: <3.0.32.19980911224522.00aa3c70@pop.intergate.bc.ca> from Tim Bray at "Sep 11, 98 10:46:13 pm"
Message-ID: <199809121923.PAA29496@goon.stg.brown.edu>

> >If the expansion of an entity reference is required to have a space on either
> >side of the expanded text, correct? If so, then something like this must be
> >invalid?
> >
> ><!ENTITY % Foo 'ENTITY'>
> ><!%Foo; Bar 'Baz'>
> >
> >right?
> 
> Right.

Just to add to Tim's statements:  It seems to me that the syntax of conditional
sections (INCLUDE/IGNORE) was changed specifically to allow spaces where older
XML standards did not, so that parameter-entity replacement would work for them.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Sat Sep 12 22:46:27 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:42 2004
Subject: XML is boring (long --- sorry)
Message-ID: <3.0.32.19980912134632.00ab9670@pop.intergate.bc.ca>

I talk to a lot of journalists.  The #1 question I get is: "What will the
impact of XML be from the user point of view?"  My sound-bite answer is
"The web should look about the same, but work a lot faster.  And search 
engine results should get a lot better."

I believe in both those predictions.

Is that boring?

 -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Sat Sep 12 23:40:16 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:42 2004
Subject: XML is boring (long --- sorry)
Message-ID: <000601bdde95$ebf53150$2ee044c6@arcot-main>

>I talk to a lot of journalists.  The #1 question I get is: "What will the
>impact of XML be from the user point of view?"  My sound-bite answer is
>"The web should look about the same, but work a lot faster.  And search
>engine results should get a lot better."
>
>I believe in both those predictions.
>
>Is that boring?


No.  It is just not exciting.

My motto when dealing with press is try to tell them what they want to hear:
"The web will look unrecognizably different from what it is now and work too
fast to be useful.  And search engines will return information you really
need rather than what you asked for."  <g>

Don


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From b.laforge at jxml.com  Sun Sep 13 03:22:30 1998
From: b.laforge at jxml.com (Bill la Forge)
Date: Mon Jun  7 17:04:42 2004
Subject: XML is boring (long --- sorry)
Message-ID: <000801bddeb4$c1f54c40$4b3dfea9@laforge>

I see XML enabling a closer integration of desktop applications and the web.

For example, I see no difficulty in pointing an editor at a document and
editing it--so long as I have write access, it shouldn't matter where the
document is located. Though that takes a little more than just XML for a
transport.

Add some reasonable caching/replication, and the applications no longer need
to track the difference between a logical and a physical location. Things
start getting much simpler/faster.

But the key here is to keep things simple enough that we can comprehend the
whole and make it all work together cleanly.

Bill


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dent at highway1.com.au  Sun Sep 13 04:34:23 1998
From: dent at highway1.com.au (Andy Dent)
Date: Mon Jun  7 17:04:42 2004
Subject: Type is not a big deal (was Re: Proposition: "SGML is Gumming Up
 the Works")
In-Reply-To: <Pine.SUN.3.91.980912095138.9503B-100000@cito.uwaterloo.ca>
References: <199809112124.QAA19935@foyt.indyrad.iupui.edu>
Message-ID: <v04011701b220df5f9c45@[203.23.215.53]>

At 10:14 PM +0800 12/9/98, Paul Prescod wrote:
>The hardest part of coming to a new domain is recognizing what parts of
>what we know from other domains do NOT apply.
<grin>

I think this is a two-level process. First is recognising that the surface
rules from other domains don't apply. Second is identifying common patterns
and meta-rules behind both domains. Learning something new gives you
insight into old areas of expertise.

>In my opinion, you must THROW OUT the notion of type to make progress on
>this front. Of course, you can then re-introduce the notion of type at
>some higher level.
The approach I've adopted for our database mapping is
- elements map to fields, thus element names being the same in different
contexts is indicative of possibly similar content but not a guarantee. eg:
Name would have similar semantics inside Person and Class.

- data types come from attributes or DTD/some other schemata. Name would be
of character type initially. If the data type is a subclass of character
data, and Name in both contexts ends up being the same subclass then it is
vastly more likely that the data serves the same purpose in those contexts.

This is not new or distorted for an XML perspective. Data typing of fields
in databases indicates only some of the semantics of the field. The rest
are context-dependent. This is true whether using object-oriented or
relational databases.
Andy Dent BSc MACS AACM, Software Designer, A.D. Software, Western Australia
OOFILE - Database, Reports, Graphs, GUI for c++ on Mac, Unix & Windows
PP2MFC - PowerPlant->MFC portability
http://www.highway1.com.au/adsoftware/crossplatform.html

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dave at userland.com  Sun Sep 13 04:40:49 1998
From: dave at userland.com (Dave Winer)
Date: Mon Jun  7 17:04:42 2004
Subject: XML is boring (long --- sorry)
In-Reply-To: <3.0.32.19980912134632.00ab9670@pop.intergate.bc.ca>
Message-ID: <3.0.5.32.19980912194121.00c958f0@scripting.com>

I think the most important thing about XML is that it will give users
choices. 

If Microsoft, for example, were to store all their Office files in XML then
you could use any other tool to work on the files. 

So Peter's dream of an XML-based spreadsheet may not be so far away.

The web of HTML documents is good for what it is, a simple display markup
with links. But there are a lot more UIs that are well understood and none
of them are particularly relevant to HTML. 

XML is a fresh start, taking the best ideas of the web (open file formats,
cross-platform, low-techness) and bringing it to a broader range of software.

Also, IMHO, if it achieves its promise, XML should clean up text-based
exchange formats, comma-delimited, tab-indented, etc.

And it presents an opportunity to clean up various incompatibilies in wire
protocols, Apple Events, COM, CORBA, etc.

But the key to all these things is compatibility, that's the big payoff for
users.

In a way it's amazing that Microsoft is supporting it so heavily, because
it seems to be much better for those of us outside of MS.

Dave

At 01:46 PM 9/12/98 -0700, you wrote:
>I talk to a lot of journalists.  The #1 question I get is: "What will the
>impact of XML be from the user point of view?"  My sound-bite answer is
>"The web should look about the same, but work a lot faster.  And search 
>engine results should get a lot better."
>
>I believe in both those predictions.
>
>Is that boring?
>
> -Tim
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>

--------------------------------------
http://www.userland.com/directory.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bckman at ix.netcom.com  Sun Sep 13 06:15:03 1998
From: bckman at ix.netcom.com (Frank Boumphrey)
Date: Mon Jun  7 17:04:42 2004
Subject: XML is boring (keep the faith!)
Message-ID: <003e01bddecd$a17e1aa0$8baddccf@ix.netcom.com>

I think that the whole of this thread has been overly pesamistic.

The beauty of XML is in it's simplicity, it is the 'Mona Lisa' of the web.
As long as the underlying spec. is not fiddled with too much, it is bound to
prevail.

Admittedly some of the spin off's of XML have been kludgy and model's of
murky ambiguity, but they will suffer the demise they deserve. The amazing
thing is that some of the standards have real potential. As other writers
have pointed out, when new standards come out there are bound to be several
false trails before the true path is discovered.

The place of XML for the storage of documents and data is surely assured,
but ap's based on these fundamentals are not 'sexy'. Sexy functions almost
by definition are functions that make the press say "wow'.

I believe we have such an 'ap' right now.

Having just finished 'hacking' the IE5 support for XML and the DOM, I am
amazed. Combined they can be used to retrieve any XML document and can
display it in almost any form we want on a (IE5 compatible, ah, there's the
rub!) browser.

I have not yet hacked the mozilla version of XML, but from what I hear it
will also give internet functionality to XML.

When this happens we can really expect XML to take off.

Several writers have expressed dissapointment that there are not more XML
'ap's' out there one year (actually only 7 months!!) after the release of
the recommendation. I think this shows how warped our perspective has
become, 7 month's is a very short time, and it took at least a year after
the release of Mosaic for the web to gain real momentum.

To write a good app.takes time, and I am actually suprised at how fast tings
are moving. There are several good programs out there, admittedly of the
'experimental' kind.

I personally now store all my doc's in XML format (I used to store them as
ASCII files), and use a simple script to convert them to HTML when I want to
read/display them. I am sure that hundreds of others are doing the same. I
wrote a simple program in VB that allows me to do this. It takes me about 30
secs to convert an XML file to HTML or RTF!.

As for the schema v.DTD controvesy, I think that DTD's are wonderful. They
allow me to make sure I have crossed all the 't's' and dotted all the 'i's'
so to speak. I use the MSXML parser to validate all my XML files.

I can well see the use of inheritable xml based schemas, but I don't need
them, and I'm willing to bet that 95% of those using XML don't need them
either.

All I can say is "Don't get depressed, keep the faith!!". We may squabble on
this list, but we (or rather XML) are bound to prevail because our cause is
just!  (And also practicable, simple, elegant, and fulfils a very necessary
purpose)

Frank

Frank Boumphrey

XML and style sheet info at Http://www.hypermedic.com/style/index.htm
Author: - Professional Style Sheets for HTML and XML http://www.wrox.com

-----Original Message-----
From: Peter Murray-Rust <peter@ursus.demon.co.uk>
To: <xml-dev@ic.ac.uk>
Date: Friday, September 11, 1998 3:42 AM
Subject: XML is boring (was Re: coming clean with the SGML crowd)


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Sun Sep 13 11:16:00 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:42 2004
Subject: Opportunities for XML-DEV
In-Reply-To: <003e01bddecd$a17e1aa0$8baddccf@ix.netcom.com>
Message-ID: <3.0.1.16.19980913101630.903f754e@pop3.demon.co.uk>

At 00:18 13/09/98 -0400, Frank Boumphrey wrote:
>I think that the whole of this thread has been overly pesamistic.

Probably :-). I certainly think we should drop the thread title. The
intention was to catalyse XML-DEV members to think of ways that we could
and can show XML in action. 

What we have discovered is that there are very few XML documents currently
being delivered over the WWW. For many of us who see XML as a communication
medium *and philosophy* this is a pity. I think it makes it harder to
develop tools to work with specs like XLink, XPointer, Namespaces because
we don't have example documents to work with. And this is cyclic, because
those creating documents don't have tools to create documents with and
don't have people who can read them. So, at the moment we can only talk
about those applications.

I'd like to think we can develop early demonstrations of some of the
exciting concepts of XML. [I think XML - as a family - is much more than
simply Moore's law - 'faster, more searchable web']. 

>
>The beauty of XML is in it's simplicity, it is the 'Mona Lisa' of the web.
>As long as the underlying spec. is not fiddled with too much, it is bound to
>prevail.
>
>Admittedly some of the spin off's of XML have been kludgy and model's of
>murky ambiguity, but they will suffer the demise they deserve. The amazing
>thing is that some of the standards have real potential. As other writers
>have pointed out, when new standards come out there are bound to be several
>false trails before the true path is discovered.
>
>The place of XML for the storage of documents and data is surely assured,
>but ap's based on these fundamentals are not 'sexy'. Sexy functions almost
>by definition are functions that make the press say "wow'.
>
>I believe we have such an 'ap' right now.

Great. 
>
>Having just finished 'hacking' the IE5 support for XML and the DOM, I am
>amazed. Combined they can be used to retrieve any XML document and can
>display it in almost any form we want on a (IE5 compatible, ah, there's the
>rub!) browser.

'almost any form' is rather optimistic unless you are restricting yourself
to human-readable material. If you want to do Math you need math-aware
software (and, of course, chemistry).

>
>I have not yet hacked the mozilla version of XML, but from what I hear it
>will also give internet functionality to XML.
>
>When this happens we can really expect XML to take off.

I was pleased to find that the documents from scripting.com and cnet.com
displayed automatically in JUMBO2 without loss of useful information. These
have virtually no 'text' - they are all structure. So I am also optimistic
that we can start sending structure over the wire - let's try some
experiments.
>
>Several writers have expressed dissapointment that there are not more XML
>'ap's' out there one year (actually only 7 months!!) after the release of
>the recommendation. I think this shows how warped our perspective has
>become, 7 month's is a very short time, and it took at least a year after
>the release of Mosaic for the web to gain real momentum.

No - it was much faster with early adopters. By end-1993 there were several
new sites daily (I have an old 'What's new at NCSA' and the doubling time
was about 10 weeks. 

The early XML timelines seem to have slipped [could WG members comment?]. I
think it was expected that by now we would have full recommendations for
XLink [it was apparently XLink that *really* excited people back in Spring
1997]. Is there a revised timescale?

>
>To write a good app.takes time, and I am actually suprised at how fast tings
>are moving. There are several good programs out there, admittedly of the
>'experimental' kind.

Perhaps we need some collation of 'The Best of XML' to be able to show to
people what it is capable of.

>All I can say is "Don't get depressed, keep the faith!!". We may squabble on
>this list, but we (or rather XML) are bound to prevail because our cause is

We don't squabble! And we continue to remain focussed on development. This
thread has perhaps just been a small amount of refocussing. I shall
continue to try to attract the enthusiasts - please point them at XML-DEV
if you find them.

	P.


Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Sun Sep 13 12:56:27 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:42 2004
Subject: XML is boring (long --- sorry)
In-Reply-To: <3.0.5.32.19980912194121.00c958f0@scripting.com>
References: <3.0.32.19980912134632.00ab9670@pop.intergate.bc.ca>
	<3.0.5.32.19980912194121.00c958f0@scripting.com>
Message-ID: <199809131055.GAA03564@unready.megginson.com>

Dave Winer writes:

 > If Microsoft, for example, were to store all their Office files in
 > XML then you could use any other tool to work on the files.

In fact, they plan to store the files in HTML, and use what they call
'XML islands' within the HTML pages to store extra
application-specific information.  Presumably, then, I'll be able to
see _something_ when I look at the files in Netscape on my Linux box,
but modifying the documents or interpreting the extra information in
the XML islands will be non-trivial.

 > So Peter's dream of an XML-based spreadsheet may not be so far away.

Still a bridge too far right now.  Why doesn't someone take Sun's
spreadsheet demo and modify it to read and write XML?  It should be a
couple of evenings' work, and would make a fun demo.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From larsga at ifi.uio.no  Sun Sep 13 15:55:46 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:04:42 2004
Subject: Reusing schema vocabularies
Message-ID: <3.0.1.32.19980913155150.00de4308@ifi.uio.no>


  REUSING SCHEMA VOCABULARIES: THINKING OUT LOUD
================================================

INTRODUCTION
------------

I'm still struggling with trying to figure out why namespaces are
needed, exactly what they achieve, what the cost is and why they are
the current preferred solution of the XML WG. This is an attempt to
clarify my thoughts by writing them down and hopefully having others
inspect (and correct) them.

Note that this is not a well-written and polished paper, just a sort
of 'textual dump' of my thoughts. The headings are there to organize
the dump somewhat and make it easier to read. I'm probably also a
little bit too optimistic about architectures, but I haven't got the
time to modify those parts now.

I will use the term 'DTD' when I refer to XML schemas as they are
defined in XML 1.0, and will use 'schema' to mean 'a DTDs or a schema
in any XML schema language'. I use 'XML 1.1' to refer to 'XML 1.0 as
extended by the namespace WD'.

The namespace WD seems motivated by the need to be able to define
different schema vocabularies in a single document, or to be less
general: the need to be able to reuse element and attribute names from
different DTDs in a single document.

As far as I can see there are currently two ways to achieve this:
namespaces and architectures. I'll try to list the advantages and
disadvantages of each to see if I can understand why the WG has chosen
what it has.


EFFECTS ON OTHER STANDARDS
--------------------------

NAMESPACES

Namespaces, while superficially simple, are really a profound change
to the XML data model: one of the most basic concepts (the concept
'name') is changed from a string to a namespace identifier _and_ a
string. The reuse of schema vocabularies is enabled by this modified
concept of names, allowing processing software to pick out names
belonging to a specific schema/namespace and operate on them.

This is incompatible with the use of names in XML 1.0, which means
that validation and attribute defaulting no longer work as before. In
other words: both validating and non-validating parsers are affected,
but only in the interpretation of the names used in DTDs. (XML 1.0
documents will work with XML 1.1 parsers, but not vice versa for
namespace-using documents.)

To allow validation and attribute defaulting in XML 1.1 the schema
syntax will have to change, whether the new syntax is a modified DTD
syntax or some entirely new schema language. This means that XML 1.1
documents that use namespaces will not be valid SGML documents.

In XML 1.1 it is conceivable that different schemas can be combined
without needing to be rewritten. With the current DTD syntax this will
require a liberal use of ANY content models, which very much weakens
the benefits of validation and structured editors. It is conceivable
that a schema language with features for the extension of the content
model of elements from reused schemas. No such schema language is
available at present.

This also means that to support XML 1.1 parsers must be modified, as
must the DOM and SAX, since they depend on the concept of names, which
has changed. (DOM getElementByTag name should be namespace-aware, for
instance.) XSL and CSS2 will also have to take XML 1.1 into account if
they are to allow stylesheets written for one schema to be used with a
schema that incorporates the first schema. (XSL patterns must then
support the new names.)

XPointer will not need to be modified, since XPointers are designed to
be tailor-written to the document they address into. Any XML query
language will have to be designed for XML 1.1 (which includes XPointer
if XPointer is used as a query language, as it can be). [XLink?]

A last problem with namespaces is less technical and more practical:
namespace names are awkward to work with, since they have a complex
syntax and must be long. This means that all XML applications that
rely on namespaces will be awkward where names are concerned, which is
almost everywhere. 


XML ARCHITECTURES

XML architectures are superficially complex, but require no changes to
the XML data model. They enable the reuse of schema vocabularies by
remapping names from the original document to a new 'virtual'
document, the architectural document.

This means that XML architectures can be layered on top of current
parsers (as XAF and xmlarch.py do), and furthermore that they require
no changes to XML 1.0. This means that SGML compatibility is retained.
Furthermore, it means that DOM, SAX, XSL, CSS2 and possible query
languages will not have to take architectures into account (beyond
allowing users to declare the architecture they wish XSL/CSS2/queries
to apply to), since they operate as before, but on an architectural
document instead of the original one.

In short, XML architectures do not affect any of the standards
currently in use or under design. (As will be seen later the
architecture syntax may have to change, but the effects of this change
are very likely minor.) XML architectures do require schemas reused in
compound schemas to be rewritten.


MEETING THE NOTE-WEBARCH-EXTLANG REQUIREMENTS
---------------------------------------------

Requirement #1:
  "It must be possible to introduce a new vocabulary in part of a
   document in a way that requires changes only locally within the
   document."

Namespaces meet this requirement by allowing new vocabularies to be
introduced on each element.

XML architectures as defined in ISO 10744:1997 A.3 do not meet this
requirement. The interesting question is of course: can they be
modified to do so?

As far as I can see, the answer must be yes. One way to do it might be
to allow the declaring PI to appear anywhere in a document, but only
to have scope from its declaration until an ending PI is met.
Architecture scopes must properly nest within each other (and within
elements).

This modified version of XML architectures meets the two first cases
listed in the motivation for requirement #1 in Note-webarch-extlang,
but not the third. However, the third is not met by namespaces either
and can only be met by a change to the XML 1.0 grammar. Given such a
change, both architectures and namespaces would meet the third case.


Requirement #2:
  "The syntax must unambiguously associate an identifier in a document
   with the related schema without requiring inspection of that or
   another schema."

By using URIs as namespace identifiers namespaces meet this
requirement.

XML architectures do not meet this requirement as they stand, since
the names of two architectures may clash. The modification suggested
above enables XML architectures to meet this requirement just as well
as namespaces do.

Namespace names may not collide in the namespace documents, but
prefixes may. If prefixes collide the inner prefix shadows the outer
one. Prefix collisions do not concern applications, since they use
namespace names to identify elements and attributes.

XML architecture names may also collide, but can be specified to
shadow one another as with prefixes. To enable the unique
identification of architectures (even in the case of collisions)
architecture declaration PIs can be extended with a namespace
attribute that contain an identifying URI.


Requirement #3:
  "It should be possible to create an original document schema such
   that one can determine, without access to the extension schema,
   which uses of extensions to that document can be ignored."

I do not understand this requirement and so cannot comment on it.


SUMMARY
-------

>From this discussion I emerge believing that XML architectures are a
superior solution to the problem of reusing schema vocabularies. They
have far less impact on the XML family of standards than namespaces do
and do not require XML to be modified or that SGML compatibility be
forsaken for documents that reuse schemas.

The nesting of namespaces is slightly more natural than that of
architectures, but since this nesting is only designed for
automatically generated documents (and since heavily nested namespaces
are more or less unreadable for humans anyway) this does not really
matter.

The data model of XML architecures is also much simpler than that of
namespaces, and XML architectures provide far better control over the
data model presented to processors designed for the original schemas.

--Lars M.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From msuzio at anecdote.com  Sun Sep 13 17:03:58 1998
From: msuzio at anecdote.com (Michael J. Suzio)
Date: Mon Jun  7 17:04:42 2004
Subject: XML is boring (long --- sorry)
References: <3.0.5.32.19980912194121.00c958f0@scripting.com>
Message-ID: <35FBDEBD.C25E1485@anecdote.com>

Dave Winer wrote:
> 
> I think the most important thing about XML is that it will give users
> choices.
> 
> If Microsoft, for example, were to store all their Office 
> files in XML then you could use any other tool to 
> work on the files.

This is a far from foregone conclusion.  Sure, it could be encoded in
XML, and the underlying data structure more easily discerned, but
then again, reverse-engineering the Word file format is a doable
task, too.  So the XML encoding makes the job easier, but you
*still* need to have an application that understands the built-in
structuring rules enough to make sense of the data.

MS Office XML file formats are helpful only insofar as they are
well-documented and parseable from an application point of view.
Without that, I can generate a nice tree from JUMBO, but 
what else can I do?  Not a lot...

My point in all this is to point out that *only* well-supported,
public DTDs (and maybe even sample code to parse example instances
of the data) are going to make the big changes happen.  When you
and I agree on what XML spreadsheet data *looks like*, then we're
onto something.

-- 
Michael J. Suzio
Interconnect of Ann Arbor
msuzio@anecdote.com / 1-734-665-5342

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dave at userland.com  Sun Sep 13 17:23:00 1998
From: dave at userland.com (Dave Winer)
Date: Mon Jun  7 17:04:43 2004
Subject: XML is boring (long --- sorry)
In-Reply-To: <35FBDEBD.C25E1485@anecdote.com>
References: <3.0.5.32.19980912194121.00c958f0@scripting.com>
Message-ID: <3.0.5.32.19980913082326.00c5c830@scripting.com>

>>MS Office XML file formats are helpful only insofar as they are
well-documented and parseable from an application point of view.
Without that, I can generate a nice tree from JUMBO, but 
what else can I do?  Not a lot...

This is totally true, and helps illustrate the point, an important one.
Microsoft has a lot of leeway in how they support XML. It can be done in a
banner-waving way, look how cool we are, or with a real committment to open
file formats, fostering the development of compatible apps from companies
other than Microsoft. My company has put a bet down that we'll be able to
do something interesting with Microsoft's XML files, and of course any
other XML-based apps that show up. However, right now, there are very few
apps that do stuff other than XML that understand XML. Who do we work with?
Right now it seems like only Microsoft.

One of the reasons UserLand stepped up with such a large investment in XML
is that our software already does a lot more stuff than just XML, so we add
unique value to the XML world, in the same way an XML-spreadsheet or
XML-drawprogram would. I'm pretty sure that's not well understood in the
press and on this list. We're going to work on that, build some more
examples of the value of our stuff. I've gotten a few ideas from this
discussion over the last few days.

My point is that there's got to be much more to XML than web browsing and
search engines, because the former has to compete with HTML, which is a
confusing standard, but very strong nonetheless; and the latter requires a
major investment by web developers and toolmakers, and by the search engine
companies, and no such investment is visible now. To expect that there will
be a user benefit in these areas is a real stretch, imho.

I also wonder if the W3C is up to handling issues of file formats for
productivity apps,  graphics tools and groupware, or if some other forum
for discussing file format standards is necessary.

Dave

--------------------------------------
http://www.userland.com/directory.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Sun Sep 13 18:56:17 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:43 2004
Subject: XML is boring (long --- sorry)
Message-ID: <3.0.32.19980913095604.00ab9100@pop.intergate.bc.ca>

At 08:23 AM 9/13/98 -0700, Dave Winer wrote:
>I also wonder if the W3C is up to handling issues of file formats for
>productivity apps,  graphics tools and groupware, or if some other forum
>for discussing file format standards is necessary.

Tough call.  I think that W3C's choices so far have been consistent
(HTML, XML, CSS, MathML, SMIL, etc) but I'm not sure anyone has written
down the underlying principle.  For example, I can't see W3C being
an appropriate venue to discuss spreadsheet formats, but I can't really
crystallize why that's the case. -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Philippe.Le_Hegaret at sophia.inria.fr  Sun Sep 13 19:18:34 1998
From: Philippe.Le_Hegaret at sophia.inria.fr (Philippe Le H�garet)
Date: Mon Jun  7 17:04:43 2004
Subject: Deterministic Content Models ?
References: <Pine.BSI.3.96r.980912000954.25689C-100000@shell1.interlog.com>
Message-ID: <35FBFE57.C9F54B94@sophia.inria.fr>

Liam R. E. Quin wrote:
> 
> On Fri, 11 Sep 1998, Philippe Le H?garet wrote:
> 
> >  Is (paragraph*)* a determinist content model ?
> >  If yes, so I think (a+ | b)* is a deterministic content model too.
> 
> Yes, they both are.
> 
> The constraint in section 3.3.1 of the XML spec is
>     it is an error if an element in the document can match more
>     than one occurrence of an element type in the content model.
> 
> Hence, a content model can only be non-deterministic in the XML sense
> if it has a name that repeats.  For example,
>     (a*, a*)
> is non-deterministic, because the input <a> could match either "a*" in
> the content model.  In the same way,
>     (a+, b?, a+)
> is bad because <a><a> could match the first "a+" or the first a+,
> a missing b between the two elements, and the second <a/> could match
> the second a+ in the content model.
  I'm not totally agree with you, because if you write the
sequence like this :
  (a, a*)* is it still deterministic ? For me no, because there are
two states in this content model.
  (a+)* is the same case and (a+ | b)* too.

   But, you're right : (a*)* is deterministic because you jump into the
same state.

Philippe.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Daniel.Veillard at w3.org  Sun Sep 13 20:57:57 1998
From: Daniel.Veillard at w3.org (Daniel Veillard)
Date: Mon Jun  7 17:04:43 2004
Subject: XML is boring
In-Reply-To: <3.0.32.19980913095604.00ab9100@pop.intergate.bc.ca>; from Tim Bray on Sun, Sep 13, 1998 at 09:56:16AM -0700
References: <3.0.32.19980913095604.00ab9100@pop.intergate.bc.ca>
Message-ID: <19980913145708.C28935@w3.org>

Quoting Tim Bray (tbray@textuality.com):
> At 08:23 AM 9/13/98 -0700, Dave Winer wrote:
> >I also wonder if the W3C is up to handling issues of file formats for
> >productivity apps,  graphics tools and groupware, or if some other forum
> >for discussing file format standards is necessary.
> 
> Tough call.  I think that W3C's choices so far have been consistent
> (HTML, XML, CSS, MathML, SMIL, etc) but I'm not sure anyone has written
> down the underlying principle.  For example, I can't see W3C being
> an appropriate venue to discuss spreadsheet formats, but I can't really
> crystallize why that's the case. -Tim

  I guess that a simple answer to that it is somewhat out of the scope
of our main goal:

--------- http://www.w3.org/Consortium/ ----------
The W3C was founded in October 1994 to lead the World Wide Web to its
full potential by developing common protocols that promote its evolution
and ensure its interoperability. 
--------------------------------------------------

  I guess also that trying to cover all the possible application of XML
would completely dilute our efforts. How far can W3C go in the standardization
of XML applications is a difficult problem, there is resources consideration
and also technical challenges. For example it seems that a standard type
system for XML encodings would be a good things since it would enhance
interoperability and avoid duplicating efforts. But I don't think we should
be involved in a normalization of the screw and bolts references XML
encoding, not that it's not important (I may even have more impact than
productivity apps format standardization), but rather because we don't have
good knowldge of this field and this will definitely better done by people
involved in that field.
  As always in real life the border between white and black is never a
line but shades of grey, this mean that W3C may give some feedback on 
other normalization process, and we will accept submission from members.

  Back to desktop applications, I for sure would be extremely happy to
have a standard XML encoding for them supported by the major application
vendors, I'm just not sure it would be a good idea to try to get this
done within W3C. 

  Daniel

-- 
Daniel.Veillard@w3.org | W3C  MIT/LCS  NE43-344  | Today's Bookmarks :
Tel : +1 617 253 5884  | 545 Technology Square   | Linux, WWW, rpm2html,
Fax : +1 617 258 5999  | Cambridge, MA 02139 USA | badminton, Kaffe,
http://www.w3.org/People/W3Cpeople.html#Veillard | HTTP-NG and Amaya.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bobp at lightlink.com  Sun Sep 13 22:02:00 1998
From: bobp at lightlink.com (Bob Parks)
Date: Mon Jun  7 17:04:43 2004
Subject: Opportunities for XML-DEV
Message-ID: <v04003a05b221e33e787e@[205.232.34.187]>

Peter wrote:
>What we have discovered is that there are very few XML documents currently
>being delivered over the WWW. For many of us who see XML as a communication
>medium *and philosophy* this is a pity. I think it makes it harder to
>develop tools to work with specs like XLink, XPointer, Namespaces because
>we don't have example documents to work with. And this is cyclic, because
>those creating documents don't have tools to create documents with and
>don't have people who can read them. So, at the moment we can only talk
>about those applications.

I have a 50,000 headword dictionary and thesaurus of English that can
fairly easily be converted to a simple XML representation. Would
availability of this text for research help stimulate application
development? What sort of applications?
Bob Parks


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at goon.stg.brown.edu  Sun Sep 13 22:20:02 1998
From: richard at goon.stg.brown.edu (Richard Goerwitz)
Date: Mon Jun  7 17:04:43 2004
Subject: Deterministic Content Models ?
References: <Pine.BSI.3.96r.980912000954.25689C-100000@shell1.interlog.com> <35FBFE57.C9F54B94@sophia.inria.fr>
Message-ID: <35FC27CD.BFE425A5@goon.stg.brown.edu>

Philippe Le H?garet wrote:

> > Is (paragraph*)* a deterministic content model ?
> > If yes, so I think (a+ | b)* is a deterministic content model too.
> > >
> > >   it is an error if an element in the document can match more
> > >   than one occurrence of an element type in the content model.
>
>   I'm not totally agree with you, because if you write the
> sequence like this:
>
>     (a, a*)*
>
> is it still deterministic ? For me no, because there are
> two states in this content model. (a+)* is the same case and
> (a+ | b)* too.

Looks like everybody is more or less correct.

The whole point of flagging nondeterministic content models (which
is what SGML did, and XML may optionally do) is that nondetermin-
istic content models often indicate logic errors by the writer.

Put somewhat differently, if a DTD writer composes a content model
that allows a given sequence of elements to be processed in more
than one way, this often indicates an error.

So, for example, with (a, a*)*, it's hard to imagine what is
intended, because a single <a/><a/> could match two instances of
(a, a*), or one instance if (a, a*), depending on how you go
through the automaton.  Processors may, incidentally, flag (a+)*
as "ambiguous", since a+ usually implemented as (a, a*).

Such ambiguities create unintended differences in how the same
input might be processed by different software.  Or they simply
lead to the input being processed in a way the surprises the user
(or worse yet, the programmer).

That's why I think it's a good idea for validators, in particular,
to flag "ambiguous" content models aggressively.

To test these sorts of things is easy enough.  Just make up a toy
DTD and run it through a good validator.  Take, for example, the
following (where elements x, y, and z should get flagged as "am-
biguous"):

<!DOCTYPE test [
  <!ELEMENT test ANY>
  <!ELEMENT a EMPTY>
  <!ELEMENT b EMPTY>
  <!ELEMENT w (a*)*>
  <!ELEMENT x (a+ | b)*>
  <!ELEMENT y (a, a*)*>
  <!ELEMENT z (a+, b?, a+)>
]>

<test></test>

Yes, as always, you can try this out with the validator at:

  http://www.stg.brown.edu/service/xmlvalid/

-- 

Richard Goerwitz
PGP key fingerprint:    C1 3E F4 23 7C 33 51 8D  3B 88 53 57 56 0D 38 A0
For more info (mail, phone, fax no.):  finger richard@goon.stg.brown.edu

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Mon Sep 14 02:04:22 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:43 2004
Subject: Namespaces Revisited...
Message-ID: <35FC5DAD.BE561987@infinet.com>

One thing which I find perplexing is that we are discussing how to add
namespaces to XML when there are no real XML apps out there that I am
aware of who have wrestled with this issue.  In other words, despite
everyone crying about standards bodies being too slow, perhaps in this
case the WG is being too fast.

Until real XML applications come out and provide their own proprietary
uses of namespaces at the application level, we will all not have much
of a clue about how application developers are using namespaces and how
application developers have implemented namespaces.

My initial understanding of namespaces was that it was a way of
associating some particular element name in conjunction with a prefix of
some sort to identify a UNIQUE form of data.  In this sense, XML is more
useful as a data container than as a document model, since a document
viewer generally has some sense of how to render all of the elements
that is presented to it.

In Java at least, I had some creative ideas as to how to dynamically
instantiate element handlers by class name.  The class name would be
constructed by using its prefix as a key to lookup the application
specific package name plus any other prefix data for the class name and
combining it with the element name in the document.

In other words if I had a namespace prefix:

java.awt

and I had a replace value for the prefix:

com.dais.awt.X

If I had a namespace name:

java.awt:Dimension

I would get as a result the class name:

com.dais.awt.XDimension

which would likely be a subclass of java.awt.Dimension


With the current namespaces proposal, it does nothing for me and I am
having a very hard time trying to figure out if it will ever do anything
for most applications.  If you look at previous standards like OpenDoc,
they generally failed because no one used them.

For the particular app I have I have chosen to embed documents within
documents instead of futzing around with all of this namespaces stuff
and believe it or not it all works quite nicely.  Part of the reasons
for this are application specific (for example some objects are
dynamically created and the content within a document can only be
applied at this particular time) but nevertheless it works.

All of this talk about extending DTD's and element type inheritance
seems to totally ignore the question of possible implementation.  An
idea is just an idea until you something concrete behind it.  XML has
the years of success of SGML behind it, but this namespaces stuff has
nothing behind it.  That is why I plead for us all to go slowly with
namespaces as there seems to be much disagreement about it now.

Standards are usually built upon some sort of concensus.  If a standards
body uses a top-down approach to publishing standards, then they will
lose a lot of respect really fast among the average ISV who likes a
standards body pushing drafts down their throat (unless it is of the
alcoholic kind) about as much as having standard shoved down their
throat by some very large monopolistic ISV.  The secrecy process that
the W3C goes through makes me wonder sometimes if the W3C is simply a
corporation responsible to its shareholders (those who fork over the big
bucks to become members) rather than a real standards body for the
betterment of computing.

If the standards process that namespaces goes through were anything like
the process that of SAX or XSchema (which to an official standards body
may be considered barbaric), I think that namespaces would be much
further ahead and there would be many more people happy with it.  No
matter how many geniuses you have working on some standards draft in a
closed forum, it is my opinion that you will rarely do better than a
forum of collective opinions of those who actually need to work with
that standard.

Once you take the politics out of a standards process and only
concentrate on being pragmatic, it is amazing how successful standards
endeavours can become.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bobp at lightlink.com  Mon Sep 14 04:09:13 1998
From: bobp at lightlink.com (Bob Parks)
Date: Mon Jun  7 17:04:43 2004
Subject: Opportunities for XML-DEV
Message-ID: <v04003a03b22239496576@[205.232.34.187]>

Peter wrote:
>What we have discovered is that there are very few XML documents currently
>being delivered over the WWW. For many of us who see XML as a communication
>medium *and philosophy* this is a pity. I think it makes it harder to
>develop tools to work with specs like XLink, XPointer, Namespaces because
>we don't have example documents to work with. And this is cyclic, because
>those creating documents don't have tools to create documents with and
>don't have people who can read them. So, at the moment we can only talk
>about those applications.

I have a 50,000 headword dictionary and thesaurus of English that can
fairly easily be converted to a simple XML representation. Would
availability of this text for research help stimulate application
development? What sort of applications?
Bob Parks


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dave at userland.com  Mon Sep 14 04:16:28 1998
From: dave at userland.com (Dave Winer)
Date: Mon Jun  7 17:04:43 2004
Subject: HotBot query to find the XML files
In-Reply-To: <v04003a03b22239496576@[205.232.34.187]>
Message-ID: <3.0.5.32.19980913191754.00c83c70@scripting.com>

>>What we have discovered is that there are very few XML documents currently
>>being delivered over the WWW. 

When we were in the final stages of Frontier 5.1.3, we wanted to see how
our parser would do with real-world XML, and we found that HotBot has a
neat feature that allows you to find all the files that point to pages
ending with ".xml". We found a bunch of test cases that way.

Here's the query:

http://www.hotbot.com/text/default.asp?SM=MC&MT=&search=SEARCH&DC=100&DE=0&A
M0=MC&AT0=words&AW0=&AM1=MN&AT1=words&AW1=&savenummod=2&date=WH&DV=0&DR=newe
r&DM=1&DD=1&DY=98&FSU=1&FS=.xml&RD=AN&Domain=&RG=all&PS=A&PD=&_v=2&OPs=MDRTP
&NUMMOD=2

Dave

--------------------------------------
http://www.userland.com/directory.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bckman at ix.netcom.com  Mon Sep 14 05:23:41 1998
From: bckman at ix.netcom.com (Frank Boumphrey)
Date: Mon Jun  7 17:04:43 2004
Subject: Opportunities for XML-DEV
Message-ID: <003b01bddf8f$9393db60$0facdccf@ix.netcom.com>

I have a 50,000 headword dictionary and thesaurus of English that can
>fairly easily be converted to a simple XML representation. Would
>availability of this text for research help stimulate application
>development? What sort of applications?
>Bob Parks


gosh yes!!

The more fundamental stuff like this we can get in XML format the better!
Part of getting the momentum going.

I'm sure every one knows about


http://sunsite.unc.edu/pub/sun-info/xml/eg/shakespeare.1.10.xml.zip">Shakesp
eares plays

and

http://sunsite.unc.edu/pub/sun-info/xml/eg/religion.1.10.xml.zip">the Bible,
the book of Mormon and the Koran.

These were marked up by Jon Bosak. he needs to update his syntax!<grin>, he
uses <?XML instead of the lower case!!

Frank

Frank Boumphrey

XML and style sheet info at Http://www.hypermedic.com/style/index.htm
Author: - Professional Style Sheets for HTML and XML http://www.wrox.com
-----Original Message-----
From: Bob Parks <bobp@lightlink.com>
To: <xml-dev@ic.ac.uk>
Date: Sunday, September 13, 1998 4:11 PM
Subject: Re: Opportunities for XML-DEV


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bckman at ix.netcom.com  Mon Sep 14 05:48:33 1998
From: bckman at ix.netcom.com (Frank Boumphrey)
Date: Mon Jun  7 17:04:43 2004
Subject: Namespaces Revisited...
Message-ID: <006e01bddf93$1111aec0$0facdccf@ix.netcom.com>

>Until real XML applications come out and provide their own proprietary
>uses of namespaces

IE5 uses name spaces see my tutorial on the DOM at, www.hypermedic.com/style
, or go to www.microsoft.com/xml .

Go to www.hypermedic.com/style
and follow the DOM links.

Look under 'XML Namespaces' under the 'Loading XML on the browser' heading
in the word or text document.


Frank

Frank Boumphrey

XML and style sheet info at Http://www.hypermedic.com/style/index.htm
Author: - Professional Style Sheets for HTML and XML http://www.wrox.com


Frank
-----Original Message-----
From: Tyler Baker <tyler@infinet.com>
To: <xml-dev@ic.ac.uk>
Date: Sunday, September 13, 1998 8:08 PM
Subject: Namespaces Revisited...


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bckman at ix.netcom.com  Mon Sep 14 07:57:43 1998
From: bckman at ix.netcom.com (Frank Boumphrey)
Date: Mon Jun  7 17:04:43 2004
Subject: Opportunities for XML-DEV
Message-ID: <000d01bddfa5$0cddc3e0$a0acdccf@ix.netcom.com>


Bob,

>I can imagine that it would be fairly easy to formulate
>interesting queries to the dictionary-thesaurus when marked up in xml.  But
>I'm wondering what sort of applications this data might enable?

Of the top of my head, I havn't a clue! but if the information was out there
it would give us some thing to work with! I bet it wouldnt be long before
some one came up with a Activexcontrol or an applet to search your material!

Are there any "general" uses for dictionary data in the XML
>community?

I don't know about the XMl community (although reading some of the posts
both the dictionary and the thesaurus most definitly needs to be used
<grin/>), but it would be information of real value for the web ingeneral.

The lack of such material I think it is an example of what Peter is worried
about when he states that there is a dearth of data out ther marked up in
XML

regards,
Frank

P.S I took the liberty of posting this reply to XML-DEV as well.

Frank Boumphrey

XML and style sheet info at Http://www.hypermedic.com/style/index.htm
Author: - Professional Style Sheets for HTML and XML http://www.wrox.com
-----Original Message-----
From: Bob Parks <bobp@lightlink.com>
To: Frank Boumphrey <bckman@ix.netcom.com>
Date: Monday, September 14, 1998 12:39 AM
Subject: Re: Opportunities for XML-DEV


Original message

>Frank,
>I understand the desire to get as many XML documents available on the web
>as possible. I can imagine that it would be fairly easy to formulate
>interesting queries to the dictionary-thesaurus when marked up in xml.  But
>I'm wondering what sort of applications this data might enable? Are there
>other applications that might be able to take advantage of the semantics in
>the data? Are there any "general" uses for dictionary data in the XML
>community?
>Bob
>
>>I have a 50,000 headword dictionary and thesaurus of English that can
>>>fairly easily be converted to a simple XML representation. Would
>>>availability of this text for research help stimulate application
>>>development? What sort of applications?
>>>Bob Parks
>>
>>
>>gosh yes!!
>>
>>The more fundamental stuff like this we can get in XML format the better!
>>Part of getting the momentum going.
>>
>>I'm sure every one knows about
>>
>>
>>http://sunsite.unc.edu/pub/sun-info/xml/eg/shakespeare.1.10.xml.zip">Shake
sp
>>eares plays
>>
>>and
>>
>>http://sunsite.unc.edu/pub/sun-info/xml/eg/religion.1.10.xml.zip">the
Bible,
>>the book of Mormon and the Koran.
>>
>>These were marked up by Jon Bosak. he needs to update his syntax!<grin>,
he
>>uses <?XML instead of the lower case!!


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jjc at jclark.com  Mon Sep 14 09:40:21 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun  7 17:04:43 2004
Subject: Namespaces Revisited...
References: <35FC5DAD.BE561987@infinet.com>
Message-ID: <35FCC436.DE586DB0@jclark.com>

Tyler Baker wrote:
> 
> One thing which I find perplexing is that we are discussing how to add
> namespaces to XML when there are no real XML apps out there that I am
> aware of who have wrestled with this issue.

XSL uses namespaces very heavily as does RDF.  There are implementations
of both of these.  WebDAV and P3P are other important users. There are
also important commercial apps that are coming out soon that make heavy
use of XML namespaces.

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tiernan at tiernan.net  Mon Sep 14 10:28:31 1998
From: tiernan at tiernan.net (tiernan ray)
Date: Mon Jun  7 17:04:43 2004
Subject: XML's proof
Message-ID: <35FCFE15.EE090819@tiernan.net>

To the list:

Hi. I am a journalist putting together a popular "Internet guide" type
of article for Fortune Magazine, to appear in November. I've been
following XML since Spring of last year, and I've been following this
mailing list in the past couple of months. I must say as a casual
observer, watching from the sidelines, and with little technical
authority, I'm nonetheless quite inspired by what you folks are
discussing and by the possibilities that XML offers. I've read the W3C's
volume on XML, published by O'Reilly, in which many of you have written
excellent, thoughtful articles, and I've been playing with some of the
Mozilla stuff. While all this is still a work in progress, I generally
assume you folks are all fairly bright and will make this stuff happen,
ultimately. I plan to present XML in my article as one of the
fundamental changes that will take place on the Web over the next year
to year-and-a-half.

My primary source of inspiration is the work Netscape has done with its
UI -- the RDF work, fetching the front-end from the network, and letting
Web sites configure browser tool bars, etc. And, of course, what each of
you has written on the importance of XML for organizing  libraries of
knowledge in domains, such as math and chemistry, and for making
possible new, non-browser types of applications. At any rate, as a
journalist I think XML sounds like a great step forward for the Web and
I plan to give this a positive spin in my Fortune piece. If anyone would
like to add to/quibble with my views, please by all means do so.

Thanks, and keep up the good work.

Tiernan Ray
freelance journalist


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Mon Sep 14 10:59:53 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:43 2004
Subject: XML is boring (long --- sorry)
Message-ID: <004501bddfbe$9ac7af40$1e09e391@mhklaptop.bra01.icl.co.uk>

>If Microsoft, for example, were to store all their Office
files in XML then
>you could use any other tool to work on the files.


I regret that this statement is no more true than saying "if
they stored all their data in ASCII then you could use any
other tool...".

Storing data in XML is a useful step towards this goal, but
it is not sufficient. The XML constructs used also need to
be well-documented. Given the choice between a
well-documented XML format and a well-documented non-XML
format, the difference is not whether you can process the
data, but how easy it is.

Anyway, I question the goal. I still believe encapsulation
is a Good Thing. I don't think the world would be a better
place if every application revealed all its internal data
structures (or even just its persistent internal data).

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From arcdev at mail.matav.hu  Mon Sep 14 11:41:30 1998
From: arcdev at mail.matav.hu (Attila Torcsvari)
Date: Mon Jun  7 17:04:44 2004
Subject: No subject
Message-ID: <01BDDFD4.9C7499F0@p2>

unsubscribe xml-dev arcdev@mail.matav.hu

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Mon Sep 14 11:47:09 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:44 2004
Subject: Namespaces Revisited...
References: <35FC5DAD.BE561987@infinet.com> <35FCC436.DE586DB0@jclark.com>
Message-ID: <35FCE647.EB095E07@infinet.com>

James Clark wrote:

> Tyler Baker wrote:
> >
> > One thing which I find perplexing is that we are discussing how to add
> > namespaces to XML when there are no real XML apps out there that I am
> > aware of who have wrestled with this issue.
>
> XSL uses namespaces very heavily as does RDF.  There are implementations
> of both of these.  WebDAV and P3P are other important users. There are
> also important commercial apps that are coming out soon that make heavy
> use of XML namespaces.
>
> James

XSL is a technology on top of XML.  It is not in and of itself an application
which uses namespaces.  RDF I am unfamiliar with and all of these commercial apps
that are coming out that will make heavy use of XML namespaces are hard to
imagine in that the latest draft (a major revision) is barely a month old and to
date there have not been too many commercial apps to date that even use XML, only
promises from a bunch of ISV's on the XML bandwagon.

I think namespaces will be a very important part of XML, but on the
implementation side of things as well as the end-user side of things (a lot of
documents may allow user editing like in HTML) the current namespaces spec seems
way too complex.  My general rule of thumb is that if something is difficult for
a programmer such as I to understand, it is hard to imagine that an average
end-user will have a clue at all what is going on.  If end-users have no idea
what they are working with, we might as all just be doing markup in binary
formats like the DOC format for Word.

I personally am working on a client-side app which uses XML extensively for tasks
as simple as init files to more complex tasks like saving object state.  The
attraction to XML is that the average user can hack around the architecture to
get the settings they want.  If XML is so complex that you need to have a masters
in computer science to understand it, then there is no reason to use XML in my
app as it is less efficient than some proprietary externalization format.  The
only major attraction to XML for me is that it makes the data my program
generates much easier to work with for third party developers and end-users.
Sadly enough, as the current namespaces spec is, it is very attractive to take
out all the muckety muck XML inherits from SGML and define my own markup language
which is simpler and easier to understand.  If I do not find XML to be usable to
third-parties and end-users, then I may have to make the decision to go another
direction depsite the positive press XML has to date as well as its standardized
reputation.  I hope that will never be the case and that simplicity of XML can
once again be a stated goal.  In fact, another developer I am aware of uses his
own markup format and is considering moving to XML, but he is finding it too
complex for him to implement in his app.  This guy is a quality developer, and
even he is having problems with XML as it stands.

HTML's main success was that it was very simple for the end-user; even my
grandmother (when she was alive) could of learned it in a few short hours.  If
people code XML documents and then get errors from the application when it reads
their data which says "invalid namespace declaration" or something like that,
then they will obviously be frustrated.

One thing I do when I write software that has intended third party uses is I
always try and make the interface and usability of the utmost simplicity and
usability.  SAX's success to date I believe is based upon this notion of
simplicity.  The current namespaces spec is not.  If people do not understand
what in the world the spec means and what it is intended for in less than half an
hour, then you need to ask yourself: "Does this make sense to the mortals".

I know this is all common-sense BS which probably insults the intelligence of
many of the people on the namespaces WG, but that is far from my intention.
Perhaps a survey of poll of application developers as well as end-users on XML
namespaces would help solve these sort of issues.

Back in my college days at CMU there was a professor who was a CS/Psychology
Nobel Laureate (I cannot remember his name off the bat) who taught there and
lectured once on a very important lesson that I apply to my software
development.  Basically, that all things being equal people will choose not the
most satisfying solution, but the most satisficing one.  Satisficing is basically
a term he invented which says that you choose the solution that makes everything
(or everyone) the most happy.

In other words, if you are searching for the holy grail solution to a problem,
then you will likely never find it.  If you choose a solution that you can prove
will work as well as achieve its objectives and make everyone happy, then choose
that solution.  I think going back to the PI-based approach will be much more
satisficing in the end than the current proposal.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ht at cogsci.ed.ac.uk  Mon Sep 14 11:49:47 1998
From: ht at cogsci.ed.ac.uk (Henry S. Thompson)
Date: Mon Jun  7 17:04:44 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
In-Reply-To: Andrew Bunner's message of Thu, 10 Sep 1998 09:39:17 -0700
References: <199809090522.BAA11695@ruby.ora.com>  <199809090121.SAA15093@mail-gw.pacbell.net>  <v03102801b21c945aab3e@[203.23.215.128]> <199809101633.JAA23541@mail-gw6.pacbell.net>
Message-ID: <f5b90jnm3dl.fsf@cogsci.ed.ac.uk>

Andrew Bunner <bunner@massquantities.com> writes:

>   The moral of the story is that if your target language is not XML, then
> you have to write your own tool to take it from XML to, let's say, HTML.
> One way is to get into the XSL processor and add your own code, another
> (less clean) way is to write something that post-processes the XML
> representation of the target language.

Um, to reiterate something James said, the draft recommendation is
clear, if perhaps not as explicitly obtrusively clear as it should be,
that it specifies a process which results in an XML result tree,
representation NOT specified, not a sequence of UNICODE characters.
It also provides a rendering semantics for one family of result trees,
namely those composed of the formatting object element types and
attributes.  It nowhere states that a conformant XSL processor must be
able to output a linearised form of the result tree.  Some may choose
to.  Others may implement the rendering semantics of the formatting
objects directly, and never expose the result tree.

Nothing in the draft recommendation suggests that an application which
implements only the tree-construction component of XSL is a conformant
XSL application.  That said, it would be naive to suppose that
applications meeting this description (XT is one, of course) will not
be common.

>   Unless, of course, we change the standard.
> . . .
>   XSL seems perfectly well equipped to handle any text-based target
> language. So why not let it?
> 
>   I guess I don't see the same need to "choose something" or restrict it in
> any way other than to say "you must produce text". There must be something
> very important that we gain by insisting the target language be one thing
> or another. Help me understand what this important thing is.

As noted above, we don't WANT to require people to produce text, or
anything else except rendering according to formatting object
semantics.  XSL is a STYLE language.  It's a wonderful side benefit
that it encorporates a useful tree construction language, and of
course people will take advantage of that, but it makes sense from the
perspective of the WG's charter to use XML to describe the structure
of the result tree we need to drive the rendering process.

If you want text output, go ahead:  Define your result tree to look
like this

<wrap>
You can put any text you like in here, and it can look like
another language;

 La plume de ma tante

or ANOTHER language

 if x &lt; 3; then echo oops; exit; fi

or an other *)#)(#@) thing you like.

</wrap>

and your output procedure to strip the wrap and expand the entity
references.  You won't have a conformant application, not because you
are outputing text, but because you're not supporting the fo:
semantics.  I'm sure you'll be OK with that :-)

I hope this will clarify, wrt your very first para. above, that this
isn't an EXTRA bit of work you have to do, to "get into the XSL
processor and add your own code": ANY XSL processor which plans to
output character sequences goes beyond the draft recommendation, and
will have to include an idiosyncratic back-end.  If you know JADE,
James Clark's DSSSL engine, you will know that it has a core which
implements the standard, AND a number of backends.  A conformant XSL
implementation will probably be organised similarly: a core which
implements tree construction, one backend which renders fo:* to a
display directly using X-windows or MS foundation classes or SwingSet
or ..., perhaps another backend which renders fo:* to print using Tex
or PostScript or ..., probably a third backend which outputs the
result tree as XML, and so on.

Hope this helps

ht [speaking for myself, not the WG]
-- 
  Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh
     2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440
	    Fax: (44) 131 650-4587, e-mail: ht@cogsci.ed.ac.uk
		     URL: http://www.ltg.ed.ac.uk/~ht/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From hirajima at tokiwa-info.co.jp  Mon Sep 14 11:52:36 1998
From: hirajima at tokiwa-info.co.jp (Masahiko Hirajima)
Date: Mon Jun  7 17:04:44 2004
Subject: No subject
Message-ID: <199809140946.AA00122@Thinkpad530cs.tokiwa-info.co.jp>

unsubscribe xml-dev hirajima@tokiwa-info.co.jp

********************************************
*      Hirajima Masahiko                   *
*      General Manager                     *
*      EDI Sales Division                  *
*      Tokiwa Information Co,. Ltd.        *
*      TEL : 813-5828-1326                 *
*      FAX : 813-5828-1175                 *
*      Email : hirajima@tokiwa-info.co.jp  *
********************************************

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From larsga at ifi.uio.no  Mon Sep 14 11:52:54 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:04:44 2004
Subject: MSXML Parser
Message-ID: <3.0.1.32.19980914114812.00dc4510@ifi.uio.no>


* Jonathan Robie
|
| Can anybody share their experiences with the MSXML parser? How does
| this compare to any other validating XML parser? Are there
| significant differences that I should be aware of?

It isn't updated to the final XML 1.0 recommendation and appears to be
abandoned. Microsoft has released a beta of their new XML parser which 
seems to be developed by DataChannel. (Strangely, DataChannel still
seem to offer DXP beta 1d.)

(See <URL:http://www.datachannel.com/xml.html>)

I don't know anything about this new parser, but I'm not too excited
about it when there are so many other good Java parsers in XML. You
can take your pick at:

<URL:http://www.stud.ifi.uio.no/~larsga/linker/xmltools/by-platform.html#Java>

--Lars M.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Mon Sep 14 12:33:07 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:44 2004
Subject: Opportunities for XML-DEV
In-Reply-To: <003b01bddf8f$9393db60$0facdccf@ix.netcom.com>
References: <003b01bddf8f$9393db60$0facdccf@ix.netcom.com>
Message-ID: <199809141031.GAA00209@unready.megginson.com>

Frank Boumphrey writes:

 > I'm sure every one knows about
 > 
 > http://sunsite.unc.edu/pub/sun-info/xml/eg/shakespeare.1.10.xml.zip">Shakesp
 > eares plays
 > 
 > and
 > 
 > http://sunsite.unc.edu/pub/sun-info/xml/eg/religion.1.10.xml.zip">the Bible,
 > the book of Mormon and the Koran.

Don't forget my old Heart of Darkness from last Fall:

  http://home.sprynet.com/sprynet/dmeggins/texts/darkness/index.html

You can start parsing directly from the following URL:

  http://home.sprynet.com/sprynet/dmeggins/texts/darkness/darkness.xml

This document makes reference to an external DTD subset and to several
external entities (each with its own encoding declaration), so it
provides a good workout.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From fernando at pix.com.br  Mon Sep 14 14:07:17 1998
From: fernando at pix.com.br (Fernando Cabral)
Date: Mon Jun  7 17:04:44 2004
Subject: Text in XML
References: <35FCFE15.EE090819@tiernan.net>
Message-ID: <35FD11F2.21193099@pix.com.br>

Hello

In order to test some characteristics of a SGML-based search engine, I need some
XML files. I would prefer having some classics of the literature, preferentially
those including attributes like emphasis, bold, italics and diacritics.

I'll be glad if any of you can send me some of them of perhaps give me
some URL I get them from.

Thank you.

-fernando


--
mailto:fernando@pix.com.br                    http://www.pix.com.br
Fernando Cabral                               Padrao iX Sistemas Abertos
Fernando@Pix.com.br                           Pix@Pix.com.br
Fone: +55 61 321-2433                         Fax: +55 61 225-3082


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Mon Sep 14 14:29:42 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:44 2004
Subject: Text in XML
Message-ID: <046301bddfdb$92594e00$c56118cb@caleb>


see http://www.xmlinfo.com/examples/


--
James Tauber / jtauber@jtauber.com      http://www.jtauber.com/
Lecturer and Associate Researcher
Electronic Commerce Network             ( http://www.xmlinfo.com/
Curtin Business School                  ( http://www.xmlsoftware.com/
Perth, Western Australia                ( http://www.schema.net/

-----Original Message-----
From: Fernando Cabral <fernando@pix.com.br>

>Hello
>
>In order to test some characteristics of a SGML-based search engine, I need
some
>XML files. I would prefer having some classics of the literature,
preferentially
>those including attributes like emphasis, bold, italics and diacritics.
>
>I'll be glad if any of you can send me some of them of perhaps give me
>some URL I get them from.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eriblair at mediom.qc.ca  Mon Sep 14 14:56:37 1998
From: eriblair at mediom.qc.ca (Eric Riblair)
Date: Mon Jun  7 17:04:44 2004
Subject: ISO-LATIN and Msxml ...
Message-ID: <199809141255.IAA29635@netra.mediom.qc.ca>

Does anyone know how to submit a XML file containing iso-latin character to
the msxml applet ( ... in a DHTML file) that works !!!


Thanks for any help,
Eric


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Mon Sep 14 16:01:49 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:44 2004
Subject: Text in XML
In-Reply-To: <35FD11F2.21193099@pix.com.br>
References: <35FCFE15.EE090819@tiernan.net>
	<35FD11F2.21193099@pix.com.br>
Message-ID: <199809141400.KAA00911@unready.megginson.com>

Fernando Cabral writes:

 > In order to test some characteristics of a SGML-based search
 > engine, I need some XML files. I would prefer having some classics
 > of the literature, preferentially those including attributes like
 > emphasis, bold, italics and diacritics.

Please don't take this the wrong way, but I'm hoping that this search
will fail (at least, the part about "bold", "italics", etc.).  There
are special circumstances where people would mark up presentational
information like typefaces in XML (codicology and library science are
two obvious examples), but for general-purpose use, an XML literary
text would say what something *is* rather than what it should *look
like*.  For example,

BAD (usually):

  <newline>
  "What a <italic>beau</italic>!" signed Cecille.

GOOD (usually):

  <p><q>What a <foreign>beau</emphatic>!</q> sighed Cecille.</p>


A literary or linguistic scholar might add all sorts of extra
information:

  <para><q ref="Ce0020"><s type="excl">What a <foreign
   source="FR" period="s.xix" usage="m-class
   u-class">beau</emphatic>!</s></q> sighed <name
   ref="Ce0020">Cecille</cecille>.</para>  

Sure, it looks like hell, but the scholar can use this to generate an
index of proper names (usefull for a 2,000-page Victorian novel) and
index of foreign terms, and can execute queries like

  How often does Cecille use French words in an exclamatory sentence?

Don't try this at home.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Mon Sep 14 17:14:07 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:44 2004
Subject: Namespaces Revisited...
References: <35FC5DAD.BE561987@infinet.com>
Message-ID: <35FD32B5.BA5440E8@locke.ccil.org>

Tyler Baker wrote:

> [D]espite
> everyone crying about standards bodies being too slow, perhaps in this
> case the WG is being too fast.

Indeed.
 
> All of this talk about extending DTD's and element type inheritance
> seems to totally ignore the question of possible implementation.  An
> idea is just an idea until you something concrete behind it.  XML has
> the years of success of SGML behind it, but this namespaces stuff has
> nothing behind it.

And what's worse, there's an alternative (architectures) that actually
has an implementation (David Megginson's XAF), plus years of SGML
experience.

> The secrecy process that
> the W3C goes through makes me wonder sometimes if the W3C is simply a
> corporation responsible to its shareholders (those who fork over the big
> bucks to become members) rather than a real standards body for the
> betterment of computing.

That's *exactly* what it is.  It's a *consortium*, which serves the
interests of its members and no others.

Unfortunately, we're stuck with it, until some ISO/IEC committee
can be persuaded to take up XML (there is already one trying
to create ISO HTML).  ISO/IEC WGs may be horribly slow and politics-ridden,
but they can't just *ignore* comments if they are properly submitted.
They *have* to process all of them.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tms at ansa.co.uk  Mon Sep 14 19:14:23 1998
From: tms at ansa.co.uk (Toby Speight)
Date: Mon Jun  7 17:04:44 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
In-Reply-To: John Cowan's message of "Fri, 11 Sep 1998 11:10:48 -0400"
References: <199809090522.BAA11695@ruby.ora.com> 				 <199809090121.SAA15093@mail-gw.pacbell.net> <v03102801b21c945aab3e@[203.23.215.128]> <35F7B89F.2E779FDB@jclark.com> <35F804CD.47CAAB14@locke.ccil.org> <35F8F3B3.4149A79A@jclark.com> <35F93D78.B8325124@locke.ccil.org>
Message-ID: <upvcylilo.fsf@delivery.ansa.co.uk>

John> John Cowan <URL:mailto:cowan@locke.ccil.org>

0> In article <35F93D78.B8325124@locke.ccil.org>, John wrote:

John> James Clark wrote:

>> The term "well-formed HTML" as used in section 1 of the XSL WD does
>> not mean SGML that conforms to HTML 4.0. It means well-formed XML
>> that uses element types and attributes from HTML.

John> Well and good.  But "uses element types" etc. is vague: all element
John> types, or only some of them?  It can't (straightforwardly) be all
John> of them, because SCRIPT and STYLE are CDATA elements, and so have
John> no XML equivalents.

Whether element content is CDATA or #PCDATA is relevant only to a parser.
Once you have (for example) a grove representation, an application can't
tell the difference between the subtrees representing the two types of
element.

-- 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Mon Sep 14 19:21:35 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:44 2004
Subject: Namespaces Revisited...
Message-ID: <3.0.32.19980914102258.00d8bcb4@pop.intergate.bc.ca>

At 11:13 AM 9/14/98 -0400, John Cowan wrote:
>Unfortunately, we're stuck with it, until some ISO/IEC committee
>can be persuaded to take up XML (there is already one trying
>to create ISO HTML).  ISO/IEC WGs may be horribly slow and politics-ridden,
>but they can't just *ignore* comments if they are properly submitted.
>They *have* to process all of them.

For what it's worth, the XML process has been *extremely* open to 
comments and considered, exhaustively and at great length, the issue
of whether what we're trying to do with namespaces could be done
better with architectural forms or some variation thereon.  I and
David Megginson and others have expanded at near-infinite length
in this forum and others on these issues and the bases for the
process' conclusions.

I fully acknowledge that you disagree with the conclusion that the
committee came to, but it is incorrect and damaging to allege that 
the process ignored the input.  -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Mon Sep 14 19:43:42 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:44 2004
Subject: ANN: SAX ParserFilters
Message-ID: <35FD55B6.A6215D8C@locke.ccil.org>

I have released beta code for two SAX parser filters.  A parser filter
looks like a standard SAX parser, but relies on an underlying real
parser to do the work, and adds some extra service.

NamespaceFilter provides an implementation of the XML Namespaces
WD.  All element and attribute names are resolved to the form
"URI^localpart", where URI may be null.  The circumflex character
is not legal in either URIs or localparts.  The non-SAX methods
mapElementName() and mapAttributeName() convert element or attribute
names to "URI^localpart" form according to the current set of
namespace prefixes.

InheritanceFilter provides support for inheritable attributes.
By default, only xml:space and xml:lang are inheritable, but
applications may call the non-SAX method inheritable() to specify
other inheritable attributes.

The abstract class ParserFilter provides the basic mechanism for
writing parser filters.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Mon Sep 14 20:05:07 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:44 2004
Subject: John Cowan's XML page is at http://www.ccil.org/~cowan/XML
Message-ID: <35FD5ABD.65BE438A@locke.ccil.org>

All my current efforts, including IBTWSH and ParserFilters, are
now available there.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bckman at ix.netcom.com  Mon Sep 14 20:47:42 1998
From: bckman at ix.netcom.com (Frank Boumphrey)
Date: Mon Jun  7 17:04:44 2004
Subject: XSL Tutorial
Message-ID: <001c01bde010$abea4ec0$89addccf@ix.netcom.com>

Hi,

to those who may be interested Part I (covering the basics) my new XSL
tutorial is ready. It is available either as a text file or as a Word97
file. Down load it from www.hypermedic.com/style. Follow the XSL links.

I would be glad of comments or criticism's

regards,
Frank

Frank Boumphrey

XML and style sheet info at Http://www.hypermedic.com/style/index.htm
Author: - Professional Style Sheets for HTML and XML http://www.wrox.com


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jctsai at fedex.com  Mon Sep 14 22:28:04 1998
From: jctsai at fedex.com (January Tsai)
Date: Mon Jun  7 17:04:44 2004
Subject: JDBC and XML
Message-ID: <199809142027.AA25354@gateway.fedex.com>

Is there a JDBC driver for XML or something that works the same but named differently?
Thanks!
-january

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From elm at arbortext.com  Mon Sep 14 22:31:30 1998
From: elm at arbortext.com (Eve L. Maler)
Date: Mon Jun  7 17:04:45 2004
Subject: 10 September 1998 version of XML spec DTD and documentation
Message-ID: <199809142030.QAA06791@doctools.com>

Hello folks-- You can now get access to the latest version of the W3C XML
specification DTD and its documentation, at the following locations:

DTD:		http://www.w3.org/XML/1998/06/xmlspec-19980910.dtd
Documentation:	http://www.w3.org/XML/1998/06/xmlspec-report-19980910.htm

Although the previous version was technically available in a public
location, I'd be surprised if anyone unearthed it...  Please send any
comments or questions to elm@arbortext.com.

Thanks,

	Eve

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From matt at veosystems.com  Mon Sep 14 22:46:17 1998
From: matt at veosystems.com (matt@veosystems.com)
Date: Mon Jun  7 17:04:45 2004
Subject: Proposition: "SGML is Gumming Up the Works"
In-Reply-To: <Pine.SUN.3.91.980912095138.9503B-100000@cito.uwaterloo.ca>
Message-ID: <Pine.LNX.3.96.980914132832.6705C-100000@archimedes.veosystems.com>


On Sat, 12 Sep 1998, Paul Prescod wrote:

> The hardest part of coming to a new domain is recognizing what parts of 
> what we know from other domains do NOT apply.
> 
> On Fri, 11 Sep 1998, Mark Tucker wrote:
> >
> 
> Saying that BNF is weaker than types systems is equivalent to saying that 
> hammers are weaker than screwdrivers. They are not comparable. Grammars
> describes serialization syntax and the other describes a data model.
> 
> If "type systems" could replace serializations, then we wouldn't need 
> XML, would we? We'd just use Java's type system.
> 
> > So, we end up jumping through hoops to write DTD's to express DATA
> > which is very, very, very easily described in terms of modern
> > programming language type systems.  All the while, hearing a low chant:
> > "What kind of cretin are you? You don't want to *validate* your data! (shock)
> > You only want well-formed documents." -- NO and YES.  I don't care
> > if my document can be validated by a pitiful DTD.  I do care that 
> > it conform to a real type schema!
> 
> "Bang. Bang. Bang. I think I bent my screwdriver." I hate to let you 
> down, but when you serialize your data model into XML, all you have is 
> characters. Characters have to be verified according to the techniques that 
> God and Chomsky provided for verifying character streams: regular 
> languages, context free grammars, regular tree grammars, etc.
> 

You forgot to ask if he likes having his programs type-checked.  Ya gotta
lex and parse and build that AST before you can hope to type-check
something.  Different layers do different things.

Not only do people not recognize what parts of what they know don't apply,
but they seem to forget we they learned as well.

> 
> DTDs are much better than BNF. DTDs describe XML data. BNF describes a
> MUCH larger family of languages. If we were to use BNF, we would have to
> put constraints on the BNF that would make it almost identical to DTDs. 
> 
> Here's the ironic part: you are right that it should be possible to use 
> the same element type name in multiple contexts as long as it isn't 
> ambiguous (as in C). I have a proposal for an extension to DTDs (or 
> schemas) that would allow that.
> 
> The problem is, that when you try to combine this advanced facility with 
> type system-based proposals (e.g. inheritance, subtyping, etc.) 
> everything goes to hell. The irony is that it is people who are screaming 
> for "types" instead of lexical constraints who are *weakening* the 
> lexical constraints that would make DTDs (or schemas) closer in power to BNF.
> 
> Consider:
> 
> <FUNCTIONDEF><NAME>Foo</NAME><PAREN><ARGS/></PAREN>
> <BODY>
>     a=<PAREN>B+1</PAREN>
> </BODY>
> 
> What does it mean to "subclass" the PAREN element type when it is clearly 
> used in two different contexts with two different content models? The 
> answer: there is no PAREN type, really. There is a PAREN "tag" that can 
> be used in completely different ways in completely different contexts.
> 

Why would anyone put a paren around args?  Args is already a grouping
construct - paren is redundant there.  In the second case, wouldn't you
rather use <EXPRESSION> than <PAREN>?  It always seemed to me that the
elements of the DTD should sit at least one level above lexing, but PAREN
is something the lexer does away with.  And doesn't it seem that ARGS and
EXPRESSION are subclasses of a parent grouping element?

> In my opinion, you must THROW OUT the notion of type to make progress on 
> this front. Of course, you can then re-introduce the notion of type at 
> some higher level. But I think that we should make this lexical level 
> powerful enough to do everything we need it to do before we move on to 
> the type level.

Are you calling for the resurrection of SHORTREFS?  Content models should
ideally address the abstract syntax tree.  Lexical constraints address
content.  If you want to cross them, you need something like SHORTREFS (or
BNF).

Matthew Fuchs
matt@veosystems.com


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Mon Sep 14 23:31:36 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:45 2004
Subject: Proposition: "SGML is Gumming Up the Works"
Message-ID: <35FD8B29.8BF59644@locke.ccil.org>

matt@veosystems.com wrote:

> Are you calling for the resurrection of SHORTREFS?  Content models should
> ideally address the abstract syntax tree.  Lexical constraints address
> content.  If you want to cross them, you need something like SHORTREFS (or
> BNF).

I have called for a simple-minded kind of SHORTREF, to address the
fact that #PCDATA content has internal markup of a type-specific
kind: decimal points in numbers, colons and hyphens in dates, etc.
etc.

Coming soon: ShortRefFilter.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jamsden at us.ibm.com  Tue Sep 15 01:17:28 1998
From: jamsden at us.ibm.com (Jim Amsden)
Date: Mon Jun  7 17:04:45 2004
Subject: XML is boring (long --- sorry)
Message-ID: <5040100022444679000002L092*@MHS>

>I talk to a lot of journalists.  The #1 question I get is: "What will the
>impact of XML be from the user point of view?"  My sound-bite answer is
>"The web should look about the same, but work a lot faster.  And search
>engine results should get a lot better."

I would add that a lot more data from a lot more sources will become available,
and you'll be able to look at the same data in a variety of ways on a number of
different devices.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Tue Sep 15 03:50:06 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:45 2004
Subject: Proposition: "SGML is Gumming Up the Works"
In-Reply-To: <Pine.LNX.3.96.980914132832.6705C-100000@archimedes.veosystems.com>
Message-ID: <Pine.SUN.3.91.980914211854.24804A-100000@cito.uwaterloo.ca>

On 14 Sep 1998 matt@veosystems.com wrote:

> > What does it mean to "subclass" the PAREN element type when it is clearly 
> > used in two different contexts with two different content models? The 
> > answer: there is no PAREN type, really. There is a PAREN "tag" that can 
> > be used in completely different ways in completely different contexts.
> > 
> 
> Why would anyone put a paren around args?  Args is already a grouping
> construct - paren is redundant there.  In the second case, wouldn't you
> rather use <EXPRESSION> than <PAREN>?  It always seemed to me that the
> elements of the DTD should sit at least one level above lexing, but PAREN
> is something the lexer does away with.  And doesn't it seem that ARGS and
> EXPRESSION are subclasses of a parent grouping element?

I used PARENs to use an example of the same token being used for 
different things that people would be familiar with.

Ar ARGS and EXPRESSION logically subclasses of a parent grouping element? 
Sure, at some level. But they don't share a content model, and they don't 
necessarily share attributes, so at the tree validation level, they are 
not really related.

Tables and figures are also related as "block-level objects" (in many
DTDs), but also do not share a content model or attributes. This is why I
feel strongly that element type subclassing is quite different from
inheritance in documents, just as in OO. 

> Are you calling for the resurrection of SHORTREFS?  Content models should
> ideally address the abstract syntax tree.  Lexical constraints address
> content.  If you want to cross them, you need something like SHORTREFS (or
> BNF.

Sorry, I was speaking loosely. I'm more interested in constraints at the 
tree level than lexical constraints. But I don't see why you think that 
lexical constraints need something like SHORTREFS or BNF. What about 
regular expressions? What would be fundamentally wrong with something 
like this:

<!ELEMENT FOO (LHS,"=",RHS)>

 Paul Prescod

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ricko at allette.com.au  Tue Sep 15 07:42:18 1998
From: ricko at allette.com.au (Rick Jelliffe)
Date: Mon Jun  7 17:04:45 2004
Subject: Namespaces Revisited...
References: <3.0.32.19980914102258.00d8bcb4@pop.intergate.bc.ca>
Message-ID: <35FDFFB0.30994062@allette.com.au>

Tim Bray �g�D�G

> At 11:13 AM 9/14/98 -0400, John Cowan wrote:
> >Unfortunately, we're stuck with it, until some ISO/IEC committee
> >can be persuaded to take up XML (there is already one trying
> >to create ISO HTML).

The ISO HTML is in fact already created. It was created in conjunctionwithHTML 4;
it is basically strict HTML 4 with additional position contraints so
that heading elements must be used to follow the ranking number
(H1, H2, H3, etc). From memory it has OBJECT, SCRIPT removed,
and no formating elements, and no frames.  It is suitable for very conservative
technical documentation.  The ISO people involved received full cooperation
from the W3C people, as far as I am aware.

Some organisations can only quote ISO or national standards as part of
their tendering requirements: ISO HTML was developed to allow such
organisations the benefit of HTML. Note that XML, even though it
is not an ISO standard, can be specified as part of tendering requirements:
ISO 8879 specifically has facilities to bring in subset specifications
like XML (i.e. the SEEALSO parameter, see ISO 8879 Annex L:
it is less than a full profile mechanism, which was felt to be overkill
in the light of SGML's existing "toolkit" customizability .)

On the issue of whether W3C is a standards body, I think W3C has been pretty
scrupulous to call their technologies "specifications" not "standards".
Every company who invents something that they think will be widespread
calls it a "standard", but it is useful to restrict the term to only those things

which have been through some broadbased, open procedure.  In practise,
some W3C efforts (e.g. XML) have been very broadbased and open; but
once there is widespread interest in a technology there needs to be some
vetting procedure to keep idiots out--a national standards body approach
like ISO uses is such a mechanism (and thereby falls open to the accusation
of becoming an "old boys clubs", which some say about IETF).

In the XML effort, I note that now national standards bodies can make
submissions to W3C concerning XML. There is a strong level of interaction
between ISO committees and W3C and industry consortia now (which you
can see from the recent CGM report  at www.w3.org/TR, so W3C
specifications are becoming more like standards.

> ISO/IEC WGs may be horribly slow and politics-ridden,
> >but they can't just *ignore* comments if they are properly submitted.
> >They *have* to process all of them.

But it is quite possible for a committee to stiffle comments, even ifthey are
good.  To call it "politics" is just to say that technologies
exist in the human world, where there are high stakes in the direction
of technology.

> I fully acknowledge that you disagree with the conclusion that the
> committee came to, but it is incorrect and damaging to allege that
> the process ignored the input.

I can certainly vouch for that. The namespaces discussion seemed
to take almost as long as the XML discussion did; along the way
many people changed their minds.

Rick Jelliffe


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rbourret at dvs1.informatik.tu-darmstadt.de  Tue Sep 15 11:04:16 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun  7 17:04:45 2004
Subject: XSchema: Final Review (1-3, A, B, & D)
Message-ID: <199809150851.KAA15917@berlin.dvs1.tu-darmstadt.de>

The (hopefully) final version of sections 1-3 and appendices A, B, and D oof the XSchema specification is now available for review at:

   http://www.simonstl.com/xschema/spec/xscspecv2.htm

The review period lasts until 23 September 1998.  Unless any major questions are 
raised, we will freeze these sections of the spec.  Please send comments to 
XML-DEV or privately to me at rbourret@dvs1.informatik.tu-darmstadt.de.  (Simon 
is swamped trying to get a book out, so I have temporarily taken over editorship 
of the spec.)

I will try to post Section 4, XSchema Transformation to XML 1.0 DTD, Section 5, 
Connecting XSchemas to XML Documents, and Appendix C, XSchema in XSchema early 
next week.  Hopefully, these sections (which are largely usage discussions) 
won't prove too controversial and we can release XSchema 1.0 in the near future.

-- Ron Bourret


Significant Technical Changes in 9/16 version:
----------------------------------------------
* Added UnparsedEntity element.  This is needed to validate ENTITY attributes.

* Allowed AttGroup to be nested inside AttGroup.

* Removed the Name attribute from AttGroup.  (It is a container and therefore 
only needs an id.)

* Added the id, ns, and prefix attributes to AttDef.

Significant Editorial Changes in 9/16 version:
----------------------------------------------
* Revised section 1.3, Relation to Standards

* Clarified element references in attribute definitions (section 2.4, para. 2)

* Clarified attribute namespaces (section 2.4, para. 4)

* Moved discussion of id attributes to separate section (2.8)

* Clarified namespace use (sections 3.0, 3.1, and 3.2)

* Added appendices A (References) and D (Contributors)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rajnishb at gsslco.co.in  Tue Sep 15 11:53:21 1998
From: rajnishb at gsslco.co.in (Rajnish Bharti)
Date: Mon Jun  7 17:04:45 2004
Subject: Vector Markup Language 
Message-ID: <000601bde0dd$6d804900$130310ac@dionysus.pune.gsslco.co.in>

Hi !!
I, was wondering whether anyone of you is working on VML
I, have done some ground work in it and would like to share 
some info. with others.

~Rajnish


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Tue Sep 15 13:41:19 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:04:45 2004
Subject: W3C and the public (was Namespaces Revisited...)
In-Reply-To: <35FDFFB0.30994062@allette.com.au>
References: <3.0.32.19980914102258.00d8bcb4@pop.intergate.bc.ca>
Message-ID: <199809151141.HAA30747@hesketh.com>

Tim Bray wrote:
>For what it's worth, the XML process has been *extremely* open to 
>comments and considered, exhaustively and at great length, the issue
>of whether what we're trying to do with namespaces could be done
>better with architectural forms or some variation thereon.  [snipped]
>
>I fully acknowledge that you disagree with the conclusion that the 
>committee came to, but it is incorrect and damaging to allege that 
>the process ignored the input.

One area in which the W3C could do a lot to improve its openness and
therefore its process is by paying attention to the public portions of its
site.  The w3.org/XML page doesn't even _list_ the latest namespaces draft;
the 'highlights' at the top is all past and gone.  The last modified date
at the bottom is 9/11/98 - maybe someone fixed a typo.

The XSL project really achieved a new level of openness with 'features'
like an openly announced schedule.  Other projects might earn a lot more
goodwill from developers (especially the freeware communities that are
producing a lot of the implementations) if they would take a similar approach.

I shouldn't have to ask on XML-Dev what the W3C is doing - the list of
activities ought to be posted publicly.  I certainly shouldn't have to rely
on finetuning.com and sunsite.unc.edu/xml for links to the latest drafts.
And it would be really, really nice to have some idea if and when there are
ever going to be new drafts of XLink and XPointer.

Even better would be a page that mentions all the W3C activities devoted to
XML development, so I wouldn't have to figure out where they hid XSL...

So, what are you crazy folks up to?  Hopefully something interesting.  I'm
sure lots of us would like to provide input, if we can only find out what's
going on...


Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From larsga at ifi.uio.no  Tue Sep 15 14:13:26 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:04:45 2004
Subject: JDBC and XML
In-Reply-To: <199809142027.AA25354@gateway.fedex.com>
Message-ID: <3.0.1.32.19980915140934.00d77cd0@ifi.uio.no>


(Please don't follow-up to both lists (xml-dev and XML-L) even though I
did here. I had to do it so people would see that this question has been
answered.)

* January Tsai
>
>Is there a JDBC driver for XML or something that works the same but named
differently?

XML is not a kind of relational database, so a JDBC driver for XML would not
make much sense. XML looks a bit like HTML, even though it does have something
in common with databases: the possibility store information in a structured
way.

XML does have "something that works the same but is named differently": SAX.
SAX is a common API for XML parsers and is supported by nearly all Java XML
parsers.

You can find it at:
<URL:http://www.megginson.com/SAX/>

There is also a higher-level API called DOM, which may also be suitable for
your purposes. You can find a list of DOM implementations at:

<URL:http://www.stud.ifi.uio.no/~larsga/linker/xmltools/by-standard.html#DOM>

I hope this helped you.
--Lars M.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Tue Sep 15 16:33:26 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:45 2004
Subject: Namespaces Revisited...
References: <3.0.32.19980914102258.00d8bcb4@pop.intergate.bc.ca> <35FDFFB0.30994062@allette.com.au>
Message-ID: <35FE7AB2.B60F8E68@locke.ccil.org>

Rick Jelliffe wrote:

> The ISO HTML is in fact already created.

Ah, I didn't know it was an International Standard already.

> On the issue of whether W3C is a standards body, I think W3C has been pretty
> scrupulous to call their technologies "specifications" not "standards".

"Recommendations", no?

> But it is quite possible for a committee to stifle comments, even if they are
> good.  To call it "politics" is just to say that technologies
> exist in the human world, where there are high stakes in the direction
> of technology.

Granted.  But AFAIK the comments still have to be considered and
replied to, even if the reply amounts to "This idea is terminally
stupid".  They can't be burked entirely.  I am not accusing the
XML WG of doing anything of the sort, merely pointing out that
W3C committees *can* do that, and by the terms of their chartering
represent the consortium members, not the general public.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Tue Sep 15 16:39:01 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:45 2004
Subject: [ANN] Kludgey workarounds for IE and Netscape
References: <199809090522.BAA11695@ruby.ora.com> 				 <199809090121.SAA15093@mail-gw.pacbell.net> <v03102801b21c945aab3e@[203.23.215.128]> <35F7B89F.2E779FDB@jclark.com> <35F804CD.47CAAB14@locke.ccil.org> <35F8F3B3.4149A79A@jclark.com> <35F93D78.B8325124@locke.ccil.org> <upvcylilo.fsf@delivery.ansa.co.uk> <35FD5A2A.7D22F643@locke.ccil.org> <uhfy968gn.fsf@delivery.ansa.co.uk>
Message-ID: <35FE7C03.7CCAA4C@locke.ccil.org>

Toby Speight wrote:

> We were discussing what HTML elements looked like in XML.  AFAICS, to
> represent SCRIPT or STYLE in XML, we must use #PCDATA (since there's
> no CDATA in XML) and escape the content with entities or a marked
> section.  That way, the parsed result is the same whether we parse the
> HTML-XML with an XML parser, or the HTML-4.0 with a HTML parser.

Yes, certainly.  Going back to the original point, though, I was
trying to argue for a directly supported ability to write out
XSL-generated stuff as HTML, since the required hack for doing so
is much smaller than that required for Postscript, RTF, etc. etc.

Now that the distinction between "specifies its output as XML"
and "specifies XML as its output" (or whatever it is) has been
made, the point's moot for now: it will have to be revisited
when/if HTML becomes an XML subset.
 
> Of course, if we then rewrite it as HTML 4.0, we need to be aware that
> the output is CDATA (though there's no way in the general case to deal
> with ETAGO in the data).

In the general case, no, but workarounds exist for ECMAScript,
CSS, and VBScript.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From roddey at us.ibm.com  Tue Sep 15 19:53:26 1998
From: roddey at us.ibm.com (Dean Roddey)
Date: Mon Jun  7 17:04:45 2004
Subject: Opportunities for XML-DEV
Message-ID: <5030300025183911000002L012*@MHS>


>I have a 50,000 headword dictionary and thesaurus of English that can
>fairly easily be converted to a simple XML representation. Would
>availability of this text for research help stimulate application
>development? What sort of applications?
>Bob Parks
>

We would defininitely be interested in such a file for performance testing
purposes. Please mail me a copy or let me know if you post it anywhere. Thanks.

----------------------------------------
Dean Roddey
Software Weenie
IBM Center for Java Technology - Silicon Valley
roddey@us.ibm.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From RMcDouga at JetForm.com  Tue Sep 15 23:00:30 1998
From: RMcDouga at JetForm.com (Rob McDougall)
Date: Mon Jun  7 17:04:45 2004
Subject: W3C and the public (was Namespaces Revisited...)
Message-ID: <311000B0752ED211B61700805F0D6B0905C5FD@OTTMAIL3>

The original topic seemed to be based more on questioning why the current
draft spec is the way it is rather than questioning what the W3C is
currently doing.

My take on the problem is that I think the W3C could make public some of the
twisted paths the namespace specification took to get to its final
destination.  IMO, this is typically where the majority of concerns tend to
arise ("did you examine all the alternatives?", "why was this alternative
rejected?").  I would like to see all the W3C committees be required to
produce an annotated spec (similar to Tim's excellent annotated XML spec)
rather than just a flat specification.  This will allow newcomers to the
process to feel more confidence that the committee has examined all the
alternatives and understand why certain alternatives were chosen over
others.

Rob


-----Original Message-----
From: Simon St.Laurent [mailto:simonstl@simonstl.com]
Sent: Tuesday, September 15, 1998 7:43 AM
To: XML Dev
Subject: W3C and the public (was Namespaces Revisited...)

One area in which the W3C could do a lot to improve its openness and
therefore its process is by paying attention to the public portions of its
site.  The w3.org/XML page doesn't even _list_ the latest namespaces draft;
the 'highlights' at the top is all past and gone.  The last modified date
at the bottom is 9/11/98 - maybe someone fixed a typo.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eriblair at mediom.qc.ca  Tue Sep 15 23:34:27 1998
From: eriblair at mediom.qc.ca (Eric Riblair)
Date: Mon Jun  7 17:04:45 2004
Subject: Header of XML file
Message-ID: <199809152134.RAA27413@netra.mediom.qc.ca>

Can somebody explain me the parts of the header of an Xml file and some
examples (...because I need to configure it to an iso-latin use)

Thanks for any help
Eric


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From matt at veosystems.com  Wed Sep 16 00:00:37 1998
From: matt at veosystems.com (matt@veosystems.com)
Date: Mon Jun  7 17:04:45 2004
Subject: Proposition: "SGML is Gumming Up the Works"
In-Reply-To: <Pine.SUN.3.91.980914211854.24804A-100000@cito.uwaterloo.ca> from "Paul Prescod" at Sep 14, 98 09:42:48 pm
Message-ID: <19980915215957.15874.qmail@veosystems.com>

A non-text attachment was scrubbed...
Name: not available
Type: text
Size: 3635 bytes
Desc: not available
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980915/af3811c9/attachment.bat
From bckman at ix.netcom.com  Wed Sep 16 02:13:56 1998
From: bckman at ix.netcom.com (Frank Boumphrey)
Date: Mon Jun  7 17:04:46 2004
Subject: Opportunities for XML-DEV
Message-ID: <006301bde107$6963a2c0$9eacdccf@ix.netcom.com>

I today put up some XML files together with their DTD's at


http://www.hypermedic.com/style/xml/xmlindex.htm

This should give people an 'URI' to try their user-agents on!

Two of them (tempest.htm and nt.htm) are from Jon (to whom the XML community
owes many thanks) Bosak's Zip files which are also referenced at this URL.


Frank

Frank Boumphrey

XML and style sheet info at Http://www.hypermedic.com/style/index.htm
Author: - Professional Style Sheets for HTML and XML http://www.wrox.com

-----Original Message-----
From: Dean Roddey <roddey@us.ibm.com>
To: <xml-dev@ic.ac.uk>
Date: Tuesday, September 15, 1998 1:57 PM
Subject: Re: Opportunities for XML-DEV


>
>>I have a 50,000 headword dictionary and thesaurus of English that can
>>fairly easily be converted to a simple XML representation. Would
>>availability of this text for research help stimulate application
>>development? What sort of applications?
>>Bob Parks
>>
>
>We would defininitely be interested in such a file for performance testing
>purposes. Please mail me a copy or let me know if you post it anywhere.
Thanks.
>
>----------------------------------------
>Dean Roddey
>Software Weenie
>IBM Center for Java Technology - Silicon Valley
>roddey@us.ibm.com
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Usha_R2 at verifone.com  Wed Sep 16 14:23:02 1998
From: Usha_R2 at verifone.com (Usha_R2@verifone.com)
Date: Mon Jun  7 17:04:46 2004
Subject: SAX conferment parser
Message-ID: <7BA6E16CF180D111944700A0C9979DE51D4F80@blr-nt-mail2.verifone.com>

Hi! All,
  I want to use the SAX method for parsing my XML files. Can anybody
please tell me which is the best SAX conferment parser written in Java.
I want the parser to be ONLY a SAX conferment parser i.e. it should not
be both DOM & SAX conferment. I need this since for my application size
is very important issue.


Thanks in advance.
Usha,
K. Usha Rani
 
Dept        : Applications
Phone    : (080) - 2869920
Email     :  usha_r2@verifone.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Wed Sep 16 15:36:47 1998
From: david at megginson.com (David Megginson)
Date: Mon Jun  7 17:04:46 2004
Subject: There is no *best* SAX-conformant parser, but...
In-Reply-To: <7BA6E16CF180D111944700A0C9979DE51D4F80@blr-nt-mail2.verifone.com>
References: <7BA6E16CF180D111944700A0C9979DE51D4F80@blr-nt-mail2.verifone.com>
Message-ID: <199809161335.JAA01012@unready.megginson.com>

Usha_R2@verifone.com writes:

 >  I want to use the SAX method for parsing my XML files.  Can
 > anybody please tell me which is the best SAX conferment parser
 > written in Java.  I want the parser to be ONLY a SAX conferment
 > parser i.e. it should not be both DOM & SAX conferment. I need this
 > since for my application size is very important issue.

There is no single best parser, because every Java-based parser can be
measured on at least six different axes:

1. Size
2. Speed
3. Feature set (i.e. external entity support, extra character encodings)
4. Error reporting (and overall conformance)
5. Legacy-browser compatibility (i.e. Netscape 3.0)
6. Licensing policy (free, GPL, non-commercial only, etc.)

Since you're worried about size, Microstar's AElfred
(www.microstar.com) is probably your best choice, since it weighs in
at only about 26K uncompressed, or around 14K in a compressed JAR.
I'd recommend that you always develop with more than one parser,
however -- with SAX, it's easy to swap parsers, and you can take
advantage of a parser with fully-conformant error reporting (like
James Clark's XP) for verification before you distribute to the
clients using AElfred: AElfred will always accept correct XML, but it
will not always tell you when something's wrong.

Isn't it nice to have too many tools to choose from, when the life of
the XML 1.0 REC can still be measured in months?


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From grove at infotek.no  Wed Sep 16 16:58:27 1998
From: grove at infotek.no (Geir Ove Gronmo)
Date: Mon Jun  7 17:04:46 2004
Subject: xmlarch: Version 0.11 released
Message-ID: <199809161457.QAA15246@mail.infotek.no>


xmlarch.py: An XML architectural forms processor written in Python

Version:  0.11
Author:   Geir Ove Gr?nmo
Email:    grove@infotek.no
Released: September 15th 1998

Homepage: http://www.infotek.no/~grove/software/xmlarch/index.html

---

What is xmlarch.py?

The xmlarch.py module contains an XML architectural forms processor written 
in Python. It allows you to process XML architectural forms using any 
parser that uses the SAX interfaces. The module allow you to process 
several architectures in one parse-pass. Architectural document events 
for an architecture can even be broadcasted to multiple DocumentHandlers. 

What's new?

There are no new features in this release. The module should now be placed 
in the xml.arch package. The demo tools have been updated to support the 
new package structure.

Problem with <?IS10744 arch ...?> not being recognized as an architecture 
use declaration is now fixed. Now both <?IS10744:arch ...?> and 
<?IS10744 arch ...?> are supported.

get_bridge_form() was called get_bridge_elem_form() a couple of places. This 
is now fixed.

---

Enjoy!

Geir Ove Gr?nmo


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 16 18:30:55 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:46 2004
Subject: ANN: DOMParser
Message-ID: <35FFE7C6.E5034DAB@locke.ccil.org>

A preliminary version of DOMParser is now available at my XML page
(http://www.ccil.org/~cowan/XML).  DOMParser is a compliant SAX
parser, except that its input comes from a DOM implementation
rather than an InputSource.  An additional method, parse(Document d),
is introduced to parse DOM Document objects.  The standard parse()
methods throw errors.  (Perhaps they should look for a real parser
and a DOM implementation to try to create a DOM Document instead?)

A demo program is also available, based on the SAX ByteStreamDemo
and using the Docuverse DOM SDK as the DOM.  Its output is identical
to that of ByteStreamDemo using Aelfred 1.2 as the parser, except
that it does not try to resolve a SystemId or set a Locator
(Locators making no sense when there is no source text).

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Wed Sep 16 18:51:09 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:46 2004
Subject: SAX conferment parser
References: <7BA6E16CF180D111944700A0C9979DE51D4F80@blr-nt-mail2.verifone.com>
Message-ID: <35FFECA6.2C26DC6E@infinet.com>

Usha_R2@verifone.com wrote:

> Hi! All,
>   I want to use the SAX method for parsing my XML files. Can anybody
> please tell me which is the best SAX conferment parser written in Java.
> I want the parser to be ONLY a SAX conferment parser i.e. it should not
> be both DOM & SAX conferment. I need this since for my application size
> is very important issue.
>
> Thanks in advance.
> Usha,
> K. Usha Rani

Your best bet would be to download several parsers and test them out.  Here is
the source to a SAX Timer Test I have made up for my parser (not yet released for
various non-technical reasons).  From what I have seen, Aelfred is the smallest
of the fully conformant SAX related parsers in terms of bytecodes, but XP is the
fastest in terms of speed.  In terms of memory usage, you will be much better off
with any streaming based-parser like Aelfred or XP than a tree-based parser like
IBM's XML for Java (another SAX compliant parser).

BTW, the following code takes one or two arguments.  The first argument is a URL
to the XML file you want to parse.  The second argument is the number of times
you want to parse the file.   I recommend parsing a file at least 4 times when
using a JIT because the first pass of each method is not compiled to native form
(it is interpreted) and significant time is spent in the JIT compiling the
bytecodes to native form.  Once this happens at least once, the real speed of the
parser can be assessed.

Usage would be something like:

java -Dorg.xml.sax.parser=SAX_DRIVER_CLASS_NAME Benchmark file:/foo/bar.xml 5

// Begin
import java.io.*;

import org.xml.sax.*;
import org.xml.sax.helpers.*;

public class Benchmark {
  public static void main(String[] args) {
    try {
      Runtime rt = Runtime.getRuntime();
      int length = (args.length >= 2) ? Integer.parseInt(args[1]) : 1;

      long begin, end;
      long free;
      String input = args[0];

      HandlerBase handler = new HandlerBase();
      Parser parser;
      for (int i = 0; i < length; i++) {
        parser = ParserFactory.makeParser();
        parser.setEntityResolver(handler);
        parser.setDTDHandler(handler);
        parser.setDocumentHandler(handler);
        parser.setErrorHandler(handler);
        System.gc();
        System.gc();
        System.gc();
        free = rt.freeMemory();
        begin = System.currentTimeMillis();
        parser.parse(input);
        end = System.currentTimeMillis();
        System.out.println("Parsing Time: "  + (end - begin));
        System.out.println("Memory Used: "  + (free - rt.freeMemory()));
        System.out.println();
      }
    } catch (Exception e) {
      e.printStackTrace(System.out);
    }
  }
}
// End

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Mike_Spreitzer.PARC at xerox.com  Wed Sep 16 20:49:49 1998
From: Mike_Spreitzer.PARC at xerox.com (Mike_Spreitzer.PARC@xerox.com)
Date: Mon Jun  7 17:04:46 2004
Subject: What is a "Public Identifier"?
Message-ID: <98Sep16.114919pdt."56386(3)"@alpha.xerox.com>

I'm wondering if there is any standardization of what's in a "public
identifier" and/or of how one is resolved to whatever it refers to.  The XML
1.0 spec has very little to say about this (unless I've overlooked something)
--- it only says that white space should be normalized before attempting to
resolve.  Yet there is a lot of regularity in the examples I see.  I've never
seen anything specifying the structure used.  What am I missing?

Thanks,
Mike

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eriblair at mediom.qc.ca  Wed Sep 16 21:19:31 1998
From: eriblair at mediom.qc.ca (Eric Riblair)
Date: Mon Jun  7 17:04:46 2004
Subject: ISO-LATIN and MSXML ...
Message-ID: <199809161918.PAA20897@netra.mediom.qc.ca>

For anyone interested ...

You can eliminate the ISO-LATIN problem with MSXML ... when you use the
numeric entity of the latin character (ex: ?) not the names entity (ex:
&eacute;) in the xml file. And the xml header (<?xml version="1.0"
encoding="ISO-8859-1"?>) is not necessary when you use it in an applet in
Jscript in a HTML file.

Regards,
Eric


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From epalma at fsaa.ulaval.ca  Wed Sep 16 21:30:36 1998
From: epalma at fsaa.ulaval.ca (...................................La maquina)
Date: Mon Jun  7 17:04:46 2004
Subject: ISO-LATIN and MSXML ...
In-Reply-To: <199809161918.PAA20897@netra.mediom.qc.ca>
Message-ID: <199809161952.PAA02561@cerberus.ulaval.ca>


Thanks Eric...
very interesting
You are really good...
:-)))
La maquina


At 08:19 PM 9/16/98 , Eric Riblair wrote:
>For anyone interested ...
>
>You can eliminate the ISO-LATIN problem with MSXML ... when you use the
>numeric entity of the latin character (ex: ?) not the names entity (ex:
>&eacute;) in the xml file. And the xml header (<?xml version="1.0"
>encoding="ISO-8859-1"?>) is not necessary when you use it in an applet in
>Jscript in a HTML file.
>
>Regards,
>Eric

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Thu Sep 17 00:02:35 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:46 2004
Subject: ANN: DOMParser
References: <35FFE7C6.E5034DAB@locke.ccil.org>
Message-ID: <360035CE.3424E342@infinet.com>

John Cowan wrote:

> A preliminary version of DOMParser is now available at my XML page
> (http://www.ccil.org/~cowan/XML).  DOMParser is a compliant SAX
> parser, except that its input comes from a DOM implementation
> rather than an InputSource.  An additional method, parse(Document d),
> is introduced to parse DOM Document objects.  The standard parse()
> methods throw errors.  (Perhaps they should look for a real parser
> and a DOM implementation to try to create a DOM Document instead?)
>
> A demo program is also available, based on the SAX ByteStreamDemo
> and using the Docuverse DOM SDK as the DOM.  Its output is identical
> to that of ByteStreamDemo using Aelfred 1.2 as the parser, except
> that it does not try to resolve a SystemId or set a Locator
> (Locators making no sense when there is no source text).

Wouldn't this be better titled as a DOM Builder and a DOM Writer?

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From hyoung at fiz.huji.ac.il  Thu Sep 17 00:11:36 1998
From: hyoung at fiz.huji.ac.il (Hyoungsoo Yoon)
Date: Mon Jun  7 17:04:46 2004
Subject: What's the Difference between ExceptionCode and Node Type in DOM spec?
Message-ID: <360037ED.D1525A52@fiz.huji.ac.il>

Hi everybody,

ExceptionCode is declared as enumerator, whereas
node types are simply defined constants in the level 1 DOM spec.
I would expect node types also to be enumerators, or else
just define ExceptionCode as constants for consistency.

Is there a reason why these two variables are declared differently?
Thanks, youngsoo

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Thu Sep 17 01:57:01 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:46 2004
Subject: OO Schemas
In-Reply-To: <19980915215957.15874.qmail@veosystems.com>
Message-ID: <Pine.SUN.3.91.980916184904.14370B-100000@cito.uwaterloo.ca>

On 15 Sep 1998 matt@veosystems.com wrote:

> Don't you mean "In my mind's eye they don't share a content model"?
> You are making statements about the characteristics of systems you
> haven't seen yet (OO Schemas or whatever) and certainly haven't used.
> Once you _can_ declare them subclasses of a parent grouping element,
> you might find you start doing things differently.  You might not, but
> you can't really make such statements until we've got some proposals
> on the table.

The real question is what makes two elements have the same type? In XML,
two elements have the same type if they share a GI. Since elements may or
may not have a DTD, they may or not share a content model or attributes. 
In SGML, we know that two elements with the same GI can have different
effective content models (exceptions) and different allowed attributes
(CONREF). We also know that people use these features (especially 
exceptions, which are used even in HTML). We also know that elements with 
the same GI can have different semantics and behaviours (especially 
rendering behaviour...consider titles in sections vs. titles of chapters!).

So in both SGML and XML, the fundamental thing that "binds" two elements 
to the same type is the GI. What's interesting is that this isn't the 
case with other languages. That was the point of the PAREN example. In 
C++, the "(" or "=" tokens can mean radically different things in 
different contexts (to say nothing of "const"). The advisability of 
reusing tokens is another issue, of course. (that's what an XML GI is, 
BTW, a tree-level token).

> I know your opinion here.  But inheritance is just a subset of
> subclass relationships (subclass is an as-a relationship, inheritance
> is an is-a relationship, and all is-a relationship are also as-a
> relationships).

I don't think that there is anything in the word "inheritance" that 
implies an is-a relationship, though I agree that sometimes it is used 
that way. As the OO FAQ says:

"Defining inheritance (with a thorough description or denotational 
semantic definition, or both) can avoid confusion about which inheritance 
scheme is being used (especially in OOD), because inheritance has many 
variations and combinations of state and environment (sometimes with 
complex rules). Inheritance can also be used for typing, where a type or 
class can be used to specify required attributes of a matching object 
(see sections 2.1, 2.7 and [Cardelli 85]). It would be more judicious to 
have discussions on how inheritance should be defined instead of over 
what it is, since it has many existing uses and semantics."

So let me do so:

XML element types have three interesting properties: content models,
attributes and GIs. So to me, "inheritance" between element types would be
about borrowing some or all of another element types content model,
attributes or GIs.

Subclassing, on the other hand, would be about having an element of one 
type "play the part of" an element of another type, such as a 
cross-reference "playing" an XLink.

> > <!ELEMENT FOO (LHS,"=",RHS)>
> > 
> 
> Datatag?
> 
> What's wrong is it doesn't identify the "=" as an operator, so either
> you know it's an = sign by default, in which case it's redundant,
> or you have an expression but no way to know what the operator is.
> You are mixing levels - you've got parsing and lexing mixed, which is
> what made SGML so twisted.

It has nothing to do with either parsing OR lexing. It simply says that 
the node between the LHS element node and the RHS element node must be 
the data character "=". This simple case could only be a mmenonmic for a 
human author. But you could also do this:

<!ELEMENT FOO (LHS,"=",OP1,("+"|"-"|"/"|"*"),OP2)>

Again, there is no parsing or lexing involved. Instead of asking if the 
node between the OPs has a GI property of "PLUS" or "MULT", you'd ask if the 
char property of the datachar node is "+" or "*".

 Paul Prescod

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From matt at veosystems.com  Thu Sep 17 02:26:49 1998
From: matt at veosystems.com (matt@veosystems.com)
Date: Mon Jun  7 17:04:46 2004
Subject: OO Schemas
In-Reply-To: <Pine.SUN.3.91.980916184904.14370B-100000@cito.uwaterloo.ca> from "Paul Prescod" at Sep 16, 98 07:49:04 pm
Message-ID: <19980917002535.9401.qmail@veosystems.com>

A non-text attachment was scrubbed...
Name: not available
Type: text
Size: 2214 bytes
Desc: not available
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980917/9d3bbdd0/attachment.bat
From dent at highway1.com.au  Thu Sep 17 03:42:48 1998
From: dent at highway1.com.au (Andy Dent)
Date: Mon Jun  7 17:04:46 2004
Subject: OO Schemas
In-Reply-To: <Pine.SUN.3.91.980916184904.14370B-100000@cito.uwaterloo.ca>
References: <19980915215957.15874.qmail@veosystems.com>
Message-ID: <v04011701b22618579c87@[203.23.215.42]>

At 7:49 AM +0800 17/9/98, Paul Prescod wrote:
>So to me, "inheritance" between element types would be
...
>
>Subclassing, on the other hand
I think it is extremely dangerous to have different definitions for
subclassing vs inheritance. The chance of confusion is way too high. Please
use two terms which are less likely to be mixed in communication.

A somewhat relevant anecdote - for a year or more I argued that
'aggregation' and 'association' were bad choices of terms in the growing
UML standard, due to the chance of confusion. Nobody seemed to get my
point, until another discussion on the mailing list led me to realize that
Americans prononounce these words with very different leading syllables.
Thus the verbal similarity that I perceived (as an Australian of English
background) was not there.
Andy Dent BSc MACS AACM, Software Designer, A.D. Software, Western Australia
OOFILE - Database, Reports, Graphs, GUI for c++ on Mac, Unix & Windows
PP2MFC - PowerPlant->MFC portability
http://www.highway1.com.au/adsoftware/crossplatform.html

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Thu Sep 17 04:24:55 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:47 2004
Subject: OO Schemas
In-Reply-To: <19980917002535.9401.qmail@veosystems.com>
Message-ID: <Pine.SUN.3.91.980916212142.14370D-100000@cito.uwaterloo.ca>

On 16 Sep 1998 matt@veosystems.com wrote:
> 
> > I don't think that there is anything in the word "inheritance" that 
> > implies an is-a relationship, though I agree that sometimes it is used 
> > that way. 
> > 
> 
> "Sometimes" meaning rarely, or "sometimes" meaning almost always, but
> I'm not ready to concede the point? (I, too, can split hairs!  ;-)

"Sometimes" meaning "usually, even though it obfuscates, like using 'tag' 
to mean element type, GI, attribute and tag'." Another way to express the 
difference is "type inheritance" vs. "implementation inheritance".

http://www.neurop2.ruhr-uni-bochum.de/personal/cozzi/sather-style_toc.html#SEC42

I don't like that terminology, because it perpetuates the idea that they 
are the same thing.

> Inheritance and subclassing are both about substitutability. 

How is C++ private inheritance (for example) about substitutability.

http://hpsalo.cern.ch/TaligentDocs/TaligentOnline/DocumentRoot/1.0/Docs/books/WM/WM_23.html

> I think
> it would be fair to say that inheritance has _almost always_ been
> associated with wholesale borrowing of the structure of the thing
> being inherited from, either through copy or reference (through
> delegation).  

I agree. Borrowing of structure, not necessarily interface.

> The exceptions, like C++'s private inheritance, are at
> the margins.  In what language that you are aware of does inheritance
> or subclassing not imply substitutability?

Well, you mentioned C++, but another is Sather. Dynamically typed OO
languages usually allow the distinction also. In Python, Smalltalk, 
etc., interfaces are described implicitly. In Python, at least, you can 
inherit and yet violate the interface of your superclass by deleting 
methods. JavaScript is the same. I don't know if there is a way to do it 
in Smalltalk.

Here's info on Sather:

"Separate Implementation and Type Inheritance 
In most object-oriented languages inheritance both defines the subtype 
relation and causes the descendant to use an implementation provided by 
the ancestor. These are quite different notions and confounding them 
often causes semantic problems. For example, one reason why Eiffel's type 
system is not statically checkable is that it mandates "covariant" 
conformance for routine argument types (Meyer, 1992). This means that a 
routine in a descendant must have argument types which are subtypes of 
the corresponding argument types in the ancestor. Because of this choice, 
the compiler cannot ensure argument expressions conform to the argument 
type of the called routine at compile time. In Sather, inheritance from 
abstract classes defines subtyping while inheritance from other classes 
is used solely for implementation inheritance. This allows Sather to use 
the statically type-safe contravariant rule for routine argument 
conformance."


> > XML element types have three interesting properties: content models,
> > attributes and GIs. So to me, "inheritance" between element types would be
> > about borrowing some or all of another element types content model,
> > attributes or GIs.
> > 
> 
> If you only borrow some, you lose substitutability.  

That isn't true. For instance, you can clearly supply your own content 
model without disturbing substitutability, as long as it is a compatible 
content model. The model can describe a sub-language, or it can describe 
a superclass language that can be converted into some sublanguage.

>If you want to
> borrow without substitution, just use pe's.  You haven't gained
> anything semantically interesting.

Structured, portable, explicit code (declaration) reuse is not
interesting? I strongly agree that "implementation" inheritance is not as
interesting as type inheritance in markup languages. But I think that
clearly "expressing intent" with respect to content model reuse is also
important. 

Note that I think that this "implementation inheritance" is
what most people mean when they say that XML "really needs inheritance."
It is a demonstration of your clear understanding of the issues that you
recognize that there is a much bigger picture than declaration reuse. 

If you, or anyone else, proposes a schema language with subtyping but 
not inheritance, I would probably support it if it was done right. If 
someone proposed the opposite, however, I would argue that it is not much 
of an improvement.

> > Subclassing, on the other hand, would be about having an element of one 
> > type "play the part of" an element of another type, such as a 
> > cross-reference "playing" an XLink.
> > 
> 
> Inheritance means playing the part of the element you are inheriting
> from (with a nod towards awkward exceptions at the margin).

Type inheritance, yes. Inheritance "in general", no. The word inheritance 
is vague and any property of one type could be inherited by another if a 
language allows it. There is no reason that all of the various types of 
inheritance should be tied together just because that is often (but not 
always) useful. It costs nothing to separate them, and buys power and 
expressiveness: the ability to specify intent.

 Paul Prescod

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Thu Sep 17 04:48:57 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:47 2004
Subject: OO Schemas
In-Reply-To: <v04011701b22618579c87@[203.23.215.42]>
Message-ID: <Pine.SUN.3.91.980916221900.14370E-100000@cito.uwaterloo.ca>

On Thu, 17 Sep 1998, Andy Dent wrote:

> At 7:49 AM +0800 17/9/98, Paul Prescod wrote: > >So to me, "inheritance"
between element types would be > ... > > > >Subclassing, on the other hand
> I think it is extremely dangerous to have different definitions for >
subclassing vs inheritance. The chance of confusion is way too high.
Please > use two terms which are less likely to be mixed in communication. 
The fact that theAt 7:49 AM +0800 17/9/98, Paul Prescod wrote:
> >So to me, "inheritance" between element types would be
> ...
> >
> >Subclassing, on the other hand
> I think it is extremely dangerous to have different definitions for
> subclassing vs inheritance. The chance of confusion is way too high. Please
> use two terms which are less likely to be mixed in communication.

The difference in meanings is inherent. The word subtype (perhaps not 
subclass) is very formally defined. The word inherit is perhaps not 
formally defined, but has a very common English-language meaning. There 
is essentially no relationship between those meanings. I don't "inherit" 
my type from my father when he dies. I get "stuff" from him.

It is worth noting that Simula did not call its "class extensions" 
inheritance. 

http://www-leland.stanford.edu/class/cs242/outlines-95/simula.html

I presume that Smalltalk was what changed the names. In Smalltalk, the
difference between inheritance and subclassing would be more or less
irrelevant. In statically typed languages (like XML) the difference is
huge. 

 Paul Prescod


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Thu Sep 17 07:46:19 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:47 2004
Subject: Processing Select Patterns in XSL...
Message-ID: <3600A279.AC87AF1E@infinet.com>

One thing that is slightly confusing with select patterns is selecting
elements containing parent anchors.

For example, say I have:

<xsl:template match="book">
  <fo:block>
    <xsl:process select="../../../heading"/>
  </fo:block>
</xsl:template>

>From what I understand, this would say first go to the third parent node
of the current node and select all heading elements and process them.
This would seem to be an error since another template may already have
processed these heading elements.

Match patterns seem to be pretty straightforward in how you use them as
all you need to really do is start at the right-most pattern component
and work left.  If everything matches up then finally make sure that the
anchor matches up with the parent of the node that matched the left most
pattern.  If everything still holds, then apply the template rule to
this particular element in the source tree when spitting out the result
tree.

Now select patterns it seems from first glance that you would instead
start from the current node and work left to right instead of right to
left as in the case of match patterns.  Essentially, you would start
from the current node and recursively process all of the descendants
that end up matching the ancestry pattern from left to right.  For
ancestry patterns that do not contain an immediate ancestor operator
this process would be rather cheap.

But in the above example, what do you do when relative or absolute
anchors withing select patterns anchor a node which is an ancestor of
the current node in context.  In this case, it seems as if you can have
multiple template rules acting on the same elements.

Another question is what to do with Absolute Anchors.  I would think
that for select patterns it would not make sense for this to be allowed
as the entire template match then has nothing to do with the actual
processing.  For example if I match a particular element and then select
a set of nodes who are anchored at the root, this would be like doing
global processing independent of the match argument.

I am sure many of these questions are things which I do not understand
due to my relative inexperience with stylesheet languages (most people
are probably also in this boat) and the fact that the most recent spec
is very much incomplete, but it would be nice to know if these are
errors overlooked in the spec, or just my complete misunderstandings on
these issues.

Regards,

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From liamquin at interlog.com  Thu Sep 17 08:45:18 1998
From: liamquin at interlog.com (Liam R. E. Quin)
Date: Mon Jun  7 17:04:47 2004
Subject: Undeclared elements error
In-Reply-To: <Pine.SUN.3.91.980916184904.14370B-100000@cito.uwaterloo.ca>
Message-ID: <Pine.BSI.3.96r.980917023936.16901A-100000@shell1.interlog.com>

A quick question about error messages...

The XML spec says (in section 3.2)
    An element type declaration constrains the element's content.

    Element type declarations often constrain which types can appear
    as children of the element.

    At user option, an XML processor may issue a warning when a
    declaration mentions an element type for which no declaration
    is provided, but this is not an error.

Should the following produce a warning?

    <!ELEMENT CHAPTER (LOCATION,DESCRIPTION)>
    <!--* error: LOCATION has not been declared *-->
    <!--* error: DESCRIPTION has not been declared *-->

    <!ELEMENT LOCATION (#PCDATA)>
    <!ELEMENT DESCRIPTION (#PCDATA)>

It seems to me that the intended interpretation is that the warning should
only be issued after all declarations have been processed, but this
is nowhere stated, and the interpretation suggested by my comments
above appears to be legal.

You could say that market forces would eliminate any such XML processor,
I suppose.

Has anyone implemented this warning/

Lee

-- 
Liam Quin, GroveWare Inc., Toronto;  The barefoot agitator
l i a m q u i n     at    i n t e r l o g    dot   c o m


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jjc at jclark.com  Thu Sep 17 08:55:08 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun  7 17:04:47 2004
Subject: Processing Select Patterns in XSL...
References: <3600A279.AC87AF1E@infinet.com>
Message-ID: <3600A5F6.BAE81794@jclark.com>

Tyler Baker wrote:
> 
> One thing that is slightly confusing with select patterns is selecting
> elements containing parent anchors.
> 
> For example, say I have:
> 
> <xsl:template match="book">
>   <fo:block>
>     <xsl:process select="../../../heading"/>
>   </fo:block>
> </xsl:template>
> 
> >From what I understand, this would say first go to the third parent node
> of the current node

ie the grandparent's parent.

> and select all heading elements and process them.

ie all heading children of the grandparent's parent

> This would seem to be an error since another template may already have
> processed these heading elements.

It would only be an error if they had already been processed (because
that would get you in a loop).

> Match patterns seem to be pretty straightforward in how you use them as
> all you need to really do is start at the right-most pattern component
> and work left.  If everything matches up then finally make sure that the
> anchor matches up with the parent of the node that matched the left most
> pattern.  If everything still holds, then apply the template rule to
> this particular element in the source tree when spitting out the result
> tree.

Only if the element was selected for processing by an xsl:process or
xsl:process-children.

> Now select patterns it seems from first glance that you would instead
> start from the current node and work left to right instead of right to
> left as in the case of match patterns.  Essentially, you would start
> from the current node and recursively process all of the descendants
> that end up matching the ancestry pattern from left to right.

A variety of strategies are possible.  For example, if you have

  select="foo|bar"

you can walk the children and process those which are of type foo or
bar.  If you have

  select="foo/bar|foo/baz"

you can walk the children and then for each child that's of type foo
walk its children and process those which are of type bar or baz.  If
you have

 select=".//foo"

you could walk all descendant elements and process those that are of
type foo.

>  For
> ancestry patterns that do not contain an immediate ancestor operator
> this process would be rather cheap.
> 
> But in the above example, what do you do when relative or absolute
> anchors withing select patterns anchor a node which is an ancestor of
> the current node in context.  In this case, it seems as if you can have
> multiple template rules acting on the same elements.

Huh?

> Another question is what to do with Absolute Anchors.  I would think
> that for select patterns it would not make sense for this to be allowed
> as the entire template match then has nothing to do with the actual
> processing. 

Not so.  Typically you would be using document level information in the
processing of some element.

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jjc at jclark.com  Thu Sep 17 09:14:18 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun  7 17:04:47 2004
Subject: Undeclared elements error
References: <Pine.BSI.3.96r.980917023936.16901A-100000@shell1.interlog.com>
Message-ID: <3600B27C.10579B57@jclark.com>

Liam R. E. Quin wrote:

> Should the following produce a warning?
> 
>     <!ELEMENT CHAPTER (LOCATION,DESCRIPTION)>
>     <!--* error: LOCATION has not been declared *-->
>     <!--* error: DESCRIPTION has not been declared *-->
> 
>     <!ELEMENT LOCATION (#PCDATA)>
>     <!ELEMENT DESCRIPTION (#PCDATA)>

Absolutely not.

> It seems to me that the intended interpretation is that the warning should
> only be issued after all declarations have been processed,

Right.

> but this
> is nowhere stated, and the interpretation suggested by my comments
> above appears to be legal.

A processor can issue any warnings it wants, but it's ridiculous to
interpret the XML spec as recommending something so stupid.  Since you
can have mutually recursive content models, it wouldn't even always be
possible to reorder the declarations in a DTD so as to avoid such a
warning.

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Thu Sep 17 10:27:17 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:47 2004
Subject: Processing Select Patterns in XSL...
References: <3600A279.AC87AF1E@infinet.com> <3600A5F6.BAE81794@jclark.com>
Message-ID: <3600C7FB.278083C8@infinet.com>

James Clark wrote:

> Tyler Baker wrote:
> >
> > One thing that is slightly confusing with select patterns is selecting
> > elements containing parent anchors.
> >
> > For example, say I have:
> >
> > <xsl:template match="book">
> >   <fo:block>
> >     <xsl:process select="../../../heading"/>
> >   </fo:block>
> > </xsl:template>
> >
> > >From what I understand, this would say first go to the third parent node
> > of the current node
>
> ie the grandparent's parent.
>
> > and select all heading elements and process them.
>
> ie all heading children of the grandparent's parent
>
> > This would seem to be an error since another template may already have
> > processed these heading elements.
>
> It would only be an error if they had already been processed (because
> that would get you in a loop).

In this case, I guess that all Nodes in the source tree would need to be flagged
whenever they have been directly processed.  If you encounter a Node that has
been flagged to be invalid (i.e. it has already been processed), then throw an
error.  Am I right in assuming that to conform to the spec you would either have
to maintain this flag value in a special purpose element node, or else have a
list of processed element nodes maintained in the stylesheet (this would seem
like the inefficient solution).

All of this might be useful info for DOM implementors as they might provide a
special flag integer in each element which can have multiple flags optionally set
to it for the benefit of technologies like XSL.

> > Match patterns seem to be pretty straightforward in how you use them as
> > all you need to really do is start at the right-most pattern component
> > and work left.  If everything matches up then finally make sure that the
> > anchor matches up with the parent of the node that matched the left most
> > pattern.  If everything still holds, then apply the template rule to
> > this particular element in the source tree when spitting out the result
> > tree.
>
> Only if the element was selected for processing by an xsl:process or
> xsl:process-children.

Sorry, I was assuming the default template rule applied in the context I was
referring to.  I guess I was not clear here.

> > Now select patterns it seems from first glance that you would instead
> > start from the current node and work left to right instead of right to
> > left as in the case of match patterns.  Essentially, you would start
> > from the current node and recursively process all of the descendants
> > that end up matching the ancestry pattern from left to right.
>
> A variety of strategies are possible.  For example, if you have
>
>   select="foo|bar"

Well for complex OrPatterns I would think this does not work too well.  I
basically just break them up into a list of AncestryPatterns.  For OrPatterns
right now in templates I just clone the template for each additional
AncestryPattern in the OrPattern.

> you can walk the children and process those which are of type foo or
> bar.  If you have
>
>   select="foo/bar|foo/baz"
>
> you can walk the children and then for each child that's of type foo
> walk its children and process those which are of type bar or baz.  If
> you have
>
>  select=".//foo"
>
> you could walk all descendant elements and process those that are of
> type foo.

This was the non-cheap traversal I was referring to.  Someone who used XT said
that for a 2K file and a 2K spreadsheet it was taking them 20 seconds or
something ridiculous to write out the output.  Well considering that XT is only a
reference implementation and that from a quick look see of XT it looked like
probably most of the processing time is spent in String creation with
String.substring(), etc.  I told this person that if Mr. Clark really spent a lot
of time trying to whip this into a commercial product, he would likely find his
processing time less than a second.

Nevertheless, the biggest thing I worry about with XSL is the possible runtime (I
am referring to O Notation) of the various pattern searches that can be
conducted.  For large documents, patterns which frequently use the ancestor
operator can obviously become very expensive.  Efficient indexing of the source
tree among other optimizations can significantly decrease some of these search
times, but it is a real worry to me that a client's expectations of XSL's
processing capabilities when presented with large source trees and complex
stylesheets are greater than reality.  It would not be good for me or any other
person currently involved in XSL software to have to explain to clients that
their HTML layout should be restricted to the look and feel of HTML 2.0 web pages
simply because a high-level of complexity will bring browsers or server-side XSL
Processors to a crawl.

> >  For
> > ancestry patterns that do not contain an immediate ancestor operator
> > this process would be rather cheap.
> >
> > But in the above example, what do you do when relative or absolute
> > anchors withing select patterns anchor a node which is an ancestor of
> > the current node in context.  In this case, it seems as if you can have
> > multiple template rules acting on the same elements.
>
> Huh?

I was referring basicly back to my previous comment about being able to "select"
ancestors (instead of just descendants) of the current node in context.  I guess
I can sort of understand now how this can be useful (say you want to reinsert a
title that may be the first node in the tree) now.  I know this sort of question
may have been a bit immature, but I like many other people are trying to first
understand XSL and how it can be creatively applied in ways that do not just
involve processing XML to HTML.


> > Another question is what to do with Absolute Anchors.  I would think
> > that for select patterns it would not make sense for this to be allowed
> > as the entire template match then has nothing to do with the actual
> > processing.
>
> Not so.  Typically you would be using document level information in the
> processing of some element.

I can see what you are saying now and I think I see the light (-:

Thanx very much for this reply as it has helped me personally understand some of
these questions a lot better and has also shed some light on how powerful XSL can
really be.  My only real concern right now is processing efficiency of XML not
necessarily in terms of implementation, but in terms of the general runtime
expense of pattern matching and selecting.

Situations like:

<process select 'ancestor('/')//foo'/>

in very large documents basically say to go to the root and recursivly traverse
the entire document tree and look for "foo" elements.  For an average size source
tree and a good number of templates which do this, performance problems should be
evident.

It would be very beneficial if you could index the source tree before doing
template matching if after all of the import actions that there were special
commands to instruct the XSL Processor to index certain frequently looked up
elements that either end match patterns.  I suppose this could be done in a
proprietary way using PI's, but a standard way would do everyone a lot of good.
Perhaps this all may be a knee-jerk so I suppose we will all have to first find
out what kind of patterns tend to cause users processing problems and then either
warn stylesheet writers (is this the accurate term) about what XSL's processing
limitations may be.

Again much thanx,

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Thu Sep 17 13:24:12 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:47 2004
Subject: More on XSL Patterns...
Message-ID: <3600F17E.A006920E@infinet.com>

In XSL there are two kinds of patterns: match and select.

Match patterns may only contain absolute anchors while select patterns
can contain both absolute anchors and relative anchors.

Now for conditional processing in the case of xsl:if and xsl:when
instructions, the patterns tested are select patterns and the contents
of these instructions are processed if one or more nodes are selected.

What this sounds like to me is that the patterns for xsl:if and xsl:when
are really match patterns which can have relative anchors.  No they are
not really match patterns but they are something different because once
you encounter an occurrence of at least one node which matches the
pattern, then the condition is satisfied.

Perhaps there should be a new pattern in the next release of the spec in
addition to the match pattern and the select pattern called a
conditional pattern where a pattern is satisfied upon the first
occurrence of a match.

This would make a lot of sense IMHO as far as clarifications go, even
though it would not really change anything fundamental to the current
spec with respect to patterns.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Thu Sep 17 13:39:48 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:47 2004
Subject: Processing Select Patterns in XSL...
In-Reply-To: <3600A279.AC87AF1E@infinet.com>
Message-ID: <Pine.SUN.3.91.980917072424.14370F-100000@cito.uwaterloo.ca>

On Thu, 17 Sep 1998, Tyler Baker wrote:

>     <xsl:process select="../../../heading"/>
> 
> >From what I understand, this would say first go to the third parent node
> of the current node and select all heading elements and process them.
> This would seem to be an error since another template may already have
> processed these heading elements.

What's wrong with processing the same element twice? That is necessary in
many cases (e.g. processing a title in the context of a cross reference, a
TOC, and in its natural locataion)

> Another question is what to do with Absolute Anchors.  I would think
> that for select patterns it would not make sense for this to be allowed
> as the entire template match then has nothing to do with the actual
> processing.  For example if I match a particular element and then select
> a set of nodes who are anchored at the root, this would be like doing
> global processing independent of the match argument.

What's wrong with that? The WD provides an example of where you would want
to do that. Do a search for "CFO". 

 Paul Prescod


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Thu Sep 17 13:47:08 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:47 2004
Subject: Processing Select Patterns in XSL...
References: <Pine.SUN.3.91.980917072424.14370F-100000@cito.uwaterloo.ca>
Message-ID: <3600F642.76A761A5@infinet.com>

Paul Prescod wrote:

> On Thu, 17 Sep 1998, Tyler Baker wrote:
>
> >     <xsl:process select="../../../heading"/>
> >
> > >From what I understand, this would say first go to the third parent node
> > of the current node and select all heading elements and process them.
> > This would seem to be an error since another template may already have
> > processed these heading elements.
>
> What's wrong with processing the same element twice? That is necessary in
> many cases (e.g. processing a title in the context of a cross reference, a
> TOC, and in its natural locataion)
>
> > Another question is what to do with Absolute Anchors.  I would think
> > that for select patterns it would not make sense for this to be allowed
> > as the entire template match then has nothing to do with the actual
> > processing.  For example if I match a particular element and then select
> > a set of nodes who are anchored at the root, this would be like doing
> > global processing independent of the match argument.
>
> What's wrong with that? The WD provides an example of where you would want
> to do that. Do a search for "CFO".

Jim's previous post answered all of these questions.  Not being tremendously
familiar with stylesheet languages in the past, I was previously under the
impression that you could only process content in the source tree once.

Regards,

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at goon.stg.brown.edu  Thu Sep 17 14:45:21 1998
From: richard at goon.stg.brown.edu (Richard L. Goerwitz III)
Date: Mon Jun  7 17:04:47 2004
Subject: Undeclared elements error
In-Reply-To: <3600B27C.10579B57@jclark.com> from James Clark at "Sep 17, 98 01:55:56 pm"
Message-ID: <199809171245.IAA32283@goon.stg.brown.edu>

> Liam R. E. Quin wrote:
> 
> > Should the following produce a warning?
> > 
> >     <!ELEMENT CHAPTER (LOCATION,DESCRIPTION)>
> >     <!--* error: LOCATION has not been declared *-->
> >     <!--* error: DESCRIPTION has not been declared *-->
> > 
> >     <!ELEMENT LOCATION (#PCDATA)>
> >     <!ELEMENT DESCRIPTION (#PCDATA)>
> 
> Absolutely not.

In an attempt to flesh out James Clark's somewhat abrupt note, let
me just give you a quick example illustrating why your intuitions
were right about it being okay to use element names in content models
before they are declared:

  <!ELEMENT text (#PCDATA | italic | bold)*>
  <!ELEMENT italic (#PCDATA | bold)*>
  <!ELEMENT bold (#PCDATA | italic)*>

Other questions you might wonder about include:

  1) What if I use an element in a content model and I don't
     declare it?
  2) What if I declare an element in a content model that I
     don't use in any content model (and the element isn't the
     root element)?

By my reading of the XML spec, these are not clearly stated to be
errors.  So they aren't errors (unless you try to use the elements
in a document - in which case, other constraints exclude them).

I may have missed something, of course.

In such situations, it's tempting to cheat and just use your know-
ledge of SGML.  One would hope that future standards will answer
such questions definitively, removing this temptation (confusion).

Richard Goerwitz
STG


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From JD1 at FADavis.com  Thu Sep 17 16:17:35 1998
From: JD1 at FADavis.com (Joe Davidyock)
Date: Mon Jun  7 17:04:47 2004
Subject: xml/xsl and MS activeX control
Message-ID: <AD6A45394A48D211A0CF00805F78EEF2013535@ipb10.fadavis.com>

Greetings all.
I am new (very new) to the XML realm. I am currently using Microsoft's XSL
control to convert a xml file to html. This is working fine on a static xml
document. My problem is that once implemented I need to apply the same xsl
sheet to a dynamically generated xml file (the result of an object query),
whose name and existence is not always fixed. How can i get the xsl to apply
to each xml file as they are generated? I have been using the following
piece of code in my html which has been successful on the static file
"test.xml":
<OBJECT ID="XSLControl" CLASSID="CLSID:2BD0D2F2-52EC-11D1-8C69-0E16BC000000"
CODEBASE="http://www.microsoft.com/xsl/xsl/msxsl.cab" STYLE="display:none">
<PARAM NAME="documentURL" VALUE="test.xml">
<PARAM NAME="styleURL" VALUE="test.xsl">        
</OBJECT>
Can I assign the first parameter with a value that points to a particular
directory where these files are? Examples/hints/references?
Thank you in advance for any help!
---------------------------
Joseph P. Davidyock
jd1@fadavis.com
---------------------------

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From graham.moore at dpsl.co.uk  Thu Sep 17 16:55:34 1998
From: graham.moore at dpsl.co.uk (Graham Moore)
Date: Mon Jun  7 17:04:47 2004
Subject: xml/xsl and MS activeX control
Message-ID: <TFSMNYSZ@dpsl.co.uk>>


you can use some script to say...

XSLControl.documentURL = urlString

etc

so if you wanted a really dynamic system you could make a request to the 
server which returned the url to connect to and then set  the url property.

When you set the prop it goes off and gets the XML, then just call, 
something like

htmlElemId.innerHTML = XSLControl.htmlText

to display the new content.

Also consider using the java stuff from jjc and others as this is more 
up-to-date than the ms offering, IMHO provides you with alot more power and 
control.

graham.

gdm@dpsl.co.uk


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From thomas.dimock at medtronic.com  Thu Sep 17 17:09:18 1998
From: thomas.dimock at medtronic.com (Thomas Dimock)
Date: Mon Jun  7 17:04:47 2004
Subject: Please unsubscribe me
Message-ID: <s600df7d.096@mspeos0.corp.medtronic.com>

unsubscribe xml-dev

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From paul at arbortext.com  Thu Sep 17 17:53:57 1998
From: paul at arbortext.com (Paul Grosso)
Date: Mon Jun  7 17:04:47 2004
Subject: Processing Select Patterns in XSL...
Message-ID: <3.0.32.19980917105006.00cd7194@pophost.arbortext.com>

At 07:45 1998 09 17 -0400, Tyler Baker wrote:
>Paul Prescod wrote:
>
>> On Thu, 17 Sep 1998, Tyler Baker wrote:
>>
>> >     <xsl:process select="../../../heading"/>
>> > . . .

Please note the existence of xsl-list@mulberrytech.com
which is the public list for discussion of XSL issues.

XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Thu Sep 17 18:34:35 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:47 2004
Subject: xml/xsl and MS activeX control
In-Reply-To: <TFSMNYSZ@dpsl.co.uk>>
Message-ID: <199809171634.JAA28226@mail-gw6.pacbell.net>

>Also consider using the java stuff from jjc and others as this is more 
>up-to-date than the ms offering, IMHO provides you with alot more power and 
>control.

  I would have to disagree.

  MSXSL gives you the <define-script> tag which puts it leaps and bounds
ahead of the other XSL processors in terms of control. What's more, MSXSL
allows you to include the <SCRIPT> tag in your generated file without
escaping all your comparison operators.

  I'm using XT, but only because I wanted to learn the working draft. In my
opinion, the proposed note that MSXSL implements is more immediately
useful. I've made XT work the way I want it to by post-processing its
output (if I had a Java development environment, I would probably have
altered the way XT spits out the result tree)

  I'm told that the second way (altering how the XSL processor spits out
the result tree) is the right way to use XSL to generate HTML.

-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Thu Sep 17 20:41:45 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:48 2004
Subject: ANN: DOMParser
References: <35FFE7C6.E5034DAB@locke.ccil.org> <360035CE.3424E342@infinet.com>
Message-ID: <360157EE.B3F611E5@locke.ccil.org>

Tyler Baker wrote:

> Wouldn't this be better titled as a DOM Builder and a DOM Writer?

I don't see why.  DOMParser is a SAX parser, just like any other,
except that instead of reading an XML source file, it walks a DOM.
It does not build a DOM, nor does it write anything.

There is a similar module in SAXON, I believe, but not packaged as
a SAX parser.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Thu Sep 17 21:33:51 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:48 2004
Subject: John Cowan's XML presentation now available in PDF format
Message-ID: <360163FE.C3EEACF9@locke.ccil.org>

from http://www.ccil.org/~cowan/XML/xml.pdf, courtesy of
Lars Marius Garshol, who did the conversion.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Thu Sep 17 21:44:14 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:48 2004
Subject: Undeclared elements error
References: <199809171245.IAA32283@goon.stg.brown.edu>
Message-ID: <3601666D.5F1CCD0D@locke.ccil.org>

Richard L. Goerwitz III wrote:

>   1) What if I use an element in a content model and I don't
>      declare it?

>From clause 3.2:

# At user option, an XML processor may issue a warning when a
# declaration mentions an element type for which
# no declaration is provided, but this is not an error.

>   2) What if I declare an element in a content model that I
>      don't use in any content model (and the element isn't the
>      root element)?

I know of no context, XML or otherwise, where it is an outright error
to declare something and not use it.  At most, that provokes a warning.

	We *mention* "our vast nuclear arsenal" so that
	we won't have to *use* it.
		-- Douglas Hofstadter on the use/mention distinction

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Thu Sep 17 22:39:33 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:48 2004
Subject: ANN: DOMParser
References: <35FFE7C6.E5034DAB@locke.ccil.org> <360035CE.3424E342@infinet.com> <360157EE.B3F611E5@locke.ccil.org>
Message-ID: <360173A8.9B7D4470@infinet.com>

John Cowan wrote:

> Tyler Baker wrote:
>
> > Wouldn't this be better titled as a DOM Builder and a DOM Writer?
>
> I don't see why.  DOMParser is a SAX parser, just like any other,
> except that instead of reading an XML source file, it walks a DOM.
> It does not build a DOM, nor does it write anything.
>
> There is a similar module in SAXON, I believe, but not packaged as
> a SAX parser.

Sorry my confusion lies in that parsing refers to taking data of one unmanageable
form and converting it into a manageable form.  In other words, you cannot do
anything useful with an XML document until you parse it, all it is a set of bytes
that follow a particular pattern.  Parsing a DOM tree is an oxymoron IMHO unless
you are converting the DOM tree into some other tree.  I guess I would have to
take a closer look at what you have.  From second glance it seems as if what you
have are utility methods for searching and sorting a DOM tree.  In this case you
would have a DOM Manager...

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Thu Sep 17 22:47:34 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:48 2004
Subject: ANN: DOMParser
References: <35FFE7C6.E5034DAB@locke.ccil.org> <360035CE.3424E342@infinet.com> <360157EE.B3F611E5@locke.ccil.org> <360173A8.9B7D4470@infinet.com>
Message-ID: <3601756D.853FA98B@locke.ccil.org>

Tyler Baker wrote:

> Sorry my confusion lies in that parsing refers to taking data of one unmanageable
> form and converting it into a manageable form.

Yes.  

> Parsing a DOM tree is an oxymoron IMHO unless
> you are converting the DOM tree into some other tree.

No, what I have is code that converts a DOM *tree* into a
SAX *event stream*.  The result is "manageable" to applications
that expect SAX events.

For example, you can use the non-API methods of Docuverse DOM SDK
(or any other Java DOM implementation that supports SAX)
in cooperation with a real SAX parser to capture an XML document
as a DOM tree.  You can then use DOMParser to generate those same
SAX parse events repeatedly by walking the tree repeatedly.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Thu Sep 17 23:18:52 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:04:48 2004
Subject: Namespaces from where?
Message-ID: <199809172118.RAA29626@hesketh.com>

I'm writing about namespaces (sooner or later I get to write about
everything in XML, it seems), and I'm trying to figure out something
different about namespaces than the usual stuff we've been arguing about here.

If I may be so nosy, where did this idea come from?  It doesn't seem to
have come in as a NOTE, and so far as I know it lacks the usual SGML
ancestry.  Glimmerings of it are visible in the XML 1.0 spec (xml:lang and
xml:space), but otherwise it seems to have arrived fully born.

I'd love to know where this one came from... explaining these things
without some kind of story is difficult at best.


Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Thu Sep 17 23:26:50 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:48 2004
Subject: Namespaces from where?
References: <199809172118.RAA29626@hesketh.com>
Message-ID: <36017EAC.78BE7F73@locke.ccil.org>

Simon St.Laurent wrote:

> If I may be so nosy, where did this idea come from?  It doesn't seem to
> have come in as a NOTE, and so far as I know it lacks the usual SGML
> ancestry.  Glimmerings of it are visible in the XML 1.0 spec (xml:lang and
> xml:space), but otherwise it seems to have arrived fully born.

Search http://www13.w3.org/XML/9712-reports.html for "namespace".
Basically, the old PI-based system seems to have arrived full-blown
in the XML WG meeting of 1 October 1997.  xml:space existed before
this under the name -XML-SPACE.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From crism at oreilly.com  Fri Sep 18 00:10:12 1998
From: crism at oreilly.com (Chris Maden)
Date: Mon Jun  7 17:04:48 2004
Subject: Namespaces from where?
In-Reply-To: <199809172118.RAA29626@hesketh.com> (simonstl@simonstl.com)
Message-ID: <199809172208.SAA08423@ruby.ora.com>

[Simon St.Laurent]
> I'm writing about namespaces (sooner or later I get to write about
> everything in XML, it seems), and I'm trying to figure out something
> different about namespaces than the usual stuff we've been arguing
> about here.  If I may be so nosy, where did this idea come from?  It
> doesn't seem to have come in as a NOTE, and so far as I know it
> lacks the usual SGML ancestry.  Glimmerings of it are visible in the
> XML 1.0 spec (xml:lang and xml:space), but otherwise it seems to
> have arrived fully born.

There was a requirement from other WGs that documents be able to
unambiguously refer to the semantics of those WGs' specifications in a
document.  For instance, "When I say <rdf>, I mean <rdf> as defined by
the RDF WG of the W3C."  Without something performing the functions
that namespaces perform, then browsers would be required to guess
based on the element type names.

This is a matter of record in the XML WG and SIG archives, but sadly,
they are not available to those not members of the W3C.

The purpose of W3C privacy is to give members a concrete advantage
over non-members (otherwise there's no point in joining).  But WGs are
required to make their work public within three months of inception;
at this point, it might do much more good than harm to open the older
archives to the public.

-Chris
-- 
<!NOTATION SGML.Geek PUBLIC "-//Anonymous//NOTATION SGML Geek//EN">
<!ENTITY crism PUBLIC "-//O'Reilly//NONSGML Christopher R. Maden//EN"
"<URL>http://www.oreilly.com/people/staff/crism/ <TEL>+1.617.499.7487
<USMAIL>90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From crism at oreilly.com  Fri Sep 18 00:52:16 1998
From: crism at oreilly.com (Chris Maden)
Date: Mon Jun  7 17:04:48 2004
Subject: Namespaces from where?
Message-ID: <199809172250.SAA09150@ruby.ora.com>

It's come to my attention that my last message wasn't very clear.

First of all, I incorrectly identified the main reason for keeping W3C
discussions private: many W3C members have more vocal lawyers than my
employer does, and contributors to the discussion fora would have to
clear everything through their lawyer before posting it if the
archives were, or ever would be open to the public.  This is a
compelling reason to keep them private forever.

I also want to make it clear that I wasn't questioning the W3C's
decision to keep its archives private.  It is a private organization,
and doesn't have any moral or legal obligation to share work paid for
by its members.  That it chooses to make its specifications public at
all is commendable (though fulfilling its mission otherwise would be
difficult), and the small lead time its members get in implementing
specifications is the (bargain-basement-cheap) price of the work
getting funded at all.

-Chris
-- 
<!NOTATION SGML.Geek PUBLIC "-//Anonymous//NOTATION SGML Geek//EN">
<!ENTITY crism PUBLIC "-//O'Reilly//NONSGML Christopher R. Maden//EN"
"<URL>http://www.oreilly.com/people/staff/crism/ <TEL>+1.617.499.7487
<USMAIL>90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Fri Sep 18 00:54:25 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:48 2004
Subject: ANN: DOMParser
References: <35FFE7C6.E5034DAB@locke.ccil.org> <360035CE.3424E342@infinet.com> <360157EE.B3F611E5@locke.ccil.org> <360173A8.9B7D4470@infinet.com> <3601756D.853FA98B@locke.ccil.org>
Message-ID: <36019320.3DABF942@infinet.com>

John Cowan wrote:

> Tyler Baker wrote:
>
> > Sorry my confusion lies in that parsing refers to taking data of one unmanageable
> > form and converting it into a manageable form.
>
> Yes.
>
> > Parsing a DOM tree is an oxymoron IMHO unless
> > you are converting the DOM tree into some other tree.
>
> No, what I have is code that converts a DOM *tree* into a
> SAX *event stream*.  The result is "manageable" to applications
> that expect SAX events.

Ahh, I see.  In this sense I suppose it is a parser.  I suppose I just was not sure
what you meant at first.  In this case you have a DOMSAX for better lack of a term.  I
can see how this would be useful to apps that build a DOM tree straight out of a
database and then need to have the data be sent into some XML framework without having
to first write the contents of the DOM tree out to a stream and then read it in the
output as input into a SAX compliant parser.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Fri Sep 18 04:16:50 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:04:48 2004
Subject: Namespaces from where?
In-Reply-To: <199809172250.SAA09150@ruby.ora.com>
Message-ID: <199809180216.WAA32197@hesketh.com>

At 06:50 PM 9/17/98 -0400, Chris Maden wrote:
>First of all, I incorrectly identified the main reason for keeping W3C
>discussions private: many W3C members have more vocal lawyers than my
>employer does, and contributors to the discussion fora would have to
>clear everything through their lawyer before posting it if the
>archives were, or ever would be open to the public.  This is a
>compelling reason to keep them private forever.

Well, let's just hope they open it up to the historians after we're all
dead. I figure the historians will want to know how the foundations of
their data storage and transfer systems, all of which are grounded in XML,
came about.

Am I being too optimistic about XML?


Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Fri Sep 18 06:34:47 1998
From: david at megginson.com (david@megginson.com)
Date: Mon Jun  7 17:04:48 2004
Subject: Namespaces from where?
In-Reply-To: <199809172118.RAA29626@hesketh.com>
References: <199809172118.RAA29626@hesketh.com>
Message-ID: <13825.58204.240209.554488@localhost.localdomain>

Simon St.Laurent writes:

 > If I may be so nosy, where did this idea come from?  It doesn't
 > seem to have come in as a NOTE, and so far as I know it lacks the
 > usual SGML ancestry.  Glimmerings of it are visible in the XML 1.0
 > spec (xml:lang and xml:space), but otherwise it seems to have
 > arrived fully born.

You'll probably be able to trace the history on Robin Cover's XML Page 
(now at OASIS) -- I know that Andrew Layman had written something up,
and that there was a whole slew of namespace-like proposals floating
around last Summer.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Fri Sep 18 08:54:15 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:04:48 2004
Subject: XSL ConstantRefs and MacroArgRefs?
Message-ID: <360203C4.EC04927@infinet.com>

For attribute value templates as well as for xsl:value-of actions the
spec states:

 "It is an error to refer to a macro argument that has not been
declared."

The only unclear thing is whether or not this means that the macro
argument has to be declared in a define-macro element that is previous
to the template element in which the MacroArgRef occurs.  In other
words, can you declare all of your constant and define-macro statements
at the end of the XSL stylsheet.

This is important to know because it basicly defines whether you can
parse attribute value templates within attribute values of the
stylesheet in a one-pass or a two-pass fashion.  If you can assume that
all define-constant and define-macro expressions are at the beginning of
the stylesheet and follow the similiar construction rules to entities in
DTD's, then all of this can be done in one-pass.  Otherwise multiple
passes are required.  If this is the case, perhaps in the XSL DTD, the
stylesheet elementdecl should be changed from:

<!ELEMENT xsl:stylesheet
 (xsl:import*,
  (xsl:include
  | xsl:id
  | xsl:strip-space
  | xsl:preserve-space
  | xsl:define-macro
  | xsl:define-attribute-set
  | xsl:define-constant
  | xsl:template)*)
>

to something like:

<!ELEMENT xsl:stylesheet
 (xsl:import*,
  (xsl:id
  | xsl:strip-space
  | xsl:preserve-space
  | xsl:define-macro
  | xsl:define-attribute-set
  | xsl:define-constant)*,
  (xsl:include
  | xsl:template)*)
>

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From arcpub at arcanum.com  Fri Sep 18 09:29:19 1998
From: arcpub at arcanum.com (Attila Torcsvari)
Date: Mon Jun  7 17:04:48 2004
Subject: String expressions in XSL
Message-ID: <36020BAD.80BD664@arcanum.com>

XSLers,

in my XML file I have
<subclass id=A01B>
...
<ref to="A01B001/00"/>
...
</subclass>
...
<subclass id=A01C>
...
<ref to="A01B001/00"/>
...
</subclass>

I have to generate the following HTML sequences:

from the first reference:
<A href="@A01B001/00">1/00</A>

...because I am in subclass A01B, thus the user should see only
1/00
which is sufficient for the user to recognize the referred location.

from the second reference:
<A href="@A01B001/00">A01B 1/00</A>
...because it is in another subclass, thus the user _must_ now that the
reference points to another subclass.

(The "@" designates "late-bound reference", used by another tool which
will resolve it a separate step.)

I have got about 800.000 of such references in about 20*15 MB XML data.

Have I got in XSL _any_ chance to generate these sequences?

I guess _no_. There seems to be _no_ string expression language neither
for patterns nor for "value-of".

Do I have to contaminate my source XML code with redundant data
(which I do now but I do not like it)
or
should I generate temporal XML files in Java, which is a neverending
story due to the sizes of the files
or
is it planned that at least regexp patterns will appear in XSL
or
is there a chance that there will be _again_ ECMAScript in XSL?
or
is there any XML transformation tool which is less painful to use than a
set of Java/Python/C/AWK/Perl programs?

Samples can be reached at http://www.arcanum.com/patclass/index.htm
These files were generated with proprietary tools which I plan to change
to XML tools, if I can.

I guess my problems are not too special.

Thanks for help.

Attila Torcsvari
Arcanum Development

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From graham.moore at dpsl.co.uk  Fri Sep 18 09:45:30 1998
From: graham.moore at dpsl.co.uk (Graham Moore)
Date: Mon Jun  7 17:04:48 2004
Subject: xml/xsl and MS activeX control
Message-ID: <TFSGZGHX@dpsl.co.uk>>


>  I'm using XT, but only because I wanted to learn the working draft. In my
> opinion, the proposed note that MSXSL implements is more immediately
> useful.

I would agree with that.

>  (if I had a Java development environment, I would probably have
> altered the way XT spits out the result tree)

Would you not consider this to be providing  you with more control? Beyond 
simply producing the output you require.

I guess my comments were aimed at the nature of the control and 
implementation as opposed to which XSL was being used. I find that the XT 
java approach allows the construction of scalable and  managable solutions. 
For example, its easy to pass an InputStream to XT from either a URL or a 
serialised Grove or something else. The MSXSL ctrl's COM interface is 
limited and non-extensible.

I believe that seeing how a thing works and having the ability to modify or 
extend it is a powerful thing. Consider a situation with a large XML or  XSL 
file, if the MSXSL ctrl can't handle it, it can't handle it and there's not 
alot you can do without the solution being contrived. With an open OO 
solution you have the power to solve the problem without breaking the model 
or imposing future constraints.

graham.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From aheitor at ef.pt  Fri Sep 18 11:24:53 1998
From: aheitor at ef.pt (Ana Heitor)
Date: Mon Jun  7 17:04:49 2004
Subject: unsubscribe
In-Reply-To: <s600df7d.096@mspeos0.corp.medtronic.com>
Message-ID: <Pine.LNX.3.96.980918084750.2660B-100000@hercules.ef.pt>


unsubscribe xml-dev


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From larsga at ifi.uio.no  Fri Sep 18 11:26:46 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:04:49 2004
Subject: What is a "Public Identifier"?
References: <98Sep16.114919pdt."56386(3)"@alpha.xerox.com>
Message-ID: <wkaf3xu629.fsf@ifi.uio.no>


* Mike Spreitzer
|
| I'm wondering if there is any standardization of what's in a "public
| identifier" 

The SGML standard (ISO 8879) defines something called a 'formal public
identifier' (clause 10.2), but whether public identifiers must be FPIs
or not is defined by the SGML declaration used.

In other words: public identifiers can be whatever you want, but you
can declare them to be FPIs.

The FPI syntax is described with BNF at

<URL:http://www.tiac.net/users/bingham/sgmlsyn/sgmlsyn.htm#P79>

| and/or of how one is resolved to whatever it refers to.

The usual way to resolve a public identifier is to use a so-called
catalog file. Currently there are two syntaxes for catalog files SGML
Open Catalog files (supported by lots of SGML tools plus DXP and
xmlproc) and XCatalogs (supported by xmlproc).

<URL:http://www.sgmlopen.org/html/a401.htm>
<URL:http://www.ccil.org/~cowan/XML>

| Yet there is a lot of regularity in the examples I see.  I've never
| seen anything specifying the structure used.  What am I missing?

Clause 10.2 of the ISO 8879 standard. The best place to get hold of it
is Goldfarbs The SGML Handbook, which has a lot of explanatory text
supplementing the standard itself.

--Lars M.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Fri Sep 18 14:09:43 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:04:49 2004
Subject: Namespaces from where?
In-Reply-To: <13825.58204.240209.554488@localhost.localdomain>
References: <199809172118.RAA29626@hesketh.com>
 <199809172118.RAA29626@hesketh.com>
Message-ID: <199809181209.IAA03010@hesketh.com>

Many thanks to all who wrote, publicly or privately, with tales of
namespaces' origins.  The stories of the origins are much more consistent
than the discussions of their impact, so at least there will be something
solid in the discussion!

Thanks again!  XML-Dev is indeed an amazing resource.  If only all my
research was that easy...

Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Fri Sep 18 14:29:16 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:49 2004
Subject: ANN: DOMParser
Message-ID: <011001bde300$8968e4e0$1e09e391@mhklaptop.bra01.icl.co.uk>

>There is a similar module in SAXON, I believe, but not
packaged as
>a SAX parser.


Indeed so. There is a class in SAXON that does this, it's
available for use, but not actively promoted as a feature,
since it was written for internal use.

One thing I needed to do in SAXON was to allow the SAX
application to get a reference to the DOM element currently
being processed; I did this by subclassing the SAX Locator
class. Does DOMParser provide anything comparable?

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rodig at sdm.de  Fri Sep 18 14:30:47 1998
From: rodig at sdm.de (Steffen Rodig)
Date: Mon Jun  7 17:04:49 2004
Subject: Characters having an ASCII value > 127
Message-ID: <199809181228.OAA16525@sunfi1.fi.sdm.de>

Hello,

imagine a plain text file which I want to markup using XML. Now it could be
that there are characters in this file whose ASCII value is greater than
127 (in PCDATA sections).

If I try to use expat on the generated XML file, it tells me that it is
not wellformed at the position where such a character occurs. Does the
XML spec say anything about not permitting characters with high ASCII
values? If so, where?

I guess, to correctly interpret and display those characters I have to
know the character set which was used to encode the original text file.
How can I communicate this character set to an XML parser?

I would be happy if anybody could point me to somewhere I could start
reading about this issue.

Thanks and have a nice weekend,
--
Steffen Rodig
rodig@sdm.de


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at cogsci.ed.ac.uk  Fri Sep 18 14:45:40 1998
From: richard at cogsci.ed.ac.uk (Richard Tobin)
Date: Mon Jun  7 17:04:49 2004
Subject: Characters having an ASCII value > 127
In-Reply-To: Steffen Rodig's message of Fri, 18 Sep 1998 14:29:15 +0200
Message-ID: <199809181245.NAA25767@cogsci.ed.ac.uk>

> I guess, to correctly interpret and display those characters I have to
> know the character set which was used to encode the original text file.
> How can I communicate this character set to an XML parser?

You can do this by putting an encoding declaration in the XML
declaration at the start of the file.  For example, if the document
is in ISO Latin 1, officially named ISO-8859-1, you can use

 <?xml version="1.0" encoding="ISO-8859-1"?>

Without an encoding declaration (or a mime type if the document comes
from an http server) a conforming parser will treat it as UTF-8, and
any character above 127 will be misinterpreted.

Of course, any particular parser may not support the character set you
happen to be using.

-- Richard

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From larsga at ifi.uio.no  Fri Sep 18 14:47:56 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:04:49 2004
Subject: Characters having an ASCII value > 127
In-Reply-To: <199809181228.OAA16525@sunfi1.fi.sdm.de>
References: <199809181228.OAA16525@sunfi1.fi.sdm.de>
Message-ID: <wkogsdsi83.fsf@ifi.uio.no>


* Steffen Rodig
| 
| If I try to use expat on the generated XML file, it tells me that it
| is not wellformed at the position where such a character occurs.
| Does the XML spec say anything about not permitting characters with
| high ASCII values? If so, where?

It doesn't. However, the XML spec _does_ say that unless XML entities
have an XML declaration with an encoding declaration parsers are to
assume that the entity is UTF-8-encoded.

This means that if you have used ISO 8859 you may get problems, since
these characters will either be mapped to a (seemingly) random Unicode
code point or simply be invalid bit sequences that do not resolve to
any character at all.
 
| I guess, to correctly interpret and display those characters I have
| to know the character set which was used to encode the original text
| file. 

Bingo. 

| How can I communicate this character set to an XML parser?

You do this on the XML declaration, like so:

<?xml version="1.0" encoding="iso-8859-1"?>

| I would be happy if anybody could point me to somewhere I could
| start reading about this issue.

Rick Jelliffe devotes a large part of The SGML/XML Cookbook to
character sets and how they are used in XML and SGML. Other than that
I don't know of any good resources apart from good old-fashioned
digging in various places.

--Lars M.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From b.laforge at jxml.com  Fri Sep 18 14:59:24 1998
From: b.laforge at jxml.com (Bill la Forge)
Date: Mon Jun  7 17:04:49 2004
Subject: ANN: DOMParser
Message-ID: <004e01bde303$d47fe340$1375fea9@laforge>

I had the same problem. I ended up extending document with
a reference to context. (Context is null when parsing is complete.)

Context then allows access to locator and a few other things specific
to coins. One driving issue for me was that an element needed to
throw a SAXParseException at endElement time and needed
access to the locator to be able to identify the element where the
error occurs.

Bill

-----Original Message-----
From: Michael Kay <M.H.Kay@eng.icl.co.uk>
>One thing I needed to do in SAXON was to allow the SAX
>application to get a reference to the DOM element currently
>being processed; I did this by subclassing the SAX Locator
>class. Does DOMParser provide anything comparable?


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Fri Sep 18 15:01:52 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:49 2004
Subject: Characters having an ASCII value > 127
Message-ID: <016401bde304$f6ac4020$1e09e391@mhklaptop.bra01.icl.co.uk>


>imagine a plain text file which I want to markup using XML.
Now it could be
>that there are characters in this file whose ASCII value is
greater than
>127 (in PCDATA sections).


If your file contains a code higher than 127 then it is not
ASCII -- ASCII stops at 127.

For example, it might be ISO 8859-1 (the code that Microsoft
refer to as "ANSI"). Many XML parsers will accept a file
containing characters from 8859-1 if you use an encoding
declaration at the start of the file:

<?xml encoding='ISO-8859-1'?>

However, the only encodings that XML parsers are obliged to
accept are the UTF-8 and UTF-16 encodings of ISO 10646
(informally, Unicode).

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tms at ansa.co.uk  Fri Sep 18 15:05:10 1998
From: tms at ansa.co.uk (Toby Speight)
Date: Mon Jun  7 17:04:49 2004
Subject: Characters having an ASCII value > 127
In-Reply-To: Steffen Rodig's message of "Fri, 18 Sep 1998 14:29:15 +0200"
References: <199809181228.OAA16525@sunfi1.fi.sdm.de>
Message-ID: <uogsd4lkb.fsf@delivery.ansa.co.uk>

Steffen> Steffen Rodig <URL:mailto:rodig@sdm.de>

0> In article <199809181228.OAA16525@sunfi1.fi.sdm.de>, Steffen wrote:

Steffen> imagine a plain text file which I want to markup using
Steffen> XML. Now it could be that there are characters in this file
Steffen> whose ASCII value is greater than 127 (in PCDATA sections).

No character has an ASCII value greater than 127: ASCII is a 7-bit
encoding.  Of course, it's possible to use characters beyond ASCII,
since the Document Character Set for XML is Unicode.


Steffen> If I try to use expat on the generated XML file, it tells
Steffen> me that it is not wellformed at the position where such a
Steffen> character occurs.

Perhaps your XML declaration doesn't agree with the actual encoding
of the document (you don't say what either of these are for your
document).  See Sections 2.8 and 4.3.3, and Appendix F.


Steffen> I guess, to correctly interpret and display those characters
Steffen> I have to know the character set which was used to encode the
Steffen> original text file.

Of course - the parser is unlikely to be able to tell the difference
between the various parts of ISO 8859, for instance.


Steffen> How can I communicate this character set to an XML parser?

In the encoding declaration, <?xml encoding="utf-8"?> (or whatever).

You may prefer to write the problematic characters as entities or
character references, if they are rare in your source.  This may
allow you to write your documents in a smaller character set.  (As an
example, I find it easiest to author in ISO-8859-1, but I need to
define entities for the Welsh characters, which lie in the Latin-2
plane.)

-- 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Fri Sep 18 15:28:02 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:50 2004
Subject: String expressions in XSL
Message-ID: <018b01bde308$be486160$1e09e391@mhklaptop.bra01.icl.co.uk>

>There seems to be _no_ string expression language neither
>for patterns nor for "value-of".
>I guess my problems are not too special.
>
I too was very surprised by the omission of string
manipulation facilities in XSL. I think XSL has been
designed rather on the assumption that your XML document
contains the character strings you want the user to see, and
the purpose of the stylesheet is to control where and how to
display them. Hence also the omission of features such as
sorting and totalling, date localisation, etc.

I would solve the problem by using my SAXON library to
convert the XML document to another XML document that
contains precisely the character content you want to
display, and then use XSL to render it. Or use SAXON to
generate the target HTML directly, if it's not too complex.

SAXON is on http://home.iclweb.com/icl2/mhkay/saxon.html

Regards, Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Fri Sep 18 16:00:58 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:50 2004
Subject: Public Identifiers
Message-ID: <019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk>

>> I'm wondering if there is any standardization of what's
in a "public
>> identifier"
>
>The SGML standard (ISO 8879) defines something called a
'formal public
>identifier' (clause 10.2), but whether public identifiers
must be FPIs
>or not is defined by the SGML declaration used.
>
>In other words: public identifiers can be whatever you
want, but you
>can declare them to be FPIs.


I think this response is referring to SGML rather than XML.
There is no SGML declaration in XML. There is no normative
link between XML and SGML, and therefore no normative link
between XML Public Identifiers and SGML FPIs.

XML does not require a Public Identifier to be either public
or an identifier; you can put anything in there that you
like, and it has no defined meaning. Tim Bray's annotated
XMl spec (on www.xml.com) has this to say:

... public identifiers are a trick inherited from SGML that
are probably only useful to people who already have working
SGML software installed. Remember that if you use public
identifiers within your own organization, that's perfectly
OK, but if you want to interchange XML documents with
anybody external, they have the right to demand, and you
have the obligation to provide, a working system identifier
(URI) for each external entity.


Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Fri Sep 18 16:50:20 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:04:50 2004
Subject: Public Identifiers
In-Reply-To: <019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <3.0.5.32.19980918094925.008e4e70@dns.isogen.com>

At 03:05 PM 9/18/98 +0100, Michael Kay wrote:

>I think this response is referring to SGML rather than XML.
>There is no SGML declaration in XML. There is no normative
>link between XML and SGML, and therefore no normative link
>between XML Public Identifiers and SGML FPIs.

This is not true. From the published recommendation (italics mine):

1. Introduction

Extensible Markup Language, abbreviated XML, describes a class of data
objects called XML documents and partially describes the behavior of
computer programs which process them. *XML is an application profile or
restricted form of SGML, the Standard Generalized Markup Language [ISO
8879]. By construction, XML           documents are conforming SGML
documents.* 

>XML does not require a Public Identifier to be either public
>or an identifier; you can put anything in there that you
>like, and it has no defined meaning. Tim Bray's annotated
>XMl spec (on www.xml.com) has this to say:

Here I agree. Public identifiers are, conceptually, the same as URNs, that
is, they are names that are intended to be indirected to their actual
system ID, rather than being direct references to storage locations, as
URLs normally are. However, as Dan Connoly and Tim B-L have argued, there's
no *functional* difference between a URN and URL because persistence is
always a function of the owner of the resource and cannot be guaranteed
simply by the choice of name. Thus, at most, the URN/URL or public
ID/system ID distinction can only express *intent*, it cannot guarantee
results.

It is a fact of life that any storage addressing scheme (or, in fact, any
addressing scheme at all) must include some notion of indirection. Both
SGML and HTTP do this and *neither* define the mechanism by which the
indirection is implemented or managed. In SGML, there is a requirement that
entity managers provide some mechanism for resolving public IDs to system
IDs, but ISO 8879 does not define a mechanism. Likewise, HTTP provides a
mechanism by which a server can report that a URL has been redirected (the
300-series messages) but doesn't define the mechanism by which a server
actually manages the redirection itself.

Thus, the unavoidable conclusion is that system IDs can be just as
indirect, and just as persistent, as so-called "public" IDs.  The only real
difference is what bit of software gets the value of the ID to resolve.
There is a useful notion of "published" names, that is names that the
resource owner or name owner (they may not be the same entity) assert will
be persistent, but there is no standard or even convention for making that
assertion.  The original idea in SGML was that public IDs would be used for
the names of "published" things, that is, resources that are available
beyond the local scope of the resource owner. However, that original intent
got lost in the more immediate need for general name indirection that
public IDs provided (because SGML systems are required to provide some sort
of mechanism). 

My conclusion at this point is that the URN/public ID distinction is not
helpful because it merely confuses the issue without actually solving any
problems. The only thing public IDs did was force vendors to provide *a
way* to do name indirection, which you do need on brain-dead operating
systems that lack something like symbolic links (which includes both VM/CMS
and DOS/Windows). If operating-system filename indirection was a universal
service, you'd just use that to manage redirection of entity storage IDs.
At the time SGML was developed, it certainly wasn't universal and it may
not have even been known outside of Bell Labs (I don't remember precisely
when Unix went public).

In hindsight, it's clear to me that we never should have allowed public IDs
in XML.  Oh well.

This is not to say that the URN idea is totally useless--it's very useful
to have a syntax for saying what name space a particular name is unique
within, which is really what URNs do.  However, I do have a problem with
putting all of that information in a single string--it too severely limits
your choice of syntaxes.  I would much rather have some sort of name
structure, such as:

<urn:address id="local-id-for-remote-resource">
<urn:name-domain>ISBN</urn:name-domain>
<urn:name>ISBN 0-1233456-123-0</urn:name>
</urn:address>

The "name-domain" element names the domain of names in which the name is
unique (e.g., ISBN numbers in this example). The "name" element holds the
name itself. By using element content rather than an attribute, there are
no syntactic restrictions on the name (it could even have structuring
subelements).  You could also combine names together to form larger,
multi-part addresses, if necessary.

Now I can refer to any resource in any name space regardless of the syntax
the name-space uses for its names. Of course, there is still a problem with
naming the name spaces, but that can be solved either by providing a
general "name space registration service" ala DNS or by simply defining in
the relevant standards what the naming authories are (as ISO 9070
does--9070 being the standard that defines the rules for SGML public
identifiers). [Note that I don't use the term "naming authority"--the same
name space may recognize several naming authorities, as is the case for
SGML public IDs.]

Remember: there's no magic to URLs or URNs--they're just identifiers that
some piece of software has to map to bytes at some point.  The only real
question is "is the pointer to the bytes also meaningful to humans or is it
only for machines?" URLs are intended to be "opaque", meaning that there is
no reliable intelligence in them. URNs are intended to be "meaningful" such
that a human observer might have some clue as to what the resource is at
the other end of it.  This is a useful distinction but it doesn't require
making the distinction at the point of reference (e.g., the PUBLIC/SYSTEM
distinction SGML and XML make). It is sufficient to have the distinction be
inherent in the form of address you're using, which means you need a way to
declare what the form is, which is what my example above does.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From elharo at sunsite.unc.edu  Fri Sep 18 16:51:02 1998
From: elharo at sunsite.unc.edu (Elliotte Rusty Harold)
Date: Mon Jun  7 17:04:50 2004
Subject: Namespaces from where?
In-Reply-To: <199809172250.SAA09150@ruby.ora.com>
Message-ID: <v03102806b2281fdd782b@[168.100.203.234]>

At 6:50 PM -0400 9/17/98, Chris Maden wrote:

>I also want to make it clear that I wasn't questioning the W3C's
>decision to keep its archives private.  It is a private organization,
>and doesn't have any moral or legal obligation to share work paid for
>by its members.

Legal, perhaps not. Moral, I completely disagree. A lot of people are VERY
uncomfortable with a few, private, for-profit corporations being allowed to
set the standards that all of us have to live with. I think the Web is too
important to be designed int the best interests of Microsoft and Netscape.
The W3C may be a vendor consortium, but perhaps it should not be.  It is
unconscionable that the interests of users and developers are not
represented in the standards process.

Quibbling about legal distinctions between specifications and standards is
irrelevant. What the W3C produces are more effective standards than much
paper that comes out of ISO or other standards bodies.


+-----------------------+------------------------+-------------------+
| Elliotte Rusty Harold | elharo@sunsite.unc.edu | Writer/Programmer |
+-----------------------+------------------------+-------------------+
|        XML: Extensible Markup Language (IDG Books 1998)            |
|   http://www.amazon.com/exec/obidos/ISBN=0764531999/cafeaulaitA/   |
+----------------------------------+---------------------------------+
|  Read Cafe au Lait for Java News:  http://sunsite.unc.edu/javafaq/ |
|  Read Cafe con Leche for XML News: http://sunsite.unc.edu/xml/     |
+----------------------------------+---------------------------------+


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Fri Sep 18 17:12:49 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:50 2004
Subject: Characters having an ASCII value > 127
References: <199809181228.OAA16525@sunfi1.fi.sdm.de>
Message-ID: <36027854.797C8093@locke.ccil.org>

Steffen Rodig wrote:

> imagine a plain text file which I want to markup using XML. Now it could be
> that there are characters in this file whose ASCII value is greater than
> 127 (in PCDATA sections).
> 
> If I try to use expat on the generated XML file, it tells me that it is
> not wellformed at the position where such a character occurs. Does the
> XML spec say anything about not permitting characters with high ASCII
> values? If so, where?

Expat, like a proper XML parser, is assuming the UTF-8 charset.
You need to specify Latin-1 or whatever you are using.
 
> I guess, to correctly interpret and display those characters I have to
> know the character set which was used to encode the original text file.
> How can I communicate this character set to an XML parser?

Put "<?xml encoding="8859-1" ?>" as the very first line.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Fri Sep 18 17:20:26 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:50 2004
Subject: Public Identifiers
References: <019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <36027A49.D857AB1E@locke.ccil.org>

Michael Kay wrote:

> I think this response is referring to SGML rather than XML.
> There is no SGML declaration in XML. There is no normative
> link between XML and SGML, and therefore no normative link
> between XML Public Identifiers and SGML FPIs.

Normative, no, but James Clark's SGML declaration
(http://www.w3.org/TR/NOTE-sgml-xml-971215) has some
standing, since it is referred to in the XML Rec.
That says "FEATURES OTHER FORMAL NO".
Nevertheless, most PublicIds in examples are FPIs.
 
> ... public identifiers are a trick inherited from SGML that
> are probably only useful to people who already have working
> SGML software installed.

Coming soon: an SAX EntityResolver that processes Socats.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Fri Sep 18 17:34:41 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:50 2004
Subject: Namespaces from where?
Message-ID: <3.0.32.19980918083202.0091a7c0@pop.intergate.bc.ca>

At 10:38 AM 9/18/98 -0400, Elliotte Rusty Harold wrote:
>Legal, perhaps not. Moral, I completely disagree. A lot of people are VERY
>uncomfortable with a few, private, for-profit corporations being allowed to
>set the standards that all of us have to live with. I think the Web is too
>important to be designed int the best interests of Microsoft and Netscape.
>The W3C may be a vendor consortium, but perhaps it should not be.  It is
>unconscionable that the interests of users and developers are not
>represented in the standards process.

Users and developers *can* be represented; but they have to pay for the
privilege.  Since the amounts aren't exorbitant, it seems like a good
practical bozo-filter to me.  Speaking as one who has been inside the
process for the last couple of years, it is absolutely *not* the case
that the discussions are dominated in any practical way by Netscape
and Microsoft, or that those guys always get what they want. -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ht at cogsci.ed.ac.uk  Fri Sep 18 18:16:49 1998
From: ht at cogsci.ed.ac.uk (Henry S. Thompson)
Date: Mon Jun  7 17:04:50 2004
Subject: String expressions in XSL
In-Reply-To: "Michael Kay"'s message of "Fri, 18 Sep 1998 14:32:11 +0100"
References: <018b01bde308$be486160$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <f5bn27xcs67.fsf@cogsci.ed.ac.uk>

1) xsl-list@mulberrytech.com is the preferred place for XSL
discussions; there is a mechanism in place to ensure that points
raised there are called to the attention of the XSL Working Group.

2) The (first) XSL draft recommendation says quite clearly that an
expression language will be provided:  we just weren't ready with one
in time.  If you're keen to see string manipulation included, see
point (1) above.

ht
-- 
  Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh
     2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440
	    Fax: (44) 131 650-4587, e-mail: ht@cogsci.ed.ac.uk
		     URL: http://www.ltg.ed.ac.uk/~ht/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bunner at massquantities.com  Fri Sep 18 18:26:19 1998
From: bunner at massquantities.com (Andrew Bunner)
Date: Mon Jun  7 17:04:50 2004
Subject: String expressions in XSL
In-Reply-To: <018b01bde308$be486160$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <199809181626.JAA09086@mail-gw5.pacbell.net>


[... moved from bottom ...]
>I would solve the problem by using my SAXON library to
>convert the XML document to another XML document that
>contains precisely the character content you want to
>display, and then use XSL to render it.

  I had an almost identical problem, fortunately there was a lot less data
involved though ;)

  The way I solved it was I defined a macro to spit out the links and I
provided the macro with some arguments to let it know the context in which
the links were bring defined. In your case, the arguments might be
"subclass A01B" or something.

  Then I got kludgey. Real kludgey.

  I made up my own tag called <define-script> to include in my XSL
document. For those who are wondering, I wanted to name it
<extra-xsl:define-script> (or something), but I couldn't figure out how to
do namespaces properly and after a few minutes, I gave up.

  Anyway, I let XT process the XSL file as best it can and then I run it
through my post-processor which sends the contents of the <define-script>
tags to a Perl interpreter and then replaces the tags with whatever the
Perl script sends to STDOUT.

  Which is to say, there's no easy way to do what you want.

>>There seems to be _no_ string expression language neither
>>for patterns nor for "value-of".
>>I guess my problems are not too special.

  Actually, I'd say your problem is representative of a big hole in the
working draft. I think they plan on incorporating some specifications for
how XSL can be extended with a scripting language in a later working draft.
That's the sort of thing that would solve this...

>I too was very surprised by the omission of string
>manipulation facilities in XSL. I think XSL has been
>designed rather on the assumption that your XML document
>contains the character strings you want the user to see

  It's probably safe to say that the spec just isn't done. These features
will have to be added later because they're too important to expressly
leave out. I hope.

-- Andrew

   Andrew Bunner
   President, Founder Mass Quantities, Inc.
   Professional Supplements for the Perfect Physique
   http://www.massquantities.com 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Fri Sep 18 18:28:39 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:50 2004
Subject: IRAL (was Public Identifiers)
Message-ID: <3.0.32.19980918092812.00ae3890@pop.intergate.bc.ca>

At 09:49 AM 9/18/98 -0500, W. Eliot Kimber wrote:
>Here I agree.
...
>Thus, at most, the URN/URL or public
>ID/system ID distinction can only express *intent*, it cannot guarantee
>results.

What Eliot said.  Every word of it.

>In hindsight, it's clear to me that we never should have allowed public IDs
>in XML.  Oh well.

Well yeah, and a large majority of the WG agreed, and in fact we voted
'em down no less than 3 times as I recall, but the SIG howled until blood
ran from our ears and it was obvious that they weren't going to stop, so
we eventually decided that they weren't actually damaging and if that
many people wanted them that badly, they ought to have them.

>This is not to say that the URN idea is totally useless ...
> I would much rather have some sort of name
>structure, such as:

Now there's a really good idea.  It's severely irritating that UR*'s
have all sorts of internal markup that most application are required
to pretend isn't there... in fact, anything that is going to be generally 
useful for addressing across the internet is probably going to have all 
sorts of internal structure, why not publish it?

Takers for developing IRAL (Internet Resource Addressing Language),
an application of XML?  Then you could have, instead of a URI plus
an XPointer, an IRAL plus an XPointer, and you'd really have something
you could do some work with. -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Fri Sep 18 18:40:27 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:04:50 2004
Subject: Opportunities for XML-DEV
In-Reply-To: <3.0.1.16.19980913101630.903f754e@pop3.demon.co.uk>
References: <003e01bddecd$a17e1aa0$8baddccf@ix.netcom.com>
Message-ID: <199809181640.MAA05701@hesketh.com>

Nearly a week ago, Peter Murray-Rust wrote:
>What we have discovered is that there are very few XML documents currently
>being delivered over the WWW. For many of us who see XML as a communication
>medium *and philosophy* this is a pity. I think it makes it harder to
>develop tools to work with specs like XLink, XPointer, Namespaces because
>we don't have example documents to work with. And this is cyclic, because
>those creating documents don't have tools to create documents with and
>don't have people who can read them. So, at the moment we can only talk
>about those applications.

I'm moving www.simonstl.com into XML syntax, though still using the HTML
vocabulary.  All future postings will be well-formed, and sometime soon I
hope to build a supplement to John Cowan's IBTWSH that will allow me to
validate my documents as well.  I'll be posting an article sometime soon
(before October?) detailing what's involved in the transition.

I realize this isn't exactly what most of us have in mind with 'XML over
the Web', but it's an important first step.  Cleaning up the top pages of
my site took about 10 minutes.  Other pages will require more work - yes, I
plan to go back and fix everything except perhaps the 100+ XSchema fragments.

Getting HTML developers used to the syntactic constraints of XML is an
important first step toward getting out the word.  Once we have real
browser support for generic XML, we can start developing more meaningful
vocabularies.

Frank Boumphrey wrote:
>Having just finished 'hacking' the IE5 support for XML and the DOM, I am
>amazed. Combined they can be used to retrieve any XML document and can
>display it in almost any form we want on a (IE5 compatible, ah, there's the
>rub!) browser.

If IE5 can display straight-up XML documents, no scripting required, using
CSS for formatting, as Netscape is starting to do, I'll consider changing
my current strategy.  If it's data islands and access via the DOM only, I'm
not supporting it.  (I'll document it, since that's my job, but it ain't
going to show up on my site.)

Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From lisarein at finetuning.com  Fri Sep 18 19:14:16 1998
From: lisarein at finetuning.com (Lisa Rein)
Date: Mon Jun  7 17:04:50 2004
Subject: Transition to XML (was Re: Opportunities for XML-DEV)
References: <003e01bddecd$a17e1aa0$8baddccf@ix.netcom.com> <199809181640.MAA05701@hesketh.com>
Message-ID: <36029E7E.65DA85DE@finetuning.com>

Simon St. Laurent wrote:

  Cleaning up the top pages of
> my site took about 10 minutes.  Other pages will require more work - yes, I
> plan to go back and fix everything except perhaps the 100+ XSchema fragments.

It took me WEEKS just to clean up the (pretty horrid) HTML on my little
site -- but now it is at least HTML 4.0 valid.  I feel this is an
important first step to first get everyone to at least be well-formed
HTML, and THEN start pushing for xml transition.  

The XML transition itself will be more easily accomplished from a web of
well-formed HTML anyway.  In fact, if I have my dithers, much of it will
be automatable.   

For now, remember that if web documents that are not well formed are
near-useless to machines.  So it just depends on whether you want the
information on your site -- all of it -- to be machine-accessible -- and
that's not even getting into real accesibility issues (WAI, ICAAD,
etc.)-- many of which also depend on such well-formedness.  

Many sites that I have been trying to access with xml-enabled apps are
dead-ends due to non-well-formed pages.  In order to accomplish creating
an XML directory, for example, I would like to be able to index at the
very least the websites of the members of this list -- but when I tried
many were inaccessible ;-(

So then I am forced to either "scrape" the data off of your sites, and
regenerate versions of it that can be indexed -- which is another pain
(shame on you all ;-) -- or keep looking for other well-formed sites --
a frustrating search which usually just leads me back to my own site.

Simon's other point (about the need to start migrating ourselves) hits a
sore spot with me because I've been getting called on this lately --
It's pretty embarrassing when some one calls you on not practicing what
you preach -- especially with what some still view (unfortunately) as
the a "religion" of xml. 

Most HTML programmers can understand the sacredness of well-formed docs
(-- although they might not get the value of syntax preservation in
general.)  I'm not sure why the well-formedness thing gets across, but
it does.  We can use this to everyone's advantage.

So soon I will be releasing my own little indexing system for my own
little site (of well-formed HTML, at first, not xml pages -- and then
mirrored xml'd versions of those same pages -- but i would love to
integrate the content of others -- i've just grown tired of looking for
well-formed pages)

So if your site has well-formed pages -- and you'd like to be included
in a little experiment -- let me know privately.  

Thanks!

lisa

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Fri Sep 18 19:43:17 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:04:50 2004
Subject: Transition to XML (was Re: Opportunities for XML-DEV)
In-Reply-To: <36029E7E.65DA85DE@finetuning.com>
References: <003e01bddecd$a17e1aa0$8baddccf@ix.netcom.com>
 <199809181640.MAA05701@hesketh.com>
Message-ID: <4.0.1.19980918132913.00e4b870@pop.hesketh.net>

At 10:55 AM 9/18/98 -0700, Lisa Rein wrote:
>It took me WEEKS just to clean up the (pretty horrid) HTML on my little
>site -- but now it is at least HTML 4.0 valid.  I feel this is an
>important first step to first get everyone to at least be well-formed
>HTML, and THEN start pushing for xml transition.  

Fortunately, I'd picked up the gospel of well-formedness from my early days
with dynamic HTML.  It's a lot easier to script elements when their
boundaries are well-defined.  Most of my pages are hand-coded, so they're
fairly clean.  The ones from MS Word - well, we'll see.

I'm actually not bothering with being valid to the HTML 4 DTD, since it's
an SGML DTD.  I'll carve out my own little XML subset for now (building on
John Cowan's IBTWSH), and watch for the W3C's HTML modules to appear.

>The XML transition itself will be more easily accomplished from a web of
>well-formed HTML anyway.  In fact, if I have my dithers, much of it will
>be automatable.   

This is definitely true.  The more we can report on how easy and
automatable this process is, the more likely it is that others will join
the fun.

>Many sites that I have been trying to access with xml-enabled apps are
>dead-ends due to non-well-formed pages.  In order to accomplish creating
>an XML directory, for example, I would like to be able to index at the
>very least the websites of the members of this list -- but when I tried
>many were inaccessible ;-(
>
>So then I am forced to either "scrape" the data off of your sites, and
>regenerate versions of it that can be indexed -- which is another pain
>(shame on you all ;-) -- or keep looking for other well-formed sites --
>a frustrating search which usually just leads me back to my own site.
>
>Simon's other point (about the need to start migrating ourselves) hits a
>sore spot with me because I've been getting called on this lately --
>It's pretty embarrassing when some one calls you on not practicing what
>you preach -- especially with what some still view (unfortunately) as
>the a "religion" of xml. 

This is definitely the case.  (I got called on using MS Word's HTML output
for XSchema over the summer, which got me thinking.)  XML needs momentum,
and its stronger supporters are a reasonable place to start.  I don't think
we need or want XML police (although that's what validating parsers are,
more or less), but the more valid and well-formed material out there, the
better.  I can't wait to move beyond the HTML vocabulary, but it's a start.

Making these sites indexable would be another gigantic boost to the cause
of XML, worthy in its own right.

>Most HTML programmers can understand the sacredness of well-formed docs
>(-- although they might not get the value of syntax preservation in
>general.)  I'm not sure why the well-formedness thing gets across, but
>it does.  We can use this to everyone's advantage.

I'm finding it gets through more and more as developers hit walls.  Dynamic
HTML was a key area for this, CSS is starting to get through (largely
thanks to positioning, in my experience), and corporate intranet developers
frustrated by the weakness of the client end of their applications need to
find a better way. Validation is harder to explain, but I think it may just
take time.

Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From johnm at magnet.com  Fri Sep 18 20:04:23 1998
From: johnm at magnet.com (John Mitchell)
Date: Mon Jun  7 17:04:50 2004
Subject: Transition to XML (was Re: Opportunities for XML-DEV)
In-Reply-To: <36029E7E.65DA85DE@finetuning.com>
Message-ID: <Pine.SGI.3.96.980918135641.16417M-100000@lemur.magnet.com>

On Fri, 18 Sep 1998, Lisa Rein wrote:

> Simon St. Laurent wrote:
> 
>   Cleaning up the top pages of
> > my site took about 10 minutes.  Other pages will require more work - yes, I
> > plan to go back and fix everything except perhaps the 100+ XSchema fragments.
> 
> It took me WEEKS just to clean up the (pretty horrid) HTML on my little
> site -- but now it is at least HTML 4.0 valid.  I feel this is an
> important first step to first get everyone to at least be well-formed
> HTML, and THEN start pushing for xml transition.  
> 
> The XML transition itself will be more easily accomplished from a web of
> well-formed HTML anyway.  In fact, if I have my dithers, much of it will
> be automatable.   


I'd strongly recommend this HTML-sanitizer:

	http://www.w3.org/People/Raggett/tidy

It automatically closes tags and other things to make your HTML *much*
more XML-friendly.  Another switch will remove "font" etc tags and
automatically replace them with CSS!


- j


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Fri Sep 18 20:08:08 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:51 2004
Subject: Transition to XML (was Re: Opportunities for XML-DEV)
References: <Pine.SGI.3.96.980918135641.16417M-100000@lemur.magnet.com>
Message-ID: <3602A18B.E69EACF3@locke.ccil.org>

John Mitchell wrote:

> I'd strongly recommend this HTML-sanitizer:
> 
>         http://www.w3.org/People/Raggett/tidy

I second the recommendation.  Tidy is open-source and is available
in compiled form for several systems, including DOS/Windows.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dalapeyre at mulberrytech.com  Fri Sep 18 20:59:29 1998
From: dalapeyre at mulberrytech.com (Deborah Aleyne Lapeyre)
Date: Mon Jun  7 17:04:51 2004
Subject: Public Identifiers
In-Reply-To: <36027A49.D857AB1E@locke.ccil.org>
References: <019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <v03020949b2285b3c4957@DialupEudora>

John Coan wrote:

>Coming soon: an SAX EntityResolver that processes Socats.

Thank you!

>From someone to whom fpis are still incredibly useful, and to
whom socats (on admittedly brain-dead but very common operating
systems) provide a real service.

--Debbie

======================================================================
Deborah Aleyne Lapeyre               mailto:dalapeyre@mulberrytech.com
Mulberry Technologies, Inc.                http://www.mulberrytech.com
17 West Jefferson Street                    Direct Phone: 301/315-9633
Suite 207                                          Phone: 301/315-9631
Rockville, MD  20850                                 Fax: 301/315-8285
----------------------------------------------------------------------
  Mulberry Technologies: A Consultancy Specializing in SGML and XML
======================================================================


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dalapeyre at mulberrytech.com  Fri Sep 18 20:59:35 1998
From: dalapeyre at mulberrytech.com (Deborah Aleyne Lapeyre)
Date: Mon Jun  7 17:04:51 2004
Subject: IRAL (was Public Identifiers)
In-Reply-To: <3.0.32.19980918092812.00ae3890@pop.intergate.bc.ca>
Message-ID: <v0302094ab2285c5e8d88@DialupEudora>

Tim Bray wrote:

>Takers for developing IRAL (Internet Resource Addressing Language),
>an application of XML?  Then you could have, instead of a URI plus
>an XPointer, an IRAL plus an XPointer, and you'd really have something
>you could do some work with.

Yes please.  I want that too, please.  But that is tomorrow.  If I need to
get product out the door today, I want access to all the tricks that work,
right out of the box, even the gross ones.  It's not a question of
functionality, truth, beauty, or elegance; it's a question of what can
off-the-shelf products support.

End rant.  Sorry.

--Debbie

======================================================================
Deborah Aleyne Lapeyre               mailto:dalapeyre@mulberrytech.com
Mulberry Technologies, Inc.                http://www.mulberrytech.com
17 West Jefferson Street                    Direct Phone: 301/315-9633
Suite 207                                          Phone: 301/315-9631
Rockville, MD  20850                                 Fax: 301/315-8285
----------------------------------------------------------------------
  Mulberry Technologies: A Consultancy Specializing in SGML and XML
======================================================================


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Fri Sep 18 21:21:28 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:51 2004
Subject: ANN: Updated DOMParser
Message-ID: <3602B2B5.FBB6EAB2@locke.ccil.org>

At the suggestion of Mike Kay, I have enhanced DOMParser to support
SAX Locators and InputSources properly.  Reiterating, DOMParser
conforms to the definition of a SAX parser, but reads from a
DOM Document representing an XML document, rather than from a
textual representation of XML.

A new class called DOMSource, a subclass of InputSource, can now be
used to encapsulate a DOM Document for SAX parsing.  Obviously, SAX
parsers other than DOMParser will be clueless if passed such an
InputSource!

DOMParser also now implements Locator, although it returns no useful
information from it.  However, the non-interface method
DOMParser.getCurrentNode() can be used to determine the current DOM Node
being parsed.

All this is available at http://www.ccil.org/~cowan/XML .

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Fri Sep 18 21:55:49 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:04:51 2004
Subject: Public Identifiers
In-Reply-To: <v03020949b2285b3c4957@DialupEudora>
References: <36027A49.D857AB1E@locke.ccil.org>
 <019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <3.0.5.32.19980918145429.0093b100@dns.isogen.com>

At 02:50 PM 9/18/98 -0400, Deborah Aleyne Lapeyre wrote:
>John Coan wrote:
>
>>Coming soon: an SAX EntityResolver that processes Socats.
>
>Thank you!
>
>From someone to whom fpis are still incredibly useful, and to
>whom socats (on admittedly brain-dead but very common operating
>systems) provide a real service.

But note that with the second edition of the SOCAT spec, you can remap
system IDs just as you can public IDs. So even there, FPIs provide no
unique facility, although the SOCAT mechanism itself does (redirection).

Formal public identifiers have value because they are intended to be human
meaningful and, when using registered owner names, guaranteed unique--but
not because they are indirect.  They are indirect only because, as far as I
know, SGML formal public IDs are not valid file names in any common
operating system. If they were, then you could use them as direct system
IDs.  It wouldn't matter whether their invocation was preceded by the
keyword PUBLIC or SYSTEM.  But note that, for example, within the scope of
dedicated repositories, I would expect to be able to use FPIs as the
primary name for resources, knowing that the redirection to the real
resource name, the private repository ID, is transparent, just as the
redirection of filenames to internal storage locations (e.g, i-nodes in
Unix) is transparent in operating systems.

The real issue is one of generally-available name redirection services ala
DNS. We take DNS for granted because the Internet would be really
inconvenient to use without it.  We could have a similar system for
resource names if the Internet community was willing to step up to funding
the development and maintenance of it. Unfortunately, because the Internet
is a distributed resource with no central management and largely hidden
shared costs, it's difficult to get a general resource in place if people
can get along with out it. You can't get along without DNS, so we have it.
You can get along without generalized resource name redirection, so we
don't have it.  

Note that things like PURLs, while useful, don't really solve the problem
because they are really nothing more than server-side redirects, which
we've always had and which anyone can provide unilaterally.  The problem
isn't really solved until we have a general service that everyone can take
for granted.  It may be, in the spirit of 80/20 solutions, that using
server-side redirection is all we will ever really need or have. Providing
a generalized resource name redirection service poses some difficult
technical and social challenges that might prove more expensive to solve
than the benefit provided can justify.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eric at hellman.net  Fri Sep 18 23:57:59 1998
From: eric at hellman.net (Eric Hellman)
Date: Mon Jun  7 17:04:51 2004
Subject: namespaces for name attribute values?
Message-ID: <v04011702b22875ecf674@[192.168.1.1]>

Here's the problem:

We're designing an XML DTD. We want the documents to be valid. We would
like to provide for Dublin Core Metadata.

One way to do this is:
<DC xmlns="DC=http://purl.org/metadata/dublin_core">
   <DC:Creator>Eric Hellman</DC:Creator>
</DC>

This declares the nomenclature of "Creator" in an unambiguous way. (xmlns
could be declared as a default in the DTD) The problem, is that we have no
control over Dublin Core. Dublin Core will get extensions, and we'd rather
not revise our dtd every time Dublin Core changed.

The second way to do this is the HTML way:

<meta name="DC.creator" content="Eric Hellman">

This encapsulates the Dublin core element set from our DTD, but the
nomenclature declaration is absent.

Could a "namespace" declaration be added to an element to declare the
nomenclature for attribute values?:
<meta name="DC:Creator" content="Eric Hellman"
xmlns="DC=http://purl.org/metadata/dublin_core">

Would this new meaning for a namespace declaration be useful?


Eric
Eric Hellman
Openly Informatics, Inc.
http://www.openly.com/           Tools for 21st Century Scholarly Publishing

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From andrewl at microsoft.com  Sat Sep 19 00:13:02 1998
From: andrewl at microsoft.com (Andrew Layman)
Date: Mon Jun  7 17:04:51 2004
Subject: namespaces for name attribute values?
Message-ID: <5BF896CAFE8DD111812400805F1991F7038CA7B3@RED-MSG-08>

Perhaps you do not really have a problem.  What I'm thinking is that if you
wrote something similar to the example you showed,

<foo xmlns:DC="http://purl.org/metadata/dublin_core">
   <DC:Creator>Eric Hellman</DC:Creator>
</foo>

then the meaning of DC:Creator is not affected by any additions made to
Dublin Core.  Your DTD remains valid, simply not as extensive as the
(expanded) Dublin Core.

This is certainly not a solution to all the issues that might arise when
combining elements from multiple namespaces, but it appears to work fine for
the example you cite.

Hope this is helpful,

Andrew Layman 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From hans at miracle.se  Sat Sep 19 01:22:57 1998
From: hans at miracle.se (Hans Carlsson)
Date: Mon Jun  7 17:04:51 2004
Subject: Q: DTD versioning
Message-ID: <99767B93CB69D111B5D1000000014EB507F1@adam.miracle.se>

Excuse me for blundering in to the exciting xml-dev discussions going
on...

I'm fairly new to XML, I have read some books, XML Complete by Seven
Holzner (a complete disaster), XML by Elliotte Rusty Harold (good
introduction). I'm not going to be a an XML tool developer, I guess I'm
going to be an XML practitioner. I confess I haven't read the W3 XML
spec.

Is it possible to have versions of XML DTDs? I'm wondering about the
'simple' issue of versioning for XML DTD's... How is this being
addressed by the XML spec (community), tools to be developed etc.?

Take for example an an INVOICE DTD. After a while someone in the
organization realizes that an INVOICE should contain a <PHONE> tag, or
that they should be printed on pink paper <PAPER COLOR="pink">. Do you
define a new DTD, <IMPROVED_INVOICE>, or do you add an attribute,
<INVOICE VERSION="2">?

Does the XML spec address this? or is it conferred to the realm of
application builders?


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From gkholman at CanadaMail.com  Sat Sep 19 07:32:17 1998
From: gkholman at CanadaMail.com (G. Ken Holman)
Date: Mon Jun  7 17:04:51 2004
Subject: page header footer resolution
In-Reply-To: <003101bdddfa$a94e85e0$b3acdccf@ix.netcom.com>
Message-ID: <Version.32.19980918192435.00ef1cd0@CraneSoftwrights.com>

At 98/09/11 23:08 -0400, Frank Boumphrey wrote:
>>I've seen slightly conflicting suggestions. In some postings it's been
>>suggested that there is not currently a standard way to HIDE content.
>
>my understanding was that if you didn't want to display an object you just
>ommited to process the children.

Not to my understanding ... one would have to omit the template content to
truly hide the element and have nothing at all displayed.

>Thus
><greeting>Hello XSL!</greeting>
>
><xsl:stlesheet>
>    <xsl:template match="greeting">
>        <fo:block font-size="16pt">
>            <process-children/>
>        </fo:block>
>    </xsl:template>
></xsl:stlesheet>
>
> would result in a styled text flow object, "Hello XSL!"
>
>whereas:
>
><xsl:template match="greeting">
>    <fo:block font-size="16pt">
>        <!--<process-children/>-->
>    </fo:block>
></xsl:template>
>
>would not.

But your example above would, I think, produce a 16pt high paragraph block
from the formatter.  Thus, though the characters of the content are hidden,
the presence of the <greeting> element would still be visible.

I would use the following to completely hide the element.

<xsl:template match="greeting"/>

I hope this helps.

.............. Ken

p.s. As Paul pointed out, this should be discussed on the XSL-List
(http://www.mulberrytech.com/xsl/xsl-list) ... I answered here since the
thread was started here.


--
G. Ken Holman               mailto:gkholman@CanadaMail.com
Crane Softwrights Ltd.  http://www.CraneSoftwrights.com/x/
Box 266,                                V: +1(613)489-0999
Kars, Ontario CANADA K0A-2E0            F: +1(613)489-0995
Training:   http://www.CraneSoftwrights.com/x/schedule.htm
Resources: http://www.CraneSoftwrights.com/x/resources.htm
Shareware: http://www.CraneSoftwrights.com/x/shareware.htm


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ht at cogsci.ed.ac.uk  Sat Sep 19 12:17:07 1998
From: ht at cogsci.ed.ac.uk (Henry S. Thompson)
Date: Mon Jun  7 17:04:51 2004
Subject: Public Identifiers
In-Reply-To: "W. Eliot Kimber"'s message of "Fri, 18 Sep 1998 14:54:29 -0500"
References: <36027A49.D857AB1E@locke.ccil.org>  <019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk> <3.0.5.32.19980918145429.0093b100@dns.isogen.com>
Message-ID: <f5blnngcsq4.fsf@cogsci.ed.ac.uk>

"W. Eliot Kimber" <eliot@dns.isogen.com> writes:

> [names without a public resolution mechanism can never be really universal]

So has the W3C, as the obvious entity with a budget and an interest in 
a solution to this problem, ever showed its hand wrt this issue?

ht
-- 
  Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh
     2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440
	    Fax: (44) 131 650-4587, e-mail: ht@cogsci.ed.ac.uk
		     URL: http://www.ltg.ed.ac.uk/~ht/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From digitome at iol.ie  Sat Sep 19 14:14:52 1998
From: digitome at iol.ie (Sean Mc Grath)
Date: Mon Jun  7 17:04:51 2004
Subject: Switching between DOM and SAX
Message-ID: <3.0.6.32.19980919094744.009157b0@gpo.iol.ie>

[John Cowan]
>..what I have is code that converts a DOM *tree* into a SAX *event stream*.
> The result is "manageable" to applications that expect SAX events.

The reverse conversion is also very valuable. I wonder how it would
work with SAX/DOM?

Explanation:
When processing large documents building entire trees is
resource intensive and time consuming. Sometimes you only
the power of tree navigation stuff for sub-parts of the
source document.

I find it very useful to start out processing events and then switch
into tree building for branches of the source document. I have a way of doing
this in a Python based SGML toolkit (LumberJack) that I have hacked up and
find
it really really useful. I wonder how it would work with the SAX/DOM
standards. FWIW, here is how it looks in my stuff (pseudo Python):--

def TABLE_HANDLER
	# Hit a table element - need a tree to process these things
	if start of table:
		EventSource.RollBack() # Roll back the start event
		TableTree = TreeBuilder (EventSource)

def FOO_HANDLER
	Other event handling functions here


Sean Mc Grath

def Get_URI_Of_Superlative_Scripting_Language():
	return "http://www.python.org"


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Sat Sep 19 14:24:08 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:04:51 2004
Subject: Q: DTD versioning
In-Reply-To: <99767B93CB69D111B5D1000000014EB507F1@adam.miracle.se>
Message-ID: <4.0.1.19980919081959.00e57d60@pop.hesketh.net>

At 01:22 AM 9/19/98 +0200, Hans Carlsson wrote:
>Is it possible to have versions of XML DTDs? I'm wondering about the
>'simple' issue of versioning for XML DTD's... How is this being
>addressed by the XML spec (community), tools to be developed etc.?

You might want to take a look at XSchema's XSchema element - we used a
#FIXED Version attribute in the DTD to identify that all documents using
this DTD are using version 1.0 of XSchema.  When we develop another
version, we can create a new DTD and update the value to 1.1 or 2.0 or
something else more interesting.  (Schemas created using XSchema will need
to implement their own versioning mechanism, possibly using this same
technique in their root element.)

It's only guaranteed to work in a validating environment (where we can
count on the parser to read in the external DTD and apply the value), but
it's pretty convenient. (The XML spec only provides for XML's own version
functionality.)  Hope it helps!


Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Sat Sep 19 20:14:19 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:51 2004
Subject: Switching between DOM and SAX
In-Reply-To: <3.0.6.32.19980919094744.009157b0@gpo.iol.ie> from "Sean Mc Grath" at Sep 19, 98 09:47:44 am
Message-ID: <199809191820.OAA00814@locke.ccil.org>

Sean Mc Grath scripsit:

> The reverse conversion is also very valuable. I wonder how it would
> work with SAX/DOM?

The Docuverse DOM SDK (as well as any competitors that may eventually
appear, no doubt) already supports this.

> When processing large documents building entire trees is
> resource intensive and time consuming. Sometimes you only
> the power of tree navigation stuff for sub-parts of the
> source document.

This is difficult with the DOM, because every Node must have
a pointer to the root (Document) Node.

-- 
John Cowan					cowan@ccil.org
		e'osai ko sarji la lojban.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From elharo at sunsite.unc.edu  Sat Sep 19 21:51:08 1998
From: elharo at sunsite.unc.edu (Elliotte Rusty Harold)
Date: Mon Jun  7 17:04:51 2004
Subject: Namespaces from where?
In-Reply-To: <3.0.32.19980918083202.0091a7c0@pop.intergate.bc.ca>
Message-ID: <v03102807b229ba7b0b08@[168.100.203.234]>

Tim Bray wrote:
>
>Users and developers *can* be represented; but they have to pay for the
>privilege.  Since the amounts aren't exorbitant, it seems like a good
>practical bozo-filter to me.  Speaking as one who has been inside the
>process for the last couple of years, it is absolutely *not* the case
>that the discussions are dominated in any practical way by Netscape
>and Microsoft, or that those guys always get what they want. -Tim

I can't join. The W3C won't take my money. Various non-profit groups I
belong to like WWWAC can join, but the W3C still allow me or any other
members a voice in the process, because we aren't allowed to particpate in
the working groups.  The W3C process is designed to serve the interests of
commercial software developers, and these are the only people who are
allowed to be members in any practical sense.


+-----------------------+------------------------+-------------------+
| Elliotte Rusty Harold | elharo@sunsite.unc.edu | Writer/Programmer |
+-----------------------+------------------------+-------------------+
|        XML: Extensible Markup Language (IDG Books 1998)            |
|   http://www.amazon.com/exec/obidos/ISBN=0764531999/cafeaulaitA/   |
+----------------------------------+---------------------------------+
|  Read Cafe au Lait for Java News:  http://sunsite.unc.edu/javafaq/ |
|  Read Cafe con Leche for XML News: http://sunsite.unc.edu/xml/     |
+----------------------------------+---------------------------------+


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From elharo at sunsite.unc.edu  Sat Sep 19 22:20:58 1998
From: elharo at sunsite.unc.edu (Elliotte Rusty Harold)
Date: Mon Jun  7 17:04:51 2004
Subject: Opportunities for XML-DEV
In-Reply-To: <199809181640.MAA05701@hesketh.com>
References: <3.0.1.16.19980913101630.903f754e@pop3.demon.co.uk>
 <003e01bddecd$a17e1aa0$8baddccf@ix.netcom.com>
Message-ID: <v0310280ab229c0c1845f@[168.100.203.234]>

>Nearly a week ago, Peter Murray-Rust wrote:
>>What we have discovered is that there are very few XML documents currently
>>being delivered over the WWW.

I've just posted all the full examples from XML: Extensible Markup Language
at http://sunsite.unc.edu/xml/books/xml/examples

They should all be well-formed and many of the later ones should validate.
However, many of the earlier examples in the book don't have DTDs. There
are also a number of examples of the old XSL style syntax.


+-----------------------+------------------------+-------------------+
| Elliotte Rusty Harold | elharo@sunsite.unc.edu | Writer/Programmer |
+-----------------------+------------------------+-------------------+
|        XML: Extensible Markup Language (IDG Books 1998)            |
|   http://www.amazon.com/exec/obidos/ISBN=0764531999/cafeaulaitA/   |
+----------------------------------+---------------------------------+
|  Read Cafe au Lait for Java News:  http://sunsite.unc.edu/javafaq/ |
|  Read Cafe con Leche for XML News: http://sunsite.unc.edu/xml/     |
+----------------------------------+---------------------------------+


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Sat Sep 19 22:33:11 1998
From: david at megginson.com (david@megginson.com)
Date: Mon Jun  7 17:04:52 2004
Subject: Public Identifiers
In-Reply-To: <f5blnngcsq4.fsf@cogsci.ed.ac.uk>
References: <36027A49.D857AB1E@locke.ccil.org>
	<019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk>
	<3.0.5.32.19980918145429.0093b100@dns.isogen.com>
	<f5blnngcsq4.fsf@cogsci.ed.ac.uk>
Message-ID: <13827.43488.151526.744687@megginson.com>

Henry S. Thompson writes:

 > "W. Eliot Kimber" <eliot@dns.isogen.com> writes:
 > 
 > > [names without a public resolution mechanism can never be really
 > > universal]
 > 
 > So has the W3C, as the obvious entity with a budget and an interest in 
 > a solution to this problem, ever showed its hand wrt this issue?

Internet hostnames have a distributed and efficient public resolution
mechanism, so they easily meet Eliot's criterion (as do URLs, more
generally but with a few limitations); the problem with hostnames is
not that they are not universal, but that they are not persistent: a
hostname may have only one owner and resolve to only one IP address at
any given moment, but next week the owner and IP address can be
different.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Sat Sep 19 23:42:18 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:52 2004
Subject: Switching between DOM and SAX
Message-ID: <004501bde416$5d005c00$2ee044c6@arcot-main>

>> When processing large documents building entire trees is
>> resource intensive and time consuming. Sometimes you only
>> the power of tree navigation stuff for sub-parts of the
>> source document.
>
>This is difficult with the DOM, because every Node must have
>a pointer to the root (Document) Node.

In PR3 of the Docuverse DOM SDK, there will be a couple of SAX event filters
for pruning the tree before it is converted into a DOM document.  This
solves a good part of the very large document problems.

Don Park
Docuverse


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Sat Sep 19 23:44:45 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:04:52 2004
Subject: Public Identifiers
In-Reply-To: <13827.43488.151526.744687@megginson.com>
References: <f5blnngcsq4.fsf@cogsci.ed.ac.uk>
 <36027A49.D857AB1E@locke.ccil.org>
 <019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk>
 <3.0.5.32.19980918145429.0093b100@dns.isogen.com>
 <f5blnngcsq4.fsf@cogsci.ed.ac.uk>
Message-ID: <3.0.5.32.19980919164312.0091ed80@dns.isogen.com>

At 08:59 AM 9/19/98 -0400, david@megginson.com wrote:
>Henry S. Thompson writes:
>
> > "W. Eliot Kimber" <eliot@dns.isogen.com> writes:
> > 
> > > [names without a public resolution mechanism can never be really
> > > universal]
> > 
> > So has the W3C, as the obvious entity with a budget and an interest in 
> > a solution to this problem, ever showed its hand wrt this issue?
>
>Internet hostnames have a distributed and efficient public resolution
>mechanism, so they easily meet Eliot's criterion (as do URLs, more
>generally but with a few limitations); the problem with hostnames is
>not that they are not universal, but that they are not persistent: a
>hostname may have only one owner and resolve to only one IP address at
>any given moment, but next week the owner and IP address can be
>different.

But doesn't "persistent" mean "when I request a thing, I get one"?
Persistence is defined by the resource owner--if I transfer ownership of
drmacro.com to someone else and they serve it from a different machine with
a different IP address, it's still drmacro.com if we say it is, and if we
do, then the drmacro.com resource is persistent. If we say "no, it's a new
and different drmacro.com, then the resource isn't persistent.  But
changing the IP address and ownership of the resource doesn't necessarily
affect the persistence.

Of course, there can always be a mismatch between the expectations and
desires of resource users with regards to persistence and the expectations
and desires of resource owners. The owner of the LA Dodgers baseball team
probably considers the Dodgers to have exhibited persistence as a team
since it started life as the Brooklyn Dodgers--fans from Brooklyn may not
agree.

Persistence in a network environment can really only mean "it's more likely
to be there than not" or "I get what I expect to get". Nothing is truly
persistent.  I don't see much profit in getting too existential about the
term "persistence".  I think the real issue is about management of
persistence: how easy is it for resource owners to manage names so that the
use of a given name gives the appropriate result for the appropriate length
of time and how easy is it for resource users to invoke those names.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Jon.Bosak at eng.Sun.COM  Sun Sep 20 00:19:34 1998
From: Jon.Bosak at eng.Sun.COM (Jon Bosak)
Date: Mon Jun  7 17:04:52 2004
Subject: Opportunities for XML-DEV
Message-ID: <199809192216.PAA09951@boethius.eng.sun.com>

[Frank Boumphrey:]

| These were marked up by Jon Bosak. he needs to update his
| syntax!<grin>, he uses <?XML instead of the lower case!!

Hey, they were perfectly good files back in 1992...

I'm (slowly) revising the Religion set and hope to have a new version
out soon.  I would really appreciate getting corrections right now,
either to the markup or the text itself.

Jon


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Sun Sep 20 00:40:59 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:04:52 2004
Subject: URN status? (was Re: Public Identifiers)
In-Reply-To: <3.0.5.32.19980919164312.0091ed80@dns.isogen.com>
References: <13827.43488.151526.744687@megginson.com>
 <f5blnngcsq4.fsf@cogsci.ed.ac.uk>
 <36027A49.D857AB1E@locke.ccil.org>
 <019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk>
 <3.0.5.32.19980918145429.0093b100@dns.isogen.com>
 <f5blnngcsq4.fsf@cogsci.ed.ac.uk>
Message-ID: <3.0.5.32.19980919173933.008c2bb0@dns.isogen.com>

As a result of prodding by Terry Allen (which I shouldn't have needed), I
did a little research into URNs (starting with
<http://www.ietf.org/html.charters/urn-charter.html>).

My conclusions from this research are:

1. The URN people seemed to have put a lot of good thought into the problem
and have developed what appear to be pretty solid proposals and/or
explorations of problems yet to be solved.

2. That the proposal for using DNS to map URN name spaces to resolvers
seems reasonable and could probably lead to a reasonable URN resolution
infrastructure given three things:
  - Support in DNS servers for the proposed new records (don't know what the
    status of this is)
  - Support in clients for using the new records (I assume this means 
    things like Web browers--again, don't know the status)
  - People to set up servers for resolving particular name spaces (e.g.,
    PURL-type servers)

None of these seem beyond the realm of possibility, but it is a lot to get
without centralized coordination.

3. That existing naming schemes such as SGML formal public IDs can be used
within a URN context if you're willing to escape lots of special characters
(but we're used to that with URLs anyway).

4. That URNs cannot be generally used today because there is no
generally-available resolution service.  There is a "Real Soon Now" promise
of a service, at least experimentally, but no indication from what I found
that one is available for general use.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jonathan at texcel.no  Sun Sep 20 01:39:46 1998
From: jonathan at texcel.no (Jonathan Robie)
Date: Mon Jun  7 17:04:52 2004
Subject: Switching between DOM and SAX
In-Reply-To: <199809191820.OAA00814@locke.ccil.org>
References: <3.0.6.32.19980919094744.009157b0@gpo.iol.ie>
Message-ID: <3.0.3.32.19980919193703.02f78ae0@pop.mindspring.com>

At 02:20 PM 9/19/98 -0400, John Cowan wrote:
 
>> When processing large documents building entire trees is
>> resource intensive and time consuming. Sometimes you only
>> the power of tree navigation stuff for sub-parts of the
>> source document.
>
>This is difficult with the DOM, because every Node must have
>a pointer to the root (Document) Node.

Lazy evaluation is perfectly OK in the DOM, which says absolutely nothing
about the physical representation of a Node, e.g. it does not say that a
Node must have a pointer, merely that it be able to return a reference to
the root. Every Node must be capable of returning such a pointer when
asked, but until someone asks for the root node, there is no need to
construct such a node. Similarly, a Node must be able to return references
to parent or child nodes, but those nodes need not be constructed until the
reference is asked for.

Jonathan
 
jonathan@texcel.no
Texcel Research
http://www.texcel.no

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mrc at allette.com.au  Sun Sep 20 08:20:02 1998
From: mrc at allette.com.au (Marcus Carr)
Date: Mon Jun  7 17:04:52 2004
Subject: Namespaces from where?
References: <3.0.32.19980918083202.0091a7c0@pop.intergate.bc.ca>
Message-ID: <36049EF3.B26E3841@allette.com.au>

Tim Bray wrote:

> Users and developers *can* be represented; but they have to pay for
> the
> privilege.  Since the amounts aren't exorbitant, it seems like a good
> practical bozo-filter to me.

Perhaps a clarification of "bozo-filter" wouldn't go astray here. Is a
bozo one who contributes time and energy to these initiatives even
though they can't realise an immediate financial gain (and therefore
can't justify the annual membership), or is a bozo one who is so
involved with the promulgation of their own software that they'll spend
the money on the off-chance that they'll be able to influence (sorry, I
mean contribute to) the process? In short, is the filter supposed to
exclude bozos, or facilitate their import? Is the code available? I can
think of many other uses for such a clever piece of work.


--
Regards,

Marcus Carr                 email:  mrc@allette.com.au
_______________________________________________________________
Allette Systems (Australia) email:  info@allette.com.au
Level 10, 91 York Street    www:    http://www.allette.com.au
Sydney 2000 NSW Australia   phone:  +61 2 9262 4777
                            fax:    +61 2 9262 4774
_______________________________________________________________


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Sun Sep 20 15:57:32 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:52 2004
Subject: Namespaces from where?
Message-ID: <3.0.32.19980920065601.00b22100@pop.intergate.bc.ca>

At 04:21 PM 9/20/98 +1000, Marcus Carr wrote:
>Tim Bray wrote:
>
>> Users and developers *can* be represented; but they have to pay for
>> the
>> privilege.  Since the amounts aren't exorbitant, it seems like a good
>> practical bozo-filter to me.
>
>Perhaps a clarification of "bozo-filter" wouldn't go astray here. 

Marcus' cynicism is quite reasonable, and my comments were uncalled-for,
sorry.  I was talking about the kind of bozo who makes most unmoderated 
discussion groups useless; someone with apparently infinite time and 
energy, but little to add to the design process.  XML-dev is a delightful 
exception.

In the W3C, either someone is paying you to be there, or you're an 
explicitly invited expert.  In practice this eliminates a certain amount 
of time-wasting. -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Sun Sep 20 17:50:45 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:52 2004
Subject: Switching between DOM and SAX
In-Reply-To: <3.0.3.32.19980919193703.02f78ae0@pop.mindspring.com> from "Jonathan Robie" at Sep 19, 98 07:37:03 pm
Message-ID: <199809201557.LAA24630@locke.ccil.org>

Jonathan Robie scripsit:

> Lazy evaluation is perfectly OK in the DOM, which says absolutely nothing
> about the physical representation of a Node, e.g. it does not say that a
> Node must have a pointer, merely that it be able to return a reference to
> the root. Every Node must be capable of returning such a pointer when
> asked, but until someone asks for the root node, there is no need to
> construct such a node. Similarly, a Node must be able to return references
> to parent or child nodes, but those nodes need not be constructed until the
> reference is asked for.

Sure, but in SAX (event stream) to DOM conversion, you need to capture
*all* the SAX events if you are to be able to satisfy the guarantees
that the DOM model makes.  You can't just decided to start DOMifying
at some random element, because by then you have forgotten what the
parent element is.  You either have to reify the SAX events and store
them as such, or else create the DOM Nodes on the fly whether the
user claims to want them or not.

-- 
John Cowan					cowan@ccil.org
		e'osai ko sarji la lojban.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Sun Sep 20 18:16:01 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:52 2004
Subject: Public Identifiers
References: <3.0.5.32.19980918094925.008e4e70@dns.isogen.com>
Message-ID: <360524E4.C63C7E92@technologist.com>

W. Eliot Kimber wrote:
> 
> Here I agree. Public identifiers are, conceptually, the same as URNs, that
> is, they are names that are intended to be indirected to their actual
> system ID, rather than being direct references to storage locations, as
> URLs normally are. However, as Dan Connoly and Tim B-L have argued, there's
> no *functional* difference between a URN and URL because persistence is
> always a function of the owner of the resource and cannot be guaranteed
> simply by the choice of name. Thus, at most, the URN/URL or public
> ID/system ID distinction can only express *intent*, it cannot guarantee
> results.

But this is not a precedent in XML. Similarly, we cannot guarantee that an
application, even a conforming one, will treat processing instructions and
comments differently. A conforming (but annoying) XML editor could remove
all processing instructions. But we know that an author would have used
processing instructions for a reason: to signal a particular intent. Thus
editors should not remove them.
 
> Thus, the unavoidable conclusion is that system IDs can be just as
> indirect, and just as persistent, as so-called "public" IDs.  The only real
> difference is what bit of software gets the value of the ID to resolve.

This is also the difference between processing instructions and comments.
In one, the XML processor has the right to interpret and remove the
construct and in the other, the application does.

> My conclusion at this point is that the URN/public ID distinction is not
> helpful because it merely confuses the issue without actually solving any
> problems. The only thing public IDs did was force vendors to provide *a
> way* to do name indirection, which you do need on brain-dead operating
> systems that lack something like symbolic links (which includes both VM/CMS
> and DOS/Windows). If operating-system filename indirection was a universal
> service, you'd just use that to manage redirection of entity storage IDs.
> At the time SGML was developed, it certainly wasn't universal and it may
> not have even been known outside of Bell Labs (I don't remember precisely
> when Unix went public).

Unix has been public since the 70s. Nevertheless, your "only thing public
IDs did...." is an odd statement. If I may paraphrase: "FPIs only provided
a reliable way to interchange SGML data between heterogenous systems for
the last 15 years, and will continue to for the next 5 that it takes
symbolic linking to become popular on Microsoft platforms." To me, the
word "only" is out of place in such a statement.
 
In a fantasy world where:

 * URNs are deployed and work
 * XML inter-document entity references allow multiple 
 * all major systems have reliable symbolic links
 * Redirection can be accomplished through reliable, well-defined
*documents*, not through HTTP server-specific magic

Public identifiers are no longer useful in XML. When that world comes
about, I will gladly get rid of them.

> But note that with the second edition of the SOCAT spec, you can remap
> system IDs just as you can public IDs. So even there, FPIs provide no
> unique facility, although the SOCAT mechanism itself does (redirection).

There are major differences:

First, there is the specification of intent: do you *intend* for this
thing to be redirected, because it is a public resource, or do you intend
for it to be a direct resource, that turns out to be redirected because of
some system limitation (e.g. a disconnect from the Internet).

Second, there is the likelihood of implementation. You, of all people,
understand vendor's reluctance to implement indirection. The only way to
force this implementation is through standardization. SOCATs would never
have come about were it not for Public Identifiers. Indirection, in turn,
would probably never have come about. On the Web, HTTP does standardize a
protocol for redirection, but the means of specifying an indirection is
not standardized in any standard.

Third, it is not proper that every reference to a system identifier should
require lookups in a variety of catalogs. That strikes me as a waste of
processing time. If the author has said explcitly "Here is where to find
this thing" then the system should not waste time trying to indirect it at
the source of the reference (in the processor) though it might do so at
the target of the reference (in the filesystem, at the HTTP server, etc.).
In other words, I think that the SYSTEM declaration in SOCAT is probably a
bad idea.

When I use a symbolic name, I recognize that I am invoking some (perhaps
expensiv) lookup process. When I use an address, I should not invoke such
a process. Of course, if any portion of the address IS a symbolic name,
then that portion will require a lookup, but the name as a whole should
not.

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

http://www.capitol.state.tx.us/txconst/sections/cn000100-000400.html

"No religious test shall ever be required as a qualification to any
office, or public trust, in this State; nor shall any one be
excluded from holding office on account of his religious sentiments,
provided he acknowledge the existence of a Supreme Being."
                         - Texas Constitution, Article 1, Section 4

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Sun Sep 20 18:16:02 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:52 2004
Subject: Public Identifiers
References: <3.0.5.32.19980918094925.008e4e70@dns.isogen.com>
Message-ID: <36052527.C03F726C@technologist.com>

W. Eliot Kimber wrote:
> 
> Here I agree. Public identifiers are, conceptually, the same as URNs, that
> is, they are names that are intended to be indirected to their actual
> system ID, rather than being direct references to storage locations, as
> URLs normally are. However, as Dan Connoly and Tim B-L have argued, there's
> no *functional* difference between a URN and URL because persistence is
> always a function of the owner of the resource and cannot be guaranteed
> simply by the choice of name. Thus, at most, the URN/URL or public
> ID/system ID distinction can only express *intent*, it cannot guarantee
> results.

But this is not a precedent in XML. Similarly, we cannot guarantee that an
application, even a conforming one, will treat processing instructions and
comments differently. A conforming (but annoying) XML editor could remove
all processing instructions. But we know that an author would have used
processing instructions for a reason: to signal a particular intent. Thus
editors should not remove them.
 
> Thus, the unavoidable conclusion is that system IDs can be just as
> indirect, and just as persistent, as so-called "public" IDs.  The only real
> difference is what bit of software gets the value of the ID to resolve.

This is also the difference between processing instructions and comments.
In one, the XML processor has the right to interpret and remove the
construct and in the other, the application does.

> My conclusion at this point is that the URN/public ID distinction is not
> helpful because it merely confuses the issue without actually solving any
> problems. The only thing public IDs did was force vendors to provide *a
> way* to do name indirection, which you do need on brain-dead operating
> systems that lack something like symbolic links (which includes both VM/CMS
> and DOS/Windows). If operating-system filename indirection was a universal
> service, you'd just use that to manage redirection of entity storage IDs.
> At the time SGML was developed, it certainly wasn't universal and it may
> not have even been known outside of Bell Labs (I don't remember precisely
> when Unix went public).

Unix has been public since the 70s. Nevertheless, your "only thing public
IDs did...." is an odd statement. If I may paraphrase: "FPIs only provided
a reliable way to interchange SGML data between heterogenous systems for
the last 15 years, and will continue to for the next 5 that it takes
symbolic linking to become popular on Microsoft platforms." To me, the
word "only" is out of place in such a statement.
 
In a fantasy world where:

 * URNs are deployed and work
 * XML inter-document entity references allow multiple 
 * all major systems have reliable symbolic links
 * Redirection can be accomplished through reliable, well-defined
*documents*, not through HTTP server-specific magic

Public identifiers are no longer useful in XML. When that world comes
about, I will gladly get rid of them.

> But note that with the second edition of the SOCAT spec, you can remap
> system IDs just as you can public IDs. So even there, FPIs provide no
> unique facility, although the SOCAT mechanism itself does (redirection).

There are major differences:

First, there is the specification of intent: do you *intend* for this
thing to be redirected, because it is a public resource, or do you intend
for it to be a direct resource, that turns out to be redirected because of
some system limitation (e.g. a disconnect from the Internet).

Second, there is the likelihood of implementation. You, of all people,
understand vendor's reluctance to implement indirection. The only way to
force this implementation is through standardization. SOCATs would never
have come about were it not for Public Identifiers. Indirection, in turn,
would probably never have come about. On the Web, HTTP does standardize a
protocol for redirection, but the means of specifying an indirection is
not standardized in any standard.

Third, it is not proper that every reference to a system identifier should
require lookups in a variety of catalogs. That strikes me as a waste of
processing time. If the author has said explcitly "Here is where to find
this thing" then the system should not waste time trying to indirect it at
the source of the reference (in the processor) though it might do so at
the target of the reference (in the filesystem, at the HTTP server, etc.).
In other words, I think that the SYSTEM declaration in SOCAT is probably a
bad idea.

When I use a symbolic name, I recognize that I am invoking some (perhaps
expensiv) lookup process. When I use an address, I should not invoke such
a process. Of course, if any portion of the address IS a symbolic name,
then that portion will require a lookup, but the name as a whole should
not.

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

http://www.capitol.state.tx.us/txconst/sections/cn000100-000400.html

"No religious test shall ever be required as a qualification to any
office, or public trust, in this State; nor shall any one be
excluded from holding office on account of his religious sentiments,
provided he acknowledge the existence of a Supreme Being."
                         - Texas Constitution, Article 1, Section 4

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Sun Sep 20 18:24:52 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:52 2004
Subject: Public Identifiers
In-Reply-To: <360524E4.C63C7E92@technologist.com> from "Paul Prescod" at Sep 20, 98 10:53:08 am
Message-ID: <199809201631.MAA25471@locke.ccil.org>

Paul Prescod scripsit:

> If the author has said explcitly "Here is where to find
> this thing" then the system should not waste time trying to indirect it at
> the source of the reference (in the processor) though it might do so at
> the target of the reference (in the filesystem, at the HTTP server, etc.).

Sometimes, however, the user of the document knows better than the
author.  For example, a downloaded document with relative links can
benefit from an *ad hoc* catalog that expands them.  Similarly,
a *pro forma* SystemId on an overworked server may be usefully
converted to a local cached copy.

> In other words, I think that the SYSTEM declaration in SOCAT is probably a
> bad idea.

I take a more limited view:  "Avoid SYSTEM entries in publicly available
catalogs."

-- 
John Cowan					cowan@ccil.org
		e'osai ko sarji la lojban.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at goon.stg.brown.edu  Sun Sep 20 18:38:11 1998
From: richard at goon.stg.brown.edu (Richard L. Goerwitz III)
Date: Mon Jun  7 17:04:52 2004
Subject: Public Identifiers
In-Reply-To: <199809201631.MAA25471@locke.ccil.org> from John Cowan at "Sep 20, 98 12:30:26 pm"
Message-ID: <199809201637.MAA16042@goon.stg.brown.edu>

> For example, a downloaded document with relative links can
> benefit from an *ad hoc* catalog that expands them.  Similarly,
> a *pro forma* SystemId on an overworked server may be usefully
> converted to a local cached copy.

ISPs, proxy servers, and your local machine all do caching.  If
a system ID doesn't change much, then setting reasonable expira-
tion times on it will help.  There is no need to work in another
resolution mechanism, in my opinion.

We need to try to make XML simpler, not more complex.  Just be-
cause PubIDs (barely) made it into the standard doesn't mean we
must immediately begin pushing the envelope of what they can do.
They're there primarily for legacy SGML compatibility.

Work would be better spent on URNs.

Richard Goerwitz

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From droddey at charmedquark.com  Sun Sep 20 21:39:31 1998
From: droddey at charmedquark.com (Dean Roddey)
Date: Mon Jun  7 17:04:52 2004
Subject: An invitation to build
Message-ID: <000001bde4cf$1a0350b0$6679a8c0@cqs_pdc.charmedquark.com>

Hi folks, I'm posting here (as my fully civilian self, just in case anyone
recognizes the name, since I have to be very careful here to keep the twain
separate and whatnot) to invite anyone who is interested in taking a look at
my C++ class libraries as the foundation for an XML parser in C++.

I probably won't have time to do one myself, and doing one would get me into
possibly sticky issues considering what I do for money and for whom I do it,
but my class libraries would be an awesome foundation to base one on. So
anyone who would like to do a freeware or shareware XML processor in C++ and
wants a powerful and modern class framework to build it on, please contact
me and lets talk about it. I can certainly offer plenty of advice though I
can't contribute to it directly myself and can't give you any insider
information ect...

The current release you'll find on my web site is NT and VC++ only, but the
next version (which I'm wrapping up now) will take it from 'easily portable'
to being able to easily support multiple platforms simultaneously. The
reference version I release for this next version will still be for NT only,
since getting all the other new goodies out there for existing customers is
important. But as soon as I get that out, I'll follow up quickly with a
Win98 version. And, as soon as Visual Age C++ 4.0 comes out for OS/2, I'll
have an OS/2 version out (actually it started its life on OS/2 many moons
ago.)

Ports to things like Linux I'll leave to others since I have no experience
in Unixy world, but the point is that your work will be very portable and
you won't be painting yourself into a corner or anything (though NT/VC++ is
a pretty big corner by itself :-)

CIDLib has all of the goodies you'd need. The only significant issues I can
think of are that the text streams only support ASCII/Unicode right now.
I'll definitely put UTF-8 on the next release wish list. Also there is no
URL class right now, so that goes on the wish list for the next release too.
But everything else you need for extremely serious development is there
(there will be about 300 back end oriented classes in the upcoming release.)

See my home page below for details (though you'll be seeing the old version
there.) The major (publically visible) changes since that posting are TCP/IP
support, regular expression engine, and MD5 support in the encryption
frameworks (DOM-Hash support maybe?).

So let me know if you are interested. Since I'm interested in getting
serious development going on CIDLib now, I want to find a couple of folks to
do serious non-commercial projects. In return for being my early adopters,
you'll get plenty of support and no licensing charges. Note that this next
release is going to be 1.0, but its been in progress for almost 7 years now
and had 6 beta releases and been ported once already, so its very mature and
well developed. Full source code is also available for free upon
registration, but of course you blessed folks will get that too of course.

If you find it the greatest thing since sliced bread and want to go
commercial, that's fine too though at that point you'd definitely come under
the licensing agreements (though they will be quite reasonable.)

Sorry if this sounded too much like an ad for this mailing list, but I want
to encourage someone to accept the challenge, and it really is just an
incredibly well designed, consistently architectured, powerful product. And
a server side XML parser engine is exactly the kind of thing that it would
be extremely well targeted towards. It does have GUI classes, but they are
only experimental right now. Its primarily back end oriented right now,
though of course it could be behind the scenes on the client side as well.

TIA.

--------------------------
Dean Roddey
The CIDLib Class Libraries
Charmed Quark Software
droddey@charmedquark.com
http://www.charmedquark.com

"100% Substance Free. Less Content, more cost. Just the way you like it"


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Sun Sep 20 21:51:37 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:52 2004
Subject: Switching between DOM and SAX
Message-ID: <001a01bde4d0$0eb279b0$2ee044c6@arcot-main>

>Sure, but in SAX (event stream) to DOM conversion, you need to capture
>*all* the SAX events if you are to be able to satisfy the guarantees
>that the DOM model makes.  You can't just decided to start DOMifying
>at some random element, because by then you have forgotten what the
>parent element is.  You either have to reify the SAX events and store
>them as such, or else create the DOM Nodes on the fly whether the
>user claims to want them or not.


Right.  Lazy evaluation is not possible when building DOM using SAX.

Don


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Sun Sep 20 22:27:18 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:53 2004
Subject: URLs
Message-ID: <360563EB.139751D0@technologist.com>

All of this talk of identifiers has made me wonder if there is any good
reason that a URL in XML code should be restricted to "safe",
"non-reserved" characters. Wouldn't it make more sense to require the XML
processor to do the necessary escaping before transmitting the URL across
the wire? It seems simpler to let the processing software deal with it
rather than forcing the human to do so (the current case).

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

It's such a 
Bore
Being always
Poor
LANGSTON HUGHES
http://www.northshore.net/homepages/hope/engHughes.html

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Sun Sep 20 22:29:57 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:53 2004
Subject: Public Identifiers
References: <f5blnngcsq4.fsf@cogsci.ed.ac.uk>
	 <36027A49.D857AB1E@locke.ccil.org>
	 <019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk>
	 <3.0.5.32.19980918145429.0093b100@dns.isogen.com>
	 <f5blnngcsq4.fsf@cogsci.ed.ac.uk> <3.0.5.32.19980919164312.0091ed80@dns.isogen.com>
Message-ID: <360561E9.5E2A2B3@technologist.com>

W. Eliot Kimber wrote:
> 
> But doesn't "persistent" mean "when I request a thing, I get one"?
> Persistence is defined by the resource owner--if I transfer ownership of
> drmacro.com to someone else and they serve it from a different machine with
> a different IP address, it's still drmacro.com if we say it is, and if we
> do, then the drmacro.com resource is persistent. 

There is a difference between YOU transferring ownership and INTERNIC
transferring ownership. Internic has no knowledge of what commitments
DRMACRO made, and thus will not require the new owner to live by them.
This isn't just a sociological issue, nor purely technical: it could have
serious legal ramifications.

> If we say "no, it's a new
> and different drmacro.com, then the resource isn't persistent.  But
> changing the IP address and ownership of the resource doesn't necessarily
> affect the persistence.

It's a different resource because the people who made the original
commitments (perhaps legally binding commitments) to make certain
resources available no longer have control over the resource and do not
have the opportunity to choose whether to continue to live up to their
commitments.

An FPI is persistent because ISO legally constracts to not reassign them.
This may mean nothing technically, but neither does the fact that the
American government legally asserts that e-commerce transactions based on
USD have value. If either ISO or the American government goes out of
business, their promises are worthless, but by then we'll have other,
bigger problems than our links breaking (which, I guess, is the real
point).
 
 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

It's such a 
Bore
Being always
Poor
LANGSTON HUGHES
http://www.northshore.net/homepages/hope/engHughes.html

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Sun Sep 20 23:24:47 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:53 2004
Subject: URLs
In-Reply-To: <360563EB.139751D0@technologist.com> from "Paul Prescod" at Sep 20, 98 03:22:03 pm
Message-ID: <199809202131.RAA03962@locke.ccil.org>

Paul Prescod scripsit:

> All of this talk of identifiers has made me wonder if there is any good
> reason that a URL in XML code should be restricted to "safe",
> "non-reserved" characters. Wouldn't it make more sense to require the XML
> processor to do the necessary escaping before transmitting the URL across
> the wire? It seems simpler to let the processing software deal with it
> rather than forcing the human to do so (the current case).

The trouble is that "http://whoever.net/foo?bar" doesn't mean the
same thing as "http://whoever.net/foo%3Fbar"; the latter induces the
server to look for a file named "foo?bar", whereas the former queries
the resource "foo" with the request "bar".

-- 
John Cowan					cowan@ccil.org
		e'osai ko sarji la lojban.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Sun Sep 20 23:45:11 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:04:53 2004
Subject: SGML char entities XML-ized?
Message-ID: <3.0.32.19980920102155.00b23e80@pop.intergate.bc.ca>

Are there XML-ized versions of some of the character entity sets in 
the common vernacular?  I.e. the following:

 <!ENTITY % ISOlat2 PUBLIC
                       "ISO 8879-1986//ENTITIES Added Latin 1//EN">

 <!ENTITY % ISOlat2 PUBLIC
                       "ISO 8879-1986//ENTITIES Added Latin 2//EN">

 <!ENTITY % ISOgrk3 PUBLIC

Etc... I'm pretty sure I saw someone posting about them here. -Tim


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Mon Sep 21 00:07:23 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:53 2004
Subject: SGML char entities XML-ized?
Message-ID: <001201bde4e2$fe33c770$2ee044c6@arcot-main>

I think John Cowan has something like that on his XML page:

http://www.ccil.org/~cowan/XML/

Don Park
Docuverse

-----Original Message-----
From: Tim Bray <tbray@textuality.com>
To: xml-dev@ic.ac.uk <xml-dev@ic.ac.uk>
Date: Sunday, September 20, 1998 2:51 PM
Subject: SGML char entities XML-ized?


>Are there XML-ized versions of some of the character entity sets in
>the common vernacular?  I.e. the following:
>
> <!ENTITY % ISOlat2 PUBLIC
>                       "ISO 8879-1986//ENTITIES Added Latin 1//EN">
>
> <!ENTITY % ISOlat2 PUBLIC
>                       "ISO 8879-1986//ENTITIES Added Latin 2//EN">
>
> <!ENTITY % ISOgrk3 PUBLIC
>
>Etc... I'm pretty sure I saw someone posting about them here. -Tim
>
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Mon Sep 21 00:41:37 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:53 2004
Subject: String expressions in XSL
References: <018b01bde308$be486160$1e09e391@mhklaptop.bra01.icl.co.uk> <f5bn27xcs67.fsf@cogsci.ed.ac.uk>
Message-ID: <36058363.18BD50B1@technologist.com>

Henry S. Thompson wrote:
> 
> 2) The (first) XSL draft recommendation says quite clearly that an
> expression language will be provided:  we just weren't ready with one
> in time.  If you're keen to see string manipulation included, see
> point (1) above.

The spec. says that there must be some extensibility mechanism, but not
that it be an "expression language" in the sense that that word is used in
DSSSL and in the old XSL. James described something quite different that
he was thinking about in the XSL-List recently.

Or do you mean something much smaller by "expression language" than
"extension mechanism"? (i.e. a small set of non-programmable primitives)

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

It's such a 
Bore
Being always
Poor
LANGSTON HUGHES
http://www.northshore.net/homepages/hope/engHughes.html

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Mon Sep 21 00:56:34 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:53 2004
Subject: URLs
References: <199809202131.RAA03962@locke.ccil.org>
Message-ID: <36058566.14CED624@technologist.com>

John Cowan wrote:
> 
> The trouble is that "http://whoever.net/foo?bar" doesn't mean the
> same thing as "http://whoever.net/foo%3Fbar"; the latter induces the
> server to look for a file named "foo?bar", whereas the former queries
> the resource "foo" with the request "bar".

That's true, I should have thought of that. Of course URLs are a
string-based language, and the language has to have reserved characters.
But what of the characters that are "unsafe because gateways and other
transport agents are known to sometimes modify such characters". I was
thinking more about those. It seems that they should be handled
transparently, as MIME encoding is in a modern mail program. Of course ~
is usually bandied about despite its "unsafeness", but I would be
surprised if browsers are smart enough to encode it during transmission
for "safety."

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

It's such a 
Bore
Being always
Poor
LANGSTON HUGHES
http://www.northshore.net/homepages/hope/engHughes.html

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From srn at techno.com  Mon Sep 21 01:56:05 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun  7 17:04:53 2004
Subject: Public Identifiers
In-Reply-To: <360524E4.C63C7E92@technologist.com> (message from Paul Prescod
	on Sun, 20 Sep 1998 10:53:08 -0500)
References: <3.0.5.32.19980918094925.008e4e70@dns.isogen.com> <360524E4.C63C7E92@technologist.com>
Message-ID: <199809202357.SAA02319@bruno.techno.com>

Regarding FPIs (formal public identifiers) and their role (or lack
thereof) in Web applications:

(Note: in the following, I use the term "namespace" in its generic
technical sense, without reference to XML "Namespaces".  As far as I
know, this note has nothing to do with XML "Namespaces".)

Will URNs permit pointing to things that aren't now and may never be
on the web?  I mean, things that their owners never intended to be on
the web and either that their owners do not want to appear on the web,
or that their owners may not (currently) see any interest in putting
on the web?

I ask this partly because one of the interesting things in the
forthcoming Topic Navigation Map standard is the use of FPIs to point
at so-called "public topics" -- topics that are identifiable by a name
in some namespace maintained by any arbitrary authority.  All you need
is an unambiguous way to point to the authority, the namespace
maintained (or the namespace that was once created) by the authority,
and the name in that namespace.

For example, to consider a certain obsolete farm implement as a topic:

Authority: Sears, Roebuck & Co.
Namespace: 1922 Farm Catalog Number
     Name: R205

According to the current Topic Navigation Map draft (soon to be CD
13250), this would appear as the following FPI:

-//Sears, Roebuck & Co.//NONSGML TOPIC 1922 Farm Catalog Number : R205//EN

Can URNs do that?  I sure wouldn't want XML to be unable to do this
kind of thing.  If it couldn't, that would rule out the use of public
topics in XML-based topic maps.  Public topics are very useful for
correlating the knowledge contained in disparate topic maps, so the
concept of "public topics" seems pretty important to me.

You may ask, "What does non-web information have to do with the Web?"
Good question.  Personally, I think it has plenty to do with it, but I
suspect others might disagree with me.  I would venture to say that,
even today, a significant fraction of all FPIs are not intended to be
resolved, but rather simply to sit there and be pointers, documenting
the sources of authoritative material that bears on the actual online
content, but to which direct access is not needed in order for
applications to run.  One example is the FPIs of Architecture
Definition Documents in Base Architecture Declarations.  

[To HyTime aficionados, using other words, I would say that one of the
most important applications of FPIs is in lieu of biblocs.  (A HyTime
"bibloc" is a pointer to an offline resource; this facility allows
pointing to things that belong to authorities that have not endowed
them with online addresses of any kind.)]

I'm not just being provocative here; I'm really interested to hear
what the readers of this list have to say about this.  Part of the
issue is whether and how to honor the reality that people and
institutions may be regarded as authorities and keepers of
online-significant namespaces, whether or not they want to be so
regarded, and whether or not their namespaces actually exist online.
The existence of FPIs that identify namespaces that are not online
could, in sufficient numbers, and in applications of sufficient
economic importance, actually have the effect of bringing such
namespaces online.  If we take that view that the purpose of W3C
standards is to enhance human productivity by increasing the
availability of knowledge, then it's clearly desirable to have this
kind of bellwether indicator of business opportunity.  It seems to me
that we should consider the ability to reference names in offline
namespaces a requirement for XML, and so I'm glad that public
identifiers exist in XML.  Please let's not deprecate FPIs; instead,
let's understand and celebrate the difference between FPIs and URNs,
even if/when URNs are terrifically indirect.  For me, the essential
difference between URIs and FPIs has nothing to do with any particular
scheme of indirect addressing, cataloging, or algorithm for
resolution.  On the contrary, FPIs remain essential to XML precisely
because URIs, including URNs, are really system addresses, where the
system is the Web, if we consider the Web as including some array of
standardized URI resolution facilities.  FPIs are different from URIs
precisely because, for FPIs, no machine-executable resolution
algorithm is standardized, specified, or even necessarily understood,
and it's useful and vital to be able to reference things in such a
fashion.

-Steve

--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com  http://www.techno.com  ftp.techno.com

voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax    +1 972 994 0087 (at ISOGEN: +1 214 953 3152)

3615 Tanner Lane
Richardson, Texas 75082-2618 USA

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Mon Sep 21 02:36:40 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:53 2004
Subject: Public Identifiers
Message-ID: <199809210043.UAA10167@locke.ccil.org>

Steven R. Newcomb scripsit:

> Will URNs permit pointing to things that aren't now and may never be
> on the web? I mean, things that their owners never intended to be on
> the web and either that their owners do not want to appear on the web,
> or that their owners may not (currently) see any interest in putting
> on the web?

Clearly yes.  RFC 1737, "Functional Requirements for Uniform Resource Names",
says:

# A URN identifies a resource or
# unit of information.  It may identify, for example, intellectual
# content, a particular presentation of intellectual content, or
# whatever a name assignment authority determines is a distinctly
# namable entity.  A URL identifies the location or a container for an
# instance of a resource identified by a URN.  The resource identified
# by a URN may reside in one or more locations at any given time, may
# move, or may not be available at all.

Note especially the last phrase.

> -//Sears, Roebuck & Co.//NONSGML TOPIC 1922 Farm Catalog Number : R205//EN

Does this refer to an actual gadget, the class of such gadgets, or
the description of it? The "EN" suggests that it refers to the
description only.

> Please let's not deprecate FPIs; instead,
> let's understand and celebrate the difference between FPIs and URNs,
> even if/when URNs are terrifically indirect.

RFC 1737 contemplates FPIs as a particular case of URNs:

# For example, ISBN numbers, ISO
# public identifiers, and UPC product codes seem to satisfy the
# functional requirements, and allow an embedding that satisfies
# the syntactic requirements described here.

A suitable URN representation of the above FPI would be:

urn:fpi:-%2E%2ESears,%20Roebuck%20&%20Co.%2E%2ENONSGML%20TOPIC%201922%20Farm%20Catalog%20Number%20:%20R205%2E%2EEN

encoded to remove illegal characters (namely spaces and slashes).
It's also necessary to encode "#", "?", and "%" when they appear in FPIs.
These rules are documented in RFC2141, "URN Syntax".

-- 
John Cowan					cowan@ccil.org
		e'osai ko sarji la lojban.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Mon Sep 21 02:45:09 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:53 2004
Subject: Socat issues for XML
Message-ID: <199809210051.UAA10435@locke.ccil.org>

I've just finished the first cut of my implementation of Socats
as a SAX EntityResolver, and two points have come up:

1) With the minor exception of notation declarations, every public id
in XML has an accompanying system id.  Therefore, the OVERRIDE entry
does not make sense: it must be ignored, and the default must be YES
rather than NO.

To explicate OVERRIDE: in SGML Socats, OVERRIDE NO means that when
an explicit system id is present, the catalog entries are ignored;
OVERRIDE YES means the catalog entries override an explicit system id.

2) As another consequence of system ids being always present and always
URLs, a usable Socat implementation must not search the whole
public catalog space for SYSTEM entries.  When should the search stop?
In some sense "when going offsite", but just when is that?
Any suggestions?

-- 
John Cowan					cowan@ccil.org
		e'osai ko sarji la lojban.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Mon Sep 21 04:42:33 1998
From: david at megginson.com (david@megginson.com)
Date: Mon Jun  7 17:04:53 2004
Subject: Switching between DOM and SAX
In-Reply-To: <001a01bde4d0$0eb279b0$2ee044c6@arcot-main>
References: <001a01bde4d0$0eb279b0$2ee044c6@arcot-main>
Message-ID: <13829.47860.433932.90549@localhost.localdomain>

Don Park writes:

 > >Sure, but in SAX (event stream) to DOM conversion, you need to
 > >capture *all* the SAX events if you are to be able to satisfy the
 > >guarantees that the DOM model makes.  You can't just decided to
 > >start DOMifying at some random element, because by then you have
 > >forgotten what the parent element is.  You either have to reify
 > >the SAX events and store them as such, or else create the DOM
 > >Nodes on the fly whether the user claims to want them or not.
 > 
 > 
 > Right.  Lazy evaluation is not possible when building DOM using SAX.

The problem is really with the XML parsing model as much as with SAX
-- XML is designed to be parsed only from the top down, though in the
past I have suggested ways that the parsing could be parallelised.  It 
is easy to imagine lazy evaluation on top of a database, but very hard 
to imagine it from a physical XML document.

That said, I can think of two possible solutions to the
lazy-evaluation problem, including the one which Don has already
named:

1. Cache the event stream in a compact format.

2. Reparse the document on demand (i.e. when the user climbs out of
   the subtree and/or tries to do something with the document node) -- 
   in this case, you need only store the treeloc of the element.

The first solution is still greedy, but would be reasonably fast and
less resource-hungry than a naive DOM implementation.  The second
solution could be very slow when someone tried to climb out of a
subtree, but it would not make great demands on memory.

Essentially, you should choose #1 if you think that users are likely
to need to climb out of the subtree fairly often, and #2 if you think
that it will be rare.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Mon Sep 21 05:22:17 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:04:53 2004
Subject: Public Identifiers
In-Reply-To: <360561E9.5E2A2B3@technologist.com>
References: <f5blnngcsq4.fsf@cogsci.ed.ac.uk>
 <36027A49.D857AB1E@locke.ccil.org>
 <019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk>
 <3.0.5.32.19980918145429.0093b100@dns.isogen.com>
 <f5blnngcsq4.fsf@cogsci.ed.ac.uk>
 <3.0.5.32.19980919164312.0091ed80@dns.isogen.com>
Message-ID: <3.0.5.32.19980920215503.0095e870@dns.isogen.com>

At 03:13 PM 9/20/98 -0500, Paul Prescod wrote:

[...]

>An FPI is persistent because ISO legally constracts to not reassign them.
>This may mean nothing technically, but neither does the fact that the
>American government legally asserts that e-commerce transactions based on
>USD have value. If either ISO or the American government goes out of
>business, their promises are worthless, but by then we'll have other,
>bigger problems than our links breaking (which, I guess, is the real
>point).

I think we need to be careful what thing we're talking about when we use
the term "persistent": the name or the resource.

A name is persistent if it is never re-used, that is, if once assigned, it
always gets you to the "same" thing and, if that thing ceases to exist,
gets you to nothing.  A thing is persistent if it exists for as long as we
care about its existence. 

SGML Formal public identifiers are not necessarily persistent names because
there is nothing in ISO 8879 or ISO 9070 that requires them to be (nor
could such a requirement be enforced or validated).  All that ISO 9070
provides is a process for registering *owner identifiers*, which are,
presumably, persistent (at least as defined by the assigning body).
However, the name owner is responsible for managing the names within their
slice of the FPI name space and can do whatever they want with them,
including reassigning them without regard for persistence at all.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From James.Anderson at mecomnet.de  Mon Sep 21 05:48:42 1998
From: James.Anderson at mecomnet.de (james anderson)
Date: Mon Jun  7 17:04:53 2004
Subject: Do you or Dont you buy Tim Bray's Namespace Validation Algorithm?
References: <199809101953.OAA02029@foyt.indyrad.iupui.edu>
Message-ID: <3605CE8E.337771F0@mecomnet.de>

greetings,

please excuse my delay in responding to this.
i was away for several weeks.

"tim's" algorithm does work. it is analogous to the mechanisms used to manage
interned symbols in symbolic processing systems. (eg lisp).

in fact, there is no literal "rewrite" required. if you manage the symbols
properly, sets of symbols are "prefixed" - not individual symbols. the
"rewrite" (ie the binding 
/scoping rules) is done by augmenting, changing, etc the names for the sets of
symbols. auxiliary names are generated to ensure uniqueness and serve as the
prefixes should the dom/combined-dtd be reserialized.

the mechanism does require - as noted in the past, that a means be provided to
bind prefixes to uri's with a scope which extends over a dtd. contrary to
other observers, i see nothing wrong with using a pi for this. although it is
not within the scope of xml1.0+namespaces, the pi method is not precluded by same.

Mark Tucker wrote:
> 
> Wait, People,
> 
> 
>         I don't see anything kludgy in it. (modulo my preference to
>         use expanded names directly in the processor's symbol table.)

i agree; this would be kludgy. names are better managed as elements of named sets.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Mon Sep 21 06:14:50 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:53 2004
Subject: SGML char entities XML-ized?
Message-ID: <00dd01bde516$ba7df4e0$e76118cb@caleb>

-----Original Message-----
From: Tim Bray <tbray@textuality.com>

>Are there XML-ized versions of some of the character entity sets in
>the common vernacular?

They were at my old site for the last year (courtesy of Rick Jelliffe) but
someone recently pointed out I'd dropped them in my transfer.

You can now find them at:

    http://www.schema.net/entities/

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From James.Anderson at mecomnet.de  Mon Sep 21 06:16:44 1998
From: James.Anderson at mecomnet.de (james anderson)
Date: Mon Jun  7 17:04:53 2004
Subject: Summary of Namespaces and Validation
References: <199809101630.LAA27616@foyt.indyrad.iupui.edu>
Message-ID: <3605D4D6.A0C6D2C0@mecomnet.de>

Mark Tucker asked:
> 
> Well,
> 

> *****************************************
>         Camp 3 : Implemented Namespace Aware Code
> *****************************************
> 
>                         What does code out there do?

one of the xml processors which is available with the cl-http server handles
namespaces with a mechanism similar to that which tim has described here. the
names are managed as symbols interned in packages. the packages are given two
names with indefinite extent - the uri and a unique name which is used as a
prefix when reserializing. in addition to these static bindings, both
attribute-based and pi-based bindings are supported - the first to be
conformant, the second in order to support dtd-based validation and attribute
defaulting. these bindings have dynamic scope and extent in the parsing process.

during the parse all names to be retained as part of the dom/dtd are interned
in the package which is named by the respective prefix. the prefixes are not retained.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Mon Sep 21 06:20:21 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:04:54 2004
Subject: Public Identifiers
In-Reply-To: <36052527.C03F726C@technologist.com>
References: <3.0.5.32.19980918094925.008e4e70@dns.isogen.com>
Message-ID: <3.0.5.32.19980920231815.00963d00@dns.isogen.com>

At 10:54 AM 9/20/98 -0500, Paul Prescod wrote:

>If I may paraphrase: "FPIs only provided
>a reliable way to interchange SGML data between heterogenous systems for
>the last 15 years, and will continue to for the next 5 that it takes
>symbolic linking to become popular on Microsoft platforms." To me, the
>word "only" is out of place in such a statement.

I would argue that it has not in fact ever been generally possible to
interchange SGML data among heterogeneous systems in the way I think you
mean.  If you send me a package of SGML entities, it is up to me, the
receiver, to make sense of them, including reworking any entity
declarations and/or public identifier mappings that may be necessary. I
spent the last years of my tenure at IBM transferring SGML documents
between OS/2, DOS, and VM/CMS systems, and it was non-trivial to manage.
To make it work, I had to set up what was essentially a homogenous system.
The only hope SGML ever had for interchange was the SDIF mechanism, which
defines a standard for packaging entities together so that automatic
processes can unpack them on the target system--but even there the standard
assumes rewriting of entity declarations to update system identifiers.
Unfortunately, nobody has ever fully implemented SDIF in a
publicly-available system (even though it shouldn't be that hard to do and
would be mighty useful if done).  [NOTE: ISO 9070 *DOES NOT* require the
use of ASN1. Do not be fooled. You can use any mechanism you want for
representing the package, even tar or Zip.]

The idea that you might be able to interchange documents that refer to
public entities (that is, entities that are somehow publicly available) is
a nice one, but without a generally-available networking system for
accessing those entities, it's only an idea.  Today, only URLs come close
to providing a useful way to name truly-public entities.  Which is not to
say that URLs are the best choice, just that they're our only option at the
moment.

It is the use of entity declarations that provides even a hope of
interchange for SGML, not formal public identifiers.  Public identifiers
(formal or not, doesn't matter) help by giving you the option of being even
more indirect but that's not requirement for deriving most of the benefit
from entity declarations (centralizing the mapping from references to
storage objects in the document instance to the storage objects
themselves--that is, avoiding "embedded filenames" in instances).  

Paul is right that if SGML hadn't required some form of indirection,
vendors never would have provided it, certainly not with the level of
consistency we have today with SOCATS. But even there, I don't have a
complete solution, because not all useful tools support SOCATs and not all
support the latest version (ADEPT*Editor, for example, only supports the
first version of the SOCAT spec, while SP supports the second).

Steve Newcomb asks if there is a difference between FPIs and URNs generally
and the answer from John Cowan was, correctly, "no, there's no difference".
 Public identifiers, and formal public identifiers in particular, are just
a special case of URN. They have no unique properties beyond those of URNs
generally (except, see below). They are not magic. They do nothing special
(except part you with 80 or 90 US dollars if you want to have a registered
owner name that is not an ISBN publisher prefix).  ISO 9070 does
standardize owner name registration mechanisms and there are three such
currently implemented: ISBN numbers, ISO registered owner names
(administered by the GCA, see <www.gca.org>), and Internet domain names
(with TC 2 to ISO 8879).  This has value because it does provide a pretty
solid infrastructure for management of name ownership, one of the
requirements for URNs generally.

That said, I must admit that Paul's arguments, along with things others
have said, have made me rethink my original statement that there's no
useful distinction between URLs and URNs (but see below).  It is still true
that URLs can be just as persistent as URNs. However, it is also the case
that we need a formal mechanism for associating names with name spaces,
which is what URNs do. URLs have a built-in name space, namely the universe
of resources on the Web (which is tautologically defined by those things
you can address by URL, but no matter).

In other words, we need to be able to say where to go to look up a name. It
doesn't matter how direct or indirect that lookup is.  Indirection isn't
the issue (because we always have some amount of it, regardless of the
addressing scheme--even a phone number is an indirection even though we
tend to think of it as a direct address).  It is always up to the machine
doing the resolution of names in a particular space to provide appropriate
optimizations--we shouldn't care what they might be when we specify a
pointer to something.

Thus, the concept of URN as a binding of name-space name to name is very
useful, in fact, essential.  Because we need to be able to point to things
that exist in different universes (as Steve wants to do in his Topic Map
example) and we want different ways of naming things (FPI, ISBN number,
URL, etc.).

But...

I think that several errors of design have been made getting here:

1. The expectation a name engenders as to its persistence is a function of
the name, not its use. Therefore, the PUBLIC/SYSTEM distinction made by
SGML (and XML) is inappropriate as a matter of syntax.  A name is a name
and there should be exactly one declared for each entity.  Within an SGML
context, the formal system identifier mechanism (Annex A.6 of ISO/IEC
10744, see
<ftp://ftp.ornl.gov/pub/sgml/wg8/document/n1920/html/clause-A.6.html>)
could be used to distinguish formal public IDs from other forms of name, e.g.:

<!-- Declare notations that represent storage managers: -->
<?IS10744 FSIDR IS9070>

<!-- Declare a storage manager, in this case, formal public identifiers: -->
<!NOTATION IS9070 SYSTEM "ISO 9070//DOCUMENT ...//EN" >

<!-- Now declare an entity that uses that storage manager: -->
<!ENTITY foo SYSTEM "<is9070>+//IDN drmacro.com//..." NDATA SGML >

2. URLs are a special case of URN. Thus the term URI, meaning "URN or URL"
is unnecessary and misleading. There are only URNs, of which URL is a
special case where the prefix "urn:url:" can be omitted.  URLs can be
recognized because they don't start with "urn:", which all other URNs must.
 URLs are really an optimization of URN where the name space resolver is
already known and all Web browsers must know how to resolve URLs (thus
there's no need to apply the more general "look up the name space resolver"
mechanism you must use with any other form of URN).  If this design had
been used from the start on the Web, then "urn:url:http://www.drmacro.com"
would be recognized by all Web clients.  

Of course, URLs have this special status only within the context of Web
browsers and data formats that give special meaning to the syntactic things
that hold URLs (e.g., the "href" attribute of HTML).  Outside this context,
a URL would be no more privileged than anything else. In a different
context, other forms of names could be privileged (as public IDs are in an
SGML context).

Finally, note that URNs as currently defined are simply *a syntax* (of an
infinite possible number of syntaxes) for representing the binding of
name-space to name.  The formal system identifier example above is another
and my suggestion of a few days ago for a <urn:name> element is a third.
The current URN syntax is appropriate for use in HREF attributes, but it
shouldn't be seen as the one and only way to do this binding.  URN
resolution mechanisms should be independent of the syntax used for the
binding--they should simply expect two arguments, a name-space name and a
name in that name space. How the client that makes the resolution request
gets those two arguments is its business.  Particular data representations
can then define their own conventions for representing the binding, whether
it's the current URN syntax or something different.

3. We've confused the persistence of names with the persistence of
resources, which has lead us to think that URLs (and system IDs) are
somehow fundamentally different from URNs (and public IDs).  We've set the
expectation that the naming method can solve problems when in fact it
can't. The evidence that this expectation has been set is the fact that
everything I read about so-called "persistent names" has gone out of its
way to stress that names alone can't guarantee persistence. They wouldn't
have to say this if people didn't expect it to be the case.

Given that my analysis is correct, here's what I'd like to see happen:

1. A general recognition of the need for name-space/name bindings in data
representation standards, regardless of the kind of data.  If these
bindings are further standardized along the URN lines (its semantics, not
its syntax, necessarily), so much the better.

2. Given item (1), data management systems (including operating systems and
networking systems) providing generalized name-space-to-resolver services
that reflect the general approach defined by item (1).  For Internet-based
resources, the DNS proposal is probably appropriate and reasonable.

3. Web clients upgraded to accept "urn:url:" as a prefix to otherwise
normal URLs.

4. People and enterprises providing non-URL name resolution servers.  These
could be along the lines of the PURL services currently being provided (and
could probably be implemented with the existing PURL software).  For
example, Oasis could fund a couple of public identifier servers.  Note that
these services needn't be free--it costs money to maintain machines and it
would be reasonable to charge people who wanted to provide published names
for their resources a reasonable fee for it.

And now, having said that SGML formal public identifiers have no special
properties, let me point out that the fact that registered formal public
identifiers are registered means that you could use owner names to direct
public ID resolution to servers maintained by the name owner, rather than
relying on a central FPI resolution server (that is, "DNS for FPIs"). If I
understand the DNS-for-URN resolution proposal (which I very well may not,
not being an Internet expert by any stretch), the ability to do this is
inherent in the proposal.

If we could do these things, and none of them seem to me to be that
onerous, then we would, I think, be well on our way to realizing the dream
of "universal names" with some hope that persistence, whatever you want
that to mean, could be provided by those that care to. [As Robin Cover
pointed out in private mail to me, we will always be dependent on human
nature for these systems to work, and it is not always human nature to
provide persistence for names, at least not outside the scope of your own
Web server.]

Cheers,

Eliot
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Mon Sep 21 07:02:29 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:54 2004
Subject: SGML char entities XML-ized? AND Re: Public Identifiers
Message-ID: <010001bde51d$63eb01c0$e76118cb@caleb>

Bringing two threads together...

I've put up a catalog at

    http://www.schema.net/public-text/catalog.soc

that contains entries for each of the entity sets available at schema.net
Just DELEGATE your local catalog to the above and you can use the FPIs for
those entity sets.

Furthermore:

1) All public text I make available at schema.net will have an entry in the
catalog (with an IDN FPI assigned, if necessary). If you want somewhere to
house your public text, send it my way.

2) I am more than willing to include a DELEGATE to others who want to use an
FPI for their XML-related entities (whether they be documents, DTDs or
entity sets) and maintain a catalog of their own.

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Mon Sep 21 08:00:25 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:54 2004
Subject: "OO" Schemas (with examples!)
References: <19980917002535.9401.qmail@veosystems.com>
Message-ID: <3605DD9A.3C16DD18@technologist.com>

matt@veosystems.com wrote:
> 
> The exceptions, like C++'s private inheritance, are at
> the margins.  In what language that you are aware of does inheritance
> or subclassing not imply substitutability?

I've been thinking a little further:

I think that it is good to allow inheritance (attribute and content model
sharing) without substitutability (the ability to plug elements of one
type in for elements of another). But since inheritance is always a labour
saving device, I can just avoid it when I don't want substitutability.
This will encourage bad design in some cases (as it does in Java), but
that's okay, we could add "private inheritance" later, as you could in
Java.

To make this concrete, consider these statements:

<!ELEMENT A (EMPTY)>
<!ATTLIST A ATTR CDATA #IMPLIED>

<!ELEMENT B INHERITS-FROM A>

<!ELEMENT C (A)>

Questions:

Inheritance: Does B have an ATTR attribute?
Substitutability: Can a C contain a B? 

The answer to the former is probably "true" in a reasonable inheritance
system for XML types. B got a property (the existance of an attribute)
"for free" from A. This is a nice feature that eases maintenance and
reduces code duplication. The answer to the latter is more complicated. If
you say yes, then you are saying that inheritance implies
substittuability, like Java, but unlike C++, Smalltalk, etc. If you say
no, then you are saying that inheritance and substitutability are two
different features and are more or less unrelated, except that they often
work together.

I am saying that I prefer the second answer, but if the first answer wins
out then I can just avoid inheritance when I don't mean to imply
substitutability, and just declare the attributes twice. Similarly, in
Java I can build proxies to implement code reuse without inheritance. I
can live with that.

The really important thing is that substitutability should not require
inheritance. Note that Java (C++, Smalltalk, etc.) do not make this
mistake. In Java, substitutability can be defined with interfaces so that
no particular implementation is required or implied. This is just as
important for documents. We are very, very close to having a clear
language for describing substitutability, but we must not get distracted
by the (comparatively unimportant) issue of inheritance. Let's forget
about inheritance for a second, get substitutability right, and we can go
back and fiddle with code saving devices later! We must especially not
confuse the two! 

Consider, for example, an element that wants to be substitutable for an
HTML A element. Perhaps it is a special hypertext bibliography element
that gets its URL from a child element named URL:

<BIBLIO>
	<ISBN>...</ISBN>
	<PUBLISHER>...<PUBLISHER>
	<URL>http://....</URL>
</BIBLIO>

We can easily translate this to an A HREF element using XSL's
transformation language:

<A href="{URL}">
	<B><xsl:process select="ISBN"/></B>
	<I><xsl:process select="PUBLISHER"/></I>
</A>

Thus, this:

<BIBLIO>
	<ISBN>XXXXXX</ISBN>
	<PUBLISHER>Prentice Hall<PUBLISHER>
	<URL>http://....</URL>
</BIBLIO>

would become:

<A href="http://....">
	<B>XXXX</B>
	<I>Prentice Hall</I>
</A>

The implications of this for e-commerce and document interchange are be
clear. You can define these "substituability mappings" into schemas
created by other people, and just "plug in" elements named according to
your own vocabulary. You can even use a schema to verify that anything
that conforms to your vocabulary will come out of this mapping conforming
to the containing application's schema.

But note that I've done no "inheriting" here. My BIBLIO type got no
property from the A type. In fact, it may have been invented completely
independently. I can "impose" substitutability on a type from outside,
just as I can in an OO language with a "proxy." The XSL rule is my proxy.
I fear that these important features will get lost in the rush to
"inheritance".

So what I ask is that you recognize that a) substitutability without
inheritance is at least as important as in in Java, b) as long as you are
going to provide that, you might as well provide inheritance without
substituatability and c) we need advanced kinds of substitutability such
as the one described above.

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

It's such a 
Bore
Being always
Poor
LANGSTON HUGHES
http://www.northshore.net/homepages/hope/engHughes.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From larsga at ifi.uio.no  Mon Sep 21 11:18:33 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:04:54 2004
Subject: URLs
In-Reply-To: <36058566.14CED624@technologist.com>
References: <199809202131.RAA03962@locke.ccil.org> <36058566.14CED624@technologist.com>
Message-ID: <wkogs94yg6.fsf@ifi.uio.no>


* Paul Prescod
|
| Of course ~ is usually bandied about despite its "unsafeness", but I
| would be surprised if browsers are smart enough to encode it during
| transmission for "safety."

They are not.

--Lars M.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rbourret at dvs1.informatik.tu-darmstadt.de  Mon Sep 21 11:20:24 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun  7 17:04:54 2004
Subject: XSchema: Element references and namespaces
Message-ID: <199809210918.LAA12494@berlin.dvs1.tu-darmstadt.de>

I was writing section 4 (conversions) yesterday and realized we have a problem 
referring to elements by name from Ref, AttDef, and AttGroup elements.  For 
example:

   <ElementDecl Name="foo" ns="http://myfoo.org" prefix="myfoo">
      ...
   </ElementDecl>

   <ElementDecl Name="foo" ns="http://yourfoo.org" prefix="yourfoo">
      ...
   </ElementDecl>

   <ElementDecl Name="refersToFoo">
      <Model>
         <Ref Element="foo"/>
      </Model>
   </ElementDecl>

Which foo does the Element attribute in the content model of refersToFoo refer 
to?  I see two solutions to this:

1) Refer to elements by ID.  I don't like this from a usability standpoint -- it 
is far more natural to refer to elements by name.

2) Add nsElement and (possibly) prefixElement attributes to Ref, AttDef, and 
AttGroup.  This matches our namespace implementation, which uses a two-part 
naming system (Name + ns).

Note 1: We cannot name these attributes ns and prefix, as those already exist on 
AttDef and AttGroup and apply to the value of the Name attribute.

Note 2: prefixElement is not strictly necessary -- you could get the prefix from 
the ElementDecl, AttGroup, or AttDef.  Although it makes it possible for the 
user to introduce conflicting prefixes, it also makes conversion to a DTD much 
easier.  Because references can occur before declarations, you can't guarantee a 
successful prefix lookup when you need it.  Thus, you must either make two 
passes over the XSchema or build an in-memory tree before outputting anything.  
I would rather add prefixElement and have the checker check that it matches the 
relevant prefix.

Note 3: Can nsElement inherit from ns and prefixElement inherit from prefix?  In 
many (most?) cases, the content model of elements will refer to elements in the 
same namespace as the element being defined and attributes will belong to the 
same namespace as the element to which they apply.  Thus, it would be nice to 
omit nsElement and just apply ns from the enclosing ElementDecl, AttGroup, or 
AttDef.  However, inheriting from one attribute to another is definitely not in 
the spirit of XML and makes me a bit queasy.

-- Ron Bourret

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rbourret at dvs1.informatik.tu-darmstadt.de  Mon Sep 21 11:32:32 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun  7 17:04:54 2004
Subject: XSchema review ends 9/23/98
Message-ID: <199809210928.LAA12604@berlin.dvs1.tu-darmstadt.de>

This is just a reminder that the (almost) final review of sections 1-3 and 
appendixes A, B, and D of XSchema ends 9/23/98.  You can find the current spec 
at:

   http://www.simonstl.com/xschema/spec/xscspecv2.htm

Based on comments so far, it looks like we will make the following changes:

* Add an Enumeration element above EnumerationValue and (?) add Enumeration to 
the content model of XSchema.  This improves grouping of enumerations and allows 
for easier reuse.

* Eliminate Model from the content model of Model, but leave it in Choice and 
Seq.  There seems to be no good reason for directly nested Models.

* Add More and Doc to the content model of UnparsedEntity.  This matches the 
content model of Notation and was an oversight in the original design.

* Clarify that if an XSchema document does not use prefixes on any elements 
except More and Doc (and always uses XSC: for them), a non-namespace-aware 
processor can interpret the document.  This is possible because the document 
will match the XSchema DTD and because there are no GI collisions between 
XSchema and IBTWSH.

* Somehow resolve namespaces when referring to elements from Ref, AttGroup, and 
AttDef (see separate mail).

Any changes we make will be open for another week of review, but everything else 
will be frozen.

-- Ron Bourret

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From srn at techno.com  Mon Sep 21 13:38:51 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun  7 17:04:54 2004
Subject: Public Identifiers
In-Reply-To: <199809210043.UAA10167@locke.ccil.org> (message from John Cowan
	on Sun, 20 Sep 1998 20:43:28 -0400 (EDT))
References: <199809210043.UAA10167@locke.ccil.org>
Message-ID: <199809211142.GAA00670@bruno.techno.com>

Thanks, John Cowan, for your clear answer, but the language you cite
from RFC 1737 only re-opens the question and casts doubt on your
answer, at least in my mind.

> From: John Cowan <cowan@locke.ccil.org>
> Date: Sun, 20 Sep 1998 20:43:28 -0400 (EDT)
> 
> Steven R. Newcomb scripsit:
> 
> > Will URNs permit pointing to things that aren't now and may never be
> > on the web? I mean, things that their owners never intended to be on
> > the web and either that their owners do not want to appear on the web,
> > or that their owners may not (currently) see any interest in putting
> > on the web?
> 
> Clearly yes.  RFC 1737, "Functional Requirements for Uniform Resource Names",
> says:
> 
> # A URN identifies a resource or
> # unit of information.  It may identify, for example, intellectual
> # content, a particular presentation of intellectual content, or
> # whatever a name assignment authority determines is a distinctly
> # namable entity.  A URL identifies the location or a container for an
> # instance of a resource identified by a URN.  The resource identified
> # by a URN may reside in one or more locations at any given time, may
> # move, or may not be available at all.
> 
> Note especially the last phrase.

The problems with the quotation from RFC 1737 are these:

* Who defines what constitutes a name assignment authority?  If it's
  the end user, in an ad hoc fashion, that's fine, I'm satisfied.  But
  the language of the RFC, 

> # For example, ISBN numbers, ISO
> # public identifiers, and UPC product codes seem to satisfy the
> # functional requirements, and allow an embedding that satisfies
> # the syntactic requirements described here.

  ...indicates otherwise.  Here, in all three examples, there is a
  name registration authority; the end user is evidently not allowed
  to specify the Sears 1922 Farm Catalog unless this has already
  become a formally-cataloged public entity of some kind.  (Note that
  it's not clear whether "ISO public identifiers" means "public
  identifiers in ISO syntax" or "public identifiers that begin with
  the letters 'ISO' and that define public text entities that were
  created under the auspices of the ISO".  There is a very small set
  of the latter -- a set that has little or nothing to do with what
  I'm concerned about here.)

* There is an even more problematic statement: "A URL identifies the
  location or a container for an instance of a resource identified by
  a URN."  This strongly implies that there must be a URL behind every
  URN, even if that URL is fictitious or doesn't happen to work.  It
  also implies that there is some sort of locatable online container.

> These rules are documented in RFC2141, "URN Syntax".

The fact that you make such a point of demonstrating the conversion of
my FPI into a URN makes me wonder whether you understood my question.
In the scenario I'm asking about, there is never any need to transmit
the FPI, so there's no need to convert it.

-Steve

--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com  http://www.techno.com  ftp.techno.com

voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax    +1 972 994 0087 (at ISOGEN: +1 214 953 3152)

3615 Tanner Lane
Richardson, Texas 75082-2618 USA

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From deke at tallent.com  Mon Sep 21 16:46:25 1998
From: deke at tallent.com (Deke Smith)
Date: Mon Jun  7 17:04:54 2004
Subject: Mix encodings in a document?
Message-ID: <1305751328-169403786@tallent.com>

I think I know the answer I am going to get, but I'll ask anyway.

Within a single XML document, is it possible to have the text encoding 
change from element to element? 

For example:

<?xml version="1.0"?>
<PHRASES>
<PHRASE encoding="ISO-8859-1" xml:lang="en">Hello!</PHRASE>
<PHRASE encoding="X-EUC-TW" xml:lang="zh-TW"><!--chinese language text 
here--></PHRASE>
</PHRASES>

At the least, I can imagine XML browsers and parsers will cough up a hair 
ball on this. My feeling is that this should NOT be valid, but I don't 
know for sure. The way I see that the specs allow for this is for the 
character encoding to be UTF-16 for the whole document:

<?xml version="1.0" encoding="UTF-16"?>
<PHRASES>
<PHRASE xml:lang="en">Hello!</PHRASE>
<PHRASE xml:lang="zh-TW"><!--chinese language text here--></PHRASE>
</PHRASES>

Deke


-----------------------------------------------------------------
Deke Smith
Tallent Communications Group, Brentwood TN
deke@tallent.com, 615-661-9878
-----------------------------------------------------------------
" The best way to predict the future is to invent it. " 
       - Alan Kay 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dave at userland.com  Mon Sep 21 16:54:55 1998
From: dave at userland.com (Dave Winer)
Date: Mon Jun  7 17:04:54 2004
Subject: http://www.scripting.com/98/09/News21.xml
In-Reply-To: <199809210918.LAA12494@berlin.dvs1.tu-darmstadt.de>
Message-ID: <3.0.5.32.19980921075615.00c7fe50@scripting.com>

I'm working on my server today, watching hit counts on various pages, and
note that there are only a couple of other servers reading our daily XML
news pages. 

It would be great if other people would use our feed, we're the first to do
it, certainly at some point Reuters or AP is going to do it, or CNN, or the
NY Times, or... We went first because we wanted to understand the
opportunity, and to help solve the chicken and egg problem.

Anyway, if you're looking for XML files to test with, we have them. 

Today's file is at:

http://www.scripting.com/98/09/News21.xml

They go back to April 1997 and will be automatically created by our content
management system as long as www.scripting.com is operating. 

The files are updated in synch with the HTML version, usually about 10-20
times a day. We cover scripting, US politics, web development, and (of
course) XML.

I encourage other people to use this resource. Please ask questions if it's
not clear what we're doing. Let's have fun!

Dave Winer


--------------------------------------
http://www.userland.com/directory.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Mon Sep 21 17:10:50 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:54 2004
Subject: Mix encodings in a document?
Message-ID: <003601bde572$5ded30e0$e36118cb@caleb>

-----Original Message-----
From: Deke Smith <deke@tallent.com>

>I think I know the answer I am going to get, but I'll ask anyway.
>
>Within a single XML document, is it possible to have the text encoding
>change from element to element?

The way you've phrased the question, the answer is yes, but given your
examples, I suspect you are really asking whether is it possible to have the
text encoding change WITHIN A SINGLE ENTITY, in which case the answer is no.

There is nothing to stop you having

<?xml version="1.0"?>
<!DOCTYPE PHRASES [
<!ENTITY phrase2 SYSTEM "phrase2.xml">
]>
<PHRASES>
<PHRASE encoding="ISO-8859-1" xml:lang="en">Hello!</PHRASE>
&phrase2;
</PHRASES>

where phrase2.xml is

<?xml encoding="X-EUC-TW"?>
<PHRASE xml:lang="zh-TW"><!--chinese language text
here--></PHRASE>

This is within the one document (but two entities).

Mind you, your example:

<?xml version="1.0"?>
<PHRASES>
<PHRASE encoding="ISO-8859-1" xml:lang="en">Hello!</PHRASE>
<PHRASE encoding="X-EUC-TW" xml:lang="zh-TW"><!--chinese language text
here--></PHRASE>
</PHRASES>

won't cause any problems, simply because there is nothing special about the
attribute "encoding". Note that it's not valid but that isn't because of the
encoding attribute, it's simply because there isn't a DTD declaring the
elements and attributes used.

James
--
James Tauber / jtauber@jtauber.com      http://www.jtauber.com/
Lecturer and Associate Researcher
Electronic Commerce Network             ( http://www.xmlinfo.com/
Curtin Business School                  ( http://www.xmlsoftware.com/
Perth, Western Australia                ( http://www.schema.net/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Keith.Corder at fritolay.com  Mon Sep 21 17:53:52 1998
From: Keith.Corder at fritolay.com (Keith.Corder@fritolay.com)
Date: Mon Jun  7 17:04:55 2004
Subject: http://www.scripting.com/98/09/News21.xml
Message-ID: <199809211553.AA18785@interlock.fritolay.com>

I'm very interested in seeing how this works.  How do I access the site?
I've tried IE and Jumbo - but no success so far.  (I'm behind a proxy
firewall, so if Jumbo is the right tool, how would I set the proxy info?)

Thanks, Keith Corder


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jmcdonou at library.berkeley.edu  Mon Sep 21 18:41:21 1998
From: jmcdonou at library.berkeley.edu (Jerome McDonough)
Date: Mon Jun  7 17:04:55 2004
Subject: Public Identifiers
In-Reply-To: <199809211142.GAA00670@bruno.techno.com>
References: <199809210043.UAA10167@locke.ccil.org>
 <199809210043.UAA10167@locke.ccil.org>
Message-ID: <3.0.5.32.19980921093859.00952100@library.berkeley.edu>

At 06:42 AM 9/21/98 -0500, Steven R. Newcomb wrote:
>> # A URN identifies a resource or
>> # unit of information.  It may identify, for example, intellectual
>> # content, a particular presentation of intellectual content, or
>> # whatever a name assignment authority determines is a distinctly
>> # namable entity.  A URL identifies the location or a container for an
>> # instance of a resource identified by a URN.  The resource identified
>> # by a URN may reside in one or more locations at any given time, may
>> # move, or may not be available at all.
>> 
>> Note especially the last phrase.
>
>The problems with the quotation from RFC 1737 are these:
>
>* Who defines what constitutes a name assignment authority?

The URN working group of the IETF is working on an Internet Draft
addressing the issue of how name assignment authorities are registered
(or not).  Quoting from the draft: "In a nutshell, a template for
the definition of the namespace is completed for deposit with IANA,
and a NID (namespace identifier) is assigned."  The draft contemplates 
three levels of name spaces: experimental, informal, and formal.
Experimental are not explicitly registered with IANA, and take
the form of x-<NID>; no provision is made for avoiding collision
of experimental NIDs.  Informal are registered with IANA and assigned
a number sequence as an identifier, in the format "iana-"<number>
where <number> is chosen by the IANA on a First Come First Served basis.
Formal identifiers are processed through an RFC review process; a template
containing registration information would be sent to urn-nid@apps.ietf.org
to allow for a 2 week discussion period; then the template should be
sent to iana@iana.org.  The template will request a particular NID
string, which is assigned by IETF consensus.  So, essentially anyone
can produce an experimental URN; formal ones are a bit more work.
See the Internet Draft for details.

>* There is an even more problematic statement: "A URL identifies the
>  location or a container for an instance of a resource identified by
>  a URN."  This strongly implies that there must be a URL behind every
>  URN, even if that URL is fictitious or doesn't happen to work.  It
>  also implies that there is some sort of locatable online container.

This does not appear to be what's contemplated for URNs at the moment.
Having spent several long, painful weeks trying to wade through all
of the RFCs and Drafts regarding URNs and name resolution services,
there does not appear to be any requirement anywhere that a URN
resolve to any URL.  This makes sense if you think about it; if you
register the ISBN name space, you're going to have an awful lot of
ISBNs which are not available online, and aren't likely to be.
Granted you probably wouldn't go to the trouble of registering a namespace
with IANA if you didn't contemplate making at least a portion of the items
identified within that space available online, but there's no requirement
that you do so.


Jerome McDonough -- jmcdonou@library.Berkeley.EDU  |  (......)
Library Systems Office, 386 Doe, U.C. Berkeley     |  \ *  * /
Berkeley, CA 94720-6000    (510) 643-2058          |  \  <>  /
"Well, it looks easy enough...."                   |   \ -- /  SGNORMPF!!!
         -- From the Famous Last Words file        |    ||||

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From paul at arbortext.com  Mon Sep 21 19:12:35 1998
From: paul at arbortext.com (Paul Grosso)
Date: Mon Jun  7 17:04:55 2004
Subject: Socat issues for XML
Message-ID: <3.0.32.19980921115505.00ea1c90@pophost.arbortext.com>

At 20:51 1998 09 20 -0400, John Cowan wrote:
>I've just finished the first cut of my implementation of Socats
>as a SAX EntityResolver, and two points have come up:
>
>1) With the minor exception of notation declarations, every public id
>in XML has an accompanying system id.  Therefore, the OVERRIDE entry
>does not make sense: it must be ignored, and the default must be YES
>rather than NO.
>
>To explicate OVERRIDE: in SGML Socats, OVERRIDE NO means that when
>an explicit system id is present, the catalog entries are ignored;
>OVERRIDE YES means the catalog entries override an explicit system id.

I'm not understanding why OVERRIDE NO doesn't make sense.  Perhaps
I'm missing something about SAX or your implementation.  (Assume I 
understand TR9401, since I edited it.)

>
>2) As another consequence of system ids being always present and always
>URLs, a usable Socat implementation must not search the whole
>public catalog space for SYSTEM entries.  When should the search stop?
>In some sense "when going offsite", but just when is that?
>Any suggestions?

I don't understand what the problem is, and I don't understand how--if
there really is a problem--anything about XML makes it a problem that
isn't a problem with SGML in general (XML is SGML, you know).

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Mon Sep 21 19:38:03 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:55 2004
Subject: Public Identifiers
In-Reply-To: <199809211142.GAA00670@bruno.techno.com> from "Steven R. Newcomb" at Sep 21, 98 06:42:02 am
Message-ID: <199809211744.NAA29846@locke.ccil.org>

Steven R. Newcomb scripsit:

> * Who defines what constitutes a name assignment authority?  If it's
>   the end user, in an ad hoc fashion, that's fine, I'm satisfied.  But
>   the language of the RFC, 

Global uniqueness *is* a requirement of URNs, in the sense that two
distinct things ought not be described by the same URN, and someone
has to define what counts as "distinct things".

My understanding is that the widespread use of unregistered FPIs
merely reflects the lack of easy access to registration until recently.

> > # public identifiers, and UPC product codes seem to satisfy the
> > # functional requirements, and allow an embedding that satisfies
> > # the syntactic requirements described here.
> 
>   ...indicates otherwise.  Here, in all three examples, there is a
>   name registration authority; the end user is evidently not allowed
>   to specify the Sears 1922 Farm Catalog unless this has already
>   become a formally-cataloged public entity of some kind.

I think that Sears itself might be quite unhappy about people invading
its FPI namespace in this fashion.  Such an FPI is more like a
prose description of the resource.

>   (Note that
>   it's not clear whether "ISO public identifiers" means "public
>   identifiers in ISO syntax" or "public identifiers that begin with
>   the letters 'ISO' and that define public text entities that were
>   created under the auspices of the ISO".  There is a very small set
>   of the latter -- a set that has little or nothing to do with what
>   I'm concerned about here.)

I think that the identifiers of ISO 9070 are intended here.

> * There is an even more problematic statement: "A URL identifies the
>   location or a container for an instance of a resource identified by
>   a URN."  This strongly implies that there must be a URL behind every
>   URN, even if that URL is fictitious or doesn't happen to work.  It
>   also implies that there is some sort of locatable online container.

Not so.  You take that sentence out of its context, which makes clear that
it's quite possible for an URN to never have any corresponding URLs.

> In the scenario I'm asking about, there is never any need to transmit
> the FPI, so there's no need to convert it.

I don't understand this point.  An FPI that's never transmitted
remains within the brain of its inventor only.  The point of 
URI-encoding is simply to make clear where the delimiters of the URI are.
Allowing embedded spaces is good for human readability, but not ffor
parsability.

-- 
John Cowan					cowan@ccil.org
		e'osai ko sarji la lojban.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Mon Sep 21 19:59:38 1998
From: david at megginson.com (david@megginson.com)
Date: Mon Jun  7 17:04:55 2004
Subject: Socat issues for XML
In-Reply-To: <3.0.32.19980921115505.00ea1c90@pophost.arbortext.com>
References: <3.0.32.19980921115505.00ea1c90@pophost.arbortext.com>
Message-ID: <13830.36646.800447.224698@localhost.localdomain>

Paul Grosso writes:

 > >To explicate OVERRIDE: in SGML Socats, OVERRIDE NO means that when
 > >an explicit system id is present, the catalog entries are ignored;
 > >OVERRIDE YES means the catalog entries override an explicit system
 > >id.
 > 
 > I'm not understanding why OVERRIDE NO doesn't make sense.  Perhaps
 > I'm missing something about SAX or your implementation.  (Assume I
 > understand TR9401, since I edited it.)

I think that the point is that if OVERRIDE NO were allowed, the public
identifiers in the catalogue would never be used (since all entities,
at least, must have system identifiers in XML).  In SGML, some
entities may not have explicit system IDs, and those can still default 
to the mappings in the catalogue.

I'm not certain that I agree (I'd have to think about the problem
further), but I *am* fairly certain that this is the reasoning behind
the original statement.

 > >2) As another consequence of system ids being always present and
 > >always URLs, a usable Socat implementation must not search the
 > >whole public catalog space for SYSTEM entries.  When should the
 > >search stop?  In some sense "when going offsite", but just when is
 > >that?  Any suggestions?
 > 
 > I don't understand what the problem is, and I don't understand
 > how--if there really is a problem--anything about XML makes it a
 > problem that isn't a problem with SGML in general (XML is SGML, you
 > know).

I'd guess that this is a problem of efficiency: when catalogues are on
the other end of relatively slow network connection, you don't want to 
retrieve a dozen catalogues unnecessarily.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Mon Sep 21 20:03:34 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:55 2004
Subject: Socat issues for XML
In-Reply-To: <3.0.32.19980921115505.00ea1c90@pophost.arbortext.com> from "Paul Grosso" at Sep 21, 98 12:10:08 pm
Message-ID: <199809211810.OAA00628@locke.ccil.org>

Paul Grosso scripsit:

> I'm not understanding why OVERRIDE NO doesn't make sense.  Perhaps
> I'm missing something about SAX or your implementation.  (Assume I 
> understand TR9401, since I edited it.)

Ah, good.  Perhaps it is I who does not understand.  If so, please
enlighten me.

I quote James Clark's docs, since they are pretty clear:

# A PUBLIC, ENTITY, DOCTYPE, LINKTYPE, or NOTATION entry
# with an overriding mode of YES will be used whether or not the
# external identifier has an explicit system identifier;
# those with an overriding mode of NO will be ignored if external identifier
# has an explicit system identifier.

In the XML context, as I said, every external identifier has
an explicit system identifier (with the minor exception of
notation declarations).  Therefore, any entries with an
overriding mode of NO will be unconditionally ignored.  Since this
is the default, any catalog not beginning with OVERRIDE YES will
be ignored *in toto* (except for SYSTEM entries).
This seems less than sensible.

> I don't understand what the problem is, and I don't understand how--if
> there really is a problem--anything about XML makes it a problem that
> isn't a problem with SGML in general (XML is SGML, you know).

XML documents are SGML documents, but XML is not SGML, because of the
additional constraints it imposes.  All XML system ids are URLs, and
in general are to be taken at face value.  SYSTEM entries serve as a
private URL-URL mapping scheme, but must the whole of a public-id-
resolution infrastructure be searched for each and every URL referred
to in a XML document?

XML, unlike SGML, has the Web as its implicit background.

-- 
John Cowan					cowan@ccil.org
		e'osai ko sarji la lojban.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Mon Sep 21 20:21:12 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:55 2004
Subject: URNs, FPIs, and RFC 1737
Message-ID: <199809211827.OAA01152@locke.ccil.org>

I would urge anyone who still thinks that FPIs should be freely creatable by
random persons to read RFC 1737 (ftp://ftp.isi.edu/in-notes/rfc1737.txt
and many other places around the net) and then think about the following
question:

Would it be perfectly all right for someone to use
"-//IETF//NONSGML DOCUMENT RFC822//EN" as an FPI for the
Independent Counsel's Report (URL http://icreport.loc.gov/icreport)?

-- 
John Cowan					cowan@ccil.org
		e'osai ko sarji la lojban.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jmcdonou at library.berkeley.edu  Mon Sep 21 20:49:56 1998
From: jmcdonou at library.berkeley.edu (Jerome McDonough)
Date: Mon Jun  7 17:04:55 2004
Subject: Mixed Content Models
Message-ID: <3.0.5.32.19980921114754.00968d00@library.berkeley.edu>

Hello all,

	In looking over the XML spec (3.2.2) on mixed content
models, something isn't clear to me.  I'm hoping someone
here can enlighten me.

	I've inherited a DTD for development that was originally
intended to be an SGML DTD, and has been converted to XML.
Contained within it is the following:

	<!ELEMENT qstn   (#PCDATA | (preQTxt?, qstnLit?, postQTxt?, forward?,
							  backward?, ivuInstr*))*

Is this a legitimate content model under XML section 3.2.2?
Msxml doesn't have a problem with it, and nsgmls using the -wxml flag
also happily parses the DTD.  IBM's xml4j, however, complains:
"Codebook.dtd: 1256, 33: This content model is not matched with the
mixed model '(#PCDATA|FOO|BAR|. . .|BAZ)*': '(#PCDATA|(preQTxt?, qstnLit?,
postQTxt?,forward?,backward?,ivuInstr*))*".

I suppose this boils down to, should the parser ignore what's within
a content group when evaluating whether someone is trying to constrain
the order or number of occurrences of 'child elements.'  If the
only 'child elements' to be considered in the above case are:

	A.	#PCDATA, and
	B.   (preQTxt?, qstnLit?, postQTxt?, forward?, backward?, ivuInstr*)

Jerome McDonough -- jmcdonou@library.Berkeley.EDU  |  (......)
Library Systems Office, 386 Doe, U.C. Berkeley     |  \ *  * /
Berkeley, CA 94720-6000    (510) 643-2058          |  \  <>  /
"Well, it looks easy enough...."                   |   \ -- /  SGNORMPF!!!
         -- From the Famous Last Words file        |    ||||

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jmcdonou at library.berkeley.edu  Mon Sep 21 21:02:04 1998
From: jmcdonou at library.berkeley.edu (Jerome McDonough)
Date: Mon Jun  7 17:04:55 2004
Subject: Mixed Content Models (PART II)
Message-ID: <3.0.5.32.19980921120001.0096a210@library.berkeley.edu>

[Apologies for the earlier, partial message.  Fingers thought emacs
when I was actually in Eudora.]

Hello all,

	In looking over the XML spec (3.2.2) on mixed content
models, something isn't clear to me.  I'm hoping someone
here can enlighten me.

	I've inherited a DTD for development that was originally
intended to be an SGML DTD, and has been converted to XML.
Contained within it is the following:

	<!ELEMENT qstn   (#PCDATA | (preQTxt?, qstnLit?, postQTxt?, forward?,
							  backward?, ivuInstr*))*

Is this a legitimate content model under XML section 3.2.2?
Msxml doesn't have a problem with it, and nsgmls using the -wxml flag
also happily parses the DTD.  IBM's xml4j, however, complains:
"Codebook.dtd: 1256, 33: This content model is not matched with the
mixed model '(#PCDATA|FOO|BAR|. . .|BAZ)*': '(#PCDATA|(preQTxt?, qstnLit?,
postQTxt?,forward?,backward?,ivuInstr*))*".

I suppose this boils down to, should the parser ignore what's within
a content group when evaluating whether someone is trying to constrain
the order or number of occurrences of 'child elements.'  If the
only 'child elements' to be considered in the above case are:

	A.	#PCDATA, and
	B.   (preQTxt?, qstnLit?, postQTxt?, forward?, backward?, ivuInstr*)

then the above content model simplies to (A | B)* and doesn't appear
to conflict with section 3.2.2.  If 'child elements' is interpreted
as meaning *any* element contained within the content model, however,
then actually the content model doesn't look valid to me.  But if
'child elements' means any element, even those within a group, then
my content model is probably bogus.  Can someone tell me which
is the correct interpretation?
 

Jerome McDonough -- jmcdonou@library.Berkeley.EDU  |  (......)
Library Systems Office, 386 Doe, U.C. Berkeley     |  \ *  * /
Berkeley, CA 94720-6000    (510) 643-2058          |  \  <>  /
"Well, it looks easy enough...."                   |   \ -- /  SGNORMPF!!!
         -- From the Famous Last Words file        |    ||||

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From crism at oreilly.com  Mon Sep 21 21:06:36 1998
From: crism at oreilly.com (Chris Maden)
Date: Mon Jun  7 17:04:55 2004
Subject: Mixed Content Models
In-Reply-To: <3.0.5.32.19980921114754.00968d00@library.berkeley.edu> (message
	from Jerome McDonough on Mon, 21 Sep 1998 11:47:54 -0700)
Message-ID: <199809211904.PAA16835@ruby.ora.com>

[Jerome McDonough]
>       In looking over the XML spec (3.2.2) on mixed content models,
> something isn't clear to me.  I'm hoping someone here can enlighten
> me.
> 
>       I've inherited a DTD for development that was originally
> intended to be an SGML DTD, and has been converted to XML.
> Contained within it is the following:
> 
> <!ELEMENT qstn   (#PCDATA | (preQTxt?, qstnLit?, postQTxt?, forward?,
>                                                 backward?, ivuInstr*))*
> 
> Is this a legitimate content model under XML section 3.2.2?

No.  See production [51].  A mixed content declaration MUST be of the
forms

<!ELEMENT e-type1 (#PCDATA | sub1 | sub2 | sub3)*>
<!ELEMENT e-type2 (#PCDATA)>

This is not optional.

> Msxml doesn't have a problem with it, and nsgmls using the -wxml
> flag also happily parses the DTD.  IBM's xml4j, however, complains:
> "Codebook.dtd: 1256, 33: This content model is not matched with the
> mixed model '(#PCDATA|FOO|BAR|. . .|BAZ)*': '(#PCDATA|(preQTxt?,
> qstnLit?, postQTxt?,forward?,backward?,ivuInstr*))*".

I'm a little surprised that nsgmls doesn't catch this; however, the
-wxml option warns about some, or even most, XML errors, but not all
of them.

Fortunately, your content model is equivalent to

<!ELEMENT qstn (#PCDATA | preQTxt | qstnLit | postQTxt | forward |
                backward | ivuInstr)*>

so this isn't a real problem.  In some cases, it is true that content
models will need to be either tightened or loosened to be expressed as
XML (notably models involving exceptions).

-Chris
-- 
<!NOTATION SGML.Geek PUBLIC "-//Anonymous//NOTATION SGML Geek//EN">
<!ENTITY crism PUBLIC "-//O'Reilly//NONSGML Christopher R. Maden//EN"
"<URL>http://www.oreilly.com/people/staff/crism/ <TEL>+1.617.499.7487
<USMAIL>90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From larsga at ifi.uio.no  Mon Sep 21 21:09:47 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:04:55 2004
Subject: Mixed Content Models
In-Reply-To: <3.0.5.32.19980921114754.00968d00@library.berkeley.edu>
References: <3.0.5.32.19980921114754.00968d00@library.berkeley.edu>
Message-ID: <wkww6x2slg.fsf@ifi.uio.no>


* Jerome McDonough
| 
| I've inherited a DTD for development that was originally intended to
| be an SGML DTD, and has been converted to XML.  Contained within it
| is the following:
| 
| <!ELEMENT qstn   (#PCDATA | (preQTxt?, qstnLit?, postQTxt?, forward?,
| 						  backward?, ivuInstr*))*
| 
| Is this a legitimate content model under XML section 3.2.2?

No, it is not. XML mixed content models must be of the form

<!ELEMENT qstn  (#PCDATA | child1 | child2 | child3 ...)*>

| Msxml doesn't have a problem with it, 

MSXML is not updated to the latest specification.

| and nsgmls using the -wxml flag also happily parses the DTD.

Hmmm. This deviation is not documented in the SP documentation.

| IBM's xml4j, however, complains:
| "Codebook.dtd: 1256, 33: This content model is not matched with the
| mixed model '(#PCDATA|FOO|BAR|. . .|BAZ)*': '(#PCDATA|(preQTxt?, qstnLit?,
| postQTxt?,forward?,backward?,ivuInstr*))*".

This is correct behaviour. (Note that it also gives an example of a
correct mixed content model.)
 
| I suppose this boils down to, should the parser ignore what's within
| a content group when evaluating whether someone is trying to
| constrain the order or number of occurrences of 'child elements.'
| If the only 'child elements' to be considered in the above case are:
| 
| 	A.	#PCDATA, and
| 	B.   (preQTxt?, qstnLit?, postQTxt?, forward?, backward?, ivuInstr*)
|
| then the above content model simplies to (A | B)* and doesn't appear
| to conflict with section 3.2.2.

Well, it does, you see, because it conflicts with the grammar, so this
is actually a well-formedness error.

| But if 'child elements' means any element, even those within a
| group, then my content model is probably bogus.  Can someone tell me
| which is the correct interpretation?

Your content model is bogus. :)

--Lars M.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Mon Sep 21 21:58:05 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:04:55 2004
Subject: Mix encodings in a document?
Message-ID: <005001bde59a$66d4fa80$e36118cb@caleb>

-----Original Message-----
From: Gavin Thomas Nicol <gtn@eps.inso.com>
>This is not strictly correct. You *could* mix encodings in a single text
>entity, but any such behaviour would fall outside the XML specification
(i.e. >it would fall below the entity manager layer).

Yes, agreed. I started typing something to this effect (that if the
character codes were legal under the declared encoding, even if their
intended characters were different, you technically would still have a well
formed document).

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From paul at arbortext.com  Mon Sep 21 22:11:45 1998
From: paul at arbortext.com (Paul Grosso)
Date: Mon Jun  7 17:04:55 2004
Subject: Socat issues for XML
Message-ID: <3.0.32.19980921150814.006ffcb8@pophost.arbortext.com>

I also received John Cowan's reply, but I'm using David's
since he included the necessary history.  I quote from both.

> > I'm not understanding why OVERRIDE NO doesn't make sense.  Perhaps
> > I'm missing something about SAX or your implementation.  (Assume I
> > understand TR9401, since I edited it.)
>
>I think that the point is that if OVERRIDE NO were allowed, the public
>identifiers in the catalogue would never be used (since all entities,
>at least, must have system identifiers in XML). 

It is true that the catalog entries of type PUBLIC, ENTITY, DOCTYPE, 
LINKTYPE, or NOTATION occurring between an OVERRIDE NO entry and the 
subsequent OVERRIDE YES entry will be ignored.

I see three options:

1.  say that your subset of TR9401 catalogs doesn't include OVERRIDE;
2.  say that your subset "recognizes" OVERRIDE entries but ignores them;
3.  say that your subset handles OVERRIDE.

Option 1 means that existing catalogs will cause your implementations
to give errors; option 2 means that they will cause your implementation 
to behave differently (perhaps subtlely and surprisingly) from existing 
TR9401 implementations; option 3 means some extra work for your 
implementations.

Looking at the pros and cons, I'd opt for option 3:  a little more work
for your implementations seems preferable to the problems 1 and 2 will
mean for end users. 

>I quote James Clark's docs, since they are pretty clear:

That's fine, but the text in TR9401 is normative (and, at least in
the case you quote, almost identical).  Note that, last I checked,
James had not implemented support for the complete TR9401:1997
(not that I'm saying your effort can't subset TR9401:1997--just
that you might want to be fully aware of what TR9401:1997 says 
to use as a resource in your efforts).

>In the XML context, as I said, every external identifier has
>an explicit system identifier (with the minor exception of
>notation declarations).  Therefore, any entries with an
>overriding mode of NO will be unconditionally ignored.  Since this
>is the default, any catalog not beginning with OVERRIDE YES will
>be ignored *in toto* (except for SYSTEM entries).

No on two counts:
a.  OVERRIDE NO is not the default per TR9401, and
b.  Even when reading a file starting in OVERRIDE NO mode, the
    catalog will not be ignored in toto; not only are there
    SYSTEM entries, as you mention, but there can be an OVERRIDE YES
    entry which means the rest of the catalog will be processed.

More on my first "no"; from TR9401:1997:

  An application must provide some way (e.g., a runtime argument, 
  environment variable, preference switch) that allows the users 
  to specify which of these modes [prefer system IDs or prefer
  public IDs] to use in the absence of any occurrences of an
  OVERRIDE catalog entry.

Note that the initial setting of OVERRIDE is reset for each
catalog entry file:

  The initial search strategy in force at the beginning of each
  catalog entry file depends on the preference as determined by
  the application.

TR9401 went to great lengths not to specify the initial default
for OVERRIDE.  Most people involved in writing the Resolution
leaned toward a default of YES, but some leaned toward NO.  We
agreed not to decide this point.  In fact, several important
implementations currently default OVERRIDE to YES, which is
what you could do for your purposes, since as you point out
this makes more sense for XML.

>
> > >2) As another consequence of system ids being always present and
> > >always URLs, a usable Socat implementation must not search the
> > >whole public catalog space for SYSTEM entries.  When should the
> > >search stop?  In some sense "when going offsite", but just when is
> > >that?  Any suggestions?
> > 
> > I don't understand what the problem is, and I don't understand
> > how--if there really is a problem--anything about XML makes it a
> > problem that isn't a problem with SGML in general (XML is SGML, you
> > know).
>
>I'd guess that this is a problem of efficiency: when catalogues are on
>the other end of relatively slow network connection, you don't want to 
>retrieve a dozen catalogues unnecessarily.

>All XML system ids are URLs, and
>in general are to be taken at face value.  SYSTEM entries serve as a
>private URL-URL mapping scheme, but must the whole of a public-id-
>resolution infrastructure be searched for each and every URL referred
>to in a XML document?

Sorry, I haven't followed your subset of TR9401 (is there a pointer
to some doc?); which one(s) of the DELEGATE and CATALOG entry types
do you support?  These are the only two entry types that can send
an implementation off to another catalog entry file, and if the
catalog writer put one of them into the catalog entry file, it 
sounds like s/he wants you to go there.

Generally, if there is no match in a given catalog entry file
(for any entry type) and the external identifier includes a
system id (as would be the case with XML), the system id is used.
The only reason another catalog would be searched is when:
1.  there has been no SYSTEM or PUBLIC match in that catalog, 
  and
2.  there is a DELEGATE entry that matches the external id's 
    public id OR there is a CATALOG entry.

I'm guessing you've got a scenario in mind where there is
no SYSTEM or PUBLIC match in a given catalog entry file and
where there HAS been a matching DELEGATE or CATALOG entry,
BUT for some reason you want to ignore the DELEGATE or CATALOG 
entry that was put into the catalog (why?) and instead just 
give up now and use the system id in the external identifier.

I see three options (and if I didn't at first, I'd invent a third
option, since one is always supposed to have three options):

1.  follow all DELEGATE and CATALOG entries as specified until 
    a match or you run out of things to follow (this is what 
    TR9401 says and what the catalog writer presumably had in
    mind when they put the DELEGATE/CATALOG entries in);
2.  leave DELEGATE and CATALOG entry types out of your subset,
    since you don't seem to want to follow them anyway;
3.  invent a new catalog entry type that says "if you get to
    the end of this catalog entry file without a match for
    anything except maybe DELEGATE and CATALOG entries and
    the external identifier has a system id, ignore the
    DELEGATE and CATALOG entries and use that system id."

Option 2 seems internally consistent, but I suspect you want
the DELEGATE and CATALOG entry capability.  Option 3 seems
odd--if you can put an entry in the catalog that says ignore
DELEGATE and CATALOG entries in this file, then why don't you
just omit the DELEGATE and CATALOG entries from this file?
That leaves option 1.

Perhaps I've not captured the scenario you're really considering.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dave at userland.com  Mon Sep 21 23:33:00 1998
From: dave at userland.com (Dave Winer)
Date: Mon Jun  7 17:04:55 2004
Subject: http://www.scripting.com/98/09/News21.xml
In-Reply-To: <199809211553.AA18785@interlock.fritolay.com>
Message-ID: <3.0.5.32.19980921143432.012adbe0@scripting.com>

Keith, I don't know why you can't get thru. It's just a website, a normal
file, no CGIs, it's on port 80, a very plain vanilla HTTP server. Dave


At 10:55 AM 9/21/98 -0500, Keith.Corder@fritolay.com wrote:
>I'm very interested in seeing how this works.  How do I access the site?
>I've tried IE and Jumbo - but no success so far.  (I'm behind a proxy
>firewall, so if Jumbo is the right tool, how would I set the proxy info?)
>
>Thanks, Keith Corder
>
>
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>

--------------------------------------
http://www.userland.com/directory.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Mon Sep 21 23:39:48 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:04:55 2004
Subject: String expressions in XSL
In-Reply-To: <f5bn27xcs67.fsf@cogsci.ed.ac.uk>
References: <"Michael Kay"'s message of "Fri, 18 Sep 1998 14:32:11 +0100">
 <018b01bde308$be486160$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <3.0.1.16.19980921223929.33171926@pop3.demon.co.uk>

At 17:16 18/09/98 +0100, Henry S. Thompson wrote:
>1) xsl-list@mulberrytech.com is the preferred place for XSL
>discussions; there is a mechanism in place to ensure that points
>raised there are called to the attention of the XSL Working Group.

Thanks very much Henry - I have been unable to post for 3-4 days and would
have sent a similar message otherwise. Crossposting makes it very difficult
to follow threads and means that the discussions can't easily be located in
the archives or digests.

	P.


Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Mon Sep 21 23:43:39 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:04:55 2004
Subject: http://www.scripting.com/98/09/News21.xml
Message-ID: <00a801bde5a8$d9f54c00$2ee044c6@arcot-main>

>At 10:55 AM 9/21/98 -0500, Keith.Corder@fritolay.com wrote:
>>I'm very interested in seeing how this works.  How do I access the site?
>>I've tried IE and Jumbo - but no success so far.  (I'm behind a proxy
>>firewall, so if Jumbo is the right tool, how would I set the proxy info?)

Could it be that the Firewall is very strict with allowed content type?
Dave, what are you setting as content type?  text/html or text/xml?

I think this is a very interesting problem.  Are there any other Firewall
related problems with XML?

Don Park
Docuverse


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eric at hellman.net  Mon Sep 21 23:48:42 1998
From: eric at hellman.net (Eric Hellman)
Date: Mon Jun  7 17:04:55 2004
Subject: xml-dev Digest V1 #121
In-Reply-To: <E0zKuL0-0005zk-00@bowmore.cc.ic.ac.uk>
Message-ID: <v04011706b22c7a2ba77d@[192.168.1.1]>

James Tauber's site.

http://www.schema.net/entities/


>From: Tim Bray <tbray@textuality.com>
>Date: Sun, 20 Sep 1998 14:44:44 -0700
>Subject: SGML char entities XML-ized?
>
>Are there XML-ized versions of some of the character entity sets in
>the common vernacular?  I.e. the following:
>
> <!ENTITY % ISOlat2 PUBLIC
>                       "ISO 8879-1986//ENTITIES Added Latin 1//EN">
>
> <!ENTITY % ISOlat2 PUBLIC
>                       "ISO 8879-1986//ENTITIES Added Latin 2//EN">
>
> <!ENTITY % ISOgrk3 PUBLIC
>
>Etc... I'm pretty sure I saw someone posting about them here. -Tim
Eric Hellman
Openly Informatics, Inc.
http://www.openly.com/           Tools for 21st Century Scholarly Publishing

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Tue Sep 22 01:51:18 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:04:56 2004
Subject: http://www.scripting.com/98/09/News21.xml
In-Reply-To: <00a801bde5a8$d9f54c00$2ee044c6@arcot-main>
Message-ID: <199809212351.TAA32570@hesketh.com>

At 02:43 PM 9/21/98 -0700, Don Park wrote:
>Could it be that the Firewall is very strict with allowed content type?
>Dave, what are you setting as content type?  text/html or text/xml?
>
>I think this is a very interesting problem.  Are there any other Firewall
>related problems with XML?

I had problems like this when I was dealing with (i.e. figuring it out to
document it) a firewall.  The firewall came with a large menu of MIME
types, many of which I'd never seen, but admins had to add new ones by
hand.  text/xml and application/xml are the least of the problems to be
faced; think of all the new types that will come out of standards built on
XML.

Hmm... maybe it's worth an article.

Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Tue Sep 22 04:33:47 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:04:56 2004
Subject: Socat issues for XML
In-Reply-To: <3.0.32.19980921170150.00770aa8@pophost.arbortext.com> from "Paul Grosso" at Sep 21, 98 05:01:57 pm
Message-ID: <199809220240.WAA20094@locke.ccil.org>

Paul Grosso scripsit:

> I'm not sure I follow.  Are you saying that, even if you set the initial
> default of OVERRIDE to YES and a catalog writer explicitly puts in an
> OVERRIDE NO entry, you think it is more "useful" to assume they didn't
> mean it and ignore it?  That there is, in fact, no way to say "please
> really use the system ids in my document" (which is, of course, precisely
> what the "real" browsers will do until and unless they actually implement
> a catalog, which I don't expect from MS&NS in the near term)?

If you really want to use the system ids, why make use of a catalog at all?
In SGML, you may want it for standalone public ids, but not in XML.

> Perhaps you need (and maybe this is what you meant above) some 
> sort of entry like CATALOG but that ignores SYSTEM entries in
> the catalog-to-be-read.

That is what I meant.

> But that still sends you off to read
> that catalog, so you aren't saving any search time as you implied
> in your earlier message.

Not necessarily.  In particular, if only a system id is supplied (no
public id) then that catalog need not be examined at all.
(I am also assuming that catalogs only accessible from such a catalog
need not be searched for SYSTEM entries either.)

> All you're doing is inhibiting any
> SYSTEM entries from matching, so you're making it even more likely
> that your search will continue longer and wider. 

Again, not necessarily.  I might have a local catalog containing
PUBLIC entries for the documents I maintain, and SYSTEM entries
for documents I cache privately.  I can then defer everything
else to the root catalog maintained by fpi.org using CATALOG.
This catalog in turn uses DELEGATE entries to delegate to me
and other catalog owners.

Now consider what happens when one of my documents refers to
"PUBLIC <foo> <bar>".  If <bar> is a system id for a document I cache
then the cached version will be specified. If <foo> is a public ID
for one of my documents, the right document will be fetched after
searching my catalog.  If <foo> is someone else's public ID, the
root catalog will delegate to the correct catalog and I will eventually
get a usable system ID.  All is well.

BUT, if my document refers to "SYSTEM <bar>", then my catalog is searched,
and if it's not one that I cache, then the root catalog will also be
searched before going to the URL.  If the root catalog is known not to
contain SYSTEM entries, as it probably should not (that would amount to
a global URL remap for all XML applications that use the catalog
structure), then there should be a way to indicate not to search it.

> You're right that the more global a catalog, the less likely
> it is that it will have a SYSTEM type entry.  But if global
> catalogs have no SYSTEM entries, you don't have a problem, and
> if they do, maybe it makes sense.

[example snipped]

Your example is compelling.  But I think there is still a need,
for efficiency's sake, to add CATALOG entries that are marked "do not
search this catalog if *not* looking for a public id".

-- 
John Cowan					cowan@ccil.org
		e'osai ko sarji la lojban.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From avirr at LanMinds.Com  Tue Sep 22 05:19:35 1998
From: avirr at LanMinds.Com (Avi Rappoport)
Date: Mon Jun  7 17:04:56 2004
Subject: http://www.scripting.com/98/09/News21.xml
In-Reply-To: <199809212351.TAA32570@hesketh.com>
References: <00a801bde5a8$d9f54c00$2ee044c6@arcot-main>
Message-ID: <v0401170ab22cc55b9573@[207.33.50.55]>

At 4:53 PM -0700 9/21/98, Simon St.Laurent wrote:
>
> I had problems like this when I was dealing with (i.e. figuring it out to
> document it) a firewall.  The firewall came with a large menu of MIME
> types, many of which I'd never seen, but admins had to add new ones by
> hand.  text/xml and application/xml are the least of the problems to be
> faced; think of all the new types that will come out of standards built on
> XML.
>

I finally found some good sources for MIME type info.

the comp.mail.mime FAQ, part 4:

http://www.netmeg.net/faq/computers/mail/mime/04.html

A list of IANA media types:

ftp://ftp.isi.edu/in-notes/iana/assignments/media-types/media-types
 (it's a text file, Netscape will display it)

A memo, RFC2376, "XML Media Types", July 1998, by Whitehead & Murata

http://www.cis.ohio-state.edu/htbin/rfc/rfc2376.html

Hope these are useful to all.

Avi
________________________________________________________________
Avi Rappoport, Web Site Search Tools Maven <mailto:avirr@lanminds.com>
Search Tools Consulting Site: <http://www.searchtools.com>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Tue Sep 22 06:02:17 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:04:56 2004
Subject: URNs, FPIs, and RFC 1737
In-Reply-To: <199809211827.OAA01152@locke.ccil.org>
Message-ID: <3.0.5.32.19980921225359.009a6550@dns.isogen.com>

At 02:27 PM 9/21/98 -0400, John Cowan wrote:
>I would urge anyone who still thinks that FPIs should be freely creatable by
>random persons to read RFC 1737 (ftp://ftp.isi.edu/in-notes/rfc1737.txt
>and many other places around the net) and then think about the following
>question:
>
>Would it be perfectly all right for someone to use
>"-//IETF//NONSGML DOCUMENT RFC822//EN" as an FPI for the
>Independent Counsel's Report (URL http://icreport.loc.gov/icreport)?

Only if that someone was an authorized agent of the IETF charged with the
authority to assign public IDs within the -//IETF name space to resources.
Otherwise no, but only because it is wrong to use names in a space you
don't control.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From srn at techno.com  Tue Sep 22 06:15:07 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun  7 17:04:56 2004
Subject: Public Identifiers
In-Reply-To: <199809211744.NAA29846@locke.ccil.org> (message from John Cowan
	on Mon, 21 Sep 1998 13:44:41 -0400 (EDT))
References: <199809211744.NAA29846@locke.ccil.org>
Message-ID: <199809220407.XAA00891@bruno.techno.com>

[John Cowan:]

> Global uniqueness *is* a requirement of URNs, in the sense that two
> distinct things ought not be described by the same URN, and someone
> has to define what counts as "distinct things".

OK. I necessarily conclude from your answer that URNs can't do what
FPIs can do.

> My understanding is that the widespread use of unregistered FPIs
> merely reflects the lack of easy access to registration until
> recently.

My understanding is that there is a need to refer to things that
nobody has registered and that nobody who needs them to be registered
has the authority to register.  Surely you're not proposing that Joe
User should take it upon himself to register Sears and/or it's 1922
Farm Catalog on behalf of Sears, if Sears hasn't done this already for
itself.

> I think that Sears itself might be quite unhappy about people invading
> its FPI namespace in this fashion.  Such an FPI is more like a
> prose description of the resource.

In my hypothetical example, it doesn't matter what Sears thinks about
it.  Sears has published a resource (the 1922 Farm Catalog) that some
person regards as authoritative.  Sears doesn't get to say whether
people are allowed to regard its publications as authoritative.
(Nobody gets to control what others believe, and, at least in the
U.S., free speech and free thought are constitutionally protected.)  I
gather from what you say, however, that URNs can only be used to
reference things that their owners have arranged to be referencable
via URIs, by going to the expense and trouble of registering
themselves and/or their published information assets.  This constraint
excludes -- and may *forever* exclude -- most knowledge from being
referencable.  If URIs can't reference most of the knowledge in
existence, I'll use FPIs and/or HyTime biblocs to do that.

By the way, my example FPI is *not* a prose description of a resource.
It is a formal identification of an authority, a namespace created by
that authority, and a name within that namespace.  These three
elements are machine parsable, and they are not expressed in natural
language.

> I think that the identifiers of ISO 9070 are intended here.

OK.  If the intent is to conform to ISO 9070, it would be a good idea
to say so.  It would be extraordinary to expect people to understand
that 9070 applies to a standard, if that standard doesn't establish
9070 as a normative reference.

> > * There is an even more problematic statement: "A URL identifies the
> >   location or a container for an instance of a resource identified by
> >   a URN."  This strongly implies that there must be a URL behind every
> >   URN, even if that URL is fictitious or doesn't happen to work.  It
> >   also implies that there is some sort of locatable online container.
> 
> Not so.  You take that sentence out of its context, which makes clear that
> it's quite possible for an URN to never have any corresponding URLs.

The sentence and its context contradict one another.  I'm glad that
you can divine which one is supposed to be regarded as authoritative,
but I personally don't have any basis for making that judgment.

> > In the scenario I'm asking about, there is never any need to transmit
> > the FPI, so there's no need to convert it.

> I don't understand this point.  An FPI that's never transmitted
> remains within the brain of its inventor only.  The point of
> URI-encoding is simply to make clear where the delimiters of the URI
> are.  Allowing embedded spaces is good for human readability, but
> not for parsability.

You're right, you don't understand my point.  Consider: if two FPIs
reference the same thing, even if they are never resolved, we know
something about both of them, and what we know about them may well be
the only thing we need to know: that whatever one of them specifies,
the other specifies, too.  In the case of Topic Navigation Maps, this
is a critical piece of information, and it is often the only piece of
information needed by the underlying processing system in order to
perform its function.  Such FPIs are never resolved; they are never
transmitted to a server.

-Steve

--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com  http://www.techno.com  ftp.techno.com

voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax    +1 972 994 0087 (at ISOGEN: +1 214 953 3152)

3615 Tanner Lane
Richardson, Texas 75082-2618 USA

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Philippe.Le_Hegaret at sophia.inria.fr  Tue Sep 22 10:28:21 1998
From: Philippe.Le_Hegaret at sophia.inria.fr (Philippe Le H�garet)
Date: Mon Jun  7 17:04:56 2004
Subject: [Announce] Koala XML activities
Message-ID: <36075F99.270796DF@sophia.inria.fr>

Hello,

 I am please to announce a new XML service on the Web.
  http://koala.inria.fr:8080/

 The goal of the Koala XML services is to show our
current work on XML.
 Current services are :

 - an XML validation service. This is my own work.
the current version has some troubles with entities.

 - an XSL engine by Jeremy Calles.
the current version doesn't support formatting objects 
yet. This web service can be invoked directly by the
XSLOnline Java program (see attachment).

 Hope you will find this helpful,

Jeremy Calles and Philippe Le Hegaret.
-------
Have fun with Koalas! :-)
http://www.inria.fr/koala/
-------------- next part --------------
import java.net.*;
import java.io.*;

/**
 * This class is a little tool to invoke the XSLEngine.
 * 
 */
public class XSLOnline {

    public static String baseURL = "http://koala.inria.fr:8080/Java/XSLEngine?";

    public static void main(String[] args) {
	try {
	    if ((!args[0].equals("-r")) || (args.length > 3)) {
		throw new ArrayIndexOutOfBoundsException();
	    }
	    String command = "";
	    command += "XMLuri=" + args[2];
	    command += "&XSLuri=" + args[1];
	    
	    URL url = new URL(baseURL + command);

	    InputStream in = openStream(url);
	    int i = 0;
	    
	
	    while ((i = in.read()) != -1) {
		System.err.print( (char) i );
	    }
	} catch (ArrayIndexOutOfBoundsException e) {
	    System.err.println("XSLOnline v 1.0 - "
			       + "jcalles@sophia.inria.fr");
	    System.err.println("Usage: -r xslUrl [xmlUrl]");
	} catch (FileNotFoundException e) {
	    System.err.println("File not found: " + e.getMessage());
	} catch (IOException e) {
	    System.err.println("I/O Error: " + e.getMessage());
	} catch (Exception e) {
	    System.err.println("Error: ");
	    System.err.println(e.toString());
	}
    }

    private static InputStream openStream(URL url) throws IOException {
        HttpURLConnection connection = (HttpURLConnection) url.openConnection();
        connection.setRequestProperty("Pragma", "no-cache"); // @@deprecated
        connection.setRequestProperty("Cache-Control", "no-cache");
        connection.setRequestProperty("Accept", "text/plain");
        connection.setRequestProperty("User-Agent", "Koala_XSLOnline/1.0");

	if (connection.getResponseCode() !=  HttpURLConnection.HTTP_OK) {
	    if (connection.getResponseMessage() != null) {
		throw new IOException(url + ": " + 
				      connection.getResponseMessage());
	    } else {
		throw new IOException(url + ": " + 
				      connection.getResponseCode());
	    }
	}
	return connection.getInputStream();
    }
}
From ht at cogsci.ed.ac.uk  Tue Sep 22 11:44:17 1998
From: ht at cogsci.ed.ac.uk (Henry S. Thompson)
Date: Mon Jun  7 17:04:56 2004
Subject: Mixed Content Models (PART II)
In-Reply-To: Jerome McDonough's message of "Mon, 21 Sep 1998 12:00:01 -0700"
References: <3.0.5.32.19980921120001.0096a210@library.berkeley.edu>
Message-ID: <f5bemt4cwih.fsf@cogsci.ed.ac.uk>

Here is the relevant production from the spec (be sure you always look 
at the real thing:  http://www.w3.org/TR/REC-xml#sec-mixed-content):

Mixed ::= '(' S? '#PCDATA' (S? '|' S? Name)* S? ')*' 
          | '(' S? '#PCDATA' S? ')' 

Note that what appears in the disjunction is NAMES, not anything else.

ht
-- 
  Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh
     2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440
	    Fax: (44) 131 650-4587, e-mail: ht@cogsci.ed.ac.uk
		     URL: http://www.ltg.ed.ac.uk/~ht/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Tue Sep 22 11:46:04 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:04:56 2004
Subject: URNs, FPIs, and RFC 1737
Message-ID: <003f01bde60e$4022e040$1e09e391@mhklaptop.bra01.icl.co.uk>

Eliot Kimber:
>...it is wrong to use names in a space you don't control.

I want to play devil's advocate here.


Giving a thing a deceptive name in order to cause someone to
use it in preference to some other thing of the same name is
a time-honoured technique, used whenever the system has not
given us enough levels of indirection to play with. For
example, if I want to monitor the internal performance of a
java class library I might substitute the component
com.sun.foo with a similarly-named component of my own; I
might well want to do similar things with documents, perhaps
for very good reasons. For example, I might want to
substitute the intended DTD of a document with a stricter
one.

If this is "wrong", I think you need to distinguish whether
you mean "it does not conform to standard XYZ", or "it is
usually bad engineering practice", or "it is against
European Law", or "it is contrary to ethical norms".

In the case of XML Public Identifiers, I'm not sure the
practice is intrinsically "wrong" on any of these counts; it
is only wrong if it hurts someone.

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tms at ansa.co.uk  Tue Sep 22 12:05:43 1998
From: tms at ansa.co.uk (Toby Speight)
Date: Mon Jun  7 17:04:56 2004
Subject: URLs
In-Reply-To: Lars Marius Garshol's message of "21 Sep 1998 11:17:13 +0200"
References: <199809202131.RAA03962@locke.ccil.org> <36058566.14CED624@technologist.com> <wkogs94yg6.fsf@ifi.uio.no>
Message-ID: <ur9x4a290.fsf@delivery.ansa.co.uk>

Lars> Lars Marius Garshol <URL:mailto:larsga@ifi.uio.no>
Paul> Paul Prescod

Paul> Of course ~ is usually bandied about despite its "unsafeness",
Paul> but I would be surprised if browsers are smart enough to encode
Paul> it during transmission for "safety."

0> In article <wkogs94yg6.fsf@ifi.uio.no>, Lars wrote:

Lars> They are not.

Nor are they, in general, smart enough to even _display_ the safe form
(for the benefit of those who cut-n-paste URIs into email).  Grrr.

-- 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tms at ansa.co.uk  Tue Sep 22 12:27:02 1998
From: tms at ansa.co.uk (Toby Speight)
Date: Mon Jun  7 17:04:56 2004
Subject: Public Identifiers
In-Reply-To: "Steven R. Newcomb"'s message of "Mon, 21 Sep 1998 06:42:02 -0500"
References: <199809210043.UAA10167@locke.ccil.org> <199809211142.GAA00670@bruno.techno.com>
Message-ID: <uogs8a19m.fsf@delivery.ansa.co.uk>

Steven> Steven R. Newcomb <URL:mailto:srn@techno.com>

0> In article <199809211142.GAA00670@bruno.techno.com>, Steven wrote:

Steven> The problems with the quotation from RFC 1737 are these:
Steven>
Steven> * Who defines what constitutes a name assignment authority?
Steven>   If it's the end user, in an ad hoc fashion, that's fine, I'm
Steven>   satisfied.  But the language of the RFC,

When a URN scheme (namespace) is registered at IANA, one of the data
required for registration is how names are assigned.  In the case of
FPIs, the registration document would simply refer to the assignment
rules in the FPI definition.  (I don't believe that an FPI URN scheme
has yet been proposed - and if it is, I expect that it won't include
all possible FPIs, since IDN FPIs are not necessarily persistent).


Steven> [quote RFC 1737:]

#> For example, ISBN numbers, ISO public identifiers, and UPC product
#> codes seem to satisfy the functional requirements, and allow an
#> embedding that satisfies the syntactic requirements described here.

Steven>   ...indicates otherwise.  Here, in all three examples, there
Steven>   is a name registration authority; the end user is evidently
Steven>   not allowed to specify the Sears 1922 Farm Catalog unless
Steven>   this has already become a formally-cataloged public entity
Steven>   of some kind.

Just the same as for FPIs, yes?


Steven>   (Note that it's not clear whether "ISO public identifiers" means
Steven>   "public identifiers in ISO syntax" or "public identifiers that
Steven>   begin with the letters 'ISO' and that define public text entities
Steven>   that were created under the auspices of the ISO".  There is a
Steven>   very small set of the latter -- a set that has little or nothing
Steven>   to do with what I'm concerned about here.)

Agreed.


Steven> * There is an even more problematic statement: "A URL identifies
Steven>   the location or a container for an instance of a resource
Steven>   identified by a URN."  This strongly implies that there must
Steven>   be a URL behind every URN, even if that URL is fictitious or
Steven>   doesn't happen to work.

I'm not sure that I see that implication - to me it explains the
meaning of having an URL for a given URN.  AFAICT from reading RFC
1737 and the URN mailing list, resolution of a URN may result in zero
or more URLs.


Steven> The fact that you make such a point of demonstrating the
Steven> conversion of my FPI into a URN makes me wonder whether you
Steven> understood my question.  In the scenario I'm asking about,
Steven> there is never any need to transmit the FPI, so there's no
Steven> need to convert it.

Storing it into a document (as a URI) counts as transmission in my
book.  If you want to represent an FPI in a XML system identifier, you
need to use the URN syntax, since the XML spec says that system IDs
must be URIs.  Similarly for any other use of an FPI in a context
where a URI is expected.

-- 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jjc at jclark.com  Tue Sep 22 12:57:30 1998
From: jjc at jclark.com (James Clark)
Date: Mon Jun  7 17:04:56 2004
Subject: Mixed Content Models
References: <3.0.5.32.19980921114754.00968d00@library.berkeley.edu>
Message-ID: <3607198A.3D4E54A0@jclark.com>

Jerome McDonough wrote:

>         <!ELEMENT qstn   (#PCDATA | (preQTxt?, qstnLit?, postQTxt?, forward?,
>                                                           backward?, ivuInstr*))*
> 
> Is this a legitimate content model under XML section 3.2.2?
> Msxml doesn't have a problem with it, and nsgmls using the -wxml flag
> also happily parses the DTD.

It's not legal and the current version of nsgmls gives a warning with
-wxml.

James


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Usha_R2 at verifone.com  Tue Sep 22 15:42:55 1998
From: Usha_R2 at verifone.com (Usha_R2@verifone.com)
Date: Mon Jun  7 17:04:56 2004
Subject: Information needed -- Please help
Message-ID: <7BA6E16CF180D111944700A0C9979DE51D4F94@blr-nt-mail2.verifone.com>

Hi! All,
   Can you give me some information regarding the following questions. I
won't be able to proceed further until these questions are clarified.
Please give me as much of information as possible on these questions.

Question 1 : 
A wide range of tools exists that offer HTML design and editing
facilities, including the multi-language support intrinsically available
on current Windows Environment. 
How do support Multi language in XML. Are there any editing tools
available which help in doing this. 

Question 2 :
 What is the use of using a <TABLE> tag in HTML file. Can any anybody
please give information on this and how is it useful? 

Question 3 :
 A parsed XML file can be specified using a TABLE tag in HTML. What
advantage does this give and for what purposes can I use this?

Question 4:
 An XSL sheet can be used for defining the format in which the contents
of XML file have to presented. Instead of using an XSL sheet can I put
the information associated for presenting the content in XML file only,
using attributes. What is the advantage of using XSL over the second
approach. Please clarify.

To make it more clear consider the following example:

I defined a receipt in the following manner:

<?xml version="1.0"?>              
<!DOCTYPE receipt SYSTEM "receipt.dtd">

<receipt> 
        <ITEM     Name="H1"         Text="Receipt"  Alignment="CENTER"
Font="BOLD"   Size="7" />
        <ITEM     Name="amount"  Value="1000"    Alignment="RIGHT"
Font="PLAIN"   Size="4" />  
        <ITEM     Name="tip"          Value="10"
Alignment="RIGHT"     Font="PLAIN"   Size="4" />   
</receipt>                               

I want to know whether it is a good idea to put Alignment, Font, Size as
attributes of ITEM ELMENT or put all this information an XSL sheet.
Please let me know the advantages and disadvantages in this approach.

Is it a better idea to do the same process using HTML using the FONT tag
etc. ( Going through XSL I found that the <FONT> tag is deprecated).

I am very sorry for posting so many question and also questions
regarding HTML. I am trying to find out which of the two (XML or HTML)
is a better approach. 

Waiting for a fast reply.

Thanks in advance
Usha
K. Usha Rani
 
Dept        : Applications
Phone    : (080) - 2869920
Email     :  usha_r2@verifone.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Keith.Corder at fritolay.com  Tue Sep 22 15:47:21 1998
From: Keith.Corder at fritolay.com (Keith.Corder@fritolay.com)
Date: Mon Jun  7 17:04:56 2004
Subject: http://www.scripting.com/98/09/News21.xml
Message-ID: <199809221347.AA20078@interlock.fritolay.com>

I should have been more specific.  When I point my browser at the site, I
get the following:

Copyright 1997-1998 UserLand Software, Inc. 1.0 Mon, 21 Sep 1998 07:00:00
GMT
Mon, 21 Sep 1998 15:07:24 GMT
http://www.scripting.com/frontier5/xml/scriptingNews.html
Thea's Galleria tours a German online CD catalog that includes audio
samples.
http://www.scripting.com/thea/ Thea's Galleria CNN: Americans watch Clinton

Videotape. I'm going to watch it later. I'm having fun playing god on
Nirvana.
http://www.cnn.com/ALLPOLITICS/stories/1998/09/21/tape/ Americans watch
Clinton
Videotape XML Evangelism: If you want XML files to test with, we have
them...
http://betty.userland.com/stories/daveWiner/98/09/xmlEvangelism.html XML
Evangelism
SJ Merc: Silicon Valley giving cold shoulder to Clinton.
http://www.sjmercury.com/columnists/nolan/docs/cn092198.htm Silicon Valley
giving
cold shoulder to Clinton We worked on Nirvana over the weekend.
<i>Lots more digging to do!</i>
http://nirvana.userland.com/system/default.wsf worked


>From looking at the page source, this looks like all of the text on the
site with all of the tags ignored.
What should I be seeing?

My proxy firewall comment really related to Jumbo - and I don't know if
Jumbo is even the right tool.
Would Jumbo only work with CML, or will it work with any XML
implementation?

Thanks, Keith Corder


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jbolles at homeaccount.com  Tue Sep 22 15:51:59 1998
From: jbolles at homeaccount.com (Jack Bolles)
Date: Mon Jun  7 17:04:56 2004
Subject: a small question on xml
References: <Pine.SGI.3.96.980817115823.14612C-100000@lassie.cs.umbc.edu>
Message-ID: <3607ABD7.E48712BC@homeaccount.com>

You have a lot of help on the second expression, so I'll give an answer to the first expresion. You can use either a single quote ' or a double quote " to encapsulate text. But if you want to encapsulate a quote (single or double) it must be encaspulated by its opposite. In other words, rewrite the first expression as either:

        <someexpression original-from="(relation 5 'abc')">
or
        <someexpression original-from='(relation 5 "abc")'>

Jack
------------------------------------------------------
Jack Bolles
Software Engineer - UI Designer
Home Account Networks
------------------------------------------------------


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jbolles at homeaccount.com  Tue Sep 22 16:07:12 1998
From: jbolles at homeaccount.com (Jack Bolles)
Date: Mon Jun  7 17:04:57 2004
Subject: http://www.scripting.com/98/09/News21.xml
References: <199809221347.AA20078@interlock.fritolay.com>
Message-ID: <3607AF7C.B8940533@homeaccount.com>

Keith.Corder@fritolay.com wrote:

> I should have been more specific.  When I point my browser at the site, I
> get the following:
>
> Copyright 1997-1998 UserLand Software, Inc. 1.0 Mon, 21 Sep 1998 07:00:00
> GMT
>
> <snip/>
>
> Videotape XML Evangelism: If you want XML files to test with, we have
> them...
> http://betty.userland.com/stories/daveWiner/98/09/xmlEvangelism.html XML
>
> <snip/>

On any given day, follow the link. Today's excerpt is:

http://www.scripting.com/98/09/News21.xml

Jack

------------------------------------------------------
Jack Bolles
Software Engineer - UI Designer
Home Account Networks
------------------------------------------------------


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From deke at tallent.com  Tue Sep 22 16:15:46 1998
From: deke at tallent.com (Deke Smith)
Date: Mon Jun  7 17:04:57 2004
Subject: Mix encodings in a document?
Message-ID: <1305666774-174489322@tallent.com>

Gavin Thomas Nicol, gtn@eps.inso.com said on 9/21/98 2:42 PM:

>The two required encodings are UTF-16 and UTF-8. You can use any other
>encoding you like, so long as the system you are working with supports
>it.
>
>Remember: byte != character code != character != glyph

It may be slightly off topic, but do you mind expanding on that last 
line? I would be interested.


IANA character encoding spec I found at 
ftp://ftp.isi.edu/in-notes/iana/assignments/character-sets do not 
explicitly name UTF-16, but does name several flavors of Unicode(?):

ISO-10646-UCS-2
ISO-10646-UCS-4
ISO-10646-UTF-1
ISO-10646-Unicode-Latin1
ISO-10646-J-1
UNICODE-1-1
UNICODE-1-1-UTF-7
UTF-7
UTF-8

Which is an alias for UTF-16?

-----------------------------------------------------------------
Deke Smith
Tallent Communications Group, Brentwood TN
deke@tallent.com, 615-661-9878
-----------------------------------------------------------------
" The best way to predict the future is to invent it. " 
       - Alan Kay 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From paul at arbortext.com  Tue Sep 22 16:34:23 1998
From: paul at arbortext.com (Paul Grosso)
Date: Mon Jun  7 17:04:57 2004
Subject: Socat issues for XML
Message-ID: <3.0.32.19980922093247.011a33d0@pophost.arbortext.com>

[John and I have been having a dialog on xml-dev@ic.ac.uk, but he
sent me one reply in the chain just to me, not to the list, so I
replied just to him.  He then replied to that message cc-ing the
list, so I figure I better send my posting to the list for context.
Sorry about this being somewhat out of order.  paul]

At 17:35 1998 09 21 -0400, John wrote:
>Paul Grosso scripsit:
>
>> I see three options:
>> 
>> 1.  say that your subset of TR9401 catalogs doesn't include OVERRIDE;
>> 2.  say that your subset "recognizes" OVERRIDE entries but ignores them;
>> 3.  say that your subset handles OVERRIDE.
>
>My current choice is 2.  For reference, my implementation understands
>PUBLIC, SYSTEM, DELEGATE, CATALOG, and BASE, and recognizes-but-ignores
>all other entries whether documented in 9401:1997 or not, as long as they
>follow the grammar given there:  (Name, Name?, Quoted-String*).

That is logically reasonable, though it suffers from the problem that
users will get unexpected results when they use existing catalogs.

>
>> Looking at the pros and cons, I'd opt for option 3:  a little more work
>> for your implementations seems preferable to the problems 1 and 2 will
>> mean for end users. 
>
>The objection to #3 is that I expect that system ids in XML documents
>will often be pro forma, and it will be more useful to use the catalog
>to find a local equivalent.

I'm not sure I follow.  Are you saying that, even if you set the initial
default of OVERRIDE to YES and a catalog writer explicitly puts in an
OVERRIDE NO entry, you think it is more "useful" to assume they didn't
mean it and ignore it?  That there is, in fact, no way to say "please
really use the system ids in my document" (which is, of course, precisely
what the "real" browsers will do until and unless they actually implement
a catalog, which I don't expect from MS&NS in the near term)?

>> I'm guessing you've got a scenario in mind where there is
>> no SYSTEM or PUBLIC match in a given catalog entry file and
>> where there HAS been a matching DELEGATE or CATALOG entry,
>> BUT for some reason you want to ignore the DELEGATE or CATALOG 
>> entry that was put into the catalog (why?) and instead just 
>> give up now and use the system id in the external identifier.
>
>Because XML system ids have defined semantics, whereas public ids
>don't.  I would expect that a local catalog would be willing to
>defer public id interpretation to a "root catalog" using CATALOG,
>without necessary wanting all system ids (URLs) to be decoded
>thereby.
>
>What I really want is a CATALOG-OF-PUBLIC-IDS-ONLY entry.

So why not put only public ids in that catalog?

You're only going to get to that catalog if you're still trying
to resolve an external identifier.  If you get to that catalog,
you've got to read/process it, so you're not saving any time.

It sounds like what you want is "ignore any SYSTEM type entries
in this catalog" but if the way to do that is to put a special
entry into that catalog, you already have to be able to access
and modify that catalog, so why not just omit the SYSTEM entries.

Perhaps you need (and maybe this is what you meant above) some 
sort of entry like CATALOG but that ignores SYSTEM entries in
the catalog-to-be-read.  But that still sends you off to read
that catalog, so you aren't saving any search time as you implied
in your earlier message.  All you're doing is inhibiting any
SYSTEM entries from matching, so you're making it even more likely
that your search will continue longer and wider. 

You're right that the more global a catalog, the less likely
it is that it will have a SYSTEM type entry.  But if global
catalogs have no SYSTEM entries, you don't have a problem, and
if they do, maybe it makes sense.  For example, say you download
the MathML Rec from http://www.w3.org/TR/REC-MathML/ to your
machine and then browse it.  There will be a graphic/icon on the
top that doesn't resolve.  If you look in the source, you'll see

  <DIV ALIGN=RIGHT>
  <A HREF="http://www.w3.org/"><IMG SRC="images/w3c_home.gif"
    ALT="W3C" BORDER=0 HEIGHT=48 WIDTH=72 ALIGN=LEFT></A>
  <B>REC-MathML-19980407</B>
  </DIV>

Note the <IMG SRC="images/w3c_home.gif"> which doesn't work
on your machine.  But if the W3C had a "global" catalog that 
had the entry:
  SYSTEM "images/w3c_home.gif"  "http://www.w3.org/images/w3c_home.gif"
and you had a catalog with the entry:
  CATALOG "www.w3.org/catalog"
then you'd get things resolving properly when you browse W3C 
documents that you've downloaded to you local machine.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From crism at oreilly.com  Tue Sep 22 16:50:07 1998
From: crism at oreilly.com (Chris Maden)
Date: Mon Jun  7 17:04:57 2004
Subject: Public Identifiers
In-Reply-To: <199809220407.XAA00891@bruno.techno.com> (srn@techno.com)
Message-ID: <199809221448.KAA03627@ruby.ora.com>

[Steven R. Newcomb]
> [John Cowan:]
> > Global uniqueness *is* a requirement of URNs, in the sense that
> > two distinct things ought not be described by the same URN, and
> > someone has to define what counts as "distinct things".
> 
> OK. I necessarily conclude from your answer that URNs can't do what
> FPIs can do.

I am not sure how this follows.  Are you saying that two distinct
things *ought* to be described by the same FPI?  I'm having a hard
time thinking of a case in which this would be a good idea, except
maybe -//crism//DOCUMENT Random Text of the Day//EN, in which case the
FPI does, in some sense, point to a single resource (the semantic
concept of a randomly selected document).

> > My understanding is that the widespread use of unregistered FPIs
> > merely reflects the lack of easy access to registration until
> > recently.
> 
> My understanding is that there is a need to refer to things that
> nobody has registered and that nobody who needs them to be
> registered has the authority to register.  Surely you're not
> proposing that Joe User should take it upon himself to register
> Sears and/or it's 1922 Farm Catalog on behalf of Sears, if Sears
> hasn't done this already for itself.

I don't understand this statement.  There is a need to refer to things
that nobody has registered, yes.  But you then say that Joe User
shouldn't register Sears, but that's exactly what he did in your
-//Sears//... example.  If Sears has not created an FPI for their 1922
catalog, then Joe User should say -//Joe User//NONSGML Sears Roebuck
and Co. 1922 Catalog//EN.

> I gather from what you say, however, that URNs can only be used to
> reference things that their owners have arranged to be referencable
> via URIs,

This is a truism, since URNs are a subset of URIs.  I think you meant
URLs, but...

> by going to the expense and trouble of registering themselves and/or
> their published information assets.

This isn't true.  The URN spec, as John quoted, specifically allows
the case of URNs that are not resolvable to a concrete electronic
resource.  And since FPIs can be used as URNs,
urn:fpi:-//Joe%20User//NONSGML%20Sears%20Roebuck%20and%20Co.%201922%20Catalog//EN
is a perfectly legitimate URN that points to the lump of paper in my
outhouse.

-Chris
-- 
<!NOTATION SGML.Geek PUBLIC "-//Anonymous//NOTATION SGML Geek//EN">
<!ENTITY crism PUBLIC "-//O'Reilly//NONSGML Christopher R. Maden//EN"
"<URL>http://www.oreilly.com/people/staff/crism/ <TEL>+1.617.499.7487
<USMAIL>90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From grk at arlut.utexas.edu  Tue Sep 22 17:10:11 1998
From: grk at arlut.utexas.edu (Glenn R. Kronschnabl)
Date: Mon Jun  7 17:04:57 2004
Subject: XP syntax error - including cals-tbl.dtd in simple xml dtd
Message-ID: <199809221506.KAA09705@ns1.arlut.utexas.edu>

I have a simple DTD that includes that cals-tbl.dtd from docbook3 and am 
trying to use jclark's XP.   Here is the relevant snippets:

-- simple.dtd --

<?xml version="1.0" encoding="ISO-8859-1"?>

<!ENTITY % calstbls PUBLIC "-//USA-DOD//DTD Table Model 951010//EN" 
"cals-tbl.dt
d">

%calstbls;

etc.

Here is the output of a XP command (all XP apps  give the same error):

grk@magellan$ java com.jclark.xml.apps.Time june98.xml 
file:/home/grk/sgmldocs/reports/nims/cals-tbl.dtd:124:29: syntax error
0.613

This is using the latest XP.  

Should this work in XML?  If so, is this a bug in XP?

Cheers,
Glenn                                  
--------------------
Glenn R. Kronschnabl
Applied Research Laboratories        | grk@arlut.utexas.edu (PGP/MIME ok)
The University of Texas at Austin    | http://www.arlut.utexas.edu/~grk
PO Box 8029, Austin, TX 78713-8029   | (Ph) 512.835.3642 (FAX) 512.835.3808
10,000 Burnet Road, Austin, TX 78758 | ... but an Aggie at heart!


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From sduda at hhcc.com  Tue Sep 22 17:14:56 1998
From: sduda at hhcc.com (sduda@hhcc.com)
Date: Mon Jun  7 17:04:57 2004
Subject: (un)subscribe xml-dev
Message-ID: <85256687.0053DCBE.00@arlington.hhcc.com>


"Glenn R. Kronschnabl" <grk@arlut.utexas.edu> on 09/22/98 11:05:29 AM

Please respond to "Glenn R. Kronschnabl" <grk@arlut.utexas.edu>

To:   xml-dev@ic.ac.uk
cc:    (bcc: Stacey Duda/Hill Holliday Advertising Inc./US)
Subject:  XP syntax error - including cals-tbl.dtd in simple xml dtd


I have a simple DTD that includes that cals-tbl.dtd from docbook3 and am
trying to use jclark's XP.   Here is the relevant snippets:
-- simple.dtd --
<?xml version="1.0" encoding="ISO-8859-1"?>
<!ENTITY % calstbls PUBLIC "-//USA-DOD//DTD Table Model 951010//EN"
"cals-tbl.dt
d">
%calstbls;
etc.
Here is the output of a XP command (all XP apps  give the same error):
grk@magellan$ java com.jclark.xml.apps.Time june98.xml
file:/home/grk/sgmldocs/reports/nims/cals-tbl.dtd:124:29: syntax error
0.613
This is using the latest XP.
Should this work in XML?  If so, is this a bug in XP?
Cheers,
Glenn
--------------------
Glenn R. Kronschnabl
Applied Research Laboratories        | grk@arlut.utexas.edu (PGP/MIME ok)
The University of Texas at Austin    | http://www.arlut.utexas.edu/~grk
PO Box 8029, Austin, TX 78713-8029   | (Ph) 512.835.3642 (FAX) 512.835.3808
10,000 Burnet Road, Austin, TX 78758 | ... but an Aggie at heart!


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dave at userland.com  Tue Sep 22 17:17:11 1998
From: dave at userland.com (Dave Winer)
Date: Mon Jun  7 17:04:57 2004
Subject: http://www.scripting.com/98/09/News21.xml
In-Reply-To: <199809221347.AA20078@interlock.fritolay.com>
Message-ID: <3.0.5.32.19980922081735.012e7750@scripting.com>

Do a View Source to see the XML tags. Dave

--------------------------------------
http://www.userland.com/directory.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From paul at arbortext.com  Tue Sep 22 17:36:15 1998
From: paul at arbortext.com (Paul Grosso)
Date: Mon Jun  7 17:04:57 2004
Subject: Socat issues for XML
Message-ID: <3.0.32.19980922103419.01143a48@pophost.arbortext.com>

At 22:40 1998 09 21 -0400, John Cowan wrote:
>Paul Grosso scripsit:
>> I'm not sure I follow.  Are you saying that, even if you set the initial
>> default of OVERRIDE to YES and a catalog writer explicitly puts in an
>> OVERRIDE NO entry, you think it is more "useful" to assume they didn't
>> mean it and ignore it?  That there is, in fact, no way to say "please
>> really use the system ids in my document" (which is, of course, precisely
>> what the "real" browsers will do until and unless they actually implement
>> a catalog, which I don't expect from MS&NS in the near term)?
>
>If you really want to use the system ids, why make use of a catalog at all?
>In SGML, you may want it for standalone public ids, but not in XML.

You omitted my compatibility argument which is summarized by:

>users will get unexpected results when they use existing [TR9410]
>catalogs.

I have not heard a convincing argument for not including OVERRIDE
in your subset of TR9401.  There are many TR9401 catalogs in use,
implementing OVERRIDE is trivial, and users are used to using it.
If you don't include it, then the existing catalogs--and users
who write new catalogs based on their understanding of TR9401
catalogs as they exist--will get subtlely different results with
no warning because you would be ignoring the OVERRIDE NO entries.

>> Perhaps you need (and maybe this is what you meant above) some 
>> sort of entry like CATALOG but that ignores SYSTEM entries in
>> the catalog-to-be-read.
>
>That is what I meant.
>. . .
>Your example is compelling.  But I think there is still a need,
>for efficiency's sake, to add CATALOG entries that are marked 
>"do not search this catalog if *not* looking for a public id".

I see your point.  You want something like a PUBLIC-CATALOG entry
type with the same semantics as the CATALOG entry type except 
additionally with the semantic "ignore if the external identifier
has no public identifier."  Note that the referenced catalog entry
file could still have SYSTEM (and ENTITY and other) entry types, and
if the catalog is ever processed, all those entry types are significant,
it's just that no catalog referenced by a PUBLIC-CATALOG entry would 
be processed if the current external identifier being resolved has 
no public id.

Note, you can't say "looking for" (or "not looking for") a public
id, because you are never looking for a match to a public id per se.
You are always looking for a match for the set of info that you have
for the current external identifier, and that set of info includes
one or more of (1) public id, (2) system id, (3) entity name.  In
particular, for XML, you always (except for notations) have both a
system id and an entity name and sometimes you also have a public id.

I think you'd also want the standard CATALOG entry type, therefore
PUBLIC-CATALOG would be a new entry type.  The standard CATALOG 
entry would address my "compelling example" as well as give
compatibility with TR9401 catalogs.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Tue Sep 22 18:07:28 1998
From: david at megginson.com (david@megginson.com)
Date: Mon Jun  7 17:04:57 2004
Subject: Mix encodings in a document?
In-Reply-To: <1305666774-174489322@tallent.com>
References: <1305666774-174489322@tallent.com>
Message-ID: <13831.48304.32182.557900@localhost.localdomain>

Deke Smith writes:

 > Gavin Thomas Nicol, gtn@eps.inso.com said on 9/21/98 2:42 PM:
 > 
 > >The two required encodings are UTF-16 and UTF-8. You can use any other
 > >encoding you like, so long as the system you are working with supports
 > >it.
 > >
 > >Remember: byte != character code != character != glyph
 > 
 > It may be slightly off topic, but do you mind expanding on that last 
 > line? I would be interested.

I'll let Gavin respond to the question, but I find it interesting that
this is somewhat (but not exactly) equivalent to the
phone/phoneme/graph/grapheme distinction in linguistics.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From thillai at ix.netcom.com  Tue Sep 22 18:32:56 1998
From: thillai at ix.netcom.com (Thillai)
Date: Mon Jun  7 17:04:58 2004
Subject: Corba data ->   XML
Message-ID: <01BDE624.4689F7A0@thillai>

Hi,

I know only little bit about XML.  I like to know whether a CORBA operation
can be expressed in XML document type definition?  

For example if the IDL has something like

interface z
{
	typedef sequence<String, 5>  t1;

	void x(in int p1,  in t1 p2);
}

and I want to store the parameters for the operation in a file.

I like to have a XML data file like
<interface>
z
<operation>
x
<inparameter>
<param>
p1
<value>10</value>
</param>
<param>
p2
<value><nelems>5</nelems><0>abc</0><1>efg</1><2>hij</2><3>
klm</3><4>nop</4></value>
</param>
</inparameter>
</operation>
</interface>

If there is no maximum limit for the type t1 then no. of elements in the
sequence might vary.

For this file is it possible to write DTD.  (I will read about DTD and find. 
Before that any expert comment will be helpful)

If it is possible then is it possible to write XSL for getting values from the 
user.  (no. of elements in the sequence might vary at runtime).

Thillai
AT&T
Middletown, NJ

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jmcdonou at library.berkeley.edu  Tue Sep 22 18:36:59 1998
From: jmcdonou at library.berkeley.edu (Jerome McDonough)
Date: Mon Jun  7 17:04:58 2004
Subject: Mix encodings in a document?
In-Reply-To: <1305666774-174489322@tallent.com>
Message-ID: <3.0.5.32.19980922093407.00aac560@library.berkeley.edu>

At 09:15 AM 9/22/98 -0500, Deke Smith wrote:
>Gavin Thomas Nicol, gtn@eps.inso.com said on 9/21/98 2:42 PM:
>
>>The two required encodings are UTF-16 and UTF-8. You can use any other
>>encoding you like, so long as the system you are working with supports
>>it.
>>
>>Remember: byte != character code != character != glyph
>
>It may be slightly off topic, but do you mind expanding on that last 
>line? I would be interested.
>

Glyphs represent the various shapes that a character may have when
rendered or displayed; a single character may have multiple glyphs,
and it's possible for a single glyph to represent several different
characters.  Arabic, as an example, has many different glyphs for
representing a single character.  So glyphs are not the same as
characters.

Unicode defines characters as "the smallest components of written
language that have semantic value," while character codes represent
characters as "values that reside only in a memory representation, as
strings in memory, or on disk."  Different character encoding standards will
have different character codes for the same character, and even within
Unicode, the same character code may have different encodings (UTF-7, 
UTF-8, etc.).  So, character codes are not the same as characters.

And Unicode represents a character as a single 16 bit word, so bytes
do not represent characters (even in UTF-8, where a character encoding
may be one to four bytes).
  
>IANA character encoding spec I found at 
>ftp://ftp.isi.edu/in-notes/iana/assignments/character-sets do not 
>explicitly name UTF-16, but does name several flavors of Unicode(?):
>
>ISO-10646-UCS-2
>ISO-10646-UCS-4
>ISO-10646-UTF-1
>ISO-10646-Unicode-Latin1
>ISO-10646-J-1
>UNICODE-1-1
>UNICODE-1-1-UTF-7
>UTF-7
>UTF-8
>
>Which is an alias for UTF-16?
>

ISO-10646-UCS-2 (the 2-octet Basic Multilingual Plane) is the
same as Unicode (which is a 16-bit chararacter encoding), so
that would be your "UTF-16." (I don't think that, technically,
the 16-bit encoding gets referred to as a UCS Transmission Format).


Jerome McDonough -- jmcdonou@library.Berkeley.EDU  |  (......)
Library Systems Office, 386 Doe, U.C. Berkeley     |  \ *  * /
Berkeley, CA 94720-6000    (510) 643-2058          |  \  <>  /
"Well, it looks easy enough...."                   |   \ -- /  SGNORMPF!!!
         -- From the Famous Last Words file        |    ||||

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From sr at ecs.soton.ac.uk  Tue Sep 22 18:46:31 1998
From: sr at ecs.soton.ac.uk (Sigi Reich)
Date: Mon Jun  7 17:04:58 2004
Subject: Corba data ->   XML
In-Reply-To: <01BDE624.4689F7A0@thillai>
Message-ID: <3.0.5.32.19980922174852.00801820@penelope.ecs.soton.ac.uk>

Thillai,

you might be interested in these two references:

- Dave Winer describes how a remote procedure call (rpc) can be realised by
encoding the call (method, parameters, ...) in an XML message (document)
that is transferred via http. See
http://www.scripting.com/davenet/98/02/rpcOverHttpViaXml.html

- The Open Hypermedia Systems Working Group (OHSWG) addresses a similar
problem as you described by specifying interfaces for different domains of
hypermedia and communicating them as XML documents (the documents are
transfered over TCP/IP using a similar mechanism as HTTP). For testing
purposes also a communication using Java's RMI has been realised. See
http://www.csdl.tamu.edu/ohs/ and
http://www.mmrg.ecs.soton.ac.uk/~sr/ohs/ohpindex.html for further details.

Sigi


At 12:25 22/09/98 -0400, you wrote:
>Hi,
>
>I know only little bit about XML.  I like to know whether a CORBA operation
>can be expressed in XML document type definition?  
>
>For example if the IDL has something like
>
>interface z
>{
>	typedef sequence<String, 5>  t1;
>
>	void x(in int p1,  in t1 p2);
>}
>
>and I want to store the parameters for the operation in a file.
>
>I like to have a XML data file like
><interface>
>z
><operation>
>x
><inparameter>
><param>
>p1
><value>10</value>
></param>
><param>
>p2
><value><nelems>5</nelems><0>abc</0><1>efg</1><2>hij</2><3>
>klm</3><4>nop</4></value>
></param>
></inparameter>
></operation>
></interface>
>
>If there is no maximum limit for the type t1 then no. of elements in the
>sequence might vary.
>
>For this file is it possible to write DTD.  (I will read about DTD and find. 
>Before that any expert comment will be helpful)
>
>If it is possible then is it possible to write XSL for getting values from
the 
>user.  (no. of elements in the sequence might vary at runtime).
>
>Thillai
>AT&T
>Middletown, NJ
-------------------------------------------------
Sigi Reich, Research Fellow
Multimedia Research Group
Department of Electronics and Computer Science, Bldg. 59
University of Southampton, Southampton S017 1BJ, UK
phone +44 (0)1703 59       fax +44 (0) 1703 59 2865
email sr@ecs.soton.ac.uk   http://www.mmrg.ecs.soton.ac.uk/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From lisarein at finetuning.com  Tue Sep 22 18:49:11 1998
From: lisarein at finetuning.com (Lisa Rein)
Date: Mon Jun  7 17:04:58 2004
Subject: Corba data ->   XML
References: <01BDE624.4689F7A0@thillai>
Message-ID: <3607DEA9.37620148@finetuning.com>

Hi Thillai:

You should look at the WebBroker work that John Tigue has done at:

http://www.w3.org/TR/1998/NOTE-webbroker/

Where he uses DTDs to express both DCOM and CORBA objects.

Cheers,

lisa rein
http://www.finetuning.com

Thillai wrote:
> 
> Hi,
> 
> I know only little bit about XML.  I like to know whether a CORBA operation
> can be expressed in XML document type definition?
> 
> For example if the IDL has something like
> 
> interface z
> {
>         typedef sequence<String, 5>  t1;
> 
>         void x(in int p1,  in t1 p2);
> }
> 
> and I want to store the parameters for the operation in a file.
> 
> I like to have a XML data file like
> <interface>
> z
> <operation>
> x
> <inparameter>
> <param>
> p1
> <value>10</value>
> </param>
> <param>
> p2
> <value><nelems>5</nelems><0>abc</0><1>efg</1><2>hij</2><3>
> klm</3><4>nop</4></value>
> </param>
> </inparameter>
> </operation>
> </interface>
> 
> If there is no maximum limit for the type t1 then no. of elements in the
> sequence might vary.
> 
> For this file is it possible to write DTD.  (I will read about DTD and find.
> Before that any expert comment will be helpful)
> 
> If it is possible then is it possible to write XSL for getting values from the
> user.  (no. of elements in the sequence might vary at runtime).
> 
> Thillai
> AT&T
> Middletown, NJ
> 
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jlapp at webMethods.com  Tue Sep 22 19:04:57 1998
From: jlapp at webMethods.com (Joe Lapp)
Date: Mon Jun  7 17:04:58 2004
Subject: Corba data ->   XML
Message-ID: <3.0.32.19980922130406.00ada54c@gw1.webmethods.com>

At 12:25 PM 9/22/98 -0400, Thillai wrote:
>I know only little bit about XML.  I like to know whether a CORBA operation
>can be expressed in XML document type definition?  

You are looking for an Interface Definition Language (IDL) expressed in
XML.  There are now a few of them floating around.

webMethods has one called WIDL (Web IDL).  It is an 80/20 IDL -- 80% of the
functionality of conventional IDLs with 20% of the complexity.  It can't
express any CORBA interface, but it can express about 80% of the ones that
show up in practice.  You can learn a little about this IDL from this URL:

    http://www.webmethods.com/xml/widl_wp.html

Unfortunately, this URL is a bit dated and does not distinguish between
WIDL specifications and WIDL mappings.  A WIDL mapping is a way to
implement a specification so that the interface wraps up an HTML or XML web
site.

You might instead read a chapter I wrote Charles Goldfarbs'
_The_XML_Handbook_.  The chapter is titled "WIDL and XML RPC."  For an
example application of WIDL with XML RPC, see the chapter titled "Supply
chain integration."

John Tigue of Datachannel also developed an IDL in XML.  It's part of a
well-thought-out framework of XML technologies for RPC.  You can find his
work at this URL:

    http://www.w3.org/Submission/1998/07/

His IDL and RPC format are intended to serve as a superset of both CORBA
and DCOM.  An amazing feat.  It quite a bit more complicated than WIDL, but
you'll probably want it if your problem space falls into the 20% area that
WIDL is missing.
--
Joe Lapp, Senior Engineer | jlapp@webMethods.com
webMethods, Inc.          | Voice: 703-267-1726
http://www.webMethods.com |   Fax: 703-352-0370

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From graham.moore at dpsl.co.uk  Tue Sep 22 19:34:03 1998
From: graham.moore at dpsl.co.uk (Graham Moore)
Date: Mon Jun  7 17:04:58 2004
Subject: Corba data ->   XML
Message-ID: <TFSOIANH@dpsl.co.uk>>


I have implemented a distributed object system that used XML for object 
migration and distributed messaging.

There is only a small amount of meta data (DTD definition) required to drive 
such a system. The kind of thing you scribbled may need a few refinements 
but is essentially all that is required. In addition to a dynamic invocation 
mechanism.

If one takes the view that the Object and thus it's serialised form  (the 
XML) IS the message / operation then it all just falls out.

As a general point it would be good to see XML become the standard for 
object serialisation / migration  and distributed messaging. Consider how 
JINI / distributed services in general could become more open if all the 
services, and calls were not only described in XML but could be invoked 
using "XML Messaging".

Graham.

gdm@dpsl.co.uk

scheme / XML : the instance is the code


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Tue Sep 22 20:13:29 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:04:58 2004
Subject: Preference Files
Message-ID: <199809221813.OAA09232@hesketh.com>

Has anyone started work on a standard DTD for preference files, or is the
expectation that everyone's going to do their own differently?  It seems
like something that might be useful, especially in the context of some of
the browser development issues that have been discussed here lately.

If anyone knows of an existing one and wants to promote it, I'm writing a
chapter...  The current material uses a custom-built preference file, but
I'd like to generalize it.

Thanks!

Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jmcdonou at library.berkeley.edu  Tue Sep 22 20:31:29 1998
From: jmcdonou at library.berkeley.edu (Jerome McDonough)
Date: Mon Jun  7 17:04:58 2004
Subject: Mix encodings in a document?
In-Reply-To: <000701bde64d$b252e810$577670c6@endymion.eps.inso.com>
References: <3.0.5.32.19980922093407.00aac560@library.berkeley.edu>
Message-ID: <3.0.5.32.19980922112823.00b6d4b0@library.berkeley.edu>

At 01:23 PM 9/22/98 -0400, Gavin Thomas Nicol wrote:
>> And Unicode represents a character as a single 16 bit word, so bytes
>> do not represent characters (even in UTF-8, where a character encoding
>> may be one to four bytes).
>
>This is no longer true. 

I'll accept that (I haven't seen Unicode 2.1 yet), but which part is no
longer true?

>> ISO-10646-UCS-2 (the 2-octet Basic Multilingual Plane) is the
>> same as Unicode (which is a 16-bit chararacter encoding), so
>> that would be your "UTF-16." (I don't think that, technically,
>> the 16-bit encoding gets referred to as a UCS Transmission Format).
>
>This is not correct. UTF-16 has not (to the best of my knowledge)
>been registered yet. UTF-16 differs from UCS-2 in some ways, the most
>significant of which is that it allows surrogate pairs (two 16 bit
>codes that represent a single logical character code).
>

OK, I shouldn't answer e-mail before coffee.  But let me check
this with you to see if I've got the spec. right (and make sure
they didn't change this in 2.1 as well).  Under Unicode version 2.0,
what I should've said is:

	Unicode == ISO-10646-UCS-2 != UTF-16

as Unicode and 10646 in UCS-2 format should be identical, but UTF-16
differs from both of these in it allows the use of code surrogate
pairs to enable encoding the BMP and next 16 planes of UCS-4.  From
what I can see at Unicode's home page, it now looks like Unicode is
dropping UCS-2 character encoding and now only endorses UTF-8 and 
UTF-16, so that the situation now is:

	Unicode != ISO-10646-UCS-2

and Unicode sometimes does/sometimes does not equal UTF-16.  Is that
more or less the case at the moment?


Jerome McDonough -- jmcdonou@library.Berkeley.EDU  |  (......)
Library Systems Office, 386 Doe, U.C. Berkeley     |  \ *  * /
Berkeley, CA 94720-6000    (510) 643-2058          |  \  <>  /
"Well, it looks easy enough...."                   |   \ -- /  SGNORMPF!!!
         -- From the Famous Last Words file        |    ||||

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dave at userland.com  Tue Sep 22 20:40:07 1998
From: dave at userland.com (Dave Winer)
Date: Mon Jun  7 17:04:58 2004
Subject: Corba data ->   XML
In-Reply-To: <3.0.32.19980922130406.00ada54c@gw1.webmethods.com>
Message-ID: <3.0.5.32.19980922114036.012fce80@scripting.com>

Here's another data point, an XML-based object serialization format is part
of the XML-RPC protocol we've been working on with WebMethods and others. 

The spec is here:

http://www.scripting.com/frontier5/xml/code/rpc.html

Dave

--------------------------------------
http://www.userland.com/directory.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simpson at polaris.net  Tue Sep 22 20:51:51 1998
From: simpson at polaris.net (John E. Simpson)
Date: Mon Jun  7 17:04:58 2004
Subject: XP syntax error - including cals-tbl.dtd in simple xml dtd
Message-ID: <3.0.32.19980922144938.006a68bc@polaris.net>

At 10:05 AM 9/22/98 -0500, Glenn R. Kronschnabl wrote:
><?xml version="1.0" encoding="ISO-8859-1"?>
>
><!ENTITY % calstbls PUBLIC "-//USA-DOD//DTD Table Model 951010//EN"
"cals-tbl.dtd">
>%calstbls;
	<snip>
>Here is the output of a XP command (all XP apps  give the same error):
>
>grk@magellan$ java com.jclark.xml.apps.Time june98.xml 
>file:/home/grk/sgmldocs/reports/nims/cals-tbl.dtd:124:29: syntax error
>0.613

Well, I'm guessing, but it looks to me as though you're simply missing the
SYSTEM keyword in your entity declaration.

====================
John E. Simpson
Just XML (ISBN 0-13-943417-8)
Available in September from Prentice Hall PTR

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simpson at polaris.net  Tue Sep 22 21:22:46 1998
From: simpson at polaris.net (John E. Simpson)
Date: Mon Jun  7 17:04:58 2004
Subject: XP syntax error - including cals-tbl.dtd in simple xml dtd
Message-ID: <3.0.32.19980922152102.006a482c@polaris.net>

At 12:05 PM 9/22/98 -0700, Murray Altheim wrote:
>John E. Simpson <simpson@polaris.net> writes:
>> 
	<snip>
>> Well, I'm guessing, but it looks to me as though you're simply missing the
>> SYSTEM keyword in your entity declaration.
>
>No, SYSTEM would not be correct syntax if PUBLIC is also specified in the
>declaration -- the syntax of the declaration is fine. The problem with 
>using CALS unmodified is that it is an *SGML* DTD which includes SGML 
>features not allowed in XML, such as declaration-based SGML comments, tag
>minimization features, etc.

Thanks, Murray. I probably wouldn't have gone out on a limb if someone else
had already answered his question -- he sounded pretty desperate. :) Your
reply makes perfect sense.

====================
John E. Simpson
Just XML (ISBN 0-13-943417-8)
Available in September from Prentice Hall PTR

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From srn at techno.com  Tue Sep 22 22:09:28 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun  7 17:04:58 2004
Subject: Public Identifiers
In-Reply-To: <3.0.5.32.19980921093859.00952100@library.berkeley.edu> (message
	from Jerome McDonough on Mon, 21 Sep 1998 09:38:59 -0700)
References: <199809210043.UAA10167@locke.ccil.org>
 <199809210043.UAA10167@locke.ccil.org> <3.0.5.32.19980921093859.00952100@library.berkeley.edu>
Message-ID: <199809221839.NAA01168@bruno.techno.com>


Many thanks for the extremely helpful note!

[Jerome McDonough:]

> The URN working group of the IETF is working on an Internet Draft
> addressing the issue of how name assignment authorities are registered
> (or not).  Quoting from the draft: "In a nutshell, a template for
> the definition of the namespace is completed for deposit with IANA,
> and a NID (namespace identifier) is assigned."  The draft contemplates 
> three levels of name spaces: experimental, informal, and formal.
> Experimental are not explicitly registered with IANA, and take
> the form of x-<NID>; no provision is made for avoiding collision
> of experimental NIDs.

Do you think I would be correct in assuming that I could use the
above-described x-<NID> form to assign the "1922 Sears Farm Catalog"
to a namespace I invent that I could call, in effect, "Sears, Roebuck
& Co."?

-Steve

--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com  http://www.techno.com  ftp.techno.com

voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax    +1 972 994 0087 (at ISOGEN: +1 214 953 3152)

3615 Tanner Lane
Richardson, Texas 75082-2618 USA

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Tue Sep 22 22:36:09 1998
From: david at megginson.com (david@megginson.com)
Date: Mon Jun  7 17:04:58 2004
Subject: Preference Files
In-Reply-To: <199809221813.OAA09232@hesketh.com>
References: <199809221813.OAA09232@hesketh.com>
Message-ID: <13831.60777.133705.973152@localhost.localdomain>

Simon St.Laurent writes:

 > Has anyone started work on a standard DTD for preference files, or
 > is the expectation that everyone's going to do their own
 > differently?  It seems like something that might be useful,
 > especially in the context of some of the browser development issues
 > that have been discussed here lately.
 > 
 > If anyone knows of an existing one and wants to promote it, I'm
 > writing a chapter...  The current material uses a custom-built
 > preference file, but I'd like to generalize it.

It sounds like a good application of RDF, if you're a believer.  

Actually, one of the problems is that a generalised document type will
always mean more work than a customised one.  For example, it might be
easier for users to understand


<netscape-config>
 <security>
  <allow-cookies>no</allow-cookies>
  <enable-javascript>no</enable-javascript>
 </security>
</netscape-config>

than

<config type="netscape>
 <section>
  <title>Security</title>
  <boolean-option>
   <name>allow-cookies</name>
   <value>no</value>
  </boolean-option>
  <boolean-option>
   <name>enable-javascript</name>
   <value>no</value>
  </boolean-option>
 </section>
</config>

On the other hand, it's easier to write reusable software that deals
with the second case.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jmcdonou at library.berkeley.edu  Wed Sep 23 00:09:30 1998
From: jmcdonou at library.berkeley.edu (Jerome McDonough)
Date: Mon Jun  7 17:04:58 2004
Subject: Public Identifiers
In-Reply-To: <199809221839.NAA01168@bruno.techno.com>
References: <3.0.5.32.19980921093859.00952100@library.berkeley.edu>
 <199809210043.UAA10167@locke.ccil.org>
 <199809210043.UAA10167@locke.ccil.org>
 <3.0.5.32.19980921093859.00952100@library.berkeley.edu>
Message-ID: <3.0.5.32.19980922150732.009f23d0@library.berkeley.edu>

At 01:39 PM 9/22/98 -0500, Steven R. Newcomb wrote:
>> The URN working group of the IETF is working on an Internet Draft
>> addressing the issue of how name assignment authorities are registered
>> (or not).  Quoting from the draft: "In a nutshell, a template for
>> the definition of the namespace is completed for deposit with IANA,
>> and a NID (namespace identifier) is assigned."  The draft contemplates 
>> three levels of name spaces: experimental, informal, and formal.
>> Experimental are not explicitly registered with IANA, and take
>> the form of x-<NID>; no provision is made for avoiding collision
>> of experimental NIDs.
>
>Do you think I would be correct in assuming that I could use the
>above-described x-<NID> form to assign the "1922 Sears Farm Catalog"
>to a namespace I invent that I could call, in effect, "Sears, Roebuck
>& Co."?
>

Yup.  So, something like:

	urn:x-SearsRoebuck:1992_Sears_Farm_Catalog

is perfectly legitimate.  There are limitations on what characters
you're allowed to use in <NID>s (see RFC 2141); only a-z, A-Z, 0-9, and
the dash character.  So, no white spaces, ampersands, punctuation, etc.
But within those restraints, you're pretty much free to do whatever
you want with experimental URNs.


Jerome McDonough -- jmcdonou@library.Berkeley.EDU  |  (......)
Library Systems Office, 386 Doe, U.C. Berkeley     |  \ *  * /
Berkeley, CA 94720-6000    (510) 643-2058          |  \  <>  /
"Well, it looks easy enough...."                   |   \ -- /  SGNORMPF!!!
         -- From the Famous Last Words file        |    ||||

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dkuhlman at netcom.com  Wed Sep 23 00:56:03 1998
From: dkuhlman at netcom.com (G. David Kuhlman)
Date: Mon Jun  7 17:04:58 2004
Subject: Corba data ->   XML
In-Reply-To: <3607DEA9.37620148@finetuning.com> from "Lisa Rein" at Sep 22, 98 10:30:17 am
Message-ID: <199809222255.PAA18211@netcom8.netcom.com>

All these proposals (WebMethods, WebBroker, and also ILU http) seem
to revolve around the same thing, specifically requesting (object)
services and doing distributed computing across HTTP.

Has anyone done a comparison of them.  Are they all replacements
for CORBA and DCOM?  That would be great; CORBA is way too complex. 

Are we talking about representing distributed objects in XML? Or is
the idea to represent the requests sent to distributed objects in
XML?  Or, maybe, both of the above.

And how does one develop an object or an application that provides
the service?  Is there a framework that it runs on top of?  The ILU
HTTP proposal seems to be saying that a server object is
implemented as a CORBA object that runs on the ILU ORB which
accepts HTTP requests.

Are these all XML App Servers?  OK. OK. I admit that I'm trying to
categorize through the application of buzzwords.

Here are some of the related links:

  http://www.w3.org/TR/1998/NOTE-webbroker/

  http://www.scripting.com/frontier5/xml/code/rpc.html

  ILU over HTTP -- ftp://ftp.parc.xerox.com/pub/ilu/misc/webilu.html
  An example is in the ILU distribution --
    ftp://ftp.parc.xerox.com/pub/ilu/ilu.html

This one is not XML-related but has interesting ideas:

  http://udell.roninhouse.com/download/dhttp.html

And I have not been able to grasp the concept behind Casbah at all:

  http://www.ntlug.org/casbah/index.shtml

Comments and comparisons that help me understand the above will be
appreciated greatly.

Dave
dkuhlman@netcom.com


> 
> Hi Thillai:
> 
> You should look at the WebBroker work that John Tigue has done at:
> 
> http://www.w3.org/TR/1998/NOTE-webbroker/
> 
> Where he uses DTDs to express both DCOM and CORBA objects.
> 
> Cheers,
> 
> lisa rein
> http://www.finetuning.com
> 
> Thillai wrote:
> > 
> > Hi,
> > 
> > I know only little bit about XML.  I like to know whether a CORBA operation
> > can be expressed in XML document type definition?
> > 
> > For example if the IDL has something like
> > 
> > interface z
> > {
> >         typedef sequence<String, 5>  t1;
> > 
> >         void x(in int p1,  in t1 p2);
> > }
> > 
> > and I want to store the parameters for the operation in a file.
> > 
> > I like to have a XML data file like
> > <interface>
> > z
> > <operation>
> > x
> > <inparameter>
> > <param>
> > p1
> > <value>10</value>
> > </param>
> > <param>
> > p2
> > <value><nelems>5</nelems><0>abc</0><1>efg</1><2>hij</2><3>
> > klm</3><4>nop</4></value>
> > </param>
> > </inparameter>
> > </operation>
> > </interface>
> > 
> > If there is no maximum limit for the type t1 then no. of elements in the
> > sequence might vary.
> > 
> > For this file is it possible to write DTD.  (I will read about DTD and find.
> > Before that any expert comment will be helpful)
> > 
> > If it is possible then is it possible to write XSL for getting values from the
> > user.  (no. of elements in the sequence might vary at runtime).
> > 
> > Thillai
> > AT&T
> > Middletown, NJ
> > 
> > xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> > Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> > To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> > (un)subscribe xml-dev
> > To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> > subscribe xml-dev-digest
> > List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
> 
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
> 
> 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rbourret at dvs1.informatik.tu-darmstadt.de  Wed Sep 23 01:05:41 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun  7 17:04:59 2004
Subject: XSchema: Section 4
Message-ID: <199809222304.BAA27676@berlin.dvs1.tu-darmstadt.de>

Here is the long rumored section 4, which discusses the interaction of XSchema documents and 
DTDs and how to convert between the two.  An HTML version will be available in a 
day or two.  Please send comments to the list or directly to me 
(rbourret@dvs1.informatik.tu-darmstadt.de).

-- Ron Bourret


4 XSchema Documents and DTDs

An XSchema document is related to two different DTDs: the DTD of the XSchema 
document itself and the DTD of the document described by the XSchema document. 
This section discusses the relationship of XSchema documents to these DTDs and 
describes what conversions are possible between the XSchema document and the 
latter DTD. There is no requirement that either DTD actually exist.

4.1 DTDs in XSchema Documents

An XSchema document may include a DTD as an internal subset, external subset, or 
both. If included, this DTD must include all of the markup declarations in 
Appendix B, "XSchema DTD." It may also include additional markup declarations, 
such as declarations of elements to be used under the More element.

The main reason to include a DTD in an XSchema document is so an XSchema-unaware 
XML parser can supply default attribute values and determine the system and 
public identifiers of notations and unparsed general entities. Default attribute 
values are used in the XSchema DTD defined in Appendix B. Notations and unparsed 
general entities can be used by user-defined elements under the More element.

Secondary reasons for including a DTD in an XSchema document are to declare 
parsed entities (see Section 5.2.1, "Parsed Entities in XSchema Documents") and 
to allow the document to be validated by XSchema-unaware software. 

4.2 DTDs in Documents Described by XSchema Documents

A document described by an XSchema document may include a DTD as well as  
processing instructions that refer to XSchema documents (see section 5.1.1, 
"XSchema Processing Instruction"). This DTD can describe the same information as 
the XSchema documents as well as additional information.

The main reason to include a DTD in a document described by an XSchema document 
is so an XSchema-unaware XML parser can supply default attribute values and 
determine the system and public identifiers of notations and unparsed general 
entities. Secondary reasons are so that the document can be used with both 
XSchema-aware and -unaware software, to define the root element in the document, 
and to declare parsed general entities.

If an XML document includes both a DTD and processing instructions that refer to 
XSchema documents, it is the responsibility of the document author to ensure 
that the information common to both is the same. If the common information is 
different, it might not be possible to use the document with both XSchema-aware 
and -unaware software. For example, it might not be possible to validate the 
document against both the DTD and the XSchema documents.

If an XSchema processor is built on top of an XML parser, the XSchema processor 
is not required to process the DTD of the XML document. If an XSchema processor 
also functions as an XML parser, it is required to process the DTD only to the 
extent required of a non-validating parser.

4.3 Converting Between XSchema Documents and DTDs

Schema information can be converted between XSchema documents and DTDs, although 
some information may be lost. Most logical information (such as element and 
attribute declarations) can be converted from DTDs to XSchema documents, while 
some logical information (such as attribute declarations not assigned to  
elements) cannot be converted from XSchema documents to DTDs. In general, 
physical information (such as parsed entity declarations and use, the order of 
declarations, and the distribution of declarations among different files) either 
cannot be converted or is converted only at the option of the converter.

4.3.1 Converting DTDs to XSchema Documents

The following DTD structures must be converted to the corresponding XSchema 
structures:

* Element, attribute, notation, and unparsed entity declarations. The order of 
the resulting elements, including whether attribute declarations are placed 
inside element declarations, is the choice of the converter.

* Namespace prefixes in element and attribute declarations. These must be 
stripped from the element or attribute name and stored in the appropriate 
attribute (prefix or ElementPrefix). The converter may prompt the user for the 
URI of the namespace to be stored in the corresponding ns or ElementNS 
attribute.

The following DTD structures may be converted to the corresponding XSchema 
structures or discarded:

* Comments. These may be converted to Doc elements. The position of resulting 
Doc elements in the XSchema document is the choice of the converter. For  
example, a converter might place comments in a Doc element inside the following 
element, attribute, notation, or unparsed entity declaration.

* Parameter entity declarations and use. These may be converted to parsed 
general entity declarations and use.

The following DTD structures cannot be converted to XSchema structures because 
such structures do not exist:

* Duplicate attribute and unparsed entity declarations.

* Parsed general entity declarations and use.

* Conditional sections in external parameter entities.

QUESTION: What happens to the following:
* Processing instructions
* Text encoding declarations

4.3.2 Converting XSchema Documents to DTDs

The following XSchema structures must be converted to the corresponding DTD 
structures:

* All information in element, notation, and unparsed entity declarations and 
attribute declarations that apply to a particular element, except as noted 
elsewhere. The order of the resulting declarations is the choice of the 
converter.

* Namespace prefixes declared in prefix and ElementPrefix attributes. These must 
be prepended to the element or attribute name.

The following XSchema structures may be converted to the corresponding DTD 
structures or discarded:

* Doc elements. These may be converted to comments. The position of resulting 
comments in the DTD is the choice of the converter.

* Parsed entities declared in the DTD of the XSchema document. These may be 
converted to parsed general entities or parameter entities as appropriate.

The following XSchema structures cannot be converted to DTD structures because 
such structures do not exist:

* More elements, AttDef and AttGroup elements that do not apply to a particular 
element, and Model and Enumeration elements nested directly beneath an XSchema 
element.

* All id attributes, all attributes of the XSchema element except for prefix, 
all ns and ElementNS attributes, and the Root attribute of the ElementDecl 
element.

* Nesting of schema information provided by nested XSchema elements.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rbourret at dvs1.informatik.tu-darmstadt.de  Wed Sep 23 01:05:40 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun  7 17:04:59 2004
Subject: XSchema: Sections 5.0 and 5.1
Message-ID: <199809222304.BAA27678@berlin.dvs1.tu-darmstadt.de>

And also the long rumored Section 5: Using XSchema Documents.  This describes 
the processing instruction used to associate XSchema documents with XML instance 
documents. It also describes an experimental method for inlining XSchema  
elements, suggested by Don Park.

An HTML version will be available in a day or two.  Please send comments to the 
list or directly to me (rbourret@dvs1.informatik.tu-darmstadt.de).

-- Ron Bourret

5 Using XSchema Documents

This section describes how to associate XSchema documents with XML documents and 
suggests ways to use XSchema documents.

5.1 Associating XSchema Documents with XML Documents

An XSchema document can define a class of XML documents in the same way a DTD 
defines a class of XML documents. A document declares that it conforms to a 
class by including the XSchema processing instruction. A document fragment can 
declare that it conforms to a class by including a nested XSchema element; this 
latter usage is experimental.

5.1.1 XSchema Processing Instruction

The XSchema processing instruction is similar to the SYSTEM declaration in a 
DOCTYPE statement. It states that the document conforms to the class of 
documents described by the XSchema document. The processing instruction has the 
following form:

[1] XSchemaPI ::= '<?xschema' S XSchemaID S? '?>'
[2] XSchemaID ::= 'xschema' Eq SystemLiteral

where S, Eq, and SystemLiteral are the same as in [XML].

An XSchema processing instruction must occur before the root element to be used; 
any XSchema processing instructions that occur after the root element will be 
ignored.

An XML document may include multiple XSchema processing instructions. The effect 
is as if a superior root XSchema element contains the root XSchema element of 
each XSchema document. This allows a document to conform to elements in many 
existing XSchema documents. For more information, see Section 5.2.5, "Reusing 
Element Declarations with Entities or Processing Instructions."

5.1.2 Inline XSchema Elements (Non-Normative)

NOTE: Inline XSchema elements are considered experimental and may change in the 
future.

In some applications it is useful to repeatedly change the schema of the XML 
document at run time.
For example, consider a system that continuously logs data 
in XML format. From an XML standpoint, it is as if a root element was started 
when the system was started, all incoming information is nested beneath the root 
element, and the root element ends only when the system is stops. For practical 
purposes, the root element might not actually exist.

If the system logs information from different sources, the format (schema) of 
the nested elements might be different for each source. XSchema elements can be 
interspersed in this stream to describe the format of following information:

<Root>
   <XSchema>...schema #1...</XSchema>
   ...log information that conforms to schema #1...
   <XSchema>...schema #2...</XSchema>
   ...log information that conforms to schema #2...
   ...
</Root>

Because such use is not well-defined today, XSchema processors that use inline 
XSchema elements should follow these rules for the greatest chance of forward 
compatibility:

* The schema information in an XSchema element applies to all following elements 
at the same level until the next XSchema element at that level is encountered.

* The schema information in an XSchema element completely replaces the schema 
information in the previous XSchema element at the same level. That is, no 
partial replacement of schema information is allowed.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rbourret at dvs1.informatik.tu-darmstadt.de  Wed Sep 23 01:06:51 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun  7 17:04:59 2004
Subject: XSchema: Section 5.2
Message-ID: <199809222304.BAA27680@berlin.dvs1.tu-darmstadt.de>

This is (except for the XSchema in XSchema appendix), the FINAL section of the 
XSchema specification.  It suggests various uses for XSchema documents and 
points at some directions we might go in the future.  Except for the section on 
validation, it is non-normative and therefore might fit better in an appendix or 
separate document.

This is where we get to talk about reusing schema information and building Java 
classes from XSchema documents and so on, so have at it.

An HTML version will be available in a day or two.  Please send comments to the 
list or directly to me (rbourret@dvs1.informatik.tu-darmstadt.de).

-- Ron Bourret


5.2 Suggested Uses of XSchema Documents (Non-Normative except as noted)

The following sections suggest possible uses of XSchema documents. Except for 
Section 5.2.3, "Validation", they are not binding on XSchema processors or 
documents.

5.2.1 Parsed Entities in XSchema Documents

Parsed general entities are used in XSchema documents for the same reasons they 
are used in XML documents: to distribute documents across multiple files, to 
enable multiple character encodings, to act as text substitution macros, and so 
on. They can also be used in a manner similar to parameter entities in a DTD. 
For example, suppose the DTD contains the following declaration:

<!ENTITY latinattribute "<AttDef Name='Latin' Type='CData' Required='No'/>" >

This can be used in the content of the XSchema document to add the Latin  
attribute to an element:

<ElementDecl Name="Species">
   ...additionalElementInformation...
   <AttGroup>
      &latinattribute;
   </AttGroup>
</ElementDecl>

Because parameter entities are used only in the DTD, they offer no special 
advantages to XSchema documents.

5.2.2 DTD Replacement

As was noted in Section 5.1, an XSchema document can define a class of XML 
documents. In this respect, it fulfills the logical functions of a DTD. That is, 
an XSchema processor can validate an XML document against an XSchema document 
and an XSchema-aware XML parser can retrieve information about the XML document, 
such as default attribute values and the system and public identifiers of 
notations and unparsed general entities.

5.2.3 Validation (Normative)

An XSchema processor can validate an XML document against an XSchema document. 
Because XSchema does not support parsed entity declarations, this validation is 
slightly less comprehensive than that defined in [XML]. XSchema processors that 
perform validation must enforce all Validity Constraints in [XML] except:

Proper Declaration/PE Nesting
Standalone Document Declaration
Proper Group/PE Nesting
Entity Declared

When enforcing the Root Element Type constraint, the XSchema processor first 
checks if there is a DOCTYPE statement in the XML document. If so, it uses the 
root element type declared there. If not, it searches the XSchema document for 
element declarations in which the Root attribute has a value of Recommended. The 
root element of the XML document must be one of these elements. If no element 
declarations have a Root attribute with a value of Recommended, the validation 
fails.

An XSchema processor that validates an XML document is not required to parse 
that document.

5.2.4 Schema Repository

XSchema documents are not required to define a particular class of XML 
documents. For example, an XSchema document might consist of nothing but  
attribute definitions. In this manner, an XSchema document can function as a 
repository for schema definitions, which can then be reused by other XSchema 
documents.  Note that while an XSchema document that defines a class of XML 
documents can always act as a repository, the converse is not always true.

5.2.5 Reusing Element Declarations with Entities or Processing Instructions

Element declarations in one XSchema document can be reused by referring to them 
in a Ref element in second XSchema document. For example, suppose an XSchema 
repository defines a FullName element:

<ElementDecl Name="FullName">
   <Model>
      <Seq>
         <Ref Element="LastName"/>
         <Ref Element="FirstName"/>
         <Ref Element="MiddleName" Frequency="ZeroOrMore"/>
      </Seq>
   </Model>
</ElementDecl>

The XSchema document that describes Letter documents might include FullName by 
reference, where the first instance is the author of the letter and the second 
instance is the recipient:

<ElementDecl Name="Letter">
   <Model>
      <Seq>
         <Ref Element="FullName"/>
         <Ref Element="FullName"/>
         <Ref Element="Paragraph" Frequency="OneOrMore"/>
      </Seq>
   </Model>
</ElementDecl>

The referenced declaration can be resolved in one of two ways. First, the second 
XSchema document can include the first, either by cutting and pasting or through 
an external parsed general entity. For example:

<!DOCTYPE XSchema [
   <!ENTITY nameRepository SYSTEM "names.xsc">
]>
<XSchema>
   &nameRepository;
   ... other declarations ...
</XSchema>

Second, a Letter (instance) document that can included processing instructions 
that point to both XSchema documents. For example:

<!DOCTYPE Letter>
<?xschema xschema="names.xsc" ?>
<?xschema xschema="letter.xsc" ?>
<Letter>
   ...
</Letter>

Note that including an XSchema processing instruction in letter.xsc that points 
to names.xsc will not have the intended effect. Rather than including the 
names.xsc, this processing instruction states that letter.xsc (an XSchema 
document) conforms to the elements declared names.xsc. This is unlikely to be 
true.

5.2.6 Reusing Schema Definitions through XLinks

In the future, it should be possible to reuse schema definitions in an XSchema 
document through XLinks. Although the exact manner in which this works cannot be 
determined until the XLink and XPointer specifications are complete, the example 
from section 5.2.5 might be performed as follows:

<ElementDecl Name="Letter">
   <Model>
      <Seq>
         <Ref Element="FullName"/>
         <Ref Element="FullName"/>
         <Ref Element="Paragraph" Frequency="OneOrMore"/>
      </Seq>
   </Model>
</ElementDecl>

<ElementDecl xml:link="simple"
             href="names.xsc#id(FullName)"
             inline="true"
             show="replace"/>

The second ElementDecl element points to, and is replaced by, the ElementDecl 
element for the FullName element in names.xsc. This eliminates the need to 
include the names.xsc through cut-and-paste, an entity, or a processing 
instruction.

XSchema has been designed with such linking in mind. It is partially because of 
this that the container elements AttGroup, Enumeration, and Model exist and can 
be directly or indirectly nested inside themselves. For example, a new AttGroup 
might be constructed by nesting multiple AttGroup elements inside it, each of 
which contains an XLink to an AttGroup in a different XSchema document.

5.2.7 Authoring

XSchema documents support authoring tools (editors) by providing human-readable 
documentation and a template for legal document structures.

A typical editing session using an XSchema-aware editor might proceed as  
follows:

1) The editor displays a list of available XSchema documents. The user chooses 
an XSchema document to use as a template.

2) The editor reads the chosen document and displays a list of elements for 
which the Root attribute has a value of Recommended. The user chooses a starting 
element.

3) The editor prompts the user for element content and attributes based on the 
information in the XSchema document. When the user requests help about a  
particular structure, the editor retrieves it from the corresponding Doc  
element.

4) When the user is done, the editor saves the new document. At the start of 
this document, the editor inserts a DOCTYPE statement declaring the root element 
type and an XSchema processing instruction declaring the template XSchema 
document.

An editor could also support schema building and modification. For example, it 
might allow the user to construct a new XSchema document from elements in 
existing XSchema documents or add new elements to existing XSchema documents.

5.2.8 General Schema Information

Because XSchema documents can contain information about a class of documents, 
they can be used by tools that work with (as opposed to on) these documents. For 
example, a database tool might read an XSchema document and construct a database 
schema or a programming tool might read an XSchema document and create Java 
classes for each element.  XSchema documents can also be used as starting points 
for search engines, which can use them to construct query-by-example interfaces.

5.2.9 Custom Uses

The More element in XSchema provides a way for users to customize their XSchema 
documents. For example, subelements of the More element might be used to assign 
the data type (integer, date, string, etc.) of PCData elements or associate Java 
classes with elements.

Note: There are a number of existing proposals for data types in XML and it is 
hoped that the W3C (and therefore XSchema) will adopt one of these in the 
future. For example, see [DCD].

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Wed Sep 23 02:27:33 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:04:59 2004
Subject: Public Identifiers
References: <f5blnngcsq4.fsf@cogsci.ed.ac.uk>
		 <36027A49.D857AB1E@locke.ccil.org>
		 <019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk>
		 <3.0.5.32.19980918145429.0093b100@dns.isogen.com>
		 <f5blnngcsq4.fsf@cogsci.ed.ac.uk>
		 <3.0.5.32.19980919164312.0091ed80@dns.isogen.com> <3.0.5.32.19980920215503.0095e870@dns.isogen.com>
Message-ID: <3607FFC1.FD6B3E51@technologist.com>

W. Eliot Kimber wrote:
> 
> SGML Formal public identifiers are not necessarily persistent names because
> there is nothing in ISO 8879 or ISO 9070 that requires them to be (nor
> could such a requirement be enforced or validated).  All that ISO 9070
> provides is a process for registering *owner identifiers*, which are,
> presumably, persistent (at least as defined by the assigning body).
> However, the name owner is responsible for managing the names within their
> slice of the FPI name space and can do whatever they want with them,
> including reassigning them without regard for persistence at all.

That's true. But Internic does not go even that far. It does not promise
not to reassign domain names. In fact, it does so regularly. I don't even
have the option of legally promising my clients that resources at my site
will be persistent, because I don't know if my contract will be renewed in
a year.

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

It's such a 
Bore
Being always
Poor
LANGSTON HUGHES
http://www.northshore.net/homepages/hope/engHughes.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jlapp at webmethods.com  Wed Sep 23 04:07:53 1998
From: jlapp at webmethods.com (Joe Lapp)
Date: Mon Jun  7 17:04:59 2004
Subject: XML IDL and XML RPC
In-Reply-To: <199809222255.PAA18211@netcom8.netcom.com>
References: <3607DEA9.37620148@finetuning.com>
Message-ID: <199809230207.WAA14026@bag-2.mail.digex.net>

(This is a renaming of the thread titled "Corba data -> XML")

At 03:55 PM 9/22/1998 -0700, G. David Kuhlman wrote:
>All these proposals (WebMethods, WebBroker, and also ILU http) seem
>to revolve around the same thing [...]
>Has anyone done a comparison of them.  Are they all replacements
>for CORBA and DCOM?  That would be great; CORBA is way too complex. 

I'll give a brief lo-down on the webMethods architecture.  It's
really very simple by comparison to conventional architectures.
These are the workings of the webMethods Business-to-Business
Integration Server (B2B):

(1) The object and method are given in the URL.  Our terminology
uses the terms "interface" and "service," but both of these terms
are somewhat overloaded so "object" and "method" do a better job
of explaining what we've done.  We also support naming the method
within the request message (outside the URL).

(2) Interface specifications are defined using WIDL.  Once you
have a spec you can use it to either generate code for various
programming languages (client or server).  It's a very simple
IDL that has proven amazingly useful despite its simplicity.
This has sort of become a theme for webMethods -- we focus
primarily on solutions that are both powerful and simple.

(3) An RPC occurs as an HTTP request/response pair.  Our server
will accept requests in any of the following formats:

  (a) CGI query parameters submitted via HTTP GET
  (b) CGI query parameters submitted via HTTP POST
  (c) XML documents submitted via HTTP POST
  (d) binary data submitted via HTTP POST

We break (c) down into the following document types:

  (c1) Generic XML RPC encodings -- This includes the RPC that

       both webMethods and Userland support, and it includes
       John Tigue's webBroker encoding, though we don't support
       this latter encoding (at this time, anyway).  Such
       encodings are capable of representing any RPC message.
  (c2) Specific XML RPC encodings -- These are XML document
       types that are specific to the problem domain.  e.g.
       a doc type for purchase orders.  Send a PO document to
       a URL that names the object and method, and we'll
       interpret that document as containing the parameters
       of the method call.
       
(4) Our server returns responses in any of the following formats
(via the standard HTTP response mechanism):

  (a) Generic XML RPC encoding
  (b) Specific XML RPC encoding
  (c) binary encodings
  (d) HTML pages

One may use templates on the server to generate the XML.  We have
a really simple template language.  (d) is interesting because the
B2B server has an admin interface (read: admin object).  It exposes
a bunch of services (read: methods) that are invoked via URL with
CGI query parameters.  Our admin interface consists of HTML web
pages generated by templates in response to service invocations.

(5) Users write Integration Modules (IMs) on the server side.  They
conform to the WIDL specifications and access the back-end system.

(6) B2B uses WIDL mappings to bridge the impedance mismatch that
exists between programming languages and documents.  Programmers
don't want to have to navigate a document to get at data.  They
just want to issue function calls and get back return parameters.

WIDL makes this happen using the WOM query language (webMethods
Object Model -- not to be confused with the DOM API).  WIDL mappings
(XML files that extend WIDL specifications) contain WOM queries
and construct programming language structurs from document info.
We'll build strings, arrays, records, etc.

(7) Because B2B uses HTTP and XML, it need only sit on the sides
that don't already speak HTTP and XML.  Everybody's got an HTTP
stack, and XML parsers are easy to come by.  Unlike distributed
computing architectures that implement generic encodings, you
don't need an ORB sitting on both ends.  You only need agreement
on the semantics of XML tags.  You'd use B2B to web-enable and
XML-enable an application (our clients are typically enabling
large legacy ERP systems).  The clients can be as stupid or as
smart as you desire.

(8) Since WIDL mappings implement specifications to bridge between
document structure and programming constructs, programs need have
no knowledge of XML tag names or XML data structure.  WIDL will
hide all of this.  The DTD for the RPC encoding could completely
change, and any side that is shielded by B2B is shielded from
these changes.  One need merely update the declarative WIDL mapping.

(9) All of the above not only provides a way to use XML over HTTP.
It also provides what we call a "ladder of integration."

Say you already have a web site whose CGI programs already capture
so much of the business logic that you need to enforce.  You want to
give your clients automated access to your data, but you don't want to
write anything new.  No problem: put B2B on top of it and use WIDL
to make your web site look like a bunch of object interfaces.

Suppose you later want to go directly into your backend database
and skip the slow CGI programs.  No problem: without affecting
existing automated clients at all, replace the WIDL mappings with
an integration module.

Suppose you now want to migrate to Microsoft's Site Server.  No
problem: Just put Site Server on the backend and B2B will keep the

clients from having to change in order to speak to the new Site
Server interfaces.

The B2B architecture allows you to automate your clients now while
postponing your backend architecture decision.  It allows you to
take the easy migration path from whatever you have now to the
future end-all be-all Microsoft Site Server business platform.

So I'm more inclined to conclude that the B2B architecture is more
an architecture for bridging architectures.  It provides a safe
way to isolate changing systems and still provide communication
between the systems as they change.  We just want applications to
play nicely together regardless of native architecture or interfaces.

>And how does one develop an object or an application that provides
>the service?  Is there a framework that it runs on top of?  [...]

One can deploy B2B applications by writing XML and marshalling code
by hand, or one can do it with our special IDE called the B2B
Developer.  Both are extremely easy to do.  The Developer also goes
a step further by allowing you to interact with the web site that
you wish to wrap inside of program interfaces.

>Are these all XML App Servers?  OK. OK. I admit that I'm trying to
>categorize through the application of buzzwords.

We call B2B an "integration server."  We focus on interoperability
and network access/security/management issues.  We don't focus at
all on application management.  We usually expect the applications
to be self-contained and generally ignorant that our server exists.
B2B is a super-sophisticated bridge between applications.  The B2B
architecture focuses on giving applications XML smarts, HTTP smarts,
network security smarts, protocol translation smarts, and smarts
to allow the admin to manage all of this.

So that's the rundown on the webMethods architecture and on how I
think it compares to and works with CORBA and DCOM.  I go into a bit
more detail in the "WIDL and XML RPC" and "Supply chain integration"
chapters in _The_XML_Handbook_.  Also, I'll be giving a speech on
the WIDL framework at XML '98 in November, should you be interested.
--
Joe Lapp             | Senior Engineer
jlapp@webmethods.com | webMethods, Inc.
jlapp@acm.org        | http://www.webMethods.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rexb at rbcomdesign.com  Wed Sep 23 04:42:22 1998
From: rexb at rbcomdesign.com (Rex Brooks)
Date: Mon Jun  7 17:04:59 2004
Subject: XML IDL and XML RPC
Message-ID: <199809230247.TAA24023@transbay.net>

Hi all,

I only recently subscribed to this mailing list in order to better
understand xml and how it was developing in relation to IDL and distributed
computing in general. I am beginning to be gravely disappointed at the
extent to which this is a list mainly populated by self-serving vendors and
authors. Is this really true, or am I just stepping in during a
particularly ugly patch of competing offers?

Soon to unsubscribe:
Rex Brooks

Rex Brooks              http://www.rbcomdesign.com
1361-A Addison          rexb@rbcomdesign.com
Berkeley, CA 94702      Vox: 510-849-2309
Virtual Reality         Fax: 510-849-1306
Modelling Language      chair: Content Development Working Group
Consortium (VRMLC)      co-chair: VRML-IPR Task Group


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From gkholman at CanadaMail.com  Wed Sep 23 04:48:06 1998
From: gkholman at CanadaMail.com (G. Ken Holman)
Date: Mon Jun  7 17:04:59 2004
Subject: Public Identifiers
In-Reply-To: <3.0.5.32.19980918094925.008e4e70@dns.isogen.com>
References: <019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <Version.32.19980922190714.00f81a30@CraneSoftwrights.com>

I've hesitated jumping in to the discussion since the SIG went through it
in detail so long ago, but something discussed back then has not been
raised in this forum where there are other participants who didn't see the
earlier discussions.

At 98/09/18 06:49 -0500, W. Eliot Kimber wrote:
>In hindsight, it's clear to me that we never should have allowed public IDs
>in XML.  

Two uses of Public Identifiers that have, I believe, been overlooked
throughout this discussion, and I feel I personally need, are (1)
receiver/user resolution when system id unavailable and (2) copy
identification (e.g. versioning).

(1) Consider I have an XML document on a CD-ROM (or password-protected for
write, or behind a firewall for write permissions, or some other condition
rendering it read-only).  It might be very large, so large I don't want to
copy it to a local environment where I can modifiy it.  I may not even have
permission to make a local copy in which I can modify it.

Now, some external resource refered to in the read-only file is identified
by both a SYSTEM identifier and a PUBLIC identifier:

 <!DOCTYPE pres
   PUBLIC "+//ISBN 1-894049::CSL::Presentations//DTD Presentation//EN"
   SYSTEM "http://www.CraneSoftwrights.com/shareware/presdev/pres.dtd">

The XML Processor discovers it cannot access the URL for whatever reason
(server down, firewall, whatever).

A Public Identifier Resolution Mechanism, if it were available in the XML
Processor, can obtain the necessary local copy if I give it information
about a copy I made some previous time when I did have access to the remote
resource.  And by changing the information to the resolution mechanism, I
can move it around freely without changing the read-only resource.

Without a public identifier, and without the ability to modify the
read-only resource, I would have to resort to some kind of System
Identifier remapping mechanism if it were available in the XML Processor.

I would think it "cleaner" to do public identifer mapping, rather than
system identifier remapping, though I will acknowledge both will end up
with the same results.

(2) In the life of a DTD, I may create a number of instances conforming to
different versions:

One day:

 <!DOCTYPE RDF
   PUBLIC "+//IDN w3.org//DTD RDF Version 1.0//EN"
   SYSTEM "file:///s|/rdf/rdf.dtd">

Three months later:

 <!DOCTYPE RDF
   PUBLIC "+//IDN w3.org//DTD RDF Version 1.1//EN"
   SYSTEM "file:///s|/rdf/rdf.dtd">

Three months after that:

 <!DOCTYPE RDF
   PUBLIC "+//IDN w3.org//DTD RDF Version 2.0//EN"
   SYSTEM "file:///s|/rdf/rdf.dtd">

Each one pointing to the same file that has been updated on the fly.

One feature of an XML Processor might be that if an error is detected
through the use of the resource found through the SYSTEM id, offer to the
user to begin processing again through the resource found through the
PUBLIC id without obliging the user to change the source file.

I'm emphasizing the temporary nature of this feature, perhaps because this
is a one-off run for the user.  For conformance reasons the governing DTD
is the one found through the SYSTEM identifier, so it shouldn't be default
behaviour of a processor to categorically "try again" with the PUBLIC
identifier when there is a fault using the SYSTEM identifier ... but a
courtesy "restart with override" feature might be offered once the
conformance requirement to stop has been met.  

This second issue is fuzzy, but I hope I've conveyed how the following
resolution of the public identifiers would help:

   PUBLIC "+//IDN w3.org//DTD RDF Version 1.0//EN"
          "file:///s|/rdf/rdf-1-0.dtd"
   PUBLIC "+//IDN w3.org//DTD RDF Version 1.1//EN"
          "file:///s|/rdf/rdf-1-1.dtd"

I'm more interested in the utility of the first issue.

To me it makes more sense to map public identifiers than remap system
identifiers.

............. Ken

--
G. Ken Holman               mailto:gkholman@CanadaMail.com
Crane Softwrights Ltd.  http://www.CraneSoftwrights.com/x/
Box 266,                                V: +1(613)489-0999
Kars, Ontario CANADA K0A-2E0            F: +1(613)489-0995
Training:   http://www.CraneSoftwrights.com/x/schedule.htm
Resources: http://www.CraneSoftwrights.com/x/resources.htm
Shareware: http://www.CraneSoftwrights.com/x/shareware.htm


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From kenchi at utj.co.jp  Wed Sep 23 05:51:22 1998
From: kenchi at utj.co.jp (Kenji Chichii)
Date: Mon Jun  7 17:04:59 2004
Subject: XML Database
Message-ID: <19980923034301254.AAA136@[192.168.1.1]>

Hi to all.

We sometimes encounter a client who belives that making XML(SGML)Documents
means making Database. What they believe is if they make their document in
XML they will easily be able to find information in it, and to manage it. 

We are having hard time to make them understand if they need data base,
they have to develop a data base system.
Then, they loose interest in XML(SGML). What they want is easy and versitle
data base system.

Do you have any good ideas to show the advantage of XML as the data base
format to this kind of people?
Are there any experienc or information regarding XML(SGML) data base?
Are there any good XML data base in commercial basis?
If there is such Database, how much will it cost? 

Thank you for your advice.

Kenji Chichii

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jlapp at webmethods.com  Wed Sep 23 05:55:43 1998
From: jlapp at webmethods.com (Joe Lapp)
Date: Mon Jun  7 17:04:59 2004
Subject: XML IDL and XML RPC
In-Reply-To: <199809230247.TAA24023@transbay.net>
Message-ID: <199809230355.XAA19555@bag-2.mail.digex.net>

At 07:47 PM 9/22/1998 -0700, Rex Brooks wrote:
>I only recently subscribed to this mailing list in order to better
>understand xml and how it was developing in relation to IDL and distributed
>computing in general. 

I apologize if I sounded too upbeat about our product architecture.
I'm not sure how to sound otherwise about it.

Please let me know if I didn't clearly portray one effective way to
make XML and distributed computing work together.  You asked someone
to compare the webMethods, CORBA, and ILU architectures.  I don't
know anything about ILU, but I believe I gave pretty good coverage
for the remainder of your question.

Please also let me know how I might have reworded things so that it
sounds more informative and less salesmanlike.  It would be useful
for me to know whether the problem was with the information or with
the tone.

>I am beginning to be gravely disappointed at the
>extent to which this is a list mainly populated by self-serving vendors and
>authors. Is this really true, or am I just stepping in during a
>particularly ugly patch of competing offers?

Well, that extent is now one author who may have committed a faux pas.
I wouldn't grow worried too soon.  Beat on me a bit, but don't jump ship.

P.S. I have options with my company, but I don't make a single cent off
of any of the articles I've written.  I'll sign up for the vendor-selling-
wares error, but I pointed you to the book because it answers your question.
--
Joe Lapp             | Senior Engineer
jlapp@webmethods.com | webMethods, Inc.
jlapp@acm.org        | http://www.webMethods.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From grk at arlut.utexas.edu  Wed Sep 23 06:16:04 1998
From: grk at arlut.utexas.edu (Glenn Kronschnabl)
Date: Mon Jun  7 17:05:00 2004
Subject: Order of #PCDATA in dtd - relevant? or XP bug?
Message-ID: <36087406.C60F8AFE@arlut.utexas.edu>

A simple DTD that causes a syntax error in XP.
If you run the following dtd thru both jade
and XP, it runs fine thru jade but XP gives a
syntax error.  The only difference is the order
of the % inline and #PCDATA.

So, given this, does imply that the order of #PCDATA
is relevant or is this a genuine XP bug?

-- doc.dtd --
<?xml version="1.0"?>

<!-- the following line works in both jade and XP
<!ENTITY % inline "#PCDATA|emphasis"> -->

<!-- the following line works in jade but NOT in XP -->
<!ENTITY % inline "emphasis|#PCDATA">

<!ELEMENT doc (toc,div+)>
<!ELEMENT toc EMPTY>
<!ELEMENT div (title,(div|p)+)>
<!ATTLIST div
          id CDATA #IMPLIED
          name ID #IMPLIED>
<!ELEMENT title (%inline;)*>
<!ELEMENT p (%inline;)*>
<!ELEMENT emphasis (#PCDATA)>
<!-- end of DTD -->

-- a.xml --
<?xml version="1.0"?>
<!DOCTYPE doc PUBLIC "-//Kronschnabl//DTD DOC//EN" "doc.dtd">
<doc>
<toc/>
<div id="div1"><title>div 1 title</title>
<p>
div 1 paragraph
</p>
</div>
<div id="div2"><title>div 2 title</title>
<p>
div 2 paragraph
</p>
</div>
</doc>
<!-- end of a.xml -->

Example showing jade working:

% jade -c catalog -d hstyle.dsl xml.dcl a.xml
%

Example using XP:

% java com.jclark.xml.apps.Time a.xml
file:/home/grk/area51/doc.dtd:12:17: syntax error
0.361

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Wed Sep 23 07:53:27 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:05:00 2004
Subject: Mixed Content Models
Message-ID: <3.0.32.19980922224713.00b043c0@pop.intergate.bc.ca>

At 11:47 AM 9/21/98 -0700, Jerome McDonough wrote:
>	<!ELEMENT qstn   (#PCDATA | (preQTxt?, qstnLit?, postQTxt?, forward?,
>							  backward?, ivuInstr*))*
>
>Is this a legitimate content model under XML section 3.2.2?
>Msxml doesn't have a problem with it, and nsgmls using the -wxml flag
>also happily parses the DTD.  IBM's xml4j, however, complains:
>"Codebook.dtd: 1256, 33: This content model is not matched with the
>mixed model '(#PCDATA|FOO|BAR|. . .|BAZ)*': '(#PCDATA|(preQTxt?, qstnLit?,
>postQTxt?,forward?,backward?,ivuInstr*))*".

This is *totally* illegal per the spec.  Is that the msxml with IE4 
or IE5?  If 4, no biggie, they were up-front about being behind.  If 
IE5, I'm flabbergasted and MS needs to hear about it now.

As for nsgmls, James has been very up-front about its lack of 
completeness as an XML processor.  xml4j is right; I expect you'd
also get complaints from XP and Lark. -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Wed Sep 23 08:36:20 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:00 2004
Subject: URNs, FPIs, and RFC 1737
In-Reply-To: <003f01bde60e$4022e040$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <3.0.5.32.19980923012705.00948100@dns.isogen.com>

At 10:49 AM 9/22/98 +0100, Michael Kay wrote:
>Eliot Kimber:
>>...it is wrong to use names in a space you don't control.

>If this is "wrong", I think you need to distinguish whether
>you mean "it does not conform to standard XYZ", or "it is
>usually bad engineering practice", or "it is against
>European Law", or "it is contrary to ethical norms".
>
>In the case of XML Public Identifiers, I'm not sure the
>practice is intrinsically "wrong" on any of these counts; it
>is only wrong if it hurts someone.

I contend it's certainly contrary to ethical norms. It's probably
actionable in US civil courts if harm can be demonstrated, and I can
certainly imagine that as these issues become more widely understood, that
national or international law might be brought to bear.

The most the standards can do is provide a framework and set of conventions
that solve the technical aspecs of the problem. It is up to society to
define and enforce the ethical aspects of the problem.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tms at ansa.co.uk  Wed Sep 23 11:14:55 1998
From: tms at ansa.co.uk (Toby Speight)
Date: Mon Jun  7 17:05:00 2004
Subject: Order of #PCDATA in dtd - relevant? or XP bug?
In-Reply-To: Glenn Kronschnabl's message of "Tue, 22 Sep 1998 23:07:34 -0500"
References: <36087406.C60F8AFE@arlut.utexas.edu>
Message-ID: <uzpbryyq0.fsf@delivery.ansa.co.uk>

Glenn> Glenn R. Kronschnabl <URL:mailto:grk@arlut.utexas.edu>

0> In article <36087406.C60F8AFE@arlut.utexas.edu>, Glenn wrote:

Glenn> <!-- the following line works in both jade and XP
Glenn> <!ENTITY % inline "#PCDATA|emphasis"> -->
Glenn>
Glenn> <!-- the following line works in jade but NOT in XP -->
Glenn> <!ENTITY % inline "emphasis|#PCDATA">

The grammar in the spec only permits the first of these.  The two
cases appear equivalent, and there's no obvious reason why the latter
isn't permitted (but I suspect it makes life easier for parsers if
they know immediately that it's a mixed content model).

BTW, did you really expect Jade to spot an XML constraint when you
didn't use the -wxml argument?

-- 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rbourret at dvs1.informatik.tu-darmstadt.de  Wed Sep 23 11:57:54 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun  7 17:05:00 2004
Subject: XML IDL and XML RPC
Message-ID: <199809230941.LAA00276@berlin.dvs1.tu-darmstadt.de>

Rex Brooks wrote:

> I only recently subscribed to this mailing list in order to better
> understand xml and how it was developing in relation to IDL and distributed
> computing in general. I am beginning to be gravely disappointed at the
> extent to which this is a list mainly populated by self-serving vendors and
> authors. Is this really true, or am I just stepping in during a
> particularly ugly patch of competing offers?
> 
> Soon to unsubscribe:
> Rex Brooks

I only briefly followed the IDL/RPC discussion, but the original question was 
whether a Corba operation could be expressed in XML.  What followed were 
references to or descriptions of a number products that already do this, 
including Joe Lapp's excellent description of his product's strategy.  All of it 
struck me as a pretty good way for somebody to learn about how to do IDL/RPC in 
XML, whether they wanted to use an existing solution or roll their own.

I don't understand what is self-serving about this.  People are simply talking 
about what they work on, which is what they know best.  The discussions were 
factual, which is within the stated bounds of this list (marketing hype draws 
you a stern slap from the list monitors).  And how were the book plugs different 
from Web-site references except that the resource isn't online?  They happened 
to come from the authors in this case, but they frequently come from readers and 
authors frequently mention competing books, which is hardly self-serving.

As to the general population of the list, it seems that most of the people who 
post contribute their opinions on a variety of questions as well as providing 
information about the specific projects/software they are working on.  In 
addition, the members of this list have developed/are developing at least three 
specifications (SAX, XSchema, and XML Logging) and provide regular input on 
many, many others.  I can't speak for others, but I know I do a lot of this on 
my own time.  I also suspect that for most of us, being able to occasionally 
puff up our chest and point to something we did is about as self-serving as it 
gets.

-- Ron Bourret

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dave at userland.com  Wed Sep 23 14:52:09 1998
From: dave at userland.com (Dave Winer)
Date: Mon Jun  7 17:05:00 2004
Subject: XML IDL and XML RPC
In-Reply-To: <199809230941.LAA00276@berlin.dvs1.tu-darmstadt.de>
Message-ID: <3.0.5.32.19980923055327.012fcad0@scripting.com>

>>I don't understand what is self-serving about this. People are simply
talking 
about what they work on, which is what they know best.

Thanks for the air cover. While we sell a commercial product that does
XML-RPC, and is an XML database, and we also have deployed XML apps and
services, I am also interested in XML because it is good for the software
industry. 

I've had a long career where things would have been possible if
compatibility could have been achieved, but good stuff didn't happen
because people didn't work together. I have high hopes for XML, but it can
only work if people cooperate and make their products compatible. This list
is invaluable to me *because* it connects competing commercial developers
so we can learn about what each other is doing and look for ways to be
compatible.

That said, to answer the question, I wrote a top-level piece in July,
suitable for investors or marketing people, that explains why XML-RPC is so
interesting:

http://www.scripting.com/davenet/98/07/xmlrpcfornewbies.html

Hope this helps.

Dave

PS: This description applies to the WebMethods stuff because we've been
working together. Cooperation. Coopetition. The good stuff.


--------------------------------------
http://www.userland.com/directory.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From grk at arlut.utexas.edu  Wed Sep 23 15:41:14 1998
From: grk at arlut.utexas.edu (Glenn R. Kronschnabl)
Date: Mon Jun  7 17:05:00 2004
Subject: XP syntax error - including cals-tbl.dtd in simple xml dtd 
In-Reply-To: Your message of "Tue, 22 Sep 1998 12:05:50 PDT."
             <199809221905.MAA21144@mehitabel.eng.sun.com> 
Message-ID: <199809231340.IAA23216@ns1.arlut.utexas.edu>

In message <199809221905.MAA21144@mehitabel.eng.sun.com> you write:

[stuff deleted]

>If you are interested in using an XML version of CALS, I believe that 
>Norm Walsh's preliminary work [XMLDB] on an XML version of DocBook might
>provide a working (if nonstandard) CALS DTD.

I tried that and it almost worked.  Turns out that the problem is that the 
default entity definitions in the XML CALS table model has the two 
following lines in it that results in an invalid XML file.  Turns out that 
#PCDATA has to be first in the list in the tbl.entry.mdl entity definition.

--- from calstblx.dtd ---
<!ENTITY % paracon '#PCDATA'>
<!ENTITY % tbl.entry.mdl        "(para|warning|caution|note|legend|%paracon
;)*">

Once I reorder the tbl.entry.mdl line to put #PCDATA first, no errors.

Cheers,
Glenn                                  
--------------------
Glenn R. Kronschnabl
Applied Research Laboratories        | grk@arlut.utexas.edu (PGP/MIME ok)
The University of Texas at Austin    | http://www.arlut.utexas.edu/~grk
PO Box 8029, Austin, TX 78713-8029   | (Ph) 512.835.3642 (FAX) 512.835.3808
10,000 Burnet Road, Austin, TX 78758 | ... but an Aggie at heart!


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jonathan at texcel.no  Wed Sep 23 15:45:53 1998
From: jonathan at texcel.no (Jonathan Robie)
Date: Mon Jun  7 17:05:00 2004
Subject: XML Database
In-Reply-To: <19980923034301254.AAA136@[192.168.1.1]>
Message-ID: <3.0.3.32.19980923092742.034a8670@pop.mindspring.com>

At 12:47 PM 9/23/98 +0900, Kenji Chichii wrote:
>Hi to all.
>
>We sometimes encounter a client who belives that making XML(SGML)Documents
>means making Database. What they believe is if they make their document in
>XML they will easily be able to find information in it, and to manage it. 
>
>We are having hard time to make them understand if they need data base,
>they have to develop a data base system.
>Then, they loose interest in XML(SGML). What they want is easy and versitle
>data base system.

There are, in fact, document management products that allow you to do
queries on document data, reuse components from one document in another,
provide a programming interface to the data, etc.

I would distinguish carefully between two approaches to XML databases. If
what you have is fundamentally objects, relational data, or other
traditional "database" data, then you are best off using a relational or
object database, and exposing it as XML for the purpose of data exchange.
If what you have is fundamentally document data, then a document management
system is probably what you want.

>Do you have any good ideas to show the advantage of XML as the data base
>format to this kind of people?
>Are there any experienc or information regarding XML(SGML) data base?
>Are there any good XML data base in commercial basis?
>If there is such Database, how much will it cost? 
 
What kind of data do they have? What do they want to do with it?

Jonathan
 
jonathan@texcel.no
Texcel Research
http://www.texcel.no

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Wed Sep 23 16:04:15 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:05:00 2004
Subject: Preference Files
References: <199809221813.OAA09232@hesketh.com> <13831.60777.133705.973152@localhost.localdomain>
Message-ID: <3608F9B7.781FFB14@technologist.com>

david@megginson.com wrote:
> 
> Simon St.Laurent writes:
> 
>  > Has anyone started work on a standard DTD for preference files, or
>  > is the expectation that everyone's going to do their own
>  > differently?  
> 
> It sounds like a good application of RDF, if you're a believer.

Whether or not you are a believer, it seems like a perfect testing ground
for RDF ideas. For instance: what would a generalized RDF editor look
like? Presumably, it would be suffice as your preference file editor for
"experts" (at computing in general, not XML/RDF in specific). It would
probably look vaguely like the Windows registry editor, but do something
more sophisticated with embedded strucutures and links.

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

How many of the Congresspeople who voted for the CDA do you suppose
also voted to release the report that reads like a borderline por-
nographic dime-store romance written by a Texas preacher's son?
	- Keith Dawson, TBTF 
		http://www.tbtf.com/archive/09-14-98.html
		http://www.tbtf.com/resource/hypocrites.html

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jonathan at texcel.no  Wed Sep 23 16:43:34 1998
From: jonathan at texcel.no (Jonathan Robie)
Date: Mon Jun  7 17:05:00 2004
Subject: Preference Files
In-Reply-To: <3608F9B7.781FFB14@technologist.com>
References: <199809221813.OAA09232@hesketh.com>
 <13831.60777.133705.973152@localhost.localdomain>
Message-ID: <3.0.3.32.19980923103018.00ba9610@pop.mindspring.com>

At 08:37 AM 9/23/98 -0500, Paul Prescod wrote:
  
>Whether or not you are a believer, it seems like a perfect testing ground
>for RDF ideas. For instance: what would a generalized RDF editor look
>like? Presumably, it would be suffice as your preference file editor for
>"experts" (at computing in general, not XML/RDF in specific). It would
>probably look vaguely like the Windows registry editor, but do something
>more sophisticated with embedded strucutures and links.
 
Is there any advantage in going beyond the simplified form of RDF that is
used in DCD? That would be enough to identify the data types, and it looks
like it allows very readable formats (like David's first example) while
still maintaining the information needed for generic programmatic handling
(like David's second example).

Jonathan
 
jonathan@texcel.no
Texcel Research
http://www.texcel.no

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jonathan at texcel.no  Wed Sep 23 16:45:27 1998
From: jonathan at texcel.no (Jonathan Robie)
Date: Mon Jun  7 17:05:00 2004
Subject: XML IDL and XML RPC
In-Reply-To: <199809230247.TAA24023@transbay.net>
Message-ID: <3.0.3.32.19980923104011.00ba9510@pop.mindspring.com>

At 07:47 PM 9/22/98 -0700, Rex Brooks wrote:
 
>I only recently subscribed to this mailing list in order to better
>understand xml and how it was developing in relation to IDL and distributed
>computing in general. I am beginning to be gravely disappointed at the
>extent to which this is a list mainly populated by self-serving vendors and
>authors. Is this really true, or am I just stepping in during a
>particularly ugly patch of competing offers?

You seem to be responding to a post that I found informative enough to
forward to two different people. It described the architecture for a
particular product in some detail, and seemed to be about the architecture,
not a sales plug for the product.
 
My impression is that a lot of the people who are involved in this are
either implementing things that use it, writing about it, or consulting,
which is why they know what they are talking about. If we eliminate the
"self-serving vendors and authors", and perhaps also the "self-serving
consultants", along with some "self-serving students" who are trying to get
their degrees, we're left with people who aren't doing anything, and
therefore have little to say. The people on this list are, by and large,
very good, and they are doing real things.

When it comes to IDL, I'm one of those people who isn't doing much with it
and has little to say, so I'm glad to read the comments of those who know
what they are talking about.

Jonathan
 
jonathan@texcel.no
Texcel Research
http://www.texcel.no

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Wed Sep 23 16:49:10 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:05:00 2004
Subject: Preference Files
In-Reply-To: <3608F9B7.781FFB14@technologist.com>
References: <199809221813.OAA09232@hesketh.com>
 <13831.60777.133705.973152@localhost.localdomain>
Message-ID: <199809231448.KAA19534@hesketh.com>

At 08:37 AM 9/23/98 -0500, Paul Prescod wrote:
>david@megginson.com wrote: 
>> Simon St.Laurent writes:
>> 
>>  > Has anyone started work on a standard DTD for preference files, or
>>  > is the expectation that everyone's going to do their own
>>  > differently?  
>> 
>> It sounds like a good application of RDF, if you're a believer.
>
>Whether or not you are a believer, it seems like a perfect testing ground
>for RDF ideas. For instance: what would a generalized RDF editor look
>like? Presumably, it would be suffice as your preference file editor for
>"experts" (at computing in general, not XML/RDF in specific). It would
>probably look vaguely like the Windows registry editor, but do something
>more sophisticated with embedded strucutures and links.

I think I'm going to discuss RDF but not march right into it for the
example itself.  I'm not a 'believer' (sorry, guys) and I suspect the
attributes/elements/whatever-they're-just-properties is a bit too
cutting-edge for the extremely simple application I'm producing here.  

To be honest, I think RDF needs a book (or several) of its own, and I don't
think I'm the one to write it.


Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Wed Sep 23 16:54:59 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:05:00 2004
Subject: Preference Files
In-Reply-To: <3.0.3.32.19980923103018.00ba9610@pop.mindspring.com>
References: <3608F9B7.781FFB14@technologist.com>
 <199809221813.OAA09232@hesketh.com>
 <13831.60777.133705.973152@localhost.localdomain>
Message-ID: <199809231454.KAA19606@hesketh.com>

At 10:30 AM 9/23/98 -0400, Jonathan Robie wrote:
>Is there any advantage in going beyond the simplified form of RDF that is
>used in DCD? That would be enough to identify the data types, and it looks
>like it allows very readable formats (like David's first example) while
>still maintaining the information needed for generic programmatic handling
>(like David's second example).

Actually, reading the DCD proposal is what finally dropped me out of
'potential believer' status.  RDF seems to me too far, too fast, without
enough compelling reasons that I can see. If that's heresy, then so be it.
 DCD will receive discussion in this book, but no extended examples - it's
only a NOTE, after all.

Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 23 17:01:46 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:00 2004
Subject: Mix encodings in a document?
References: <1305666774-174489322@tallent.com>
Message-ID: <36090DA3.F246F485@locke.ccil.org>

Deke Smith asked about Gavin Thomas Nicol's remark:

> >Remember: byte != character code != character != glyph

A character code may be more than one byte long, but is always
an integer.  A character is an abstract object which can be
represented by different character codes in different coded
character sets (ASCII, EBCDIC/US, JIS X 0208, etc.)

Glyphs are abstractions of *appearance*, whereas characters are
abstractions of *function*.
 
> ISO-10646-UCS-2
> ISO-10646-UCS-4
> ISO-10646-UTF-1
> ISO-10646-Unicode-Latin1
> ISO-10646-J-1
> UNICODE-1-1
> UNICODE-1-1-UTF-7
> UTF-7
> UTF-8

ISO-10646-UCS-2 is near enough UTF-16; UTF-16 only implies that
surrogates are correctly processed, and decent UCS-2 implementations
will at worst leave surrogates alone.
 
-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 23 17:06:13 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:00 2004
Subject: Public Identifiers
References: <199809221448.KAA03627@ruby.ora.com>
Message-ID: <36090EB5.EB9482E3@locke.ccil.org>

Chris Maden wrote:
> There is a need to refer to things
> that nobody has registered, yes.  But you then say that Joe User
> shouldn't register Sears, but that's exactly what he did in your
> -//Sears//... example.  If Sears has not created an FPI for their 1922
> catalog, then Joe User should say -//Joe User//NONSGML Sears Roebuck
> and Co. 1922 Catalog//EN.

I agree absolutely.
 
> This isn't true.  The URN spec, as John quoted, specifically allows
> the case of URNs that are not resolvable to a concrete electronic
> resource.  And since FPIs can be used as URNs,
> urn:fpi:-//Joe%20User//NONSGML%20Sears%20Roebuck%20and%20Co.%201922%20Catalog//EN
> is a perfectly legitimate URN that points to the lump of paper in my
> outhouse.

Right, though urn:x-joe-user:SearsCatalog1922 might be more perspicuous
as well as shorter.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 23 17:11:17 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:00 2004
Subject: Socat issues for XML
References: <3.0.32.19980922103419.01143a48@pophost.arbortext.com>
Message-ID: <36090FEE.9678496F@locke.ccil.org>

Paul Grosso wrote:

> I have not heard a convincing argument for not including OVERRIDE
> in your subset of TR9401.  There are many TR9401 catalogs in use,
> implementing OVERRIDE is trivial, and users are used to using it.
> If you don't include it, then the existing catalogs--and users
> who write new catalogs based on their understanding of TR9401
> catalogs as they exist--will get subtlely different results with
> no warning because you would be ignoring the OVERRIDE NO entries.

*grumble*

I concede your points.  I will add support for OVERRIDE YES/NO,
with standard semantics.

> I see your point.  You want something like a PUBLIC-CATALOG entry
> type with the same semantics as the CATALOG entry type except
> additionally with the semantic "ignore if the external identifier
> has no public identifier."  Note that the referenced catalog entry
> file could still have SYSTEM (and ENTITY and other) entry types, and
> if the catalog is ever processed, all those entry types are significant,
> it's just that no catalog referenced by a PUBLIC-CATALOG entry would
> be processed if the current external identifier being resolved has
> no public id.

More accurately, it would not be processed merely because of the
PUBLIC-CATALOG entry.  If it were also referred to indirectly or
directly via a CATALOG entry, it would be processed.

> Note, you can't say "looking for" (or "not looking for") a public
> id, because you are never looking for a match to a public id per se.
> You are always looking for a match for the set of info that you have
> for the current external identifier, and that set of info includes
> one or more of (1) public id, (2) system id, (3) entity name.

Granted.  But due to implementation restrictions in SAX, I never
have an entity name available: consequently, the NOTATION and ENTITY
entries are ignored.

> I think you'd also want the standard CATALOG entry type, therefore
> PUBLIC-CATALOG would be a new entry type.  The standard CATALOG
> entry would address my "compelling example" as well as give
> compatibility with TR9401 catalogs.

Yes.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 23 17:20:47 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:00 2004
Subject: Socat issues for XML
References: <199809210051.UAA10435@locke.ccil.org> <3607D4B3.415C1D1@eng.sun.com>
Message-ID: <36091206.FB6C953B@locke.ccil.org>

David Brownell wrote:

> As a networking guy I've got to wonder what the right number
> of caching mechanisms for URLs should be.  My gut reaction is
> that SYSTEM identifiers should be managed like any other part of
> the web, and proxy caches should be configured to know that
> some entries change virtually never.  That's in line with what
> John proposed (don't map URIs to URIs).

I think that the main benefit of URI->URI mappings (SYSTEM entries)
is for local (meta)stable caches informally created by users.  There is
no support for this in the HTTP world now, and if there were,
people often want to keep local copies of things accessible by
FTP (like RFCs).

> "Use the Web, Luke!"

Mostly yes. But most of us don't run proxy servers on our own
workstations.

> p.s. You'll notice that the com.sun.xml.parser.Resolver class in
>     http://developer.java.sun.com/developer/earlyAccess/xml/
> doesn't support any particular catalog syntax, and offers only a
> flat namespace for XML public IDs rather than expecting SGML's FPIs
> to apply.

Boo, hiss!

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 23 17:24:53 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:01 2004
Subject: Corba data ->   XML
References: <01BDE624.4689F7A0@thillai>
Message-ID: <3609130F.DBF96771@locke.ccil.org>

Thillai wrote:

> I know only little bit about XML.  I like to know whether a CORBA operation
> can be expressed in XML document type definition?

You got lots of answers to the question "How to do Corba-style stuff
by exchanging XML documents", but that isn't really what you asked.

> For example if the IDL has something like
> 
> interface z
> {
>         typedef sequence<String, 5>  t1;
> 
>         void x(in int p1,  in t1 p2);
> }
> 
> and I want to store the parameters for the operation in a file.
> 
> I like to have a XML data file like
> <interface>
> z
> <operation>
> x
> <inparameter>
> <param>
> p1
> <value>10</value>
> </param>
> <param>
> p2
> <value><nelems>5</nelems><0>abc</0><1>efg</1><2>hij</2><3>
> klm</3><4>nop</4></value>
> </param>
> </inparameter>
> </operation>
> </interface>
> 
> If there is no maximum limit for the type t1 then no. of elements in the
> sequence might vary.

The tags <0>, <1> etc. are not valid XML, because an element name can't
be numeric.  You could use _0, _1, etc. but really all you need is
just "<value><component>abc</component><component>efg</component> ...",
with no need to specify an explicit count (XML processors are expected
to be able to count things).

> For this file is it possible to write DTD.  (I will read about DTD and find.
> Before that any expert comment will be helpful)

Yes, no problem.
 
> If it is possible then is it possible to write XSL for getting values from the
> user.  (no. of elements in the sequence might vary at runtime).

I don't really see how.  XSL is for styling XML for display.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From deke at tallent.com  Wed Sep 23 17:31:17 1998
From: deke at tallent.com (Deke Smith)
Date: Mon Jun  7 17:05:01 2004
Subject: Mix encodings in a document?
Message-ID: <1305575850-179956852@tallent.com>

John Cowan, cowan@locke.ccil.org said on 9/23/98 10:02 AM:

>ISO-10646-UCS-2 is near enough UTF-16; UTF-16 only implies that
>surrogates are correctly processed, and decent UCS-2 implementations
>will at worst leave surrogates alone.

And what is the implications of this (if any) for XML rendering? I'm not 
sure of what you mean by "surrogates are correctly processed."

Thanks for everyone's answers. They have helped.

-----------------------------------------------------------------
Deke Smith
Tallent Communications Group, Brentwood TN
deke@tallent.com, 615-661-9878
-----------------------------------------------------------------
" The best way to predict the future is to invent it. " 
       - Alan Kay 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From epalma at fsaa.ulaval.ca  Wed Sep 23 17:43:37 1998
From: epalma at fsaa.ulaval.ca (Eduardo Palma)
Date: Mon Jun  7 17:05:01 2004
Subject: attributes
Message-ID: <199809231605.MAA26290@cerberus.ulaval.ca>

Hi 

how you can know if an 
xml attribute
is present with JavaScript? 

I tried this:

		if (theElement.getAttribute("MANAGER") != null)
			aText += "&manager=" + theElement.getAttribute("MANAGER");

doesn't work!

Thanks
Eddie


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dalapeyre at mulberrytech.com  Wed Sep 23 17:52:30 1998
From: dalapeyre at mulberrytech.com (Deborah Aleyne Lapeyre)
Date: Mon Jun  7 17:05:01 2004
Subject: Public Identifiers
In-Reply-To: <Version.32.19980922190714.00f81a30@CraneSoftwrights.com>
References: <3.0.5.32.19980918094925.008e4e70@dns.isogen.com>
 <019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <v03020902b22ec9d03677@DialupEudora>

>At 98/09/18 06:49 -0500, W. Eliot Kimber wrote:
>>In hindsight, it's clear to me that we never should have allowed public IDs
>>in XML.

As Ken Holman has said, this was a long and bloody discussion.  But to sum
up ONE of the critical points "in-favor-of" fpis:

I need to get information to London, and you are taking away my quill pen
and paper and giving me a radio, but telling me there will be no radio
broadcasts until next year.

--Debbie

======================================================================
Deborah Aleyne Lapeyre               mailto:dalapeyre@mulberrytech.com
Mulberry Technologies, Inc.                http://www.mulberrytech.com
17 West Jefferson Street                    Direct Phone: 301/315-9633
Suite 207                                          Phone: 301/315-9631
Rockville, MD  20850                                 Fax: 301/315-8285
----------------------------------------------------------------------
  Mulberry Technologies: A Consultancy Specializing in SGML and XML
======================================================================


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 23 17:57:07 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:01 2004
Subject: Public Identifiers
References: <199809220523.BAA23950@locke.ccil.org> <199809221814.NAA01159@bruno.techno.com>
Message-ID: <36091A54.E708787C@locke.ccil.org>

Steven R. Newcomb wrote:

> > ... your FPI looks a whole lot like a Sears FPI; it appears
> > to be in Sears space.  What is the source of your right to create a
> > name within Sears space?
> 
> I didn't create it.  Sears did, when it published its 1922 Farm
> Catalog.

Not so.  Sears created a catalog in 1922, but not an FPI.

In any event, I think I understand the source of our disagreements.
As I (and Chris Maden, apparently) read the owner-identifier in FPIs,
it is the creator of the name, not of the thing named, that
appears there.  Apparently you disagree:
 
> So, there must be many occasions when formal public identifiers, or
> something like them (and URNs, or something like them) will need to be
> created by persons who lack any special authority to create them.
> They must include the name of the authority that created, published,
> or is otherwise associated with the referenced information.

This seems to clearly indicate that for you an FPI is like a
bibliography entry: it holds the publisher, the format, the title,
and the language of the publication.  Anybody can create one as
long as they tell the truth.

For me, an FPI is a name assigned by a namer (which can be anyone),
and its parts are the namer, the format, the assigned name, and the
language.  I can name anything if I list myself as the namer, but
I cannot concoct names listing someone else as the namer, any more
than I can create an ISBN for a book I publish that has the wrong
publisher prefix.

> If FPIs and/or URNs should *not* be used for referencing offline
> information produced by unregistered authorities, then what should?

They should be so used, but not as you propose using them, I think.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jmcdonou at library.berkeley.edu  Wed Sep 23 18:00:33 1998
From: jmcdonou at library.berkeley.edu (Jerome McDonough)
Date: Mon Jun  7 17:05:01 2004
Subject: Mixed Content Models
In-Reply-To: <3.0.32.19980922224713.00b043c0@pop.intergate.bc.ca>
Message-ID: <3.0.5.32.19980923085744.009ead00@library.berkeley.edu>

At 10:53 PM 9/22/98 -0700, Tim Bray wrote:
>At 11:47 AM 9/21/98 -0700, Jerome McDonough wrote:
>>	<!ELEMENT qstn   (#PCDATA | (preQTxt?, qstnLit?, postQTxt?, forward?,
>>							  backward?, ivuInstr*))*
>>
>>Is this a legitimate content model under XML section 3.2.2?
>>Msxml doesn't have a problem with it, and nsgmls using the -wxml flag
>>also happily parses the DTD.  IBM's xml4j, however, complains:
>>"Codebook.dtd: 1256, 33: This content model is not matched with the
>>mixed model '(#PCDATA|FOO|BAR|. . .|BAZ)*': '(#PCDATA|(preQTxt?, qstnLit?,
>>postQTxt?,forward?,backward?,ivuInstr*))*".
>
>This is *totally* illegal per the spec.  Is that the msxml with IE4 
>or IE5?  If 4, no biggie, they were up-front about being behind.  If 
>IE5, I'm flabbergasted and MS needs to hear about it now.
>

It's the older version, available through Microsoft's site.  No need
for panic, although having just checked, I'm a little surprised
that Microsoft still makes this older code available off their site.
Now that the newer code is available, they should probably stop 
distributing the older version and just point at the DataChannel
site.


Jerome McDonough -- jmcdonou@library.Berkeley.EDU  |  (......)
Library Systems Office, 386 Doe, U.C. Berkeley     |  \ *  * /
Berkeley, CA 94720-6000    (510) 643-2058          |  \  <>  /
"Well, it looks easy enough...."                   |   \ -- /  SGNORMPF!!!
         -- From the Famous Last Words file        |    ||||

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Wed Sep 23 18:04:25 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:05:01 2004
Subject: Mix encodings in a document?
Message-ID: <001001bde70b$ef26dd00$1e09e391@mhklaptop.bra01.icl.co.uk>

Jerome McDonough wrote:
>
>ISO-10646-UCS-2 (the 2-octet Basic Multilingual Plane) is
the
>same as Unicode (which is a 16-bit chararacter encoding),
so
>that would be your "UTF-16." (I don't think that,
technically,
>the 16-bit encoding gets referred to as a UCS Transmission
Format).
>
No. UTF-16 is an encoding of ISO 10646 that uses 16 bits to
represent the characters in the Basic MultiLingual Plane
(BMP, equivalent to Unicode) and longer sequences to
represent characters outside the BMP. It is thus a pure
superset of UCS-2 or Unicode. See
http://osiris.dkuug.dk/jtc1/sc2/wg2/docs/N1334.html

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 23 18:26:11 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:01 2004
Subject: Mix encodings in a document?
References: <3.0.5.32.19980922093407.00aac560@library.berkeley.edu> <3.0.5.32.19980922112823.00b6d4b0@library.berkeley.edu>
Message-ID: <36092162.911B7109@locke.ccil.org>

Jerome McDonough wrote:

> Under Unicode version 2.0,
> what I should've said is:
> 
>         Unicode == ISO-10646-UCS-2 != UTF-16
> 
> as Unicode and 10646 in UCS-2 format should be identical, but UTF-16
> differs from both of these in it allows the use of code surrogate
> pairs to enable encoding the BMP and next 16 planes of UCS-4.  From
> what I can see at Unicode's home page, it now looks like Unicode is
> dropping UCS-2 character encoding and now only endorses UTF-8 and
> UTF-16, so that the situation now is:
> 
>         Unicode != ISO-10646-UCS-2
> 
> and Unicode sometimes does/sometimes does not equal UTF-16.  Is that
> more or less the case at the moment?

"Unicode 2.0" and "Unicode 2.1" always mean UTF-16.  UCS-2 proper
(that is, the encoding that does not allow references to what
10646 calls Planes 1 to 10) has never been Unicode since the
distinction between UCS-2 and UTF-16 was invented.  Before that,
there was only UCS-2 and Unicode = UCS-2.

So Unicode = UTF-16 != UCS-2, but the distinction is usually
trivial: UCS-2 per se does not define any meaning for surrogate
characters.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 23 19:10:15 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:01 2004
Subject: XSchema: Section 4
References: <199809222304.BAA27676@berlin.dvs1.tu-darmstadt.de>
Message-ID: <36092BD1.32FC3D03@locke.ccil.org>

Ron Bourret wrote:

> QUESTION: What happens to the following:
> 
> * Processing instructions

Discard.  PIs in the DTD (that is, in the external subset or
in external parameter entities) belong to the document,
not the schema.  They shouldn't be read by the XSchema processor
as part of the schema.

> * Text encoding declarations

Interpret and regenerate.  There is no requirement that the output
of a converter have the same text encoding as the input.

> The following XSchema structures may be converted to the corresponding DTD
> structures or discarded:
> 
> * Doc elements. These may be converted to comments. The position of resulting
> comments in the DTD is the choice of the converter.

It should be said generally that comment declarations may be
freely generated by converters in either direction ad libitum.
They may copy them, discard them, or invent them.  (In particular,
a converter may copy over the whole of its input as a comment,
or as many comments, for documentation's sake.)

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 23 19:12:46 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:01 2004
Subject: XSchema: Sections 5.0 and 5.1
References: <199809222304.BAA27678@berlin.dvs1.tu-darmstadt.de>
Message-ID: <36092C4E.357487FB@locke.ccil.org>

Ron Bourret wrote:

> [1] XSchemaPI ::= '<?xschema' S XSchemaID S? '?>'

Why the funky lower case?  "XSchema" here as elsewhere.
Also, there can be S after the "<?".

> [2] XSchemaID ::= 'xschema' Eq SystemLiteral

Some provision needs to be made for a PubIdLiteral as well.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 23 19:19:11 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:01 2004
Subject: Order of #PCDATA in dtd - relevant? or XP bug?
References: <36087406.C60F8AFE@arlut.utexas.edu>
Message-ID: <36092DF4.42FE6FA8@locke.ccil.org>

Glenn Kronschnabl wrote:

> A simple DTD that causes a syntax error in XP.
> If you run the following dtd thru both jade
> and XP, it runs fine thru jade but XP gives a
> syntax error.  The only difference is the order
> of the % inline and #PCDATA.

Jade is a general SGML tool, and doesn't enforce
XML restrictions on content models.  XP does.
 
> So, given this, does imply that the order of #PCDATA
> is relevant or is this a genuine XP bug?

No, it's an XML rule:  #PCDATA must be mentioned first,
because the minute a parser sees #PCDATA, it needs to enforce not
the general "element-content" rules but the different
"mixed-content" rules.
 
-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From James.Anderson at mecomnet.de  Wed Sep 23 19:27:22 1998
From: James.Anderson at mecomnet.de (james anderson)
Date: Mon Jun  7 17:05:01 2004
Subject: Public Identifiers
References: <199809221448.KAA03627@ruby.ora.com> <36090EB5.EB9482E3@locke.ccil.org>
Message-ID: <36093170.B9274B0C@mecomnet.de>

John Cowan wrote:
> 
> Chris Maden wrote:
> > There is a need to refer to things
> > that nobody has registered, yes.  But you then say that Joe User
> > shouldn't register Sears, but that's exactly what he did in your
> > -//Sears//... example.  If Sears has not created an FPI for their 1922
> > catalog, then Joe User should say -//Joe User//NONSGML Sears Roebuck
> > and Co. 1922 Catalog//EN.
> 
> I agree absolutely.

I almost agree.
There is also a need to refer to "things" which someone else has registered.
It shouldn't matter whether Sears has created an identifier or not. Any naming
authority should be able to generate any name within its space within the
terms of fair use. Otherwise, for example, there would be no way to catalogue
a library.

Is there's something special about FPI's which precludes this?

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 23 20:06:03 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:01 2004
Subject: Public Identifiers
References: <3.0.5.32.19980918094925.008e4e70@dns.isogen.com> <3.0.5.32.19980920231815.00963d00@dns.isogen.com>
Message-ID: <360938ED.E0E806D0@locke.ccil.org>

W. Eliot Kimber scripsit:

> [T]he PUBLIC/SYSTEM distinction made by
> SGML (and XML) is inappropriate as a matter of syntax.  A name is a name
> and there should be exactly one declared for each entity.

However, there is a difference between a name (FPI or URN) and
an address (URL), which does not quite correlate with PUBLIC
vs. SYSTEM.  IMHO this distinction is very much worth maintaining.

When we address postal mail, we generally
supply both a name and an address, and the postal system primarily
uses the address to deliver mail.  (There may be indirection within the
address system, as when mail is forwarded, or through P.O. boxes, but that's
another matter.)  If address-based delivery fails, the postal
tries to use name-based delivery (I have gotten mail addressed
to "John Cowan" at all sorts of random addresses in my city).

Resulting maxims:

	1) My name is not simply an indirect way of indicating
	my address!

	2) Names tell us "who", addresses tell us "where".

> URN resolution mechanisms should be independent of the syntax used for the
> binding--they should simply expect two arguments, a name-space name and a
> name in that name space. How the client that makes the resolution request
> gets those two arguments is its business.

Having a single syntax makes the interface between a client stub and
a general resolver (as opposed to namespace-specific resolvers)
universal.

> Given that my analysis is correct, here's what I'd like to see happen:
> 
> 1. A general recognition of the need for name-space/name bindings in data
> representation standards, regardless of the kind of data.  If these
> bindings are further standardized along the URN lines (its semantics, not
> its syntax, necessarily), so much the better.

Agreed.

> 2. Given item (1), data management systems (including operating systems and
> networking systems) providing generalized name-space-to-resolver services
> that reflect the general approach defined by item (1).  For Internet-based
> resources, the DNS proposal is probably appropriate and reasonable.

Agreed.
 
> 3. Web clients upgraded to accept "urn:url:" as a prefix to otherwise
> normal URLs.

I disagree, as that would blur an essential distinction.
 
> 4. People and enterprises providing non-URL name resolution servers.  These
> could be along the lines of the PURL services currently being provided (and
> could probably be implemented with the existing PURL software).  For
> example, Oasis could fund a couple of public identifier servers.

Three cheers!

> And now, having said that SGML formal public identifiers have no special
> properties, let me point out that the fact that registered formal public
> identifiers are registered means that you could use owner names to direct
> public ID resolution to servers maintained by the name owner, rather than
> relying on a central FPI resolution server (that is, "DNS for FPIs").

DELEGATE was born to make this work.

> If I
> understand the DNS-for-URN resolution proposal (which I very well may not,
> not being an Internet expert by any stretch), the ability to do this is
> inherent in the proposal.

The maxim here is: If you need a distributed database on the Internet,
try to use the DNS if you can, because it is robust, scaleable,
and --- most importantly --- already there.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Wed Sep 23 20:54:04 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:05:02 2004
Subject: attributes
Message-ID: <002901bde723$7ef578d0$2ee044c6@arcot-main>

Eddie,

>how you can know if an
>xml attribute
>is present with JavaScript?
>
>I tried this:
>
> if (theElement.getAttribute("MANAGER") != null)
> aText += "&manager=" + theElement.getAttribute("MANAGER");

getAttribute will return empty string if attribute is not present.  Use
getAttributeNode.

if (theElement.getAttributeNode("MANAGER") != null)

Don Park
Docuverse


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rbourret at dvs1.informatik.tu-darmstadt.de  Wed Sep 23 21:14:04 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun  7 17:05:02 2004
Subject: XSchema: Sections 5.0 and 5.1
Message-ID: <199809231913.VAA16422@berlin.dvs1.tu-darmstadt.de>

John Cowan wrote:
> Ron Bourret wrote:
> 
> > [1] XSchemaPI ::= '<?xschema' S XSchemaID S? '?>'
> 
> Why the funky lower case?  "XSchema" here as elsewhere.

I started with XSchema and then thought xschema was easier to type (says he who 
proposed mixed case element names).  I'll change it.

> Also, there can be S after the "<?".

Not true.  See production [16] in the XML spec.

> > [2] XSchemaID ::= 'xschema' Eq SystemLiteral
>
> Some provision needs to be made for a PubIdLiteral as well.

I wondered about this.  I'll change the xschema in [2] to SYSTEM and add a 
production for PUBLIC.  Two questions:

1) Are there any conventions in PIs for use/no use of equals signs?

2) Do we always require a system identifier or are the choices system / public / 
both?

-- Ron

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From James.Anderson at mecomnet.de  Wed Sep 23 21:29:12 1998
From: James.Anderson at mecomnet.de (james anderson)
Date: Mon Jun  7 17:05:02 2004
Subject: Reusing schema vocabularies
References: <3.0.1.32.19980913155150.00de4308@ifi.uio.no>
Message-ID: <36094DFD.C0D520B0@mecomnet.de>

? why not just keep things simple:

use namespaces to keep names distinct.
use architectures to combine content models?

don't try to use one for a purpose for which it is not intended and for which
the other provides a better mechanism.

Lars Marius Garshol wrote:
> 
>   REUSING SCHEMA VOCABULARIES: THINKING OUT LOUD
> ================================================
> 
> INTRODUCTION
> ------------
> 
...
> EFFECTS ON OTHER STANDARDS
> --------------------------
> 
> NAMESPACES
> 
> Namespaces, while superficially simple, are really a profound change
> to the XML data model: one of the most basic concepts (the concept
> 'name') is changed from a string to a namespace identifier _and_ a
> string. The reuse of schema vocabularies is enabled by this modified
> concept of names, allowing processing software to pick out names
> belonging to a specific schema/namespace and operate on them.

xml has no data model. while the rec does specify the criteria for identity,
it does not specify that they be modeled as strings. in fact, the discussion
of "match", although loose, distinguishes between names and strings.

the namespace draft simply says that the name has two parts rather than one.
since there is no model specified this can't change what isn't there.

> 
> This is incompatible with the use of names in XML 1.0, which means
> that validation and attribute defaulting no longer work as before. In
> other words: both validating and non-validating parsers are affected,
> but only in the interpretation of the names used in DTDs. (XML 1.0
> documents will work with XML 1.1 parsers, but not vice versa for
> namespace-using documents.)

with certain data models, they work exactly as before. it is not the model for
names which causes the problems, it is the mechanism provided (or in the case
of the present draft <em>not provided</em>) for binding prefixes to uri's
which causes the problems.

> 
> To allow validation and attribute defaulting in XML 1.1 the schema
> syntax will have to change, whether the new syntax is a modified DTD
> syntax or some entirely new schema language. This means that XML 1.1
> documents that use namespaces will not be valid SGML documents.

while the last sentence may be true, it does not follow from the first.
...
> XML ARCHITECTURES
> 
...
> 
> MEETING THE NOTE-WEBARCH-EXTLANG REQUIREMENTS
> ---------------------------------------------
> 
> Requirement #1:
...
> Requirement #2:
>   "The syntax must unambiguously associate an identifier in a document
>    with the related schema without requiring inspection of that or
>    another schema."
> 
... 
> XML architecture names may also collide, but can be specified to
> shadow one another as with prefixes. To enable the unique
> identification of architectures (even in the case of collisions)
> architecture declaration PIs can be extended with a namespace
> attribute that contain an identifying URI.

which is to introduce the same mechanism suggested by namespaces, just with a
different syntax / encoding. why bother?

> 
> Requirement #3:
...> 
> SUMMARY
> -------
> 
> >From this discussion I emerge believing that XML architectures are a
> superior solution to the problem of reusing schema vocabularies. They

i concluded from your arguments more that they were superior for reusing
schema structure and that additions would necessary in order to handle the
issues which namespaces address.

...

> 
> The data model of XML architecures is also much simpler than that of
> namespaces,

you need to say more about the data model which you propose for the two before
i can believe this. in some data models it is not true.

...

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From crism at oreilly.com  Wed Sep 23 21:51:22 1998
From: crism at oreilly.com (Chris Maden)
Date: Mon Jun  7 17:05:02 2004
Subject: XSchema: Sections 5.0 and 5.1
In-Reply-To: <199809231913.VAA16422@berlin.dvs1.tu-darmstadt.de>
	(rbourret@dvs1.informatik.tu-darmstadt.de)
Message-ID: <199809231949.PAA06529@ruby.ora.com>

[Ron Bourret]
> I started with XSchema and then thought xschema was easier to type
> (says he who proposed mixed case element names).

[Ahem.]

[Ron Bourret]
> > > [2] XSchemaID ::= 'xschema' Eq SystemLiteral

[John Cowan]
> > Some provision needs to be made for a PubIdLiteral as well.

[Ron Bourret]
> I wondered about this.  I'll change the xschema in [2] to SYSTEM and
> add a production for PUBLIC.  Two questions:
> 
> 1) Are there any conventions in PIs for use/no use of equals signs?

Not formally, but the trend (in the XML declaration, the old PI-based
namespace proposal, and the experimental stylesheet PI) is towards
attribute-like syntax.  It makes processing a bit easier; your
expression language can retrieve information about a PI in the same
way it retrieves information about attributes.

> 2) Do we always require a system identifier or are the choices
> system / public / both?

That's really a goals question.  If the initial cut at XSchema is to
have the same (or a slight superset of) functionality of a DTD, then a
system identifier must be required.  If schemas are intended to go
beyond DTDs, then consideration should be given to not requiring
system IDs.  But I suspect the discussion will end up following the
same track that the XML WG/SIG took: there is no widespread mechanism
yet for resolving FPIs, and so a document will be less portable with
no system ID.  But that is a long and tiring debate, and it may be
better, for now, to adopt the decision of the XML WG and require
system IDs.

-Chris
-- 
<!NOTATION SGML.Geek PUBLIC "-//Anonymous//NOTATION SGML Geek//EN">
<!ENTITY crism PUBLIC "-//O'Reilly//NONSGML Christopher R. Maden//EN"
"<URL>http://www.oreilly.com/people/staff/crism/ <TEL>+1.617.499.7487
<USMAIL>90 Sherman Street, Cambridge, MA 02140 USA" NDATA SGML.Geek>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 23 22:20:30 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:02 2004
Subject: Mix encodings in a document?
References: <1305575850-179956852@tallent.com>
Message-ID: <3609585D.D9BC222@locke.ccil.org>

Deke Smith wrote:

> And what is the implications of this (if any) for XML rendering? I'm not
> sure of what you mean by "surrogates are correctly processed."

Essentially it means that the two 16-bit values that form a
surrogate-pair (representing a Unicode character on the Astral
Plane) is always treated as a single character.

In XML, surrogate-pairs can appear only in attribute values, #PCDATA
content, PIs, and comments; they are not allowed in element GIs,
attribute names, or the like.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 23 22:23:20 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:02 2004
Subject: Mix encodings in a document?
References: <001001bde70b$ef26dd00$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <360958AD.DEA2D6B3@locke.ccil.org>

Michael Kay wrote:

> No. UTF-16 is an encoding of ISO 10646 that uses 16 bits to
> represent the characters in the Basic MultiLingual Plane
> (BMP, equivalent to Unicode) and longer sequences to
> represent characters outside the BMP. It is thus a pure
> superset of UCS-2 or Unicode. See
> http://osiris.dkuug.dk/jtc1/sc2/wg2/docs/N1334.html

Almost.  Unicode = UTF-16; Unicode applications are not
allowed to support only the BMP, although there are no
characters on the Astral Planes yet.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eriblair at mediom.qc.ca  Wed Sep 23 22:34:14 1998
From: eriblair at mediom.qc.ca (Eric Riblair)
Date: Mon Jun  7 17:05:02 2004
Subject: How to sort the XMl element ...
Message-ID: <199809232033.QAA15036@netra.mediom.qc.ca>

Greetings,

In some HTML files I use applets (msxml) an XML file with element are not
sort.
How can I sort them before they appear in the screen ... with an internal
function or ...

Thanks for any help,
Eric


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Wed Sep 23 22:34:24 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:05:02 2004
Subject: Mix encodings in a document?
Message-ID: <3.0.32.19980923133343.00aff680@pop.intergate.bc.ca>

At 04:23 PM 9/23/98 -0400, John Cowan wrote:
>Almost.  Unicode = UTF-16; Unicode applications are not
>allowed to support only the BMP, although there are no
>characters on the Astral Planes yet.

I've been told that the geniuses in charge have blessed a whole
bunch of language tagging characters on plane 14.  Anyone have
a confirmation of this? -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From colds at nwlink.com  Thu Sep 24 00:57:24 1998
From: colds at nwlink.com (Chris Olds)
Date: Mon Jun  7 17:05:02 2004
Subject: Mix encodings in a document?
Message-ID: <110501bde745$5da27710$dc59fcc6@albert.salsa.walldata.com>

Tim Bray said:
>At 04:23 PM 9/23/98 -0400, John Cowan wrote:
>>Almost.  Unicode = UTF-16; Unicode applications are not
>>allowed to support only the BMP, although there are no
>>characters on the Astral Planes yet.
>
>I've been told that the geniuses in charge have blessed a whole
>bunch of language tagging characters on plane 14.  Anyone have
>a confirmation of this? -Tim


Yes, this is true.  It is not (yet) part of the full standard, but it is
"provided as information and guidance to implementers".  Details at
http://www.unicode.org/unicode/reports/tr7.html

Additionally, there is a document that shows what characters and scripts are
"in the pipeline", which includes several scripts (Linear B, Etruscan,
Gothic, Western Musical Notation, etc.) that have or are expected to be
allocated space in Plane 1.  UCS-2 is history.

/cco


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Thu Sep 24 09:28:17 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:02 2004
Subject: Public Identifiers
In-Reply-To: <36091A54.E708787C@locke.ccil.org>
References: <199809220523.BAA23950@locke.ccil.org>
 <199809221814.NAA01159@bruno.techno.com>
Message-ID: <3.0.5.32.19980924022615.009374c0@dns.isogen.com>

At 11:57 AM 9/23/98 -0400, John Cowan wrote:
>Steven R. Newcomb wrote:

>In any event, I think I understand the source of our disagreements.
>As I (and Chris Maden, apparently) read the owner-identifier in FPIs,
>it is the creator of the name, not of the thing named, that
>appears there.  Apparently you disagree:

John is correct. The owner identifier identifies the owner of the *name*,
not the resource identified. It cannot be otherwise.  This issue was hashed
out in the discussions that lead to the completion of TC 2 to ISO 8879 (I'm
not sure if this discussion is archived in any public place).  It is
probably the combination of the ambiguity of the term "owner identifier"
and the original idea that public identifiers would be used for published
things (and thus provided by publishers  for others to use) that leads to
the invalid conclusion that "owner" in "owner identifier" means "resource
owner" and not "name owner".  It is a common misconception.

When Steve DeRose and David Durrand published their book on Hytime (Making
Hypermedia Work: An Author's Guide to HyTime), they included a set of
public identifiers for a variety of notations. They used the ISBN number of
their book as the owner identifiers for these FPIs.  At the time I thought
that it was inappropriate of them to create names for things they didn't
own. I now realize that it isn't a problem: by using their owner
identifier, they simply asserted control over the names, not the things
named. In particular, they made it clear that they were *not* trying to
somehow usurp the rights of the notation owners to define their own names. 

Steve and David were simply providing a service of cataloging notations in
exactly the same way that the Library of Congress assigns names to books:
the fact that the LoC owns the names in no way implies that they own the
books named. So it is with public identifiers (or URNs of any sort).

I don't care what you call me, just don't call me late for dinner.

Think of all the people who refer to you by names they prefer rather than
the name you'd like them to use.  You may find some of the names annoying
or even offensive, but you implicitly respect their right to use whatever
name they want. (Of course, you may also respond with a "304" message to
the effect of "I'd rather you not call me that".)  "Do you mind if we call
you 'Bruce' to keep it clear?"

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Thu Sep 24 09:28:20 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:02 2004
Subject: Public Identifiers
In-Reply-To: <v03020902b22ec9d03677@DialupEudora>
References: <Version.32.19980922190714.00f81a30@CraneSoftwrights.com>
 <3.0.5.32.19980918094925.008e4e70@dns.isogen.com>
 <019501bde30d$56009280$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <3.0.5.32.19980924020707.00979910@dns.isogen.com>

At 11:56 AM 9/23/98 -0400, Deborah Aleyne Lapeyre wrote:
>>At 98/09/18 06:49 -0500, W. Eliot Kimber wrote:
>>>In hindsight, it's clear to me that we never should have allowed public IDs
>>>in XML.
>
>As Ken Holman has said, this was a long and bloody discussion.  But to sum
>up ONE of the critical points "in-favor-of" fpis:
>
>I need to get information to London, and you are taking away my quill pen
>and paper and giving me a radio, but telling me there will be no radio
>broadcasts until next year.

I don't understand this statement. FPIs don't do anything that can't be
done in the context of "system" identifiers. In particular, the SOCAT
mechanism, which you would presumably use to define the mapping for FPIs,
will work just as well for this "system" identifier: "urn:is9070fpi:+//IDN
drmacro.com//DOCUMENT some doc//EN".

All that removing the distinction between PUBLIC and SYSTEM in entity
declarations does is remove the possiblity of ambiguous redundancy of two
identifiers for the same resource (remember that ISO 8879 doesn't define
which identifier takes precedence).  It doesn't remove the ability to use
names that have the characteristics of formal public identifiers.

The problem with SGML and XML with respect to public identifiers is that it
distinguishes the kind of pointer as a function of the referencing syntax,
not that it also provides a form of managed, explicitly system-independent
name.

If XML required *formal* public identifiers, then the argument that they
are useful would be more compelling because at least you'd know that the
value following a PUBLIC keyword conformed to some known and useful rules.
However, XML doesn't require formal public IDs, so you could put any valid
literal following the PUBLIC keyword, including normal URLs. Given that,
there's no useful difference between putting a resource identifier in the
"PUBLIC" slot or the "SYSTEM" slot, except that if you use the PUBLIC slot,
you'll still have to specify a value for the "system" identifier, which is
silly. [It's also silly for XML to not be consistent with notations, but
nobody listened to me on that one.]

I hear Debbie saying "public identifiers" are valuable because I know I
need to have different mappings for the same resource and SGML's public
identifier mechanism gives me that.  Reasonable enough, but it's not
compelling because you don't need public identifiers to get the result.

I hear Ken and Steve saying "*formal* public identifiers" are valuable
because they are managed name spaces that let manage my names and trust (or
at least evaluate) names I get from others in a way that is independent of
the facilities of any operating system.  I can't agree more, but as we've
seen, FPIs can be used in a URN or formal system identifier context, so
again, you can get the benefit without having to preserve the PUBLIC/SYSTEM
distinction at the entity and notation declaration level.

My observation, based in part on my own reactions in the past, is that SGML
practitioners have been using entity declarations with their PUBLIC/SYSTEM
distinction for so long that we have lost sight of what the different parts
of the system are.  We've taken particular implemenations as the definition
of what the standard and/or its intent is, which is not necessarily the
case.  I would urge everyone to revisit the wording of the standard. It is
very fuzzy.  The distinctions between PUBLIC and SYSTEM identifier are
highly semantic and subjective.  In particular, the term "system" is not
(and cannot be) crisply defined.  I could argue that public identifiers are
just as system specific as any other kind of identifier because they are
dependent on there being a system that knows how to resolve them, just that
this system may span individual computers and may have humans as necessary
components [get document, call sender of document, ask them what the
various public IDs map to, update local mapping tables].

NOTE: I am *not* arguing against well-managed, human-meaningful names. I am
*not* arguing against having lots of indirection between reference to
resource and data of resource.

All I am arguing against is the *syntax* of entity and notation
declarations that lets you specify two identifiers for a resource, that is,
the PUBLIC/SYSTEM keywords. The reason I make this argument is because
names *always* convey the name space to which they apply and it is the name
space that defines how direct or indirect its names are.  [Note: the name
space may be implicit in the processing context for the document and not
explicit in the syntax of the identifier itself, e.g., URLs used in a
Web-access context.]

Thus, given a syntax for fully qualifying names, there is no need for the
PUBLIC/SYSTEM distinction to be made outside the context of the name
specification itself.  We have at least two standardized schemes for fully
qualifying names: formal system identifiers (ISO/IEC 10744:1997, Annex A.6)
and URNs.

While the distinction SGML made was well intentioned and a reasonable
approach at the time, given that neither formal system identifiers nor URNs
had been invented, I still contend that there was no excluse for carrying
that mistake into XML.  The argument that "there is no URN resolution
facility" is incorrect. Certainly with respect to people who are today
using and want to continue using formal public identifiers it is not true
because implemented SOCAT-based systems can remap system identifiers and
can therefore remap system identifiers that are URNs, and in particular,
system identifiers that are FPI URNs.

Remember that SGML effectively requires that all general SGML processors
provide customizable entity managers. It is certainly the case for all the
SGML tools I use that the entity manager can be customized with more or
less effort, so that even if these tools don't support the latest SOCAT
specification (which, for example, ADEPT*Editor 7.x does not), you can
still modify them to resolve URNs of any sort (or formal system identifiers
of any sort, if you're not constrained by XML's URI-only requirement).

Because XML allows URIs and because public IDs can be used as URNs, the
argument that XML needed the PUBLIC keyword in order to allow the use of
FPIs is clearly bogus.  The most you can complain about is the need to
escape characters in URNs, which I grant is ugly, but not so ugly as to
compel the inclusion of the PUBLIC keyword in XML (especially if you agree
that tools should be handling the escaping at transmission time).

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Thu Sep 24 11:36:43 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:05:02 2004
Subject: How to sort the XMl element ...
Message-ID: <004201bde79f$6a17c180$1e09e391@mhklaptop.bra01.icl.co.uk>

>In some HTML files I use applets (msxml) an XML file with
element are not
>sort.
>How can I sort them before they appear in the screen ...
with an internal
>function or ...
>
I don't fully understand what you are trying to do.

But if you are prepared to do a little java programming you
might like to look at SAXON on
http://home.iclweb.com/icl2/mhkay/saxon.html which includes
some simple facilities for sorting XML elements.

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From kenchi at utj.co.jp  Thu Sep 24 16:50:02 1998
From: kenchi at utj.co.jp (Kenji Chichii)
Date: Mon Jun  7 17:05:02 2004
Subject: XML Database
Message-ID: <19980924144113218.AAA127@[192.168.1.1]>

Jonathan,

Thank you for your kind advice.
As you said that it is important to distiguish object data from document
data.

I am thinking how we can build document data base with XML. These data base
would have capability of full text search(as you said)
and of managing parts of a document, and of configuring document parts into
a document. If it has API for XML(SGML) Editors it would be better.

But I could not find such a data base which we can use casually. (All are
very very expensive!!)

Kenji


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From bmhughes at ozemail.com.au  Thu Sep 24 17:26:53 1998
From: bmhughes at ozemail.com.au (Baden Hughes)
Date: Mon Jun  7 17:05:02 2004
Subject: text editing controls (ActiveX)
Message-ID: <000001bde7cf$7193cc80$e63570c2@bmhmobile>

Hi -

I have a feeling this is a long shot, but I'm asking anyway.

Does anyone know of a text editing control that:

* reads and writes XML
* allows people with very limited computer skills to apply
  mark up to a note in a way they are comfortable with
  (e.g. does not make them look at a structure tree)
* costs no more than a few hundred dollars
* is freely distributable as an executable
* is an activeX control
* runs under win95 and later

If so, or if you know of something reasonably close to this, can you
let me know ? Mail me privately and I'll summarise to the list.

TIA

Baden


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Thu Sep 24 20:58:41 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:05:03 2004
Subject: XSchema draft with sections 4 & 5
Message-ID: <199809241858.OAA01832@hesketh.com>

The latest XSchema draft, including the Sections 4 and 5 which Ron Bourret
posted here yesterday, is available at
http://www.simonstl.com/xschema/spec/xscspecv3.htm

As always, further information about XSchema is available at:
http://purl.oclc.org/NET/xschema.

Enjoy!

Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From srn at techno.com  Thu Sep 24 22:43:09 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun  7 17:05:03 2004
Subject: Public Identifiers
In-Reply-To: <36091A54.E708787C@locke.ccil.org> (message from John Cowan on
	Wed, 23 Sep 1998 11:57:09 -0400)
References: <199809220523.BAA23950@locke.ccil.org> <199809221814.NAA01159@bruno.techno.com> <36091A54.E708787C@locke.ccil.org>
Message-ID: <199809241542.KAA01093@bruno.techno.com>

[John Cowan:]

> > If FPIs and/or URNs should *not* be used for referencing offline
> > information produced by unregistered authorities, then what should?
> 
> They should be so used, but not as you propose using them, I think.

You oppose the solution I have proposed.  OK, I understand that, and I
think I understand why, too.  (I also note with interest that my
approach is already supported by the URN draft, using the "x-"
prefix.)

But what is the correct solution, in your mind?  The only alternative
I know of is the HyTime "bibloc" architectural form.  In your opinion,
John, should there be a similar architectural form (in XML jargon,
"template") in XML?  Or do you have another proposal for indicating
bibliographic references?

-Steve

--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com  http://www.techno.com  ftp.techno.com

voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax    +1 972 994 0087 (at ISOGEN: +1 214 953 3152)

3615 Tanner Lane
Richardson, Texas 75082-2618 USA

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ffarahbo at informix.com  Fri Sep 25 02:14:29 1998
From: ffarahbo at informix.com (Farzad Farahbod)
Date: Mon Jun  7 17:05:03 2004
Subject: No subject
Message-ID: <199809250013.RAA28345@olympus.oak.informix.com>

Hi,
Sorry about bandwidth , I was wondering if   there is any tool to convert
from xml schema to DTD. 

Thanks
FF


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ricko at allette.com.au  Fri Sep 25 06:38:47 1998
From: ricko at allette.com.au (Rick Jellife)
Date: Mon Jun  7 17:05:03 2004
Subject: XSchema: Sections 5.0 and 5.1
References: <199809231949.PAA06529@ruby.ora.com>
Message-ID: <360B1FE3.F35AC61A@allette.com.au>


Chris Maden �g�D�G

> [Ron Bourret]
>
> > 1) Are there any conventions in PIs for use/no use of equals signs?
>
> Not formally, but the trend (in the XML declaration, the old PI-based
> namespace proposal, and the experimental stylesheet PI) is towards
> attribute-like syntax.  It makes processing a bit easier; your
> expression language can retrieve information about a PI in the same
> way it retrieves information about attributes.

Because PIs are relatively under-defined (so you can plonk inany old thing you
want) it makes them difficult for DOM or XLL
to say much useful about: if you use an attribute-like syntax
then at least you may have a fighting chance that future common
tools will be able to examine them and use them.

But, of course, piggy-backing PIs onto elements using attributes
is probably always a preferable option, if your PI structure can
matches the start-tag locations.

Even if you use attribute-like syntax, PI values are still just
text strings, from the XML viewpoint.  Having a target (notation)
to indicate the syntax of the PI is a great headstart in labelling.

Rick Jelliffe


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Fri Sep 25 10:18:25 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:05:03 2004
Subject: LISTRIVIA (was Re: How to sort the XML element ...)
In-Reply-To: <199809232033.QAA15036@netra.mediom.qc.ca>
Message-ID: <3.0.1.16.19980925090247.1b07c0f4@pop3.demon.co.uk>

At 16:35 23/09/98 -0400, Eric Riblair wrote:

[copied to at least FIVE other lists]

>Greetings,
>
>In some HTML files I use applets (msxml) an XML file with element are not
>sort.
>How can I sort them before they appear in the screen ... with an internal
>function or ...
>
>Thanks for any help,
>Eric

Please do not post general questions to XML-DEV and crosspost to other
lists. It makes the discussion very fragmented and is not normally useful
to anyone. 

	P.


Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rbourret at dvs1.informatik.tu-darmstadt.de  Fri Sep 25 10:33:27 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun  7 17:05:03 2004
Subject: xml schema -> DTD
Message-ID: <199809250830.KAA21631@berlin.dvs1.tu-darmstadt.de>

Farzad Farahbod wrote:
> Sorry about bandwidth , I was wondering if   there is any tool to convert
> from xml schema to DTD. 

By "xml schema", do you mean XSchema?  

If so, I have Java classes that convert XSchema->DTD and DTD->XSchema. However, 
they have not been updated for the current spec and don't support namespaces or 
the AttGroup, Model, Enumeration, or UnparsedEntity elements.  I hope to have 
new versions in a few weeks, but these will save you some time right now if you 
don't mind cleaning up by hand.

You can download the converters from:

http://www.informatik.tu-darmstadt.de/DVS1/staff/bourret/xschema/convert.html

-- Ron Bourret

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rbourret at dvs1.informatik.tu-darmstadt.de  Fri Sep 25 11:29:54 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun  7 17:05:03 2004
Subject: XSchema: Sections 5.0 and 5.1
Message-ID: <199809250924.LAA22517@berlin.dvs1.tu-darmstadt.de>

Rick Jelliffe wrote:

> But, of course, piggy-backing PIs onto elements using attributes
> is probably always a preferable option, if your PI structure can
> matches the start-tag locations.

Do you mean that instead of using:

<?XSchema SYSTEM="foo.xsc" ?>
<foo>
...
</foo>

we should use something like:

<foo XSchemaSystem="foo.xsc">
...
</foo>

A good idea, but I don't think it will work for us.  We accept an arbitrary 
number of XSchema PI's, so there is no way to name the attributes.

> 
> Even if you use attribute-like syntax, PI values are still just
> text strings, from the XML viewpoint.  Having a target (notation)
> to indicate the syntax of the PI is a great headstart in labelling.

I don't understand what you mean.  A PI has a target by definition, but I don't 
know how this indicates syntax.

-- Ron Bourret

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Fri Sep 25 11:39:47 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:05:03 2004
Subject: Processing instructions (was XSchema: Sections 5.0 and 5.1)
Message-ID: <002201bde863$5ff770a0$1e09e391@mhklaptop.bra01.icl.co.uk>

>if you use an attribute-like syntax [for Processing
Instructions]
>then at least you may have a fighting chance that future
common
>tools will be able to examine them and use them.

>But, of course, piggy-backing PIs onto elements using
attributes
>is probably always a preferable option...

That gives me the opportunity to ask a question that's been
bugging me for a while.

I'm designing a document type. Is there any circumstance
when you would advise me to use a Processing Instruction
rather than an empty element with attributes? I can't think
of one myself, but I'm sure the XML designers must have had
something in mind.

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jamesr at steptwo.com.au  Fri Sep 25 13:35:08 1998
From: jamesr at steptwo.com.au (James Robertson)
Date: Mon Jun  7 17:05:03 2004
Subject: text editing controls (ActiveX)
In-Reply-To: <000001bde7cf$7193cc80$e63570c2@bmhmobile>
Message-ID: <199809251134.VAA16351@oznet11.ozemail.com.au>

At 01:24 25/09/1998 , you wrote:

  | I have a feeling this is a long shot, but I'm asking anyway.
  | 
  | Does anyone know of a text editing control that:
  | 
  | * reads and writes XML
  | * allows people with very limited computer skills to apply
  |   mark up to a note in a way they are comfortable with
  |   (e.g. does not make them look at a structure tree)
  | * costs no more than a few hundred dollars
  | * is freely distributable as an executable
  | * is an activeX control
  | * runs under win95 and later
  | 
  | If so, or if you know of something reasonably close to this, can you
  | let me know ? Mail me privately and I'll summarise to the list.

Make that a native Delphi control, and if necessary,
scrub the requirement for low purchase cost ...
and you've got a sale. Immediately.

Any solutions out there?

Cheers,

J

-------------------------
James Robertson
Step Two Designs Pty Ltd
SGML, XML & HTML Consultancy
http://www.steptwo.com.au/
jamesr@steptwo.com.au

"Beyond the Idea"
 ACN 081 019 623

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From SMUENCH at us.oracle.com  Fri Sep 25 14:22:33 1998
From: SMUENCH at us.oracle.com (Steve Muench)
Date: Mon Jun  7 17:05:03 2004
Subject: Unplugged & Install-Friendly XML
Message-ID: <199809251222.FAA14906@mailsun3>

Hello, 
 
If one were building a product which used XML to write out 
metadata about things, what is the recommended approach 
to referring to a product-specific DTD in your XML 
document instances given the fact that: 
 
    1. A user will want to work on his files when 
       not plugged into the Internet. 
 
         I'm assuming this means that referring 
         to DTD by http://company.com/xyz.dtd is 
         not gonna cut the mustard. 
 
    2. A user can install that product into any 
       directory on his hard drive. In particular 
       the product-specific DTD file could land 
       up in the .\LIB subdirectory of an arbitrary 
       "install home" that the user picks at 
       install time. 
 
Is there any way that I've missed in my read of the 
XML spec which would allow a DTD to use a syntax like: 
 
<?xml version="1.0"?> 
<!DOCTYPE TopLevelThingy SYSTEM "file:/&INSTALL_HOME;/lib/jbo.dtd"> 
 
Where the value of the environment variable INSTALL_HOME 
would be setup by the installer? 
 
Thanks for any ideas... 
 
_________________________________________________  
 Steve  | XML Technology | smuench@oracle.com  
 Muench |   Evangelist   | geocities.com/~smuench 
 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From richard at cogsci.ed.ac.uk  Fri Sep 25 14:41:14 1998
From: richard at cogsci.ed.ac.uk (Richard Tobin)
Date: Mon Jun  7 17:05:03 2004
Subject: Unplugged & Install-Friendly XML
In-Reply-To: Steve Muench's message of 25 Sep 98 05:15:23 -0700
Message-ID: <199809251240.NAA12479@cogsci.ed.ac.uk>

> If one were building a product which used XML to write out 
> metadata about things, what is the recommended approach 
> to referring to a product-specific DTD in your XML 
> document instances given the fact that [...]

You could refer to the DTD by means of a public id which is known to
the application.  Other XML processors that don't understand that
public id will use the URL instead.

For example:

 <!DOCTYPE mydoc PUBLIC "-//MyCompany//My product//EN" 
                        "http://my.company.com/myproduct.dtd">

Your application would recognise the id "-//MyCompany//My product//EN"
and supply a built-in DTD instead of fetching the URL.

Perhaps someone else could give better guidance on just how to construct
a suitable public id.

-- Richard

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Fri Sep 25 14:54:30 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:05:03 2004
Subject: Standard interface for DOM building
Message-ID: <007101bde884$38b11b60$1e09e391@mhklaptop.bra01.icl.co.uk>

There are a number of DOM implementations now appearing,
(for example a new one from SUN) and I have been trying to
add support for them to SAXON. The thing that's missing is a
standard interface to build the Document. Can XML-DEV step
in to fill the gap?

I'm using the following interface, as a starter for ten
(sorry, that's a UK game show phrase):

public interface DOMBuilder
{

    /**
    * Define the parser to be used when building the DOM.
The DOM implementation is free to ignore this and use its
    * own parser if it wishes.
    */

    public void setParser (Parser parser);

    /**
    * Build the DOM document from an input source.
    * @param source The InputSource to use.
    * @return The DOM Document object that results from
parsing the input.
    */

    public Document build (InputSource source)
        throws java.io.IOException,
org.xml.sax.SAXException;

}

I've got implementations of this interface working for the
Docuverse and SUN products; anyone see any difficulty in
supporting it for other DOM implementations? Are there any
other methods that could/should go in the interface?

Any ideas where this should belong: part of SAX 2.0?

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Fri Sep 25 15:17:48 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:05:03 2004
Subject: Standard interface for DOM building
References: <007101bde884$38b11b60$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <360B9838.7B8B18E7@infinet.com>

Michael Kay wrote:

> There are a number of DOM implementations now appearing,
> (for example a new one from SUN) and I have been trying to
> add support for them to SAXON. The thing that's missing is a
> standard interface to build the Document. Can XML-DEV step
> in to fill the gap?

This seems like a good idea.  One thing I do in the DOM implementation I have is
to pre-index all of the elements for each tag name into NodeLists and then store
them in a table.  The reason for this was that for some applications like XSL
Processors which need to be able to extract elements by name through
Element.getElementsByTagName(String name), this operation can be costly if done
repeatedly without any sort of indexing.  Even though the DOM interfaces are
standard, I think that some sort of context interface would help for application
developers so that they can make assumptions like: Are the values returned from
Node.getNodeName() internalised strings or not?  Other things application
programmers might want to know (that are not covered in the spec) are questions
like: can the DOM tree be indexed?

The two main solutions I have identified are:

(1) Have a DOM Document Factory and get initialization parameters via system
properties in a manner similiar to how SAX's Parser Factory looks up the value
returned from org.xml.sax.parser to get the class name for the SAX parser.

(2) Specify that particular DOM Document implementation look up certain
properties upon initialization and understand how to initialize themselves for
whatever environment they are configured for.

Right now the most standard way for DOM Document support that I can think of is
to make sure that you have at least one constructor be an empty constructor.

Regards,

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Fri Sep 25 15:48:45 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:05:03 2004
Subject: Sun XML early access
Message-ID: <199809251348.JAA10494@hesketh.com>

Has anyone had a chance to look at the early access Sun XML package yet?
I'm just starting my explorations, and (so far, to me) it looks pretty
intriguing.  The DOM interface is promising, and I'm very glad to see SAX
support.

If anyone's found any land mines, please let me know.  I'm used to setting
them off, but it's nice to know if other people have too.

The package is at:
http://developer.javasoft.com/developer/earlyAccess/xml/index.html


Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Fri Sep 25 16:07:23 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:05:03 2004
Subject: Sun XML early access
Message-ID: <009a01bde88e$67570c40$1e09e391@mhklaptop.bra01.icl.co.uk>


>Has anyone had a chance to look at the early access Sun XML
package yet?
>I'm just starting my explorations, and (so far, to me) it
looks pretty
>intriguing.  The DOM interface is promising, and I'm very
glad to see SAX
>support.


I've been testing it with SAXON. Some hiccups, as yet
unresolved, but generally promising. Performance is in the
same league as xp and AElfred. They've done some interesting
things with the DOM, for example the ability to nominate
user-defined subclasses of Element, and a TreeWalker
interface. This is where SAXON started last December!

Actually I'm not sure subclassing Element with a "semantic"
subclass, e.g. a business object such as Invoice, is the
right approach, because you get a clumsy class heirarchy,
and you invite the user to override methods inappropriately.
I'd prefer to have the Element contain a "userObject"
pointer to the business object.

Is anyone from the SUN team listening?

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Fri Sep 25 16:25:24 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:05:03 2004
Subject: Sun XML early access
References: <199809251348.JAA10494@hesketh.com>
Message-ID: <360BA810.8595E905@infinet.com>

"Simon St.Laurent" wrote:

> Has anyone had a chance to look at the early access Sun XML package yet?
> I'm just starting my explorations, and (so far, to me) it looks pretty
> intriguing.  The DOM interface is promising, and I'm very glad to see SAX
> support.

The SAX support is solid and the only SAX Parser that is faster that I am aware
of is XP.  This is understandable since I have personally found that XP's I/O
routines are tons faster than the java.io classes which the SUN Parser uses.
Everything in the com.sun.xml.tree package (including the DOM support) is poorly
implemented (and that is a claim I make without even looking at the source).
When parsing Jon Bosak's ot.xml I get out of memory errors on the second try,
even after trying to reclaim memory from the first try (I had to write my own
benchmarks as the there were none supplied.).  Besides the memory problems,
building the DOM tree is incredibly slow.  Better off to stick with something
else like Docuverse for now.

> If anyone's found any land mines, please let me know.  I'm used to setting
> them off, but it's nice to know if other people have too.

The package is pretty much just SAX and DOM support and some experimental stuff
dealing with beans.  The good news is that this is not packaged as a Java
extension (something I feared SUN would do) but under the com.sun namespace.  I
guess the "open-process" with regards to SUN, Java, and XML may work out after
all.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Fri Sep 25 16:33:23 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:05:03 2004
Subject: Sun XML early access
References: <009a01bde88e$67570c40$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <360BA9ED.9192095@infinet.com>

Michael Kay wrote:

> >Has anyone had a chance to look at the early access Sun XML
> package yet?
> >I'm just starting my explorations, and (so far, to me) it
> looks pretty
> >intriguing.  The DOM interface is promising, and I'm very
> glad to see SAX
> >support.
>
> I've been testing it with SAXON. Some hiccups, as yet
> unresolved, but generally promising. Performance is in the
> same league as xp and AElfred. They've done some interesting
> things with the DOM, for example the ability to nominate
> user-defined subclasses of Element, and a TreeWalker
> interface. This is where SAXON started last December!
>
> Actually I'm not sure subclassing Element with a "semantic"
> subclass, e.g. a business object such as Invoice, is the
> right approach, because you get a clumsy class heirarchy,
> and you invite the user to override methods inappropriately.
> I'd prefer to have the Element contain a "userObject"
> pointer to the business object.

I pretty much never ever subclass anything for typing purposes except as a code
reuse policy.  I have already learned the hard way that this sort of approach
builds highly inflexible and highly inefficient systems.  It forces you to build
upon previous implementations of superclasses that may have crappy
implementations.  If you are strict in doing all your typing with interfaces, you
can rewrite entire parts of systems (if necessary) without having to change your
code in any of your other systems.  Every major Java ISV out there seems to get
this other than SUN who still seems to prefer the subclassing approach in their
designs.  Maybe this is for some potential purpose of vendor lock in.  Who really
knows.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From gmessner at messners.com  Fri Sep 25 16:41:55 1998
From: gmessner at messners.com (Gregory M. Messner)
Date: Mon Jun  7 17:05:04 2004
Subject: XML Questions
Message-ID: <199809251441.HAA04535@websales.com>


 I have 2 questions/problems:

1) A DTD describes a document which contains content specified by a URL, a
local file name, or inline. Documents created using this DTD are assembled
and transported across a network. How do you include the content? The 2
ways we have discussed are:

    * Inline using Base64 encoding in a CDATA section
    * Wrap the document in a multipart/related MIME message
      and include the content as attachments

I am leaning towards multipart/related, but would like to know of others
experience in this area.


2) We desire to provide an API on the client side which exposes a simple
mechanism for creating and modifying objects. These objects are serialized
using XML and then transported to a server for further processing. The
server then responds with another XML document that we then de-serialize
into an object and present it to the API user. Here are some basic
requirements:

    * Support for both Java and C++
    * API must be similar for both Java and C++
    * Object members are accessed via get/set methods
    * Adhere to JavaBean method naming patterns

We are thinking of developing an application which takes a DTD and then
generates Java and/or C++ code for each object. We would use a XML helper
file to give more control over the generation process. Are we out in left
field here? What are some of the other ways to do this? What are you
experiences doing something like this?


Gregory M. Messner
gmessner@vsi.com


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Fri Sep 25 16:44:54 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:05:04 2004
Subject: Processing instructions (was XSchema: Sections 5.0 and 5.1)
References: <002201bde863$5ff770a0$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <360BA172.8DDCC3E4@technologist.com>

Michael Kay wrote:
> 
> That gives me the opportunity to ask a question that's been
> bugging me for a while.
> 
> I'm designing a document type. Is there any circumstance
> when you would advise me to use a Processing Instruction
> rather than an empty element with attributes? I can't think
> of one myself, but I'm sure the XML designers must have had
> something in mind.

Processing instructions are more for the use of people who are NOT
document type designers. Let's say your XML-smart HTTP server has a
replacement function for doing server-side variable includes (e.g. time,
date, last modified). Obviously that function cannot be tied to any
particular document type, because they can't force one document type on
all of their users. So they could specify it as a processing instruction
instead. The PI is "invisible" to DTD validation and thus doesn't
interfere with the doctype.

That was why namespaces originally used processing instructions. They
weren't supposed to interfere with document types (beyond the problems
with the prefixes, etc.). That's also why XML's own declarations are often
processing instructions.

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

How many of the Congresspeople who voted for the CDA do you suppose
also voted to release the report that reads like a borderline por-
nographic dime-store romance written by a Texas preacher's son?
	- Keith Dawson, TBTF 
		http://www.tbtf.com/archive/09-14-98.html
		http://www.tbtf.com/resource/hypocrites.html

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Fri Sep 25 17:10:54 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:04 2004
Subject: Processing instructions (was XSchema: Sections 5.0 and 5.1)
References: <002201bde863$5ff770a0$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <360BB25C.A4D0F045@locke.ccil.org>

Michael Kay scripsit:

> I'm designing a document type. Is there any circumstance
> when you would advise me to use a Processing Instruction
> rather than an empty element with attributes? I can't think
> of one myself, but I'm sure the XML designers must have had
> something in mind.

Probably not.

PIs are extremely useful when adding information to documents with
frozen DTDs.  For example, the proposed "xml:stylesheet" PI allows you
to attach a stylesheet to any XML document without having to
tamper with the document's DTD (as by adding a STYLE element
or attribute).

I can't see designing a new document type and using PIs, though.
As you say, elements or attributes do the work.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Fri Sep 25 17:38:19 1998
From: david at megginson.com (david@megginson.com)
Date: Mon Jun  7 17:05:04 2004
Subject: Processing instructions (was XSchema: Sections 5.0 and 5.1)
In-Reply-To: <360BB25C.A4D0F045@locke.ccil.org>
References: <002201bde863$5ff770a0$1e09e391@mhklaptop.bra01.icl.co.uk>
	<360BB25C.A4D0F045@locke.ccil.org>
Message-ID: <13835.47263.586408.125770@localhost.localdomain>

John Cowan writes:

 > PIs are extremely useful when adding information to documents with
 > frozen DTDs.  For example, the proposed "xml:stylesheet" PI allows you
 > to attach a stylesheet to any XML document without having to
 > tamper with the document's DTD (as by adding a STYLE element
 > or attribute).
 > 
 > I can't see designing a new document type and using PIs, though.
 > As you say, elements or attributes do the work.

PI's provide a (slightly clumsy) method for adding new declaration
types to XML without breaking existing parsers.  In general, I don't
think that it's a good idea to use elements and attributes for
declarations, since they represent the logical structure of the
document itself.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Fri Sep 25 17:43:57 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:04 2004
Subject: Processing instructions (was XSchema: Sections 5.0 and 5.1)
References: <002201bde863$5ff770a0$1e09e391@mhklaptop.bra01.icl.co.uk>
		<360BB25C.A4D0F045@locke.ccil.org> <13835.47263.586408.125770@localhost.localdomain>
Message-ID: <360BBA21.2D6A4AC9@locke.ccil.org>

david@megginson.com wrote:

> PI's provide a (slightly clumsy) method for adding new declaration
> types to XML without breaking existing parsers.  In general, I don't
> think that it's a good idea to use elements and attributes for
> declarations, since they represent the logical structure of the
> document itself.

An excellent point, which I was going to include but couldn't think
how to phrase.  Thank you.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jcw at equi4.com  Fri Sep 25 18:01:35 1998
From: jcw at equi4.com (Jean-Claude Wippler)
Date: Mon Jun  7 17:05:04 2004
Subject: XML Database
Message-ID: <360BBDB0.CA1D0257@equi4.com>

Kenji,

> We sometimes encounter a client who belives that making
> XML(SGML)Documents means making Database. What they believe is if
> they make their document in XML they will easily be able to find
> information in it, and to manage it. 
> 
> We are having hard time to make them understand if they need data
> base, they have to develop a data base system.
> Then, they loose interest in XML(SGML). What they want is easy and
> versitle data base system.

As others have said, XML is an interchange format - i.e. the stuff that
matters when you either transport information, or when you wish to file
it without knowing/caring how it is going to be used in the future.

It's also a "markup language", of course.

> Do you have any good ideas to show the advantage of XML as the data
> base format to this kind of people?

Raw speed.

> Are there any experience or information regarding XML(SGML) data base?

Very, very tentatively, I'd say: yes, I've been doing some work in this
area the explore the field.  I'm the author of a cross-platform storage
manager for structured data, called "MetaKit".  Right now, there are
interfaces for C++, Tcl, and Python (the library is written in C++).

I wrote a small utility called "mk4xml", which reads any XML document
into a flattened tree-structure (using "expat") and saves that as a
MetaKit datafile.  As an experiment, I wrote a small Tcl script called
"ot_conf.tcl" which takes such a MetaKit datafile, generated by mk4xml
from the "ot.xml" (Old Testament) document, and converts it into a
nested datastructure that matches this specific document's DTD.

You can see some of this in motion at:
	http://www.equi4.com/metakit/xml/
but as you'll see this is proof-of-concept stuff at this point...
there's not even a decent index page there yet.  There's a "summary.tcl"
script which collects some stats on files generated by "mk4xml".

The results are interesting:

    File sizes:
	ot.xml			3.9 Mb
	mk4xml ot.xml result:	4.1 Mb
	ot_conv.tcl result:	3.1 Mb

    Access speeds:
	ot.xml		(take your pick, parse in seconds up to minutes)
	mk4xml ot.xml result:	opens in 1.4 sec  (on a fast PII/400)
	ot_conv.tcl result:	opens in 60 mSec  (on same system)

    Access method:
	ot.xml			SAX/DOM, usually linear scan
	mk4xml ot.xml result:	random access by element/subelement/...
	ot_conv.tcl result:	random access by book, chapter, verse
	
> Are there any good XML data base in commercial basis?

Being such a general question, I do not feel qualified to answer this.

> If there is such Database, how much will it cost? 

My first reaction would be: could be anything.  The current market for
XML software runs from free to 5-digit dollar amounts, and commercial
businesses being what they are, the rule in a new field like XML is very
likely to be "anything you can get away with"... oops, that's not a nice
thing to say, let me rephrase that as "what the market will bear".

MetaKit is free for non-commercial use, with binaries available for
Unixes, Windows, Mac, VMS, and royalty free source code licenses for
commercial use, at a price level which seems to cause some people to not
take it seriously... see the website for details.  Be your own judge.

Feel free to contact me for further details.  At this stage, suggestions
and comments are most welcome - as I said, it's just a sneak preview...

Regards,
Jean-Claude

________________________________________________________________________
Jean-Claude Wippler    MetaKit home page - http://www.equi4.com/metakit/
Equi4 Software         "Portable database software for a changing world"

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ken at bitsko.slc.ut.us  Fri Sep 25 18:20:07 1998
From: ken at bitsko.slc.ut.us (Ken MacLeod)
Date: Mon Jun  7 17:05:04 2004
Subject: Corba data -> XML
Message-ID: <199809251610.LAA14518@bitsko.slc.ut.us>

A non-text attachment was scrubbed...
Name: not available
Type: text
Size: 2069 bytes
Desc: not available
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980925/eb60f6b3/attachment.bat
From cowan at locke.ccil.org  Fri Sep 25 23:21:43 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:04 2004
Subject: Public identifiers and topic maps
Message-ID: <360C0929.A4DC3C69@locke.ccil.org>

The current draft of ISO/IEC 13250, Topic Navigation Maps (a
standard architectural form) explicitly takes Steven Newcomb's
view of the meaning of owner identifiers in FPIs: scarcely surprising,
since he is one of the editors.

In particular, clasue 6.1.1 reads in part:

# [T]he registration indicator, public text class, and language fields
# are as specified for formal public identifiers in ISO/IEC 8879:1986.
# The 'topic authority' [the string after the "+//" or "-//"] is
# the owner of <em>the information resource</em> that defines the
# concept.  [Emphasis added.]

SGML tribal elders (a set which I am not a member of) may wish to
protest this language and similar language elsewhere in the draft
before it becomes hard-coded in an ISO standard.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Billow.Danny at emeryworld.com  Fri Sep 25 23:42:59 1998
From: Billow.Danny at emeryworld.com (Billow, Danny J)
Date: Mon Jun  7 17:05:04 2004
Subject: XML parsing from within a VB server component
Message-ID: <0165C354AE8ED11182E0006094519AD8013AB8A4@MWABS021>

Can it be done?
Is there any documentation for msxml.dll?
Is this module used by the browser only or can I use it in my server
component?
If had trouble finding information on this module. I've only found what
looks like an incomplete API.
Any direction would be greatly appreciated...

Danny J. Billow


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From srn at techno.com  Sat Sep 26 01:15:16 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun  7 17:05:04 2004
Subject: Public identifiers and topic maps
In-Reply-To: <360C0929.A4DC3C69@locke.ccil.org> (message from John Cowan on
	Fri, 25 Sep 1998 17:20:41 -0400)
References: <360C0929.A4DC3C69@locke.ccil.org>
Message-ID: <199809252319.SAA01814@bruno.techno.com>

> Date: Fri, 25 Sep 1998 17:20:41 -0400
> From: John Cowan <cowan@locke.ccil.org>
> Organization: Lojban Peripheral
> 
> The current draft of ISO/IEC 13250, Topic Navigation Maps (a
> standard architectural form) explicitly takes Steven Newcomb's
> view of the meaning of owner identifiers in FPIs: scarcely surprising,
> since he is one of the editors.
> 
> In particular, clasue 6.1.1 reads in part:
> 
> # [T]he registration indicator, public text class, and language fields
> # are as specified for formal public identifiers in ISO/IEC 8879:1986.
> # The 'topic authority' [the string after the "+//" or "-//"] is
> # the owner of <em>the information resource</em> that defines the
> # concept.  [Emphasis added.]
> 
> SGML tribal elders (a set which I am not a member of) may wish to
> protest this language and similar language elsewhere in the draft
> before it becomes hard-coded in an ISO standard.

Yes.  John has done some homework and he has discovered why I brought
this matter up in this forum: in general I think it's better to get
consensus *before* a standard is published.  Several luminaries have
now argued in this forum against the FPI-based methodology we have
proposed in the Topic Navigation Map draft.  Now is the time to fix
this.  All reasonable suggestions are welcome; please suggest now.

What's needed is a way to reference authoritative materials as a way
of identifying "public topics".  A "public topic" is a concept or
subject that has a specifiable unique name in a specifiable namespace
created and/or managed by a specifiable authority on the topic, and
that is referenced as a public topic by anybody who wants to regard
the authority as an authority and the topic as a public topic,
regardless of whether the authority's namespace is online.  Really,
it's a bibliographic reference with certain very broad constraints and
used for a particular purpose.  I repeat my example:

Authority: Sears, Roebuck & Company
Namespace: 1922 Farm Catalog
     Name: [catalog number] R204

Should we be using a (subtype of?) bibloc for this purpose, perhaps
using the bibloc as a location source for a (subtype of?) namespace
location address?

[John, you are already on your way to becoming an SGML tribal elder,
yourself.  If you don't want that to happen, you'll have to try harder
to avoid being quite so helpful to the cause! (:^) ]

-Steve

--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com  http://www.techno.com  ftp.techno.com

voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax    +1 972 994 0087 (at ISOGEN: +1 214 953 3152)

3615 Tanner Lane
Richardson, Texas 75082-2618 USA

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From donpark at quake.net  Sat Sep 26 01:41:13 1998
From: donpark at quake.net (Don Park)
Date: Mon Jun  7 17:05:04 2004
Subject: Sun XML early access
Message-ID: <026501bde8dd$e7132550$2ee044c6@arcot-main>

>Has anyone had a chance to look at the early access Sun XML package yet?
>I'm just starting my explorations, and (so far, to me) it looks pretty
>intriguing.  The DOM interface is promising, and I'm very glad to see SAX
>support.

I have looked at the DOM implementation only and found that:

1. sibling-based navigation is considerablly slower than index-based
navigation.

Actually, this can be fixed rather easily by caching last-accessed index.

2. getElementsByTagName is not efficient.

getElementsByTagName is best implemented using lazy evaluation techniques
but Sun XML package does cache the evaluation result.  For example, the
getLength() method of the returned NodeList walks the whole subtree
everytime.  item() method also walks the subtree until the desired number of
nodes have been seen.

3. Output needs more design work.

One needs to subclass to override output and there is no built in support
for conversion to other formats.  For example, you can not write out a DOM
Document as HTML without some serious patching or overriding.

4. XML Bean concept is a little disappointing for what it does.

I am afraid I can not go into details about this due to Docuverse's own work
in this area but Sun needs to broaden their view of what a bean is.

I am quite sure that efficiency issues will all be resolved over time but
design issues are quite serious and deserves more attention.

Best,

Don Park
Docuverse


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ricko at allette.com.au  Sat Sep 26 05:51:32 1998
From: ricko at allette.com.au (Rick Jellife)
Date: Mon Jun  7 17:05:04 2004
Subject: Public identifiers and topic maps
References: <360C0929.A4DC3C69@locke.ccil.org> <199809252319.SAA01814@bruno.techno.com>
Message-ID: <360C664A.2483A421@allette.com.au>

Steven R. Newcomb �g�D�G

>  ,.. Several luminaries have
> now argued in this forum against the FPI-based methodology we have
> proposed in the Topic Navigation Map draft.  Now is the time to fix
> this.

This problem I also came up against when writing my book, because Ineeded to make
up a lot of FPIs for other people's info. In the end
I took it to ISO WG8, and they definitely decided that ISO 9070 was
correct and ISO 8879 was ambiguous: it was fixed for WebSGML.

> What's needed is a way to reference authoritative materials as a way
> of identifying "public topics".

Do we need a public text type of TOPIC? You could have something like this

"+//IDN techno.com//TOPIC
Sears, Roebuck & Company:: 1922 Farm Catalog:: [catalog number] R204//EN//
www.techno.com/topics/sr1922r204"

In other words: the owner uses IDN, a public text type TOPIC is
defined, ISO 9070 "::" syntax is used in the name, and the
display version uses (part of a) URL.  I wonder if 9070 should be upgraded
to allow full URLs in the display text field?

I have a different problem with FPIs or URNs for topic names:
I think it is important that I should be able to assign a topic, like
the farm catalog above, even if I cannot locate the canonical form,
and there should be a chance my systems will work.

In other words, I hope any FPI convention for TNM would  not
preclude using fuzzy matching on names: in fact, I tend to
think the owner name should be the last thing looked at for
matching topics (in many situations). Of course, some users
need exact matches only, but as the web search engines prove,
there is a lot of usefulness in searching.

Rick Jelliffe


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Mike_Spreitzer.PARC at xerox.com  Sat Sep 26 06:48:42 1998
From: Mike_Spreitzer.PARC at xerox.com (Mike_Spreitzer.PARC@xerox.com)
Date: Mon Jun  7 17:05:04 2004
Subject: Public identifiers and topic maps
In-Reply-To: <199809252319.SAA01814@bruno.techno.com>
Message-ID: <98Sep25.214825pdt."55474(2)"@alpha.xerox.com>

Am I right in understanding that, among perhaps other things, you're
challenging the idea (which I think is in the XML spec --- please correct me if
I'm wrong) that every (formal or otherwise) Public Identifier must be
resolvable to a URI?

Thanks,
Mike

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From h.rzepa at ic.ac.uk  Sat Sep 26 09:15:24 1998
From: h.rzepa at ic.ac.uk (Rzepa, Henry)
Date: Mon Jun  7 17:05:04 2004
Subject: Fwd: Why can't I post to XML-Dev?
Message-ID: <v04011700b2324644b0be@[155.198.8.3]>

Forwarded from From: David Brownell <db@Eng.Sun.COM>

>[ My posts to this list generally get swallowed somewhere, so
>I expect to need to forward this manually ... ]
>
>Tyler Baker wrote:
>> 
>> Michael Kay wrote:
>> 
>> > Actually I'm not sure subclassing Element with a "semantic"
>> > subclass, e.g. a business object such as Invoice, is the
>> > right approach, because you get a clumsy class heirarchy,
>> > and you invite the user to override methods inappropriately.
>
>The API to that feature is subject to change, more than most
>other parts of the library.  For example, it's not yet aware
>of namespaces.  We see such subclassing as only one of several
>tools that need to exist.  (Any tool can be misused!)
>
>One thing that comes from subclassing is the ability to modify
>the tree construction dynamically.  You can optimize in-memory
>representations easily -- e.g. removing ignorable whitespace and
>things like redundant representations of data (cutting memory
>use by quite a bit!).  Also, anyone who's really manipulating
>text will need more than DOM should even think of supporting.
>
>I think it'd be generally true that the semantic model that
>is used by an application would not always conform to the XML
>structure, exposed by DOM.  An example we've used is that of
>a 3D spreadsheet.  The internal representation will need to be
>highly optimized for most things; tables will be either dense
>or sparse arrays, with "slice" operations, for starters.  It
>could be fine to have such an optimized object "boxed" inside
>a DOM document, minimizing the need for document-specific
>navigation APIs and maximizing code reuse.  No need to have an
>XML-oriented representation of that core data also lingering
>around -- that's needed for externalization, period.
>
>
>> > I'd prefer to have the Element contain a "userObject"
>> > pointer to the business object.
>
>Such a delegation approach is necessary in any case, since
>it's important to integrate into frameworks that already
>define their own base class.  Is it sufficient?  Hmmm ...
>almost certainly not, IMHO.  Even so, it's on the roadmap.
>
>
>> I pretty much never ever subclass anything for typing purposes
>> except as a code reuse policy.  I have already learned the hard
>> way that this sort of approach builds highly inflexible and
>> highly inefficient systems.
>
>It also builds flexible and efficient ones.  One uses the right
>tool for the problem, and sometimes subclassing is that tool.
>
>
>>	  Every major Java ISV out there seems to get
>> this other than SUN who still seems to prefer the subclassing
>> approach in their  designs.
>
>Remember that this functionality is documented as "experimental"
>and subject to change ... also, that quite a few systems have
>had real success with this particular style of subclassing.
>
>Many XML related ones too!  COINS, IBM's XML4J, and more.  The
>Raven editor uses this (and other techniques), and a number of
>HTML display packages too.  I was intrigued by the statistics
>Steve Withall presented at the XML Developer's conference,
>showing that by far the bulk of the customization code was
>associated with elements.  (See his XML Testbed, on "xml.com"!)
>
>
>>	  Maybe this is for some potential
>> purpose of vendor lock in.  Who really knows.
>
>I do, I do!!  There are no such dark motives lurking here.
>
>- Dave
>

Henry Rzepa. +44 171 594 5774 (Office) +44 171 594 5804 (Fax)
http://www.ch.ic.ac.uk/rzepa/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From db at Eng.Sun.COM  Sat Sep 26 09:54:42 1998
From: db at Eng.Sun.COM (David Brownell)
Date: Mon Jun  7 17:05:05 2004
Subject: Sun XML early access
References: <009a01bde88e$67570c40$1e09e391@mhklaptop.bra01.icl.co.uk> <360BA9ED.9192095@infinet.com>
Message-ID: <360C9CEB.7A1BA231@eng.sun.com>

In answer to Michael Kay's question, yes some folk
from Sun are listening on this list!

See one followup on this topic that's been forwarded
by Henry Rzepa under the subject "Why can't I post to
XML-DEV?".  (Answer:  Sun's internal mail system
changed a while back, and majordomo wanted me to use
a different e-mail address!)

More responses to come later!

- Dave

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Sat Sep 26 10:21:46 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:05:05 2004
Subject: LISTRIVIA: posting problems
Message-ID: <3.0.1.16.19980926081324.0a6fdfec@pop3.demon.co.uk>

Occasionally some XML-DEV members appear to be unable to post to the list.
The process is controlled by the list software and depends critically on
the exact machine address from which the request was made. The software is
only partially under our control and we cannot always diagnose the problem. 

	If you are prevented from posting please mail Henry Rzepa
(h.rzepa@ic.ac.uk) or me and we will forward it.

	P.

Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Sat Sep 26 10:21:49 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:05:05 2004
Subject: Request for Resources [Liora Alschuler]
Message-ID: <3.0.1.16.19980926080938.0a6f0514@pop3.demon.co.uk>

>From: Liora Alschuler <Liora@the-word-electric.com>


I would like to post a query regarding the list of authoring tools that I am
responsible for on XML.com
(http://www.xml.com/xml/pub/98/09/authortoolsintro.html). Specifically, I
would like to ask developers on the list if they have any additional
resources to list since I have included projects that are in beta, alpha,
and pre-alpha alongside those that are in double-digit release--many of
these I know of only through posts to xml-dev. I feel this is as much a
service to the developer community as it is to buyers and established
vendors -- perhaps more so since it costs nothing to be listed and listings
are extensive for small projects as well as large ones. It is also a method
for developers to get some initial feedback on their work since we have
given users the ability to post their comments.

	[Liora Alschuler]

Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Sat Sep 26 10:21:54 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:05:05 2004
Subject: 10 September 1998 version of XML spec DTD and documentation
In-Reply-To: <199809142030.QAA06791@doctools.com>
Message-ID: <3.0.1.16.19980926082019.4b877446@pop3.demon.co.uk>

At 16:27 14/09/98 -0400, Eve L. Maler wrote:
>Hello folks-- You can now get access to the latest version of the W3C XML
>specification DTD and its documentation, at the following locations:
>
>DTD:		http://www.w3.org/XML/1998/06/xmlspec-19980910.dtd
>Documentation:	http://www.w3.org/XML/1998/06/xmlspec-report-19980910.htm
>
>Although the previous version was technically available in a public
>location, I'd be surprised if anyone unearthed it...  Please send any
>comments or questions to elm@arbortext.com.

Many thanks for this Eve. DTDs in XML are very valuable resources. 

Is anyone collecting XML DTDs? I seem to remember there was but forget whom...

	P.

Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Sat Sep 26 10:22:04 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:05:05 2004
Subject: XML-DEV motives (was Re: XML IDL and XML RPC)
In-Reply-To: <199809230247.TAA24023@transbay.net>
Message-ID: <3.0.1.16.19980926092022.57af0ac8@pop3.demon.co.uk>

At 19:47 22/09/98 -0700, Rex Brooks wrote:
>Hi all,
>
>I only recently subscribed to this mailing list in order to better
>understand xml and how it was developing in relation to IDL and distributed
>computing in general. I am beginning to be gravely disappointed at the
>extent to which this is a list mainly populated by self-serving vendors and
>authors. Is this really true, or am I just stepping in during a
>particularly ugly patch of competing offers?

I act as 'moderator' of this list - I've been away for a few days and
haven't yet read the thread that you refer to. It's extremely uncommon for
competitive commercial postings on this list. We do (gently) encourage
factual product announcements when they seem to bring new functionality. 

XML-DEV was created as a list for developers [small 'd' - i.e. not
necessarily commercial] of XML applications and other resources. It has
managed to make considerable contributions in that way. Much of the
software and resources announced have had an OpenSource-like license (and
some contributors have changed their license in response to public pressure
from the list.)

XML-DEV has shown itself to be a virtual community which is capable of
extremely high-quality work. The SAX interface (read the history on
http://www.megginson.com/SAX) was developed in open process over a very
short timescale with ca. 100 contributors. SAX (IMO) avoided the potential
Babel of competing XML parser APIs and is universally adopted by commercial
and non-commercial developers. XSchema is currently going through the final
stages of a similar process. On many occasions discussion has resolved
confusion and identified valuable resources.

Historically XML started from the W3C which is a vendor-led consortium.
Mots of the original creators of XML (which included a 100-strong SIG) were
from commercial orgs - there are a handful of us (ca 5) who are academics.
It's not surprising that the early traffic on XML-DEV is from those people.
Many of them have made enormous personal contributions from their spare
time (XML-geeks don't have much of a life...) and very few are required to
post to XML-DEV by their employers.

What I think we really need for XML is an enthusiast community beyond the
commercial developers. There should be much more academic involvement
(especially grad students). There are wonderful projects simply crying out
to be hacked. I've urged this several times. How can we reach them? 

So, I think you may have hit a misleading patch :-). 

>
>Soon to unsubscribe:
>Rex Brooks

Please don't - and get some students involved !

	P.


Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Sat Sep 26 10:39:52 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:05:05 2004
Subject: 10 September 1998 version of XML spec DTD and documentation
Message-ID: <04e301bde929$89a58920$d86118cb@caleb>

-----Original Message-----
From: Peter Murray-Rust <peter@ursus.demon.co.uk>
>Is anyone collecting XML DTDs? I seem to remember there was but forget
whom...

I am, for schema.net

Submissions invited.

James
--
James Tauber / jtauber@jtauber.com      http://www.jtauber.com/
Lecturer and Associate Researcher
Electronic Commerce Network             ( http://www.xmlinfo.com/
Curtin Business School                  ( http://www.xmlsoftware.com/
Perth, Western Australia                ( http://www.schema.net/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From larsga at ifi.uio.no  Sat Sep 26 11:25:29 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:05:05 2004
Subject: XML-DEV motives (was Re: XML IDL and XML RPC)
In-Reply-To: <3.0.1.16.19980926092022.57af0ac8@pop3.demon.co.uk>
References: <3.0.1.16.19980926092022.57af0ac8@pop3.demon.co.uk>
Message-ID: <wk4stvw7k6.fsf@ifi.uio.no>


* Peter Murray-Rust
|=20
| What I think we really need for XML is an enthusiast community
| beyond the commercial developers. There should be much more academic
| involvement (especially grad students). There are wonderful projects
| simply crying out to be hacked. I've urged this several times. How
| can we reach them?

It may comfort you to know that at least in Norway this seems to be
happening now. Both the University of Troms=F8 and NTNU in Trondheim
have started academic projects where students investigate XML in
various ways.

>From what I can gather, both projects are for undergraduates, but
hopefully this is just the beginning.

--Lars M.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Sat Sep 26 12:22:22 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:05:05 2004
Subject: XML-DEV motives (was Re: XML IDL and XML RPC)
Message-ID: <050501bde937$d222a1c0$d86118cb@caleb>

-----Original Message-----
From: Lars Marius Garshol <larsga@ifi.uio.no>
>* Peter Murray-Rust
>| What I think we really need for XML is an enthusiast community
>| beyond the commercial developers. There should be much more academic
>| involvement (especially grad students).

>It may comfort you to know that at least in Norway this seems to be
>happening now. Both the University of Troms? and NTNU in Trondheim
>have started academic projects where students investigate XML in
>various ways.

Here at Curtin University, I already teach XML as part of my Web Site
Management course and many students are picking XML for their project in
this unit. One of my students is going to be focusing on XML as part of his
Masters next year and I'm just about to embark on my PhD in this area too.

James
--
James Tauber / jtauber@jtauber.com      http://www.jtauber.com/
Lecturer and Associate Researcher
Electronic Commerce Network             ( http://www.xmlinfo.com/
Curtin Business School                  ( http://www.xmlsoftware.com/
Perth, Western Australia                ( http://www.schema.net/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Sat Sep 26 13:50:11 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:05:05 2004
Subject: [Fwd: Sun XML early access]
Message-ID: <199809261149.HAA20553@hesketh.com>

Forwarded for David Brownell:

[ My posts to this list generally get swallowed somewhere, so
I expect to need to forward this manually ... ]

Tyler Baker wrote:
> 
> Michael Kay wrote:
> 
> > Actually I'm not sure subclassing Element with a "semantic"
> > subclass, e.g. a business object such as Invoice, is the
> > right approach, because you get a clumsy class heirarchy,
> > and you invite the user to override methods inappropriately.

The API to that feature is subject to change, more than most
other parts of the library.  For example, it's not yet aware
of namespaces.  We see such subclassing as only one of several
tools that need to exist.  (Any tool can be misused!)

One thing that comes from subclassing is the ability to modify
the tree construction dynamically.  You can optimize in-memory
representations easily -- e.g. removing ignorable whitespace and
things like redundant representations of data (cutting memory
use by quite a bit!).  Also, anyone who's really manipulating
text will need more than DOM should even think of supporting.

I think it'd be generally true that the semantic model that
is used by an application would not always conform to the XML
structure, exposed by DOM.  An example we've used is that of
a 3D spreadsheet.  The internal representation will need to be
highly optimized for most things; tables will be either dense
or sparse arrays, with "slice" operations, for starters.  It
could be fine to have such an optimized object "boxed" inside
a DOM document, minimizing the need for document-specific
navigation APIs and maximizing code reuse.  No need to have an
XML-oriented representation of that core data also lingering
around -- that's needed for externalization, period.


> > I'd prefer to have the Element contain a "userObject"
> > pointer to the business object.

Such a delegation approach is necessary in any case, since
it's important to integrate into frameworks that already
define their own base class.  Is it sufficient?  Hmmm ...
almost certainly not, IMHO.  Even so, it's on the roadmap.


> I pretty much never ever subclass anything for typing purposes
> except as a code reuse policy.  I have already learned the hard
> way that this sort of approach builds highly inflexible and
> highly inefficient systems.

It also builds flexible and efficient ones.  One uses the right
tool for the problem, and sometimes subclassing is that tool.


>	  Every major Java ISV out there seems to get
> this other than SUN who still seems to prefer the subclassing
> approach in their  designs.

Remember that this functionality is documented as "experimental"
and subject to change ... also, that quite a few systems have
had real success with this particular style of subclassing.

Many XML related ones too!  COINS, IBM's XML4J, and more.  The
Raven editor uses this (and other techniques), and a number of
HTML display packages too.  I was intrigued by the statistics
Steve Withall presented at the XML Developer's conference,
showing that by far the bulk of the customization code was
associated with elements.  (See his XML Testbed, on "xml.com"!)


>	  Maybe this is for some potential
> purpose of vendor lock in.  Who really knows.

I do, I do!!  There are no such dark motives lurking here.

- Dave


Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Sat Sep 26 13:50:14 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:05:05 2004
Subject: [Fwd: Sun XML early access]
Message-ID: <199809261149.HAA20556@hesketh.com>

Forwarded for David Brownell:

Tyler Baker wrote:
> 
> The SAX support is solid and the only SAX Parser that is faster that I am
aware
> of is XP.  This is understandable since I have personally found that XP's
I/O
> routines are tons faster than the java.io classes which the SUN Parser uses.

I/O is only one factor.  I think you'll find that diagnostics are much more
useful from Sun's parser.  If we hear performance is an issue, we can put
more work into that, too.

Comparing validating parsers is also interesting.  I don't think there's a
faster validating parser, in 100% Pure Java, generally available today.


> Everything in the com.sun.xml.tree package (including the DOM support) is
poorly
> implemented (and that is a claim I make without even looking at the source).
> When parsing Jon Bosak's ot.xml I get out of memory errors on the second
try,
> even after trying to reclaim memory from the first try (I had to write my
own
> benchmarks as the there were none supplied.).  Besides the memory problems,
> building the DOM tree is incredibly slow.  Better off to stick with
something
> else like Docuverse for now.

The release notes point out that object model support hasn't been tuned
for space utilization; that's different from "poorly implemented"!  And
along the same lines, this is "early access" -- as in, alpha test, so
honest bugs are to be expected.  (Report them to the feedback alias.)

Since it doesn't maintain any static state, I'd suspect that your benchmark
is saving something which causes that error.  If not, we'd certainly fix
such a bug in our code!


> The package is pretty much just SAX and DOM support and some experimental
stuff
> dealing with beans.

"Pretty much" -- but do look at the rest.  It supports the current draft
of XML namespaces, and lets you write DOM objects out as XML text (which
DOM doesn't provide).  DOM doesn't give access to the ID attribute of a
node (unless you know what DTD it's using!), so you can't implement full
XSL or XPointer with just DOM.  And there's more.

- Dave


Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Sat Sep 26 13:58:46 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:05:05 2004
Subject: [Fwd: Sun XML early access]
In-Reply-To: <199809261149.HAA20553@hesketh.com>
Message-ID: <199809261158.HAA20607@hesketh.com>

At 07:50 AM 9/26/98 -0400, Simon St.Laurent wrote:
>Forwarded for David Brownell:

Apologies to all; one of these messages was already forwarded by Henry
Rzepa under a different subject header (Fwd: Why can't I post to XML-Dev?).
 The other one's new, though.

Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Sat Sep 26 14:21:38 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:05:05 2004
Subject: Java/XML/SUN (was: Re: Fwd: Why can't I post to XML-Dev?)
In-Reply-To: <v04011700b2324644b0be@[155.198.8.3]>
Message-ID: <3.0.1.16.19980926131604.2ff7e1b0@pop3.demon.co.uk>

At 08:21 26/09/98 +0100, Rzepa, Henry wrote:
>Forwarded from From: David Brownell <db@Eng.Sun.COM>

Thanks very much David - haven't yet had time to look at the classes. Some
simple comments on your comments:

>>
>>One thing that comes from subclassing is the ability to modify
>>the tree construction dynamically.  You can optimize in-memory
>>representations easily -- e.g. removing ignorable whitespace and
>>things like redundant representations of data (cutting memory
>>use by quite a bit!).  Also, anyone who's really manipulating
>>text will need more than DOM should even think of supporting.

I certainly find this an important point. Some objects (like HTML/IBTWSH)
can contain a large number of nodes. Tree-building (using
com.sun.java.swing.tree.DefaultMutableTreeNode) seems expensive when there
are lots of nodes and I now find myself writing subclasses to manage these.
For example jumbo.xml.data.HTMLNode is subclassed from DMTN and will hold
the HTML element as a serialised String. When it needs display it can be
unpacked. 

>>I think it'd be generally true that the semantic model that
>>is used by an application would not always conform to the XML
>>structure, exposed by DOM.  An example we've used is that of
>>a 3D spreadsheet.  The internal representation will need to be
>>highly optimized for most things; tables will be either dense
>>or sparse arrays, with "slice" operations, for starters.  It
>>could be fine to have such an optimized object "boxed" inside
>>a DOM document, minimizing the need for document-specific
>>navigation APIs and maximizing code reuse.  No need to have an
>>XML-oriented representation of that core data also lingering
>>around -- that's needed for externalization, period.

Agreed. And for this sort of thing (e.g. Molecules) I pack the XML event
stream into a more suitable representation. It means that for these type of
classes I use a set of routines such as:
	processXML()	- called in SAX to pack (and verify) elements
	processEventStream()	- outputs the internal representation as an XML
stream (variations such as prettyprinting, whitespace, etc. can be controlled)
	getDisplayComponent()	- returns an editable JComponent

It would be great if we could standardise on the API for this sort of
thing. Then element-oriented programming could become really attractive.
The domain-specific classes could use a standard core facility.


	P.

Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From b.laforge at jxml.com  Sat Sep 26 17:10:29 1998
From: b.laforge at jxml.com (Bill la Forge)
Date: Mon Jun  7 17:05:05 2004
Subject: [Fwd: Sun XML early access]
Message-ID: <003801bde960$093bfb20$02000003@thing1.camb.opengroup.org>

>> > I'd prefer to have the Element contain a "userObject"
>> > pointer to the business object.
>
>Such a delegation approach is necessary in any case, since
>it's important to integrate into frameworks that already
>define their own base class.  Is it sufficient?  Hmmm ...
>almost certainly not, IMHO.  Even so, it's on the roadmap.


This is part of the capability being added to Coins release 2 
(Early access now, general release end of this month).

Coins will wrap user objects, with support for things like 
EventListener registration between user objects specified in
the CSchema as a kind of refId attribute.

What I'm trying to do with it is demonstrate the use of XML to
compose an awt program from a mix of standard awt 
components and user application code.

Bill


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From lisarein at finetuning.com  Sat Sep 26 20:55:02 1998
From: lisarein at finetuning.com (Lisa Rein)
Date: Mon Jun  7 17:05:05 2004
Subject: resources on xml.com
Message-ID: <360D426B.107BAB7F@finetuning.com>

hey everybody -- i just wanted to let everyone know that if you don't
see something listed under the resources at XML.com, it is most surely
an oversight, not a slighting in any way (for example, i just realized
that i didn't have ibm's rdf for xml in there under rdf parsers -- what
i myself would consider a HUGE oversight....)

so it would be a great help to me when you casually notice stuff like
that to shoot me an email ok?  even for little stuff like the Lotus/LDO
thing (which really isn't THAT little) or the fact that i somehow missed
that new xml DTD until today -- which i think is quite the big deal!

Also --  if you do searches for subjects where certain resources don't
come up that "should"  or do come up that "shouldn't" -- that really
helps because i have been working quite extensively on finetuning the
search engine so it will be....ya know....useful :-)

thanks,

lisa rein

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Sat Sep 26 21:04:10 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:06 2004
Subject: Public identifiers and topic maps
In-Reply-To: <199809252319.SAA01814@bruno.techno.com>
References: <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
Message-ID: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>

At 06:19 PM 9/25/98 -0500, Steven R. Newcomb wrote:

>What's needed is a way to reference authoritative materials as a way
>of identifying "public topics".  A "public topic" is a concept or
>subject that has a specifiable unique name in a specifiable namespace
>created and/or managed by a specifiable authority on the topic, and
>that is referenced as a public topic by anybody who wants to regard
>the authority as an authority and the topic as a public topic,
>regardless of whether the authority's namespace is online.  Really,
>it's a bibliographic reference with certain very broad constraints and
>used for a particular purpose.  I repeat my example:

I think there's two different things being talked about here:

1. Topics that are "published resources", that is, a thing that the creator
of the thing has made public in some way.  One way to do this is to
announce to the world "I have defined a topic called '+//...//EN' which
refers to the idea of blah blah blah".  Note that "the idea of blah blah
blah" is the primary form of the resource that the name "+//.../EN" is
mapped to as indicated by the message from the publisher ("which refers to").

2. Names that are known to map to topics.  

Note that the owner of the resource (the idea that was published) need not
be the owner of the name or the name space within which the name occurs.  

For example, say Steve has decided to provide the service of cataloging
public topics and provides a registration service by which publishers of
topics can request that Steve catalog their topics.  Steve has registered
the owner name "technoteacher.com", so he owns that name space and all
names within it.  I call Steve and ask to register my topic.  I give to
Steve my authoritative description of the topic ("the idea blah blah
blah"). Steve assigns a name and creates an entry in his catalog that looks
like this:

+//IDN technoteacher.com//DOCUMENT ABCD.1234-466 QZ2//EN := 
   "The idea blah blah blah"

Steve owns the name but I own the resource.  Nothing in the name indicates
who owns the resource, in this case. (It could, but that would be up to
Steve and his design for a cataloging scheme).

Note also that there is no meaningful, namable, thing that is an "abstract
concept" as soon as the description of that concept gets recorded in some
reasonably permanent and retrievable form.  Thus, it's not meaningful to
have a name for an "abstract topic" without having some authoritative
definition of what that topic is. Because there must always be a
description (even if it's "call Eliot and ask him what this topic is all
about") there will always be at least one resource for the name of that
topic to map to.  Of course, it is the responsibility of the owner of the
idea to declare and publicize what that resource is.  

In the example above, the right-hand side of the catalog *is* the resource,
that is, a textual description of the topic. But what if I have a Web page
that I consider to be the authoritative definition of the topic? I would
have given that to Steve instead, making his catalog:

+//IDN technoteacher.com//DOCUMENT ABCD.1234-466 QZ2//EN := 
   "http://www.drmacro.com/topics/blah-blah-blah/"

In this case, the name for the authoritative definition of the topic is one
I own (because I own drmacro.com). But I could have also let someone else
serve my definition, so I don't necessarily have to own that name either.

So, ownership of the name of a topic (or any other resource) *cannot
predict* ownership of the resource.  Ownership of resources is managed by
means other than names within computer-addressible data spaces. It is
managed by contracts and property law and lawyers and burly guys named
Guido who will break your legs if you touch the definition of my topic.

Thus, while it is possible for me to own the name of a topic I own, it is
not necessary for me to own the name of a topic I own.  

Let's say that the world accepts Steve's catalog as the authoritative
source for finding published topics (just as we accept the Library of
Congress as an authoritative source for finding published books).  When I
announce to the world that I've published a topic, I can provide the name
from Steve's catalog as a way to refer to it. But that can't prevent anyone
else from assigning their own name to my topic.

I don't think its reasonable to argue that the topic *is* the name because
names, even formal public identifiers, can't practically contain enough
information to usefully communicate the idea of the topic in all (or even
most) cases.  But even then, ownership of the name still wouldn't be
predicable from the name itself because, as objects, names can be bought
and sold.  If anyone wants a URL on the drmacro.com site, I'll sell it to
them for a very reasonable price. Of course, if they want to have a
resource at the end of the URL, that's extra. And persistence is an
additional fee...

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From srn at techno.com  Sat Sep 26 21:50:52 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun  7 17:05:06 2004
Subject: Public identifiers and topic maps
In-Reply-To: <98Sep25.214825pdt."55474(2)"@alpha.xerox.com>
	(Mike_Spreitzer.PARC@xerox.com)
References: <98Sep25.214825pdt."55474(2)"@alpha.xerox.com>
Message-ID: <199809301835.NAA02112@bruno.techno.com>

> From: Mike_Spreitzer.PARC@xerox.com
> X-NS-Transport-ID: 0000AA0089EAEAF539CF
> Date: Fri, 25 Sep 1998 21:47:51 PDT
> cc: xml-dev@ic.ac.UK
> 
> Am I right in understanding that, among perhaps other things, you're
> challenging the idea (which I think is in the XML spec --- please correct me if
> I'm wrong) that every (formal or otherwise) Public Identifier must be
> resolvable to a URI?

Well, I've been *asking* more than I've been *challenging*, but yes,
I think that's one of the things I've been asking.

-Steve

--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com  http://www.techno.com  ftp.techno.com

voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax    +1 972 994 0087 (at ISOGEN: +1 214 953 3152)

3615 Tanner Lane
Richardson, Texas 75082-2618 USA

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From thillai at ix.netcom.com  Sun Sep 27 00:48:03 1998
From: thillai at ix.netcom.com (Thillai)
Date: Mon Jun  7 17:05:06 2004
Subject: XML editor
Message-ID: <01BDE97D.2D64B380@thillai>

Hi,

I tried MS XML Notepad with IE5 installed in my computer.  When I create
the XML document it is not doing any validation.  Only after storing it and
loading it,  the XML document is validated based on DTD.

Is there any XML editor which does validation based on DTD when I create
the XML document. (after assoicating a DTD)   For e.g if I try to add a 
invalid child it should give warning or based on DTD it should list what are all 
the valid children for a particular node.  (Is it possible??)

Thillai
AT&T
Middletown, NJ


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From srn at techno.com  Sun Sep 27 02:11:25 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun  7 17:05:06 2004
Subject: Public identifiers and topic maps
In-Reply-To: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	(eliot@dns.isogen.com)
References: <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org> <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
Message-ID: <199809302215.RAA02155@bruno.techno.com>

[Eliot Kimber:]

> I think there's two different things being talked about here:

[ ... and lots of other good stuff with which I agree.]

But, Eliot, your note does not address the problem we're trying to 
solve here.

Consider the whole universe of information.  My example, a description
of an obsolete farming implement in an obsolete farm catalog of which
no single copy may even exist any more, was intended to get you to
think "outside the box."  Strong-minded person that you are, it didn't
work.

[Even so, in response to what you say, I feel compelled to point out,
perhaps irrelevantly, that names cannot be owned in any meaningful
sense.  If it were true that *names* were ownable, then there would be
an awful lot of names that we wouldn't be allowed to mention or use.
Only the *meaning* or *referent* of a name can be owned.  Namespaces
can be owned: the names inside them are the meaning of the name of the
namespace.  The most important namespaces are not owned.  But I
digress, and I think you agree with that anyway.  I am reminded of the
millions spent by Xerox Corporation (with only limited success) to
prevent "xerox" from becoming a synonym for "photocopy".]

So here's another example: Lake Geneva.  What namespace does the name
"Lake Geneva" exist in?  Who owns that namespace?  If, for Joe Author,
Lake Geneva (the lake itself, not just its name) is a topic, how
should Joe Author refer to it?  (In fact, the Lake Geneva example
points up another interesting aspect of the problem.  In France, the
very same lake is called "Lac Leman".  Two names, one lake.)  Joe
Author needs to point at the Lake itself as a topic, and he needs to
do it in a way that will be maximally useful to unknown others for
figuring out what it is that he's regarding as this topic.  Nobody is
ever going to "resolve" this pointer; if somehow they did resolve the
pointer, a flood of living water would come pouring out of the CRT, or
the user would be teleported into the lake and be drowned.  That's not
what we're trying to accomplish here.  We're merely trying to find a
way to tell others enough information to allow them to have a prayer
of figuring out whether or not two citations of the topic in question
are really about the same topic, and THAT'S ALL.

Anybody who regards Lake Geneva as a topic must use some authoritative
reference work that uses some sort of cataloguing system that endows
this lake with a unique name.  It can't matter at all whether the
authority has created an FPI, URN, or any other "standard" way of
referencing Lake Geneva.  Typically, the authority will not have done
any such thing, and there is absolutely no way to compel any authority
to do so!  It's only necessary to identify the authority, the
namespace in which this unique identifier exists, and the unique
identifier.  It's not important, at least for the next few centuries,
that there be only one authority, namespace, or name that everyone
must use; this is obviously an impossible (not to mention hopelessly
naive) goal.

> 1. Topics that are "published resources", that is, a thing that the
> creator of the thing has made public in some way.  One way to do
> this is to announce to the world "I have defined a topic called
> '+//...//EN' which refers to the idea of blah blah blah".  Note that
> "the idea of blah blah blah" is the primary form of the resource
> that the name "+//.../EN" is mapped to as indicated by the message
> from the publisher ("which refers to").

Yes, but normally the authority will not be so cooperative, and it
will neither know nor care that Joe Author uses or needs to use one of
its catalog numbers to refer to Lake Geneva.  This is neither a
copyright issue nor any other kind of legal problem.  The authority
gave Lake Geneva that catalog number; Joe didn't.  Joe just needs to
use it, and there is no reason why he should not be permitted to do
so.  Moroever, there's no reason why other people should not be able
to understand what Joe had in mind when he used it.

Everybody who creates a topic map needs to decide for themselves whose
names and namespaces they choose to regard as authoritative.  Joe
Author must, in all cases, be the ultimate meta-authority who decides
what authority he will regard as authoritative for the purpose of
helping him to refer to a topic.  Joe Author's choice of authority
will normally be made on the basis of his assessment of what is most
likely to be meaningful to the topic map's intended audience(s).

> For example, say Steve has decided to provide the service of
> cataloging public topics and provides a registration service by
> which publishers of topics can request that Steve catalog their
> topics.

In the general case this is much too hopeless a cause to base a
business on.  For most practical cases, there are already numerous
authorities.  Nobody will use my catalog number for Lake Geneva when
there are so many cartographers, water resource catalogers, almanacs,
government agencies, travel agencies, etc. etc. whose published
materials are far more accessible and far more authoritative than
anything I could ever do.  Michelin springs to mind, as does the US
Defense Mapping Agency and the World Almanac.  All are perfectly good
authorities.

> Steve has registered the owner name "technoteacher.com", so he owns
> that name space and all names within it.  I call Steve and ask to
> register my topic.  I give to Steve my authoritative description of
> the topic ("the idea blah blah blah"). Steve assigns a name and
> creates an entry in his catalog that looks like this:

> +//IDN technoteacher.com//DOCUMENT ABCD.1234-466 QZ2//EN := 
>    "The idea blah blah blah"

> Steve owns the name but I own the resource.  Nothing in the name
> indicates who owns the resource, in this case. (It could, but that
> would be up to Steve and his design for a cataloging scheme).

As a practical matter, I'm not gonna do this, and neither is anybody
else.  Anyway, I'm not interested in cataloging a document, per se.
I'm interested in a *topic*, and the only reason I'm interested in
documents is that I may choose to use a document that authoritatively
provides that topic with a unique identifier in order to refer
unambiguously to that topic.

> Note also that there is no meaningful, namable, thing that is an
> "abstract concept" as soon as the description of that concept gets
> recorded in some reasonably permanent and retrievable form.

I don't understand this statement at all.  Abstract concepts don't
cease to exist whenever someone defines or describes them.

> Thus, it's not meaningful to have a name for an "abstract topic"
> without having some authoritative definition of what that topic is.

I disagree.  A topic can exist regardless of whether it has a name and
regardless of whether it has been described.  Take away all of Lake
Geneva's names and descriptions, and you still have a lake -- the very
same lake, in fact.  (I admit that, under such circumstances, you
can't reference it without standing in front of it and pointing at it
with your finger.  Still, it exists.)

> Because there must always be a description (even if it's "call Eliot
> and ask him what this topic is all about") there will always be at
> least one resource for the name of that topic to map to.

Most topics are not owned by anyone.

> Of course, it is the responsibility of the owner of the
> idea to declare and publicize what that resource is.  

Which Sears, Roebuck & Company in fact already did in their 1922 Farm
Catalog.  If Joe Author chooses to regard that publication as the
authoritative disambiguator of what he's talking about, how should
he do it?  *That* is the question I'm trying to pose here.

If your answer is that Joe Author should declare his own namespace in
which "Sears, Roebuck & Company 1922 Farm Catalog Number R204" is a
meaningful name, I reply to you that that all makes perfect sense,
except for the part about Joe Author having to declare his own
namespace.  There is no point in comparing the name "Sears, Roebuck &
Company 1922 Farm Catalog Number R204" in Joe's namespace with any
name in any other author's namespace -- they're different namespaces,
after all, and any similarity in any two names in the two different
namespaces is, by definition, coincidental.  The whole value of
mentioning Sears at all stems from the fact that "Sears, Roebuck &
Company" is a name in the namespace that all of us who breathe oxygen
and spend money in North America hold in common.  Nobody owns this
namespace.  "Sears, Roebuck & Company" is a name whose *meaning* or
*referent* (a certain retail merchandising company) belongs to a
certain group of stockholders, but the name itself belongs to all of
us and it appears in a namespace that belongs to nobody (or
everybody).  The name "Sears, Roebuck & Company" is meaningful only in
that common culturally-determined namespace.

In other words, ISO standard numbers, W3C Recommendations, Internet
domain names, ISBNs, ISSNs, and Library of Congress Catalog Numbers
are all special cases -- namespaces that happen to be specially
recognized by existing formalisms for referencing documents.  (And
even they themselves are meaningless except by virtue of the fact that
we all share a common culture in which they are meaningful.)  The
overwhelming majority of topics don't have unique identifiers in any
of those specially-recognized namespaces, and topics aren't document,
anyway.  What is needed is a much more generalized capability -- one
that begins its location ladder in the common unnamed namespace of the
culture from which the topic map springs, and which can identify any
namespace "commonly" used in that culture.  This requirement echoes
and extends a similar thought that appeared earlier in this same
thread:

[Eliot Kimber:]

> ...here's what I'd like to see happen:
> 
> 1. A general recognition of the need for name-space/name bindings in
> data representation standards, regardless of the kind of data.  If
> these bindings are further standardized along the URN lines (its
> semantics, not its syntax, necessarily), so much the better.

I've been thinking that Joe Author should just create his own FPIs for
public topics.  If, as several have said on this list, Joe Author
cannot be trusted to create his own FPIs, or if the philosophy of FPIs
would be undermined by such a practice, what should Joe Author do?  We
need to fulfill this requirement, and if FPIs can't or shouldn't do
the job, we need to create something that will.

BTW, I like all of Rick Jelliffe's suggestions, which are not very
different from what is now in the Topic Navigation Map draft.  If I
understand them, collectively they amount to an enhanced FPI syntax.
Given a new public text type, "TOPIC", and copious use of "::" as a
field separator, can y'all countenance the use of FPIs to refer to
public topics?

-Steve

--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com  http://www.techno.com  ftp.techno.com

voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax    +1 972 994 0087 (at ISOGEN: +1 214 953 3152)

3615 Tanner Lane
Richardson, Texas 75082-2618 USA

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tln at insect.sd.monash.edu.au  Sun Sep 27 08:45:56 1998
From: tln at insect.sd.monash.edu.au (Thuy-Linh Nguyen)
Date: Mon Jun  7 17:05:06 2004
Subject: Help on XML4J1.0.4
Message-ID: <Pine.GSO.3.96.980927144340.6306F-100000@insect.sd.monash.edu.au>

Hi !

Could someone give me a clue on how to solve this:

I'm writing a servlet which imports com.ibm.xml.parser.*. I got a
"NoClassDefFoundError" as soon as I try to create a parser:

Parser ps = new Parser("myfile.xml");

Writing similar thing as an application seems to work ok.

Thanks !
TL


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From abcoates at ozemail.com.au  Sun Sep 27 09:05:08 1998
From: abcoates at ozemail.com.au (Anthony B. Coates)
Date: Mon Jun  7 17:05:06 2004
Subject: [ANN] Literate Programming & XML site has moved.
Message-ID: <199809270704.RAA20242@fep6.mail.ozemail.net>

"xml-litprog-l" has moved
=====================
Some of you may be aware of the "Literate Programming & XML" mailing list that
I set up.  I have recently had to change cities and jobs, and that has required
the mailing list to move.  The new Web site for the mailing list is

<http://www.allette.com.au/xml-litprog/>

and from here you can find out how to subscribe and/or post to the new list. 
Send any queries directly to "abcoates@ozemail.com.au".

	Cheers,
			Tony.

** Anthony B. Coates
** Software Engineer (Java).  This is a 100% Pure Java e-mail.
** <mailto:abcoates@ozemail.com.au>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mtbryan at sgml.u-net.com  Sun Sep 27 12:32:53 1998
From: mtbryan at sgml.u-net.com (Martin Bryan)
Date: Mon Jun  7 17:05:06 2004
Subject: Public identifiers and topic maps
Message-ID: <033001bdea01$f0fb2660$bcbc77c1@sgml.u-net.com>

Steve

>I've been a bit worried about this so I've been floating trial
>balloons about it in the xml-dev mailing list.  There is considerable
>sentiment to the effect that we're making a mistake here.

I've been trying to track this work while I have been away this week. It has
been very frustrating because most of the comments seem to be ill-informed
in that they were unaware of the main intention of the public attribute,
which you explained to me as being to allow the meaning of topic navigation
maps to be documented. We must bear this foremost in our mind when
considering the role of this attribute and the FPIs it references.

The second thing we need to bear in mind is the definitions of FPIs. The
following points need to be borne in mind:

What we are referring to are basically problems related to owner
identifiers, which are defined in ISO 8879 as "The portion of a public
identifier that identifies the owner or originator of public text", public
text being defined as "Test that is known beyond the context of a single
document or system environment, and which can be accessed with a public
identifier". No there are two questions that need discussion here:
i) what is the difference between an owner and an originator?
ii) what does "can be accessed with a public identifier mean?

I will come back to these questions shortly.

Eliot wrote:

>3. That existing naming schemes such as SGML formal public IDs can be used
>within a URN context if you're willing to escape lots of special characters
>(but we're used to that with URLs anyway).
>
>4. That URNs cannot be generally used today because there is no
>generally-available resolution service.  There is a "Real Soon Now" promise
>of a service, at least experimentally, but no indication from what I found
>that one is available for general use.


However, the existence of a trial resolution service for Digital Object
Identifiers (DOIs) shows us the way to go. This service, which has been put
together by CNRI, who seem to 'own' the whole process for URL
identification,  suggests that we should be able to use urn:fpi, urn:idn or
just fpi: or, better still, idn:, to provide a resolution service for XML
users. If we amend the ISO 9070 spec so that one of these is used in place
of/addition to the +//IDN currently used to identify the use of internet
domain names as owner identifiers we should be able to get an easily
automatable service up and running soon after identifying a sponsor for the
service (e.g. GCA)

Paul Prescod wrote
>An FPI is persistent because ISO legally constracts to not reassign them.
>This may mean nothing technically, but neither does the fact that the
>American government legally asserts that e-commerce transactions based on
>USD have value. If either ISO or the American government goes out of
>business, their promises are worthless, but by then we'll have other,
>bigger problems than our links breaking (which, I guess, is the real
>point).

FPIs have no legal status. Registered ones must have the names of their
ownders declared via the GCA, but unregistered ones have no such
constraints, and there in no guarantee that they will be unique. This does
not, however, stop them being useful.

Eliot riposted:
>SGML Formal public identifiers are not necessarily persistent names because
>there is nothing in ISO 8879 or ISO 9070 that requires them to be (nor
>could such a requirement be enforced or validated).  All that ISO 9070
>provides is a process for registering *owner identifiers*, which are,
>presumably, persistent (at least as defined by the assigning body).
>However, the name owner is responsible for managing the names within their
>slice of the FPI name space and can do whatever they want with them,
>including reassigning them without regard for persistence at all.


The FPIs used in public attributes in topic navigation maps do not need to
be persistent: they do need even need to be resolvable. They do need to be
"researchable": you should be able to find a copy of the original definition
somewhere to be able to ensure that you are using the topic correctly.

>According to the current Topic Navigation Map draft (soon to be CD
>13250), this would appear as the following FPI:
>
>-//Sears, Roebuck & Co.//NONSGML TOPIC 1922 Farm Catalog Number : R205//EN

Part of the porblem with this whole set of correspondence is the ineptness
of this example FPI (sorry Steve). Let me suggest what it should have been:


-//TechnoTeacher//NONSGML TOPIC Sears, Roebuck & Co 1922 Farm Catalog
Number: R205//EN

If this form had been used originally the discussion would never have
started. The point is that the owner of the FPI is not the owner of the
referenced data, but the owner of the public identifier. (See 8879
definition given above). Claiming to represent Saer Roebuck is not only
morally illegal, it is illegal in terms of 8879.

Steven Newcomb wrote:
>> Will URNs permit pointing to things that aren't now and may never be
>> on the web? I mean, things that their owners never intended to be on
>> the web and either that their owners do not want to appear on the web,
>> or that their owners may not (currently) see any interest in putting
>> on the web?


URNs may not permit pointing to thins that are not on the web, but Digital
Object Identifiers will. They will even allow you to point to information
resources that will exsit in the future (one of their biggest advantages
from the marketeer's viewpoint). Resolving a DOI can send you to web
resources that a) allow you to order a copy of the document when it becomes
available b) provide you with name and address of someone who can tell you
where to find the data you need or, if really necessary, c) point you to a
set of electronic copies of the document and ask you which format you would
like to purchase it in. They do not need to resolve to the actual data.

Eliots claim that:
> Public identifiers, and formal public identifiers in particular, are just
>a special case of URN.
seems wrong to me. URNs are designed for electronic access. Now, depending
on how you interpet the 8879 definition of public text (see above) I am not
certain that FPIs need to reference electronically accessible text in the
way URNs (or at least URLs) are designed to do. To support this argument let
me point out that a special class of FPIs was created to reference data
within ISO standards, yet ISO does not allow electronic access to its
standards. As the Internet did not exist in 1986 I would argue that the term
"accessed" as used in 8879 should not be read as "electronically accessed
via a network". At best it can be interpreted as being "electronically
accessed in some local system dependent way" (e.g. via a catalog).

 Eliot also said
> Therefore, the PUBLIC/SYSTEM distinction made by
>SGML (and XML) is inappropriate as a matter of syntax.  A name is a name
>and there should be exactly one declared for each entity.

While this may be true in an XML context, where FPIs only apply to entity
declarations, it does not make sense in the sense they are being used in the
topic navigation mao specification, where they are *only* being used to
indirectly document the meaning of a topic. Now whether we should claim that
the names of the public identifiers used for this purpose should be
implicitly similar to those used for identifying entities containing markup
declarations or replacement text is another question. Some form of
structured name is required. Basing this structure on the existing structure
for FPIs makes sense to me.

Steve Newcomb wrote:
>But what is the correct solution, in your mind?  The only alternative
>I know of is the HyTime "bibloc" architectural form.  In your opinion,
>John, should there be a similar architectural form (in XML jargon,
>"template") in XML?  Or do you have another proposal for indicating
>bibliographic references?


This just would not be acceptable as a methodology for documenting topic
maps, which, I repeat, is what the public attribute is all about.

The comment that you originally quoted (from Paul Prescod I believe, though
it might have been John Cowan) was;
>> # [T]he registration indicator, public text class, and language fields
>> # are as specified for formal public identifiers in ISO/IEC 8879:1986.
>> # The 'topic authority' [the string after the "+//" or "-//"] is
>> # the owner of <em>the information resource</em> that defines the
>> # concept.  [Emphasis added.]


Note that what is bieng complained about here is the definition of topic
authority. This term is currently undefined in ISO 13250. To my mind adding
the following definition to Clause 4 would solve this misunderstanding:

Topic Authority
The person or organization responsible for the maintenance of the topic map.

The emphasised part of the definition is wrong and should be corrected in
ISO 13250. It should be replaced by "the public identifier used to reference
the information resource".

If we make this small change then we do not need to take in Ricks suggestion
at all (though it is a good one).


Rick Jelliffe wrote:
>Do we need a public text type of TOPIC?

You most definitely do.

>You could have something like this
>
>"+//IDN techno.com//TOPIC
>Sears, Roebuck & Company:: 1922 Farm Catalog:: [catalog number] R204//EN//
>www.techno.com/topics/sr1922r204"


You do not need the part after the EN. As shown above, it is simply enough
to identify the Topic Authority at the start: you do not need to identify
where that authority has stored an electronic copy of the definition, which
is all Rick's useful extension adds to what is suggested above.

As Rick said:
>I think it is important that I should be able to assign a topic, like
>the farm catalog above, even if I cannot locate the canonical form,
>and there should be a chance my systems will work.


This is what the public topic references in clause 6.1.1 of ISO 13250 are
designed to do. Mixing this up with other useers of FPIs has muddied the
water so much that everybody seems to have missed the illumination that this
vital option adds to topic navigation maps.

Martin Bryan


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mtbryan at sgml.u-net.com  Sun Sep 27 13:59:24 1998
From: mtbryan at sgml.u-net.com (Martin Bryan)
Date: Mon Jun  7 17:05:06 2004
Subject: Public identifiers and topic maps
Message-ID: <034501bdea0e$09851400$bcbc77c1@sgml.u-net.com>

Steve wrote:
>Everybody who creates a topic map needs to decide for themselves whose
>names and namespaces they choose to regard as authoritative.  Joe
>Author must, in all cases, be the ultimate meta-authority who decides
>what authority he will regard as authoritative for the purpose of
>helping him to refer to a topic.  Joe Author's choice of authority
>will normally be made on the basis of his assessment of what is most
>likely to be meaningful to the topic map's intended audience(s).


But it is Joe Author who is the owner of the public text. He's the one with
access to the copy of the document that is being referenced. It is what is
said in his copy of the source that matters.

Let me ask you this. If Joe Author defines to make the subject of his topic
something in the Book of Kells, who should he cite as the owner: the monk
who wrote the book, Eadfrith, Bishop of Lindersfarne at the time it was
written, Lindersfarne Abbey (abandoned following Danish raids in 875),
Trinity College Dublin who I believe hold the original when it is not out on
loan, the local library where Joe Author borrowed a facsimile from to look
up the topic, or the TEI initiative's electronic encoding of the Book of
Kells?

>> Steve owns the name but I own the resource.  Nothing in the name
>> indicates who owns the resource, in this case. (It could, but that
>> would be up to Steve and his design for a cataloging scheme).

And nothing needs to. What the public text should do is resolve who you
should go to for the official definition, and how he will recognize one such
definition from another in his local catalogue of meanings. This is what
FPIs do.


Martin Bryan


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mtbryan at sgml.u-net.com  Sun Sep 27 14:31:38 1998
From: mtbryan at sgml.u-net.com (Martin Bryan)
Date: Mon Jun  7 17:05:06 2004
Subject: Public identifiers and topic maps
Message-ID: <035301bdea12$8cbf4c60$bcbc77c1@sgml.u-net.com>

Whoops, must stop trying to cook Sunday lunch, watch a Grand Prix, look
after the kids and answer e-mail at the same time:-( Of course the Book of
Kells was not prepared at Lindisfarne alongside the Lindesfarne Gospel, but
you get my point as to attribution.

Martin Bryan


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Philippe.Le_Hegaret at sophia.inria.fr  Sun Sep 27 18:44:47 1998
From: Philippe.Le_Hegaret at sophia.inria.fr (Philippe Le H�garet)
Date: Mon Jun  7 17:05:06 2004
Subject: KOML : XML Serialization for java
Message-ID: <360E6B73.4E84A366@sophia.inria.fr>

The Koala XML serialization provides an easy way to serialize
and deserialize any Java Objets in an XML document. This
application is called KOML for Koala Object Markup Language. 
This is a 100% pure Java solution. 

 Documentation, technical note and packages can be found here :
http://www.inria.fr/koala/XML/serialization/

Regards,
Philippe.
---------
Philippe Le Hegaret
Philippe.Le_Hegaret@sophia.inria.fr -- http://www.inria.fr/koala/plh/
KOALA/DYADE/BULL @ INRIA (Stagiaire) - Sophia Antipolis

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From srnm at yahoo.com  Sun Sep 27 19:23:18 1998
From: srnm at yahoo.com (Steven Marcus)
Date: Mon Jun  7 17:05:06 2004
Subject: DCD v DTD?
Message-ID: <19980927172029.22131.rocketmail@send1e.yahoomail.com>


Hello all,

Can someone provide some perspective on the use of DCDs v DTDs?

This is what I know:
1) DTDs are standardized and ready now.
2) DCDs use XML to describe that which you would use DTDs for (?)

Is that it? Am I missing something?

tia!
Steven


_________________________________________________________
DO YOU YAHOO!?
Get your free @yahoo.com address at http://mail.yahoo.com


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From larsga at ifi.uio.no  Sun Sep 27 22:34:46 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:05:06 2004
Subject: XML editor
In-Reply-To: <01BDE97D.2D64B380@thillai>
References: <01BDE97D.2D64B380@thillai>
Message-ID: <wkd88hdyw0.fsf@ifi.uio.no>


* thillai@ix.netcom.com
| 
| Is there any XML editor which does validation based on DTD when I
| create the XML document. (after assoicating a DTD) For e.g if I try
| to add a invalid child it should give warning or based on DTD it
| should list what are all the valid children for a particular node.
| (Is it possible??)

Yes, this is possible, and SGML editors have been doing this for
years already.

One editor that does this is the Emacs PSGML-mode, but I'm not aware
of any other free editors that do this. XED does let you run a parser
and step through the validation errors while editing.

There are many commercial programs that let you do this.

--Lars M.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From gmessner at messners.com  Mon Sep 28 05:45:05 1998
From: gmessner at messners.com (Gregory M. Messner)
Date: Mon Jun  7 17:05:06 2004
Subject: XML and Objects
Message-ID: <199809280344.UAA13557@websales.com>


I have 2 questions/problems:

1) A DTD describes a document which contains content specified by a URL, a
local file name, or inline. Documents created using this DTD are assembled
and transported across a network. How do you include the content? The 2
ways we have discussed are:

    * Inline using Base64 encoding in a CDATA section
    * Wrap the document in a multipart/related MIME message
      and include the content as attachments

I am leaning towards multipart/related, but would like to know of others
experience in this area.


2) We desire to provide an API on the client side which exposes a simple
mechanism for creating and modifying objects. These objects are serialized
using XML and then transported to a server for further processing. The
server then responds with another XML document that we then de-serialize
into an object and present it to the API user. Here are some basic
requirements:

    * Support for both Java and C++
    * API must be similar for both Java and C++
    * Object members are accessed via get/set methods
    * Adhere to JavaBean method naming patterns

We are thinking of developing an application which takes a DTD and then
generates Java and/or C++ code for each object. We would use a XML helper
file to give more control over the generation process. Are we out in left
field here? What are some of the other ways to do this? What are you
experiences doing something like this?


Gregory M. Messner
gmessner@vsi.com


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Mon Sep 28 07:41:19 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:05:06 2004
Subject: XML and Objects
In-Reply-To: <199809280344.UAA13557@websales.com>
Message-ID: <3.0.1.16.19980928064046.098fc852@pop3.demon.co.uk>

At 20:47 27/09/98 -0700, Gregory M. Messner wrote:
[...]
>
>2) We desire to provide an API on the client side which exposes a simple
>mechanism for creating and modifying objects. These objects are serialized
>using XML and then transported to a server for further processing. The
>server then responds with another XML document that we then de-serialize
>into an object and present it to the API user. Here are some basic
>requirements:
>
>    * Support for both Java and C++
>    * API must be similar for both Java and C++
>    * Object members are accessed via get/set methods
>    * Adhere to JavaBean method naming patterns
>
>We are thinking of developing an application which takes a DTD and then
>generates Java and/or C++ code for each object. We would use a XML helper
>file to give more control over the generation process. Are we out in left
>field here? What are some of the other ways to do this? What are you
>experiences doing something like this?

Does 'left-field == bleeding edge'?

I think a number of people on XML-DEV have a very similar requirement: The
Coins approach, the SUN early release of XML, XXX (Steve Withall) and
JUMBO. We all want object functionality client-side. The balance between
client and server may differ, but we need an element-object API.

I have raised this sporadically on XML-DEV and there is no doubt that we
will benefit enormously from a shared API (created and developed in the
same way as SAX). It seems that the time is right. We need some champions
:-) Volunteers?

	P.

Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Jon.Bosak at eng.Sun.COM  Mon Sep 28 08:18:40 1998
From: Jon.Bosak at eng.Sun.COM (Jon Bosak)
Date: Mon Jun  7 17:05:06 2004
Subject: Prerelease of religion.200 available
Message-ID: <199809280614.XAA14660@boethius.eng.sun.com>

I've put a prerelease of a major revision to the religion set in a
temporary location:

   http://sunsite.unc.edu/pub/sun-info/standards/xfer/rel200-1.zip

There are lots of changes from the old release (1.10).  The biggest
change is that I've removed the verse numbers; these are now to be
generated by style sheets.  I've included sample DSSSL style sheets as
proof of concept.  Directions are included for generating RTF versions
of the XML files using Jade.

Not part of the release, but provided for reference purposes at the
same temporary location, are the RTF files you will get if you run
Jade using the style sheets included in the package:

   http://sunsite.unc.edu/pub/sun-info/standards/xfer/rel1rtf.zip

I invite members of the dssslist and xml-dev to check out the new
release before I announce it to a wider audience.  I have tried hard
to make these files absolutely compliant XML, but previous experience
makes me doubt that I've gotten everything just right.  I would also
welcome constructive criticism of the DSSSL files.

Jon

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rbourret at dvs1.informatik.tu-darmstadt.de  Mon Sep 28 11:09:51 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun  7 17:05:06 2004
Subject: DCD v DTD?
Message-ID: <199809280823.KAA09405@berlin.dvs1.tu-darmstadt.de>

> Can someone provide some perspective on the use of DCDs v DTDs?
> 
> This is what I know:
> 1) DTDs are standardized and ready now.

Correct.

> 2) DCDs use XML to describe that which you would use DTDs for (?)

Yes.  (XSchema, too.)

There are a two major advantages to using XML syntax for schema information 
rather than DTDs.  The first is the availability of tools -- while there are 
plenty of tools around for manipulating XML files, there are few available for 
manipulating DTDs.

The second advantage is extensibility.  Because of the stated goal that XML be 
compatible with SGML, XML DTDs are (in theory) extensible only if the same 
extension is simultaneously made to SGML DTDs.  This makes it difficult to add a 
lot of interesting schema information, such as data types, to XML DTDs.  Because 
there is no such requirement when schema information is represented in XML, the 
field is open.

The only drawback (besides short-term availability) is if your application 
requires a DTD.  This is not always the case, as it is often possible to 
separate authoring (which validates against schema information) from use (which 
assumes validity).  And even if your application requires a DTD, you might be 
able to convert part of your schema information into DTD form while still using 
the additional XML-based schema information (such as data types) in your 
authoring tools.

-- Ron Bourret

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Mon Sep 28 12:15:59 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:05:07 2004
Subject: XML parsing from within a VB server component
Message-ID: <003601bdeac9$9ad1aac0$1e09e391@mhklaptop.bra01.icl.co.uk>


-----Original Message-----
From: Billow, Danny J <Billow.Danny@emeryworld.com>
To: 'xml-dev@ic.ac.uk' <xml-dev@ic.ac.uk>
Date: 25 September 1998 22:45
Subject: XML parsing from within a VB server component


>Can it be done?
Yes.

>Is there any documentation for msxml.dll?
I wouldn't advise using msxml, it's out of date and
Microsoft don't appear to have any plans for it. Use any of
the excellent Java-based SAX parsers available.

So how to access a Java parser from a VB (COM) environment?
I haven't tried doing this directly at the SAX level, though
it's probably possible if you understand the intricacies of
COM. What I did was to write a Java wrapper class around the
SAXON library (http://home.iclweb.com/icl2/mhkay/saxon.html)
and register this as a COM object using Microsoft's javareg
utility. I designed the wrapper class to have a very simple
interface: no exceptions, no callbacks, only strings and
integers as parameters; I'm sure I restricted it
unnecessarily but I had had problems bridging more complex
interfaces so I was overcautious. The COM wrapper class is
part of the SAXON distribution.

As an alternative to Microsoft's javareg you ought to be
able to use Sun's ActiveX Bridge but I spent a week trying
to get it to work, while javareg worked for me first time.

>Is this module used by the browser only or can I use it in
my server
>component?

I've only tried using this server-side, generally from
VBScript on ASP pages. I haven't had time to explore the
complexities of client-side operation.

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Mon Sep 28 12:27:57 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:05:07 2004
Subject: Sun XML early access
Message-ID: <004b01bdeacb$44bdec00$1e09e391@mhklaptop.bra01.icl.co.uk>

>> They've done some interesting
>> things with the DOM, for example the ability to nominate
>> user-defined subclasses of Element, and a TreeWalker
>> interface. This is where SAXON started last December!
>
>Hmmm ... did I ask you to elaborate on this one?  SAXON
>started there, and that's not what it's got now; it's
>effectively got a dispatch framework rather than an
>in-memory data structure.  Was that because you didn't
>want the in-memory stuff?


Yes, it's an interesting history. I started off by building
the document tree with MSXML, and wrote a TreeWalker class
to make it easier to process the tree serially (unlike SUN's
TreeWalker which supports a getNext() interface, mine did a
callback at each node); then I realised most of my
applications were single-pass, so I changed it to do
essentially the same thing on top of an event-based
interface like SAX. Later other people asked for a
non-serial interface, so I put the ability to walk the DOM
back in.

(I've now got it working to be independent of the DOM
implementation, with drivers for SUN and Docuverse: not yet
released though).

Mike K


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tln at insect.sd.monash.edu.au  Mon Sep 28 12:57:17 1998
From: tln at insect.sd.monash.edu.au (Thuy-Linh Nguyen)
Date: Mon Jun  7 17:05:07 2004
Subject: Help on XML4J1.0.4 [solved]
Message-ID: <Pine.GSO.3.96.980928205528.19749E-100000@insect.sd.monash.edu.au>

Hi !

Thank you to all who replied me. I did look at the classpath, but didn't
look at the servlet configuration, thinking it would use the same
classpath as the JDK. That's where the problem is !

Thanks !
TL


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Mon Sep 28 13:32:57 1998
From: david at megginson.com (david@megginson.com)
Date: Mon Jun  7 17:05:07 2004
Subject: XML and Objects
In-Reply-To: <199809280344.UAA13557@websales.com>
References: <199809280344.UAA13557@websales.com>
Message-ID: <13839.29384.260466.681001@localhost.localdomain>

Gregory M. Messner writes:

 > 1) A DTD describes a document which contains content specified by a
 > URL, a local file name, or inline. Documents created using this DTD
 > are assembled and transported across a network. How do you include
 > the content? The 2 ways we have discussed are:
 > 
 >     * Inline using Base64 encoding in a CDATA section

I don't think that you need a CDATA section with Base64 (can someone
confirm that Base64 excludes '<' and '&'?).

 >     * Wrap the document in a multipart/related MIME message
 >       and include the content as attachments

This sounds like a wise choice.  XML packaging is a problem that the
W3C XML Activity has not yet addressed -- experimentation and
implementation experience will be very helpful to them when the time
comes (after all, that's one of XML-DEV's greatest strengths).

 > I am leaning towards multipart/related, but would like to know of
 > others experience in this area.

The key is to use a streaming protocol so that you can start
processing the first files while the rest are arriving.  ZIP is
useless for this purpose, since it keeps the directory information at
the end; TAR is good, as (I think) is CPIO.

 > 2) We desire to provide an API on the client side which exposes a
 > simple mechanism for creating and modifying objects. These objects
 > are serialized using XML and then transported to a server for
 > further processing. The server then responds with another XML
 > document that we then de-serialize into an object and present it to
 > the API user. Here are some basic requirements:
 > 
 >     * Support for both Java and C++
 >     * API must be similar for both Java and C++
 >     * Object members are accessed via get/set methods
 >     * Adhere to JavaBean method naming patterns

The DOM would be a pretty close (and obvious) fit, and has the
advantage of being very close to W3C Recommendation.

 > We are thinking of developing an application which takes a DTD and
 > then generates Java and/or C++ code for each object. We would use a
 > XML helper file to give more control over the generation
 > process. Are we out in left field here? What are some of the other
 > ways to do this? What are you experiences doing something like
 > this?

No, this is a common approach.  Are you going to build the templates
from the DTD itself (I'm not certain that I understand the references
to 'DTD')?


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From b.laforge at jxml.com  Mon Sep 28 14:20:50 1998
From: b.laforge at jxml.com (Bill la Forge)
Date: Mon Jun  7 17:05:07 2004
Subject: XML and Objects
Message-ID: <001601bdeada$9f805600$02000003@thing1.camb.opengroup.org>

> > 1) A DTD describes a document which contains content specified by a
> > URL, a local file name, or inline. Documents created using this DTD
> > are assembled and transported across a network. How do you include
> > the content? The 2 ways we have discussed are:
> > 
> >     * Inline using Base64 encoding in a CDATA section
>
>I don't think that you need a CDATA section with Base64 (can someone
>confirm that Base64 excludes '<' and '&'?).


I've used Base64 extensively in coins version 0. It excludes < and &.
It includes only A-Z, a-z, 0-9, +, /.

Bill


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Mon Sep 28 15:31:30 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:05:07 2004
Subject: XML editor
References: <01BDE97D.2D64B380@thillai>
Message-ID: <360F8B21.181209D1@technologist.com>

> Is there any XML editor which does validation based on DTD when I create
> the XML document. (after assoicating a DTD)   For e.g if I try to add a
> invalid child it should give warning or based on DTD it should list what are all
> the valid children for a particular node.  (Is it possible??)

Yes it is possible. As far as I know, the only tools that do it are tools
that were formerly SGML tools. Check out Steve Pepper's "Whirlwind guide":
http://www.infotek.no/sgmltool/editetc.htm#cat-edit

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

How many of the Congresspeople who voted for the CDA do you suppose
also voted to release the report that reads like a borderline por-
nographic dime-store romance written by a Texas preacher's son?
	- Keith Dawson, TBTF 
		http://www.tbtf.com/archive/09-14-98.html
		http://www.tbtf.com/resource/hypocrites.html

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Mon Sep 28 16:13:29 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:05:07 2004
Subject: XML and Objects
Message-ID: <3.0.32.19980928071027.00af0c00@pop.intergate.bc.ca>

At 06:40 AM 9/28/98, Peter Murray-Rust wrote:
>At 20:47 27/09/98 -0700, Gregory M. Messner wrote:
>>2) We desire to provide an API on the client side which exposes a simple
>>mechanism for creating and modifying objects. 
...
>I think a number of people on XML-DEV have a very similar requirement: The
>Coins approach, the SUN early release of XML, XXX (Steve Withall) and
>JUMBO. We all want object functionality client-side. The balance between
>client and server may differ, but we need an element-object API.

This can/should be built on top of the DOM, right?  -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Mon Sep 28 16:34:19 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:05:07 2004
Subject: XML and Objects
References: <3.0.32.19980928071027.00af0c00@pop.intergate.bc.ca>
Message-ID: <360F9ECC.C43C2816@infinet.com>

Tim Bray wrote:

> At 06:40 AM 9/28/98, Peter Murray-Rust wrote:
> >At 20:47 27/09/98 -0700, Gregory M. Messner wrote:
> >>2) We desire to provide an API on the client side which exposes a simple
> >>mechanism for creating and modifying objects.
> ...
> >I think a number of people on XML-DEV have a very similar requirement: The
> >Coins approach, the SUN early release of XML, XXX (Steve Withall) and
> >JUMBO. We all want object functionality client-side. The balance between
> >client and server may differ, but we need an element-object API.
>
> This can/should be built on top of the DOM, right?  -Tim

Not sure exactly what Peter means by an element-object API, but the DOM has done
quite well for my needs to date.  I still believe there are many API changes I
think are necessary to the DOM to make it optimally efficient for applications
(for example CharacterData.getData() should return a character array or provide a
filler routine so that underlying DOM implementations can decide how they want to
deal with data storage), but as a whole it does a pretty good job.

I have an element-object API which I would call more of a data-driven XML
framework from the parser on up that allows the application developer to easily
map elements to objects dynamically (does not work anything like coins does).
This sort of framework is done natively in our parser, but this sort of framework
could easily be built on top of SAX as well.  I have found that for the DOM
implementation and an XSL implementation I am working on for a client, that this
sort of framework makes writing these sort of tools a cinch.  The only problems I
have had with XSL is understanding the latest draft in the first place (sorry I
don't have 10 years of document software experience on my resume).

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From creitzel at mediaone.net  Mon Sep 28 16:34:23 1998
From: creitzel at mediaone.net (Charles Reitzel)
Date: Mon Jun  7 17:05:07 2004
Subject: Element oriented programming
Message-ID: <199809281433.KAA18537@chmls05.mediaone.net>

Peter Murray-Rust wrote:
>It would be great if we could standardise on the API for this sort of
>thing. Then element-oriented programming could become really attractive.
>The domain-specific classes could use a standard core facility.

Agree also.  How would the Netscape proposal for "behavior sheets" fit in?
The basic idea of using style sheet-like pattern matching (XEvent?) to map
to method invocation (w/ parameter lists) was viable and would build on the
work already done for style sheets (which presumably builds on XPointer).

For java work, it would seem natural to use XEvent statements to register a
bean with the "behavior processor".  XEvent statements could map elements
matching a pattern with a bean event to "fire".  Further XEvent statements
could map specific events to specific event listeners available in
registered beans.  As each element appears on the input, it is checked
against the list of patterns.  For each pattern matched, the corresponding
event is created and all registered listeners executed.  

Element and attribute data must be mapped to event properties.  Likewise, it
must be possible to fire both when the event when the element is first
encountered (pre-XXX) and after all of its contents have been read in are
available to the event (post-XXX).

Limiting the event input to the current element and its contents seems
reasonable.  If the application needs references to other elements, it can
save the data for later reference as needed.

Something similar can be done for other languages like C++.  It isn't
necessary to recreate the equivalent of Java beans entirely - just event
definition.  Is this an instance of the publisher-subscriber pattern?
Linkage issues for C++ will be platform specific, but not too bad.

To my mind behavior sheets, per se, are not the hardest part.  What is
lacking is better cohesion among the various methods of pattern matching.
Xml, XPointer, XSL, et al need to share a unified view of specifying sets of
elements.

My $0.02 worth,
Charlie Reitzel


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Mon Sep 28 16:58:24 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:07 2004
Subject: XML and Objects
References: <199809280344.UAA13557@websales.com> <13839.29384.260466.681001@localhost.localdomain>
Message-ID: <360FA3F2.C6D02FD3@locke.ccil.org>

David Megginson scripsit:

> I don't think that you need a CDATA section with Base64 (can someone
> confirm that Base64 excludes '<' and '&'?).

I confirm it.  Base64 has 65 characters: A-Z, a-z, 0-9, +, -, and =,
the last of which is used for padding purposes when the binary
is not a multiple of 3 bytes (Base64 is basically a 3-byte binary
to/from 4-byte ASCII conversion).

Authoritative information is available in RFC 2045, clause 6.8.
 
>  >     * Wrap the document in a multipart/related MIME message
>  >       and include the content as attachments
> 
> This sounds like a wise choice.  XML packaging is a problem that the
> W3C XML Activity has not yet addressed -- experimentation and
> implementation experience will be very helpful to them when the time
> comes (after all, that's one of XML-DEV's greatest strengths).

One advantage of multipart/related is that there is a standard
URL format for referring from one part of a multipart message to
another.  Each part can have a Content-ID: header, analogous to
the overall Message-ID: header, and then an URL of the form
"cid:xx7f9weURV@whoever.com" can refer to the part with that
header.  See RFC 2111.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Mon Sep 28 17:36:48 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:05:07 2004
Subject: Element oriented programming
Message-ID: <00f601bdeaf6$50776be0$1e09e391@mhklaptop.bra01.icl.co.uk>

>Likewise, it
>must be possible to fire both when the event when the
element is first
>encountered (pre-XXX) and after all of its contents have
been read in are
>available to the event (post-XXX).


In my GedML (genealogy) app, which makes extensive use of
ID/IDREF relationships, I do the semantic validation of an
element in four phases:
1 - when the start element tag is encountered (check that
the context is OK and the attributes are valid)
2 - when the end element tag is encountered (check element
content, and consistency of subelements)
3 - when all elements have been read (check that all
referenced elements exist and are of the right type, and fix
up IDREF pointers)
4 - when all elements have been through stage 3 (check
inter-element consistency, e.g. that the family tree is
non-cyclic.)

So I define four standard events for each element type.

I don't know whether this represents any kind of general
model or whether it is peculiar to my application. I suspect
it applies to any application that handles data structures
that are not purely sequential or hierarchic.

Note: some of the validation I do duplicates that done by a
validating XML processor, but this is such a small part of
the whole that I decided to put it in anyway. E.g. in phase
3 I have to check that the target element is of the right
type, so it's little extra effort to check first that it
exists, even though a validating parser will have done this
already. (Anyway, I can't be sure that the user chose a
validating parser - a SAX limitation?)

I think there's something to be said for separating
"validity checking" events from any other behaviour. The
above discussion all relates to the problem of ensuring that
the XML document conforms to application-defined validity
rules. Of course, it would be much nicer if most of this
could be done declaratively though XSchema extensions.

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Mon Sep 28 18:08:23 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:07 2004
Subject: Ownership of Names (was Re: Public identifiers and topic maps)
In-Reply-To: <199809302215.RAA02155@bruno.techno.com>
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
Message-ID: <3.0.5.32.19980928103117.00966620@dns.isogen.com>

At 05:15 PM 9/30/98 -0500, Steven R. Newcomb wrote:

>[Even so, in response to what you say, I feel compelled to point out,
>perhaps irrelevantly, that names cannot be owned in any meaningful
>sense.  

I own drmacro.com. This ownership is asserted through paying my bill to
InterNIC, which acts as both a registrar of ownership (just like when you
register the deed to your house at the courthouse) and a manager of access
to the names (by controlling the DNS system that maps names to machines).

Thus, I own the name space. I can argue that I also own the names within
that name space because I also control the machine that has the resources
that those names will map to [that's actually not true--Steve owns the
machine and I use it only through his largess and kindness, but let's
pretend I did own it.].  I can sell names in name space, just as I can sell
space on my Web pages (or the roof of my house, which is right in the
landing flight path of Austin's airport, and therefore a good candidate for
signage that will reach a largely affluent audience--I'll even trim my tree
if the price is right :-)).  If I can see it, I must own it.

Maintaining and protecting ownership is another thing, as you point out.
But that's true for anything we can own--that's why we have property laws.
If someone else "takes" one of my names by using it in some way that I
didn't approve (like registering it and providing a mapping for it that
doesn't end up on drmacro.com), I can call the Sheriff and sue them.  Xerox
may have failed to maintain Xerox, but I know that Coke has succeeded in
maintaining Coke as a trade name.  So names can be protected given
sufficient vigalance and the right breed of attack lawyer.

So I maintain my assertion that names, not just name spaces, are ownable
things.  If this were not the case, Compaq would not have spent 3.something
million dollars to buy the name "www.altavista.com".  QED.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jborden at mediaone.net  Mon Sep 28 18:25:06 1998
From: jborden at mediaone.net (Jonathan  A. Borden)
Date: Mon Jun  7 17:05:07 2004
Subject: XML and Objects
Message-ID: <002601bdeafb$4fcda510$d3228018@jabr.ne.mediaone.net>

First, as Tim Bray suggests, the DOM goes a long way to represent objects in
XML.

We can add to this by

1) a DTD for a Simple Object Definition Language (SODL) this DTD is included
in a post to follow.
2) Use of both Base64 and multipart/related techniques.

	The advantage of SODL is that the object's data is represented in XML
itself. This works for most objects except those whose state is comprised of
large amounts of binary data. For moderate amounts of binary data base64
encoding is fine and certainly works. For large amounts e.g. video clips and
pictures, multipart/related MIME messages with Content-ID tagged binary
parts is more efficient. When representing an object by a multipart/related
compound document, the first part is an XML metadata header which contains
internal "cid:xxx" links.

	We are using these techniques in production for our XML/internet based
telemedicine system. I have developed a DOM for MIME which we use for this.
I would be happy to share more of these details if there is interest.

	Now to your specific question, if objects are represented in this fashion,
you can access members through interfaces (i.e. Java/C++) through get/set
pairs. A language XML interface layer is needed. This layer is identical to
COM's dispatch layer which allows COM objects to be used from within
Javascript and VBScript. COM uses a binary typelibrary as input. Our
technique takes the SODL document and

a) generates a typelibrary from it
b) employs a custom interface which is driven by the SODL document

	The advantage of (a) is that it is compatible with existing software
however the software is limited to Windows.
	XML-DEV would be an excellent place to develop an independent (b) layer
specification. This spec would certainly need to interface with DOM.

Jonathan Borden
JABR Technology
jborden@mediaone.net
>
>
>
> I have 2 questions/problems:
>
> 1) A DTD describes a document which contains content specified by a URL, a
> local file name, or inline. Documents created using this DTD are assembled
> and transported across a network. How do you include the content? The 2
> ways we have discussed are:
>
> * Inline using Base64 encoding in a CDATA section
> * Wrap the document in a multipart/related MIME message
> and include the content as attachments
>
> I am leaning towards multipart/related, but would like to know of others
> experience in this area.
>
>
> 2) We desire to provide an API on the client side which exposes a simple
> mechanism for creating and modifying objects. These objects are serialized
> using XML and then transported to a server for further processing. The
> server then responds with another XML document that we then de-serialize
> into an object and present it to the API user. Here are some basic
> requirements:
>
> * Support for both Java and C++
> * API must be similar for both Java and C++
> * Object members are accessed via get/set methods
> * Adhere to JavaBean method naming patterns
>
> We are thinking of developing an application which takes a DTD and then
> generates Java and/or C++ code for each object. We would use a XML helper
> file to give more control over the generation process. Are we out in left
> field here? What are some of the other ways to do this? What are you
> experiences doing something like this?
>
>
> Gregory M. Messner
> gmessner@vsi.com
>

Jonathan Borden
JABR Technology Corporation
617-557-5151
(fax) 617-557-5160
mailto:jborden@mediaone.net


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jborden at mediaone.net  Mon Sep 28 18:26:57 1998
From: jborden at mediaone.net (Jonathan  A. Borden)
Date: Mon Jun  7 17:05:07 2004
Subject: A Simple Object Definition Language (SODL)
Message-ID: <002701bdeafb$58a8f2c0$d3228018@jabr.ne.mediaone.net>

A Simple Object Description Language (SODL)

This is a very simple way to reprasent objects in XML. This representation
is compatible with Microsoft's typelibrary which is a binary format. The
element "type" was intended to use definitions from XML-Data but if XML-Data
isn't going anywhere we may need to include a "mini-XML-Data" within the
SODL DTD.

SODL is related to XML-RPC in that objects are defined as being composed of
interfaces (i.e. interfaceDef's). This approach to object definition is
taken because it is compatible with several of the XML-RPC efforts including
John Tigue's as well as being compatible with Microsoft's COM.
First an example, then the DTD:

<objectDef uid="
" name="JABR.DataObject">
		<interfaceDef uid="
" name="IJABRDataInterface">
			<property>
			<name>Y</name>
			<value><i4>345667</i4></value>
		</property>
		<property id="1">
		<value><string>An unnamed property</string></value>
		</property>
		</interfaceDef>
	</objectDef>

----DTD-Part-Here:-)

<!ELEMENT interfaceDef (name,derivedFrom,(property|method)*)>
<!ATTLIST interfaceDef
	uid CDATA #required
	version CDATA "1.0"
>
<!ELEMENT property (name,params?)>
<!ATTLIST property id CDATA ""
	access (get|put|getput) "getput"
	description CDATA ""
>
<!ELEMENT method (name,params?>
<!ATTLIST method id CDATA ""
	description CDATA ""
>
<!ELEMENT params (param*)
<!ELEMENT param (name, type)>
<!ATTLIST param
	type (in|out|inout|retval) #required
	id CDATA ""
>
<!ELEMENT name #PCDATA>
<!ELEMENT derivedFrom #PCDATA>
<!ELEMENT type (long|short|string|bool ..)>
		<!- types from XML-data to be used here ->
<!ELEMENT objectDef (interfaceDef+,other)>
<!ATTLIST objectDef
	uid CDATA #required
	name CDATA #required
	transacted (not|supports|required|new) "not">

Jonathan Borden
JABR Technology Corp.
jborden@mediaone.net


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Mon Sep 28 18:35:36 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:07 2004
Subject: Ownership of Names (was Re: Public identifiers and topic
  maps)
In-Reply-To: <3.0.5.32.19980928103117.00966620@dns.isogen.com>
References: <199809302215.RAA02155@bruno.techno.com>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
Message-ID: <3.0.5.32.19980928113448.0098c100@dns.isogen.com>

At 10:31 AM 9/28/98 -0500, W. Eliot Kimber wrote:
>if the price is right :-)).  If I can see it, I must own it.

I meant "if I can *sell* it, I must own it." 

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Mon Sep 28 19:19:46 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:08 2004
Subject: Public identifiers and topic maps
In-Reply-To: <199809302215.RAA02155@bruno.techno.com>
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
Message-ID: <3.0.5.32.19980928121846.00978810@dns.isogen.com>

At 05:15 PM 9/30/98 -0500, Steven R. Newcomb wrote:
>[Eliot Kimber:]
>
>> I think there's two different things being talked about here:
>
>[ ... and lots of other good stuff with which I agree.]
>
>But, Eliot, your note does not address the problem we're trying to 
>solve here.

In trying to respond in depth to Steve's note, I realized what the
fundamental problem is. Steve is talking about topics as though the topics
were the things (i.e., the topic "Lake Geneva" *is* the actual lake).  But
topics are not the things, they are descriptions of and opinions about
things.  That's why I say that a topic is a document.

We can prove that there exists in the alps of Europe a body of water that
has some measurable position on the globe.  That is a fact.

As soon as we start saying things like "this body of water is a lake" or
"this lake is called 'Lake Geneva'" we have asserted opinions about this
pool of water.  The opinion is not the thing.  The opinion points to the
thing.  Topics are just formalized forms of these types of statements.
They are abstract ideas that have to be documented if communication about
them over any useful time scale is to occur.

Why do we know that "Lake Geneva" is called "Lake Geneva"? Because someone
somewhere wrote it down the assertion: this body of water to which I refer
is called, by this group of people, "Lake Geneva".  They created the topic
"Lake Geneva" by writing down the assertion that the body of water is
called "Lake Geneva" by at least one person.  The topic is the idea that
this body of water is called Lake Geneva, not the body of water. The
document that says this is one member resource of the topic.

The names of topics are not and cannot be distinguishing, in the general
case, such that you can tell from two topic names whether or not the topics
are the same or different. I can create a topic called "Lake Geneva" by
which I mean all lakes called "Geneva" anywhere in the world. The only way
you can distinguish my topic from Steve's topic is to find all the members
of each topic and compare them for identity.  By same token, I can create a
topic with the name "That Lake in Switzerland" that is identical to Steve's
topic named "Lake Geneva" (identical because it includes the same member
resources).

It has always been and will always be the case that if two names are the
same (within the same name space, of course) then they must refer to the
same resource. But the converse can never be proved: if two names are
different, there's no guarantee, in the general case, that they don't refer
to the same thing.

You could define a name space in which you impose the rule that every
resource shall have exactly one name, but then you have the problem of
defining identity of resources.  For things like printed books or human
beings, it's relatively easy because they have lots of inherently
distinguishing properties, like author, title, publisher's ISBN number,
fingerprints, unique location in space and time, etc., that make it easier
to inspect names to see if they might actually refer to the same thing.
But topics, being more abstract (they're just ideas and opinions with no
well-defined physical or electronic representation), don't have inherent
distinguishing properties, so you can't use them when constructing names.
It would be up to some topic cataloging service to determine when two names
really referred to the same topic and disallow the cataloging of the second
name.  But this is a function of the catalogers, not the naming mechanism.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Mon Sep 28 19:26:55 1998
From: david at megginson.com (david@megginson.com)
Date: Mon Jun  7 17:05:08 2004
Subject: Ownership of Names (was Re: Public identifiers and topic maps)
In-Reply-To: <3.0.5.32.19980928103117.00966620@dns.isogen.com>
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	<360C0929.A4DC3C69@locke.ccil.org>
	<199809302215.RAA02155@bruno.techno.com>
	<3.0.5.32.19980928103117.00966620@dns.isogen.com>
Message-ID: <13839.50688.801484.233333@localhost.localdomain>

W. Eliot Kimber writes:

 > So I maintain my assertion that names, not just name spaces, are
 > ownable things.  If this were not the case, Compaq would not have
 > spent 3.something million dollars to buy the name
 > "www.altavista.com".  QED.

If you wanted to express the point formally, you could say that each
unique combination of

  { namespace, time }

or

  { namespace, time, name }

is ownable.  Eliot owns drmacro.com at 00:00:01 GMT 28 September 1998
(and for a while before and after).

Vanity of vanities, dust to dust, and all that.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Mon Sep 28 19:37:52 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:08 2004
Subject: Public identifiers and topic maps
In-Reply-To: <033001bdea01$f0fb2660$bcbc77c1@sgml.u-net.com>
Message-ID: <3.0.5.32.19980928123556.00956c50@dns.isogen.com>

At 11:30 AM 9/27/98 +0100, Martin Bryan wrote:

>The FPIs used in public attributes in topic navigation maps do not need to
>be persistent: they do need even need to be resolvable. They do need to be
>"researchable": you should be able to find a copy of the original definition
>somewhere to be able to ensure that you are using the topic correctly.

If you can find the definition of something by an FPI, then you have
resolved the thing. Thus the FPI is resolvable. It may not be resolvable
electronically (I called you up and said, hey, what does this FPI mean),
but so what?  If you need electronic resolution, then you create a bibloc
that provides the electronic proxy representation for the thing itself:
that's what a bibliographic location is for (see my signature at the bottom
of this note for an example).

If your argument is that it doesn't need to be *electronically* resolvable,
I'd argue that, in today's world, it would take more effort to make
something researchable but not resolvable than it would be to make it
resolvable.  That's because whatever you publish will probably start as an
electronic data set anyway, so why not simply publish it to the Web?  If I
do that and then tell you "FPI 'x' maps to URL 'y', the FPI is
electronically resolvable.

If I can't figure out what an FPI maps to, whether the resource is
electronic or not, then the FPI is meaningless because it doesn't get me to
anything. Thus, if an FPI is researchable, it's just as easy to make it
resolvable.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From maillist at chris.hubick.com  Mon Sep 28 20:03:15 1998
From: maillist at chris.hubick.com (Chris Hubick)
Date: Mon Jun  7 17:05:08 2004
Subject: XML and Objects
In-Reply-To: <360FA3F2.C6D02FD3@locke.ccil.org>
Message-ID: <Pine.LNX.3.96.980928105349.23341A-100000@chris.hubick.com>

On Mon, 28 Sep 1998, John Cowan wrote:

> I confirm it.  Base64 has 65 characters: A-Z, a-z, 0-9, +, -, and =,
> the last of which is used for padding purposes when the binary
> is not a multiple of 3 bytes (Base64 is basically a 3-byte binary
> to/from 4-byte ASCII conversion).
> 
> Authoritative information is available in RFC 2045, clause 6.8.

And if anyone wants an implementation with source, thank the W3C Jigsaw
team, Base 64 Encoding comes as part of the Jigsaw package :-)

import org.w3c.tools.codec.Base64Encoder;

---
Chris Hubick
mailto:chris@hubick.com
http://www.hubick.com/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Mike_Spreitzer.PARC at xerox.com  Mon Sep 28 20:17:26 1998
From: Mike_Spreitzer.PARC at xerox.com (Mike_Spreitzer.PARC@xerox.com)
Date: Mon Jun  7 17:05:08 2004
Subject: Public Identifiers and their association with URIs
Message-ID: <98Sep28.111702pdt."53869(1)"@alpha.xerox.com>

I'm a bit puzzled about the exact intended relationship between Public
Identifiers and URIs.

In an ExternalID (which says where to find an external entity), a System
Identifier (URI) must always be given, and a Public Identifier may also be
given (XML 1.0 section 4.2.2).  The spec includes text suggesting that an XML
processor may map the Public Identifier to a different URI than the one given
alongside the Public Identifier in the ExternalID, and use this different URI
to actually fetch the external entity's content.  Can this alternate URI vary
in any way?  From installation to installation of a given XML application?
>From on-line to off-line operation?  Over time in general?  Note that these
things can vary independently of the content of the referring XML document.
Can the content of an external entity use relative URIs?  In the face of
whatever variation is allowed in the resolved URI of the external entity, what
guarantees does the author of the external entity have about whether and what a
relative URI in the external entity content resolves to (i.e., because the base
varies, base+rel varies, leading to some uncertainty about nested content).

In the section on external entities, Tim Bray's Annotated XML Spec has a note
that says (among other things):

``if you use public identifiers within your own organization, that's perfectly
OK, but if you want to interchange XML documents with anybody external, they
have the right to demand, and you have the obligation to provide, a working
system identifier (URI) for each external entity.''

Um, doesn't the (non-annotated) spec already say that every ExternalID has to
include a working URI?  Are non-working URIs allowed in ExternalIDs?  Or is
this particular comment only meaningful for NotationDecl (the only place I've
noticed where it's allowed to have Public Identifier without an accompanying
System Identifier)?

In a NotationDecl, it is allowable to give only a Public Identifier without an
accompanying System Identifier.  Why does it make sense to offer this option
for NotationDecl but not ExternalID?

It seems to me that for all the reasons people want URNs instead of URLs, we'd
also like to have Public Identifiers that are simply not connected with those
damn URLs (there are no really effective URNs available right now, so URI
effectively equals URL at the present time).

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Curt.Arnold at hyprotech.com  Mon Sep 28 20:26:07 1998
From: Curt.Arnold at hyprotech.com (Arnold, Curt)
Date: Mon Jun  7 17:05:08 2004
Subject: XML and Objects
Message-ID: <E0zNhzJ-0004Q2-00@punch.ic.ac.uk>


At 06:40 AM 9/28/98, Peter Murray-Rust wrote:
>At 20:47 27/09/98 -0700, Gregory M. Messner wrote:
>>2) We desire to provide an API on the client side which exposes a
simple
>>mechanism for creating and modifying objects. 
...
>I think a number of people on XML-DEV have a very similar requirement:
The
>Coins approach, the SUN early release of XML, XXX (Steve Withall) and
>JUMBO. We all want object functionality client-side. The balance
between
>client and server may differ, but we need an element-object API.

Tim Bray wrote:
>This can/should be built on top of the DOM, right?  -Tim

If on top of the DOM you mean that you would would completely populate
the DOM, then build the corresponding objects, then I would tend to
disagree.  I have had good luck and performance restoring objects from a
large (>2MB) XML file responding to events from Expat.  If I first built
an in-memory representation and then processed the information, I don't
think that I could get nearly the same performance.  I would think an
object creation and link resolution layer on top of SAX would be
preferable.

p.s. I've downloaded the Sun XML Early Access, but I can only find
passing references to XML Beans.  Is there a specific document and/or
source file that clarify what they mean by XML Beans.  The two
alternative interpretation of the term that I have contemplated are:

1. Java Beans that modify the behavior of the parser
2. A serialization mechanism for Java Beans
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/ms-tnef
Size: 2752 bytes
Desc: not available
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980928/02cee4f8/attachment.bin
From b.laforge at jxml.com  Mon Sep 28 20:34:03 1998
From: b.laforge at jxml.com (Bill la Forge)
Date: Mon Jun  7 17:05:08 2004
Subject: XML and Objects
Message-ID: <006b01bdeb0e$ae594f40$ab026982@thing1.camb.opengroup.org>

>> >>2) We desire to provide an API on the client side which exposes a
simple
>> >>mechanism for creating and modifying objects.
>> ...
>> >I think a number of people on XML-DEV have a very similar requirement:
The
>> >Coins approach, the SUN early release of XML, XXX (Steve Withall) and
>> >JUMBO. We all want object functionality client-side. The balance between
>> >client and server may differ, but we need an element-object API.
>>
>> This can/should be built on top of the DOM, right?  -Tim


In coins v2, I've got 4 different kinds of objects/elements:

    1. Plain old DOM objects. Passive data holders.

    2. Objects which implement Element and the SAX DocumentHandler
        interfaces. Active and passive roles.

    3. Wrappers which hold beans. The wrappers implement Element and
        DocumentHandler. The Beans receive attribute values as bean
        properties and interact with other beans.

    4. Wrappers which hold CoinApplication beans. Same as 3, except
        that they are passed a reference to their wrapper when constructed,
        allowing them to access the entire DOM.

The Mint Utility has been rewritten using 2 and 4 (above).

The code for release 2 is available today... Documentation to follow.
 :-)

Bill
http://www.jxml.com/coins/download.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Mon Sep 28 20:45:41 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:08 2004
Subject: Ownership of Names (was Re: Public identifiers and topic maps)
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com> <3.0.5.32.19980928103117.00966620@dns.isogen.com>
Message-ID: <360FD92B.2F72ECB3@locke.ccil.org>

W. Eliot Kimber wrote:

> So I maintain my assertion that names, not just name spaces, are ownable
> things.

The difficulty is that there are so many names that are public domain;
their meaning is settled by tacit agreement among the users, not by
registration.  (This does not mean that the *referent* is necessarily
in the public domain.)

For example, the name "Spencertown, New York" is not registered anywhere.
Spencertown is a part of the Town of Austerlitz ("Towns" in New York
State and New England are roughly what is called "townships" elsewhere
in the U.S.:  registered land units larger than a county).  But it
is custom alone that says what is, and what is not, Spencertown.

Nevertheless, it makes sense as a topic of conversation.  "I am going
to Spencertown" is intelligible even though Spencertown is not
subject to precise definition.  How shall we handle names of this sort?

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Mon Sep 28 20:51:06 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:08 2004
Subject: Mix encodings in a document?
References: <1305575850-179956852@tallent.com>
		<3609585D.D9BC222@locke.ccil.org> <f98092810525400AD@inu.menteith.com>
Message-ID: <360FDA75.E530F70B@locke.ccil.org>

Tony Graham scripsit:

> Surrogate pairs are not allowed in parsed entities.  The production
> for Char excludes the surrogate blocks:
> 
> [2] Char::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD]
>             | [#x10000-#x10FFFF]

On the contrary.  UTF-16 is a standard representation that XML
systems must accept (clause 4.3.3), and the representation of the
characters #x10000-#x10FFFF in UTF-16 (which is the same as
Unicode 2.x) is precisely a surrogate pair.

Individual surrogate characters are excluded, but they have no meaning
in UTF-16 anyway.

> You can include non-BMP/non-UCS-2 characters by making numeric
> references to their Unicode Scalar Value (or by using UCS-4).

That works too.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Mon Sep 28 20:57:03 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:08 2004
Subject: Ownership of Names (was Re: Public identifiers and topic
  maps)
In-Reply-To: <360FD92B.2F72ECB3@locke.ccil.org>
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
Message-ID: <3.0.5.32.19980928135613.00993d70@dns.isogen.com>

At 02:44 PM 9/28/98 -0400, John Cowan wrote:
>W. Eliot Kimber wrote:
>
>> So I maintain my assertion that names, not just name spaces, are ownable
>> things.
>
>The difficulty is that there are so many names that are public domain;
>their meaning is settled by tacit agreement among the users, not by
>registration.  (This does not mean that the *referent* is necessarily
>in the public domain.)

What's your point? Much software is in the public domain, yet software is
ownable.  Just because names are ownable doesn't mean that all names are
owned.

>For example, the name "Spencertown, New York" is not registered anywhere.
>Spencertown is a part of the Town of Austerlitz ("Towns" in New York
>State and New England are roughly what is called "townships" elsewhere
>in the U.S.:  registered land units larger than a county).  But it
>is custom alone that says what is, and what is not, Spencertown.
>
>Nevertheless, it makes sense as a topic of conversation.  "I am going
>to Spencertown" is intelligible even though Spencertown is not
>subject to precise definition.  How shall we handle names of this sort?

It's intelligible if you know one thing:

1. What the name space context is (towns and townships in New York)

How do we handle that? By establishing a name space context and then
providing services for resolving names in it:

John: I'm going to "Spencertown" today.
Eliot: Oh, what or where is "Spencertown"?
John: It's a little town in New York.
Eliot: Never heard of it. Can you show me where it is on a map?
John: Sure. {Gets out map, shows which Spencertown he means}
Eliot: {Having received resource referenced by John's use of 
       the name "Spencertown"}. Cool, have fun.

This is no different from any other name resolution we do today. There are
no unique problems here.  There are no unique solutions.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Mon Sep 28 21:00:48 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:08 2004
Subject: Public Identifiers and their association with URIs
References: <98Sep28.111702pdt."53869(1)"@alpha.xerox.com>
Message-ID: <360FDCAA.B41D4B88@locke.ccil.org>

Mike_Spreitzer.PARC@xerox.com wrote:

> Can this alternate URI vary
> in any way?  From installation to installation of a given XML application?
> From on-line to off-line operation?  Over time in general?

Yes.  Yes.  Yes.  Yes.

> In the face of
> whatever variation is allowed in the resolved URI of the external entity, what
> guarantees does the author of the external entity have about whether and what a
> relative URI in the external entity content resolves to (i.e., because the base
> varies, base+rel varies, leading to some uncertainty about nested content).

No guarantees whatever.  This is a problem whenever a document can be retrieved
by more than one URL: it's not specific to public IDs.

> [D]oesn't the (non-annotated) spec already say that every ExternalID has to
> include a working URI?

Yes, except for ...

> In a NotationDecl, it is allowable to give only a Public Identifier without an
> accompanying System Identifier.  Why does it make sense to offer this option
> for NotationDecl but not ExternalID?

Because the referent of the external id for a notation declaration is just an
explanation of the notation (in English or whatever), and it's not necessary to
fetch it to make use of the notation.  It's most probably either ignored or
compared for equality, and public ids do either job just fine.

> It seems to me that for all the reasons people want URNs instead of URLs, we'd
> also like to have Public Identifiers that are simply not connected with those
> damn URLs (there are no really effective URNs available right now, so URI
> effectively equals URL at the present time).

URNs and FPIs (formal public ids) are the same thing, invented by two different
communities.  In principle they could contain each other: there can be an fpi:
URN namespace and an +//URN FPI prefix.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Mon Sep 28 21:19:38 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:08 2004
Subject: Ownership of Names (was Re: Public identifiers and topic
	  maps)
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <3.0.5.32.19980928103117.00966620@dns.isogen.com> <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
Message-ID: <360FE124.E2FCF5C6@locke.ccil.org>

Blunderingly I wrote:

> > [...] registered land units larger than a county. [...]

For "larger" read "smaller".

Eliot wrote:

> This is no different from any other name resolution we do today. There are
> no unique problems here.  There are no unique solutions.

The problem is not resolving such names, but fitting them into our
existing URN/FPI name architecture.

How should I refer to Spencertown via an FPI?  The standard solution
is "-//John Cowan//TOPIC Spencertown, N.Y.", but that suggests
that *my* Spencertown is meant, and I do not mean *my* Spencertown,
but *the* Spencertown, the one that appears on the maps.
Note that the maps are not *defining* here: they merely report common
usage.

The current usage of the ISO 13wawa draft is something like
"-//US::New York//NONSGML TOPIC Spencertown", but most XML-DEViants
( :-) ) have complained that that FPI steps on New York State's
proprietary name space.

Both these solutions being unsatisfactory, what should we do instead?
Again, it is simply not the case that someone executed an act,
written or oral, whose illocutionary force was "I name this place
'Spencertown'".  The name simply evolved among a community of users,
and was eventually recorded in various maps and gazeteers.

Similar arguments apply to things like words of languages (there is
a registrar for the *names* of languages, but not for the words in them),
persons (how would you refer to Simon de Montfort, or Henry VIII,
with an FPI?), and many other important matters.

Even if we confine ourselves to documents as topics (which 13wawa
by no means insists on, on my reading of it), we have problems.
Consider John 3:14 (in the KJV version, to be concrete). 
What is an FPI I can use for it?  I have the same unpalatable
alternatives: "-//John Cowan//NONSGML KJV John 3:14//EN", which
is a name I own but which is embarrassingly non-public, or
"-//King James I of England//NONSGML John 3:14", which belongs
to a man who is unlikely to register any names.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tgraham at mulberrytech.com  Mon Sep 28 21:32:04 1998
From: tgraham at mulberrytech.com (Tony Graham)
Date: Mon Jun  7 17:05:08 2004
Subject: Mix encodings in a document?
In-Reply-To: <3609585D.D9BC222@locke.ccil.org>
References: <1305575850-179956852@tallent.com>
	<3609585D.D9BC222@locke.ccil.org>
Message-ID: <f98092814155200E9@inu.menteith.com>

At 23 Sep 1998 16:21 -0400, John Cowan wrote:
 > Deke Smith wrote:
 > > And what is the implications of this (if any) for XML rendering? I'm not
 > > sure of what you mean by "surrogates are correctly processed."
 > 
 > Essentially it means that the two 16-bit values that form a
 > surrogate-pair (representing a Unicode character on the Astral
 > Plane) is always treated as a single character.
 > 
 > In XML, surrogate-pairs can appear only in attribute values, #PCDATA
 > content, PIs, and comments; they are not allowed in element GIs,
 > attribute names, or the like.

Surrogate pairs are not allowed in parsed entities.  The production
for Char excludes the surrogate blocks:

[2] Char::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD]
            | [#x10000-#x10FFFF]

You can include non-BMP/non-UCS-2 characters by making numeric
references to their Unicode Scalar Value (or by using UCS-4).

Regards,


Tony Graham
======================================================================
Tony Graham                            mailto:tgraham@mulberrytech.com
Mulberry Technologies, Inc.                http://www.mulberrytech.com
17 West Jefferson Street                    Direct Phone: 301/315-9632
Suite 207                                          Phone: 301/315-9631
Rockville, MD  20850                                 Fax: 301/315-8285
----------------------------------------------------------------------
  Mulberry Technologies: A Consultancy Specializing in SGML and XML
======================================================================


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Tue Sep 29 00:17:48 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:08 2004
Subject: Ownership of Names (was Re: Public identifiers and topic 
  maps)
In-Reply-To: <360FE124.E2FCF5C6@locke.ccil.org>
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
 <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
Message-ID: <3.0.5.32.19980928171700.00958ba0@dns.isogen.com>

At 03:19 PM 9/28/98 -0400, John Cowan wrote:
>Eliot wrote:
>
>> This is no different from any other name resolution we do today. There are
>> no unique problems here.  There are no unique solutions.
>
>The problem is not resolving such names, but fitting them into our
>existing URN/FPI name architecture.
>
>How should I refer to Spencertown via an FPI?  The standard solution
>is "-//John Cowan//TOPIC Spencertown, N.Y.", but that suggests
>that *my* Spencertown is meant, and I do not mean *my* Spencertown,
>but *the* Spencertown, the one that appears on the maps.
>Note that the maps are not *defining* here: they merely report common
>usage.

No no no. If by "-//John Cowan//TOPIC Spencertown, N.Y." you mean a small
town in the state of New York (United States of America) commonly known as
"Spencertown", then that is fine (except that a town is not a topic, so the
public text class is incorrect--it should be NONSGML). There is nothing in
that FPI that suggests that you are claiming ownership of Spencertown, any
more than the Library of Congress issuing a catalog numbers suggests
ownership of the books cataloged.

Now, if by "-//John Cowan//TOPIC Spencertown, N.Y." you mean "the idea of
place called 'Spencertown' as expressed by John Cowan", then the FPI refers
to the topic that you happen to own (by having expressed your ideas about
this town (and the public text class is correct).

If what is wanted is a way to refer to places by FPI in a way that is
authoritative, then I suggest asking the U.S. Geological Survey or the CIA
or some UN agency to register a public owner identifier and define an
algorithm for getting from their published (on paper) identifiers for
places to syntactically valid FPIs (or URNs of any sort).

For example, I might expect something like this:

+//IDN us.gov::Geological Survey::places//NONSGML
municipality::Spencertown::New York::USA//EN
+//IDN us.gov::Geological Survey::places//NONSGML bodyofwater::Lake
Geneva::Switzerland//EN

But lacking a cataloging agency and either assigned names or a
deterministic algorithm for generating names from some other classification
scheme, there's not much you can do.

Cheers,

E.


--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From gmr at nextpath.com  Tue Sep 29 02:32:53 1998
From: gmr at nextpath.com (Gavin M. Roy)
Date: Mon Jun  7 17:05:08 2004
Subject: XMLTP.Org
Message-ID: <3637C5A5.D188E007@nextpath.com>

I am currently putting together a project to provide a common protocol
for sending and executing upon XML data.  This concept is different then
embedding XML in HTML, or other traditional mechanisms.  By creating a
common protocol, server daemon, and client/server architecture, we can,
in essence, create a system that by using a modular plug in technology,
similar to Apache's, that will provide a system that is: platform
independent, reliable, scalable, and multi-functional.  If you would
like to help, find this idea questionable, or are curious as to my
sanity, stop by http://www.xmltp.org.  

Thanks,

Gavin M. Roy
gmr@xmltp.org
-------------- next part --------------
A non-text attachment was scrubbed...
Name: gmr.vcf
Type: text/x-vcard
Size: 341 bytes
Desc: Card for Gavin M. Roy
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980929/2bdec7ab/gmr.vcf
From srn at techno.com  Tue Sep 29 02:53:49 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun  7 17:05:09 2004
Subject: Ownership of Names (was Re: Public identifiers and topic
	  maps)
In-Reply-To: <360FE124.E2FCF5C6@locke.ccil.org> (message from John Cowan on
	Mon, 28 Sep 1998 15:19:00 -0400)
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <3.0.5.32.19980928103117.00966620@dns.isogen.com> <3.0.5.32.19980928135613.00993d70@dns.isogen.com> <360FE124.E2FCF5C6@locke.ccil.org>
Message-ID: <199809290010.TAA02038@bruno.techno.com>

> From: John Cowan <cowan@locke.ccil.org>
> Consider John 3:14 (in the KJV version, to be concrete). 
> What is an FPI I can use for it?  I have the same unpalatable
> alternatives: "-//John Cowan//NONSGML KJV John 3:14//EN", which
> is a name I own but which is embarrassingly non-public, or
> "-//King James I of England//NONSGML John 3:14", which belongs
> to a man who is unlikely to register any names.

What John said.  This is the problem we're trying to address with
"public topics".

I think maybe what we're looking for here is some expressiveness that
even Library Science hasn't bothered with very much, due to the focus
on documents rather than on the things that documents talk about,
which is what really ties them together.  If you don't believe this is
a real issue, consider what kinds of listings you find in indexes.  Do
you find documents there?  No, you find the names of things, and only
some of those things are documents.  What we're looking for is a canon
for referencing things in general.

[Eliot Kimber:]
> Steve is talking about topics as though the topics were the things
> (i.e., the topic "Lake Geneva" *is* the actual lake).

Yes, that's exactly what I meant.

> But topics are not the things, they are descriptions of and opinions
> about things.  That's why I say that a topic is a document.

No, not at all; this is infinite regression.  What is the topic of an
airplane manual?  An airplane manual is not about documents.  It's
about airplanes.  We need a maximally canonical way to refer to
airplanes, snails, puppy-dog's tails, shoes, ships, sealing wax,
cabbages, and St. James's Bible: to the topics themselves, and not
just to more and more words about them and pictures of them.

<explanation>In topic map jargon, a "topic" is an information
construct that has names, occurrences, and roles played in
relationships with other topics.  But in common English usage, a
"topic" is anything that is regarded as a subject about which
communication occurs -- "a topic of conversation".  In topic map
jargon, the latter concept is called "public topic", because it is the
referent to which many topic-map-topics may commonly refer, thus
allowing users of various topic maps to determine when two
topic-map-topics, which may be in different topic maps, are really
about the same English-topic.  The fact that different
topic-map-topics may have many different names in many different
scopes is already well understood and well handled in the topic map
formalism.  The question we're facing here is how best to refer to
public topics (in English: topics).</explanation>

[Perhaps irrelevantly, I'm reminded of the fact that it's a provably
undecidable proposition that there is any connection, in reality,
between mathematics and reality.  The proof boils down to the
undecidability of the proposition that the catalog of all things
necessarily does or does not contain a listing for itself.  At that
level, this is a purely philosophical question.  However, I'm
concerned about more practical matters.  I just want a way to point to
the subject that I'm discussing so that other people can have a prayer
of attaining certainty as to what it is that I'm talking about.  I
can't afford to care whether there's any absolute value in the fact of
my pointing, any more than I care whether the number "4" is really a
valid class of which the number of wheels on my car is a valid
instance.  It's good enough for me if it works.]

It seems to me that this could be done as a chain of namespace/name
pairs in a very wide variety of equally compelling ways.  The main
problem is "Where to start?"  In other words, which of the "common"
namespaces in our world culture should be the first namespace of the
chain?  I think this is always going to be an artistic, political,
sociological, legal, and/or other-technical decision, and there is
plenty of room for differences between people.  It turns out that it
just doesn't matter; the real world of human knowledge is by
definition totally out of control, and we're just doing what we can to
leverage the technology of language to help us manage our fast-growing
wealth of knowledge.  (And what's a liberal education for, anyway, if
not to gain a working knowledge of the common namespaces?)

Back to John's example.  Here's a possible location chain for you:

Step 0: Namespace: (the most common of all namespaces)
        Name: "The Bible"  

Step 1: Namespace: "Editions"
        Name: "King James Version"

Step 2: Namespace: "Books"
        Name: "The Gospel According to John"

Step 3: Namespace: "Chapters"
        Name: "3"

Step 4: Namespace: "Verses"
        Name: "14"


And here's an equally serviceable example, for a somewhat different
audience:

Step 0: Namespace: (the most common of all namespaces)
        Name: "English literature"

Step 1: Namespace: "Reigning monarchs"
        Name: "James I"

Step 2: Namespace: "Titles"
        Name: "Holy Bible"

Step 3: Namespace: "Books"
        Name: "The Gospel According to John"

Step 4: Namespace: "Chapters"
        Name: "3"

Step 5: Namespace: "Verses"
        Name: "14"


-Steve

--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com  http://www.techno.com  ftp.techno.com

voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax    +1 972 994 0087 (at ISOGEN: +1 214 953 3152)

3615 Tanner Lane
Richardson, Texas 75082-2618 USA

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From srn at techno.com  Tue Sep 29 02:54:57 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun  7 17:05:09 2004
Subject: Ownership of Names (was Re: Public identifiers and topic
  maps)
In-Reply-To: <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
	(eliot@dns.isogen.com)
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <3.0.5.32.19980928103117.00966620@dns.isogen.com> <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
Message-ID: <199809290048.TAA02047@bruno.techno.com>


> This is no different from any other name resolution we do
> today. There are no unique problems here.  There are no unique
> solutions.

I think so, too, but my attempt to do it with FPIs met with
unhappiness and consternation.  So, what is this non-unique solution?
How do I point to Spencertown, New York (not to a book about it, or a
picture of it, but to the town itself)?

-Steve

--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com  http://www.techno.com  ftp.techno.com

voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax    +1 972 994 0087 (at ISOGEN: +1 214 953 3152)

3615 Tanner Lane
Richardson, Texas 75082-2618 USA

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From srn at techno.com  Tue Sep 29 02:55:46 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun  7 17:05:09 2004
Subject: Ownership of Names (was Re: Public identifiers and topic maps)
In-Reply-To: <3.0.5.32.19980928103117.00966620@dns.isogen.com>
	(eliot@dns.isogen.com)
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com> <3.0.5.32.19980928103117.00966620@dns.isogen.com>
Message-ID: <199809282235.RAA02033@bruno.techno.com>

> >[Even so, in response to what you say, I feel compelled to point out,
> >perhaps irrelevantly, that names cannot be owned in any meaningful
> >sense.  
> 
> I own drmacro.com. This ownership is asserted through paying my bill
> to InterNIC, which acts as both a registrar of ownership (just like
> when you register the deed to your house at the courthouse) and a
> manager of access to the names (by controlling the DNS system that
> maps names to machines).

You're missing my point, which is a subtle nuance: namespaces are
ownable; the names in them are owned by the owner of the namespace.
Nobody can really "own" (permanently and absolutely control) a name in
a namespace without also owning the namespace.  

Your own example demonstrates the very point I'm trying to make.  By
definition, you can't possibly be the "owner" of "drmacro.com".
InterNIC owns the name "drmacro.com" because InterNIC owns the
namespace in which it exists.  What InterNIC has granted to you, under
a lease agreement, is the privilege of determining, temporarily, what
"drmacro.com" *means*.  The only difference between your relationship
to "drmacro.com" and everyone else's relationship to "drmacro.com" is
that nobody but you can change what "drmacro.com" *means*, in the
context of certain namespaces (Internet domain names being one of
them; I don't know if you've registered "drmacro.com" in the "US
Trademark" namespace, or in any other namespaces).  But this is *only*
because InterNIC restricts the ability of anyone but you to determine
what "drmacro.com" means.  Since InterNIC makes all the rules
regarding "drmacro.com", InterNIC owns "drmacro.com".  You don't.  You
rent it from InterNIC, under InterNIC's terms.

> Thus, I own the name space.

If you owned the Internet Domain Namespace, you'd be collecting the
money, not InterNIC.  So you must mean that you own the hierarchy of
namespaces of which the namespace whose own name is "drmacro.com" is
the root.  I agree with that!  You do in fact own the namespace that
temporarily happens to have the name "drmacro.com" in the Internet
Domain Name namespace.  Some of those names may themselves be the
names of namespaces, and those namespaces are also owned by you.

> I can argue that I also own the names within that name space because
> I also control the machine that has the resources that those names
> will map.

Exactly right, except for the first "also".

> So I maintain my assertion that names, not just name spaces, are
> ownable things.

If you own the namespace, you own the names that are in it.  If not,
not.  The fact that you own a namespace says nothing about whether or
not you own a namespace containing a name of the namespace that you
own.

> If this were not the case, Compaq would not have spent 3.something
> million dollars to buy the name "www.altavista.com".  QED.

Phooey.  Compaq did that because it wanted to control what
"www.altavista.com" means.  It had to pay that money to the previous
leaseholder because of certain rights that are granted by InterNIC to
leaseholders of names in the Internet Domain Name Namespace.  One of
those rights is the right to continue to lease a name that one has
leased uninterruptedly from InterNIC; InterNIC simply doesn't allow
domain names to be hijacked.  But that's just an InterNIC rule, and
InterNIC writes the rules, because InterNIC (or Network Solutions
Inc. or whoever makes the rules) is the real owner.  Consider also the
fact that if you fail to pay the rent on "drmacro.com", it reverts
entirely to InterNIC, whereupon InterNIC will be happy to lease it to
anyone on a first-come, first-served basis.  That's not ownership:
there's only limited control, and there's no permanence.

The US trademark namespace is similar, except that ownership of this
particular namespace belongs to the American people.  If you don't use
a US trademark, you lose it.  Similarly, if you don't defend it
against infringement, you lose it.  Staying in business and paying for
legal defense is not the same kind of rent that you pay to InterNIC
for "drmacro.com", but it's still an ongoing upkeep expense, and you
don't control the rules under which you are allowed to continue
enjoying the privileges of trademark "ownership".  Those rules are
kind of peculiar, in fact.  As I understand them (and I'm no lawyer so
don't count on this information) tomorrow, for example, I could open a
business under the trademark "Micropolis", and I wouldn't have to pay
a dime to the liquidators of the now-defunct disk drive maker.
Micropolis is out of business, so the name "Micropolis" is up for
grabs.  Therefore, the name "Micropolis" cannot be considered to have
been "owned" by Micropolis in any ordinary sense.  (Don't try this at
home, BTW.  You'd be asking for all kinds of trouble if you created a
new business under the Micropolis trademark, but that's another
question.)

"Ownership" is a very strong term -- way too strong for the
arrangement under which you control "drmacro.com" in the Internet
Domain Name namespace.

-Steve

--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com  http://www.techno.com  ftp.techno.com

voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax    +1 972 994 0087 (at ISOGEN: +1 214 953 3152)

3615 Tanner Lane
Richardson, Texas 75082-2618 USA

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Tue Sep 29 04:27:38 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:09 2004
Subject: Ownership of Names (was Re: Public identifiers and topic
  maps)
In-Reply-To: <199809282235.RAA02033@bruno.techno.com>
References: <3.0.5.32.19980928103117.00966620@dns.isogen.com>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
Message-ID: <3.0.5.32.19980928212636.00958100@dns.isogen.com>

At 05:35 PM 9/28/98 -0500, Steven R. Newcomb wrote:
[...]

>> Thus, I own the name space.
>
>If you owned the Internet Domain Namespace, you'd be collecting the
>money, not InterNIC.  So you must mean that you own the hierarchy of
>namespaces of which the namespace whose own name is "drmacro.com" is
>the root.  

Yes.

>> I can argue that I also own the names within that name space because
>> I also control the machine that has the resources that those names
>> will map.
>
>Exactly right, except for the first "also".

I own the name space "drmacro.com" and the names within it. That's what I
meant.

>> So I maintain my assertion that names, not just name spaces, are
>> ownable things.
>
>If you own the namespace, you own the names that are in it.  If not,
>not.  The fact that you own a namespace says nothing about whether or
>not you own a namespace containing a name of the namespace that you
>own.

Sure. This is true despite the fact that one particular name space granting
authority doesn't grant full ownership in the names it manages (just as the
government can sell you land or lease it). Other authorities do grant full
rights in names (e.g., the ISO 9070 registration authority, the ISBN, etc.).

>"Ownership" is a very strong term -- way too strong for the
>arrangement under which you control "drmacro.com" in the Internet
>Domain Name namespace.

I think we agree that there are at least two forms of control over a thing:
outright ownership and leasing/licensing.  Both afford you control, but
ownership affords more control.  But for this discussion, I think they come
to the same thing.

Whether the InterNIC should lease or sell names is a matter for society to
work out, I think.  Lawyers, start your engines....

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Tue Sep 29 04:37:34 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:09 2004
Subject: Ownership of Names (was Re: Public identifiers and topic 
  maps)
In-Reply-To: <199809290010.TAA02038@bruno.techno.com>
References: <360FE124.E2FCF5C6@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
 <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
 <360FE124.E2FCF5C6@locke.ccil.org>
Message-ID: <3.0.5.32.19980928213649.0096e210@dns.isogen.com>

At 07:10 PM 9/28/98 -0500, Steven R. Newcomb wrote:
>> From: John Cowan <cowan@locke.ccil.org>
>> Consider John 3:14 (in the KJV version, to be concrete). 
>> What is an FPI I can use for it?  I have the same unpalatable
>> alternatives: "-//John Cowan//NONSGML KJV John 3:14//EN", which
>> is a name I own but which is embarrassingly non-public, or
>> "-//King James I of England//NONSGML John 3:14", which belongs
>> to a man who is unlikely to register any names.
>
>What John said.  This is the problem we're trying to address with
>"public topics".

Again, I don't agree. How is "-//John Cowan//NONSGML KJV John 3:14//EN" any
different from "-//Some Name//NONSGML 12345ABCD//EN" if they both happen to
be mapped to the Bible verse John 3:14?  They're just arbitrary names. The
fact that your arbitrary name happens to contain a string that one might
guess, in the abscence of an explicit mapping, refers to a Bible verse, is
irrelevant.  I can't *know* it refers to a Bible verse until you provide a
mapping.  It could just as easily be a reference to a memo from Ken
Jeramiah Verhoven to John Cowan sent at 3:14.

Assigning your own names to things is just cataloging, nothing more.  If
the Dewey Decimal system had conformed to ISO 9070, all our library catalog
entries would be of the form:

-//Dewey::Catalog//DOCUMENT 301 Title, Author//EN

But Dewey doesn't own the books, just the cataloging system for them.

So why should you be denied the same opportunity to define a classification
scheme as Dewey?

Cheers,

E.

--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From amitr at abinfosys.com  Tue Sep 29 05:48:34 1998
From: amitr at abinfosys.com (Amit Rekhi)
Date: Mon Jun  7 17:05:09 2004
Subject: Repesention of table in an XML-DTD
Message-ID: <009c01bdeb5c$af7b8bb0$0101a8c0@server.abinfosys.com>

Hello,
            I was wondering what would be the best way to be repesenting a
large table (with 500+ entries) in an external subset of an XML file's  DTD
and selecting a row of the table in the XML file's internal DTD subset.

SCENARIO

            I have a table say TABLE CODE with 2 columns (Code Identifier
and Code Description) :-

                                                                    TABLE
CODE
                Code Identifier
Code Description

                        1
This represents the first code value in list
                        2
This represents second code value in list
                        .
                        .
                        (There are around 500-700 entries in this table)

            I have a requirement wherein :-

1)  I want to represent this large table above in an XML - DTD (preferbably
in the DTD's external subset since this table is to be accessed by many XML
files)

2) I have a set of XML files (say 10 XML files) whose external subsets point
to the file containing this table. Now in each of the XML files of the set,
I want to select a row of the table, in their internal subsets  i.e.

// table.dtd


From db at Eng.Sun.COM  Tue Sep 29 06:01:54 1998
From: db at Eng.Sun.COM (David Brownell)
Date: Mon Jun  7 17:05:09 2004
Subject: Ownership of Names (was Re: Public identifiers and topic maps)
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
		<360C0929.A4DC3C69@locke.ccil.org>
		<199809302215.RAA02155@bruno.techno.com>
		<3.0.5.32.19980928103117.00966620@dns.isogen.com> <13839.50688.801484.233333@localhost.localdomain>
Message-ID: <36105ACE.995AACFB@eng.sun.com>

david@megginson.com wrote:
> 
> W. Eliot Kimber writes:
> 
>  > So I maintain my assertion that names, not just name spaces, are
>  > ownable things.  If this were not the case, Compaq would not have
>  > spent 3.something million dollars to buy the name
>  > "www.altavista.com".  QED.
> 
> If you wanted to express the point formally, you could say that each
> unique combination of
> 
>   { namespace, time }
> or
>   { namespace, time, name }
>
> is ownable.  Eliot owns drmacro.com at 00:00:01 GMT 28 September 1998
> (and for a while before and after).

Of course, I hope everyone understands that we're being
sloppy here when we say "namespace" by assuming we all
mean the same thing.  Eliot certainly doesn't own the
name (context) "drmacro.com" in any table I maintain;
while it may be true in a table NIC maintains, there's
no sense in attempting to keep people from defining their
own naming context ("namespace") and the names within
them ("drmacro.com" etc).

This is one of the things that makes naming discussions
go on, and on ... there are an infinite number of contexts
and it's more or less impossible to say things that can
be true for every possible context.  Time is only one
of the variables!

- Dave


> Vanity of vanities, dust to dust, and all that.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ricko at allette.com.au  Tue Sep 29 06:10:09 1998
From: ricko at allette.com.au (Rick Jellife)
Date: Mon Jun  7 17:05:09 2004
Subject: XML and Objects
References: <199809280344.UAA13557@websales.com> <13839.29384.260466.681001@localhost.localdomain>
Message-ID: <36105F27.5193168E@allette.com.au>


david@megginson.com �g�D�G

>  This sounds like a wise choice.  XML packaging is a problem that the
> W3C XML Activity has not yet addressed -- experimentation and
> implementation experience will be very helpful to them when the time
> comes (after all, that's one of XML-DEV's greatest strengths).

The XML WG made the definite choice to avoid specifying a packaging solution for
MIME, when Murata and Whitehead put together the RFC for text/xml and
application/xml.

I think this was because:

* they wanted to provide the base-level MIME types first and fast: HTML has gone
far with just text/html;

* the URLs locate resources (i.e. entities) not documents per se: so just a MIME
type for entities is not inappropriate;

* there was considerable fuss with text/sgml (one side says that the document
should be parsed first, and only the entities that are actually referenced should
be sent, in the order they are required; the other side says the server should be
able to bundle anything it wants into the package, and that no transitive closure
is required) and the WG needed to sidestep it;

* multipart XML documents can be sent using text/sgml (oops I am doing this from
distant memory...) anyway, if you send the appropriate SGML declaration with it;
so there is already something available, even if it is not optimal;  text/sgml
lets you ship SOCATs too I think;

* in any case, packaging is more appropriate for email than browser delivery, and
XML was "SGML on the Web" not "SGML over Email", so perhaps there is no strong
requirement for the WG to provide a solution, even if there is a gap;

* because SGML documents may be shipped with just public identifiers on entity
declarations, SGML documents may require a SOCAT file, so text/sgml needed to
look at the multipart issue--XML documents must have system identifiers on entity
declarations, hence they need not require a SOCAT file, hence the multipart issue
is peripheral to basic text/xml on MIME on HTTP.  If an XML document does use and
require a SOCAT, then the developer of the document system has to figure out how
to arrange it to work with MIME HTTP.

There has been some ISO discussion about this issue. I would be very interested
if anyone on XML-DEV has any fresh perspective on this.

Rick Jelliffe


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From db at Eng.Sun.COM  Tue Sep 29 06:52:32 1998
From: db at Eng.Sun.COM (David Brownell)
Date: Mon Jun  7 17:05:09 2004
Subject: DCD v DTD?
References: <199809280823.KAA09405@berlin.dvs1.tu-darmstadt.de>
Message-ID: <361066AE.BD443BFF@eng.sun.com>

Ron Bourret wrote:
> 
> There are a two major advantages to using XML syntax for schema information
> rather than DTDs.  The first is the availability of tools -- while there are
> plenty of tools around for manipulating XML files, there are few available for
> manipulating DTDs.

This applies similarly to APIs ... any of the "schema in XML"
proposals work with something like DOM Level 1 Core, but none
of the DTD support works with such an API.  If you want to be
writing an editor, you could define your schema framework and
get going right away (unless you wanted to wait for something
more standard) using standard APIs.  Not so with DTDs.

- Dave

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From db at Eng.Sun.COM  Tue Sep 29 09:36:10 1998
From: db at Eng.Sun.COM (David Brownell)
Date: Mon Jun  7 17:05:09 2004
Subject: XML and Objects
References: <199809280344.UAA13557@websales.com> <13839.29384.260466.681001@localhost.localdomain>
Message-ID: <36108C1F.DC6F482E@eng.sun.com>

david@megginson.com wrote:
> 
> Gregory M. Messner writes:
> 
>  > I am leaning towards multipart/related, but would like to know of
>  > others experience in this area.
> 
> The key is to use a streaming protocol so that you can start
> processing the first files while the rest are arriving.  ZIP is
> useless for this purpose, since it keeps the directory information at
> the end; TAR is good, as (I think) is CPIO.

The Java ARchive (JAR) format has manifests at the very beginning,
so that the correct digital signatures can be computed during download.
This can be used with non-Java systems, but nowadays I have the worst
time understanding why anyone would work with any other system!  ;-)


>  > 2) We desire to provide an API on the client side which exposes a
>  > simple mechanism for creating and modifying objects. These objects
>  > are serialized using XML and then transported to a server for
>  > further processing. The server then responds with another XML
>  > document that we then de-serialize into an object and present it to
>  > the API user. Here are some basic requirements:
>  >
>  >     * Support for both Java and C++
>  >     * API must be similar for both Java and C++
>  >     * Object members are accessed via get/set methods
>  >     * Adhere to JavaBean method naming patterns
> 
> The DOM would be a pretty close (and obvious) fit, and has the
> advantage of being very close to W3C Recommendation.

Yes, but ... it depends on what's meant by serialize/deserialize
though.  There are quite a few options there (in just XML systems,
four come quickly to mind!) so more info may be helpful.  DOM does
not say how to read or write DOM objects, and nodes don't really
have get/set method naming for "data" (just DOM info).  Other sorts
of solution may be closer to what Gregory wanted.

For example, in Java it's (way) easy to put together something that
can "serialize" (not in the "java.io" sense though) beans like:

	<BEAN CLASS="com.example.foo.SimpleBean">
	    <PROPERTY NAME="prop1" DCD:i4>49</PROPERTY>
	    <PROPERTY NAME="prop2" DCD:string>hello world</PROPERTY>
	    ...
	    </BEAN>

Then reading it back in Java is a case of taking the "CLASS" tag
and instantiating, then assigning properties.  In C++ it'd need a
table associating that class with some custom generated C++ stuff.
Plus of course there are corner cases like wanting to emit strings
containing characters that are not legal XML -- formfeed, BEL, and
so on.  (That'd be one reason why when I did such stuff, I didn't
use DCD.)  Reflection makes stuff like that rather simple to do;
you can use custom generated code, but don't need to.

That particular solution doesn't require DOM at all. 

- Dave

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From stevew at access.com.au  Tue Sep 29 10:02:46 1998
From: stevew at access.com.au (Steve Withall)
Date: Mon Jun  7 17:05:09 2004
Subject: XML and Objects
Message-ID: <3.0.32.19980929180436.00a94db8@pop.access.com.au>

At 00:28 29/9/98 -0700, David Brownell wrote:
>For example, in Java it's (way) easy to put together something that
>can "serialize" (not in the "java.io" sense though) beans like:
>
>	<BEAN CLASS="com.example.foo.SimpleBean">
>	    <PROPERTY NAME="prop1" DCD:i4>49</PROPERTY>
>	    <PROPERTY NAME="prop2" DCD:string>hello world</PROPERTY>
>	    ...
>	    </BEAN>
>
>Then reading it back in Java is a case of taking the "CLASS" tag
>and instantiating, then assigning properties.  In C++ it'd need a
>table associating that class with some custom generated C++ stuff.
>Plus of course there are corner cases like wanting to emit strings
>containing characters that are not legal XML -- formfeed, BEL, and
>so on.  (That'd be one reason why when I did such stuff, I didn't
>use DCD.)  Reflection makes stuff like that rather simple to do;
>you can use custom generated code, but don't need to.
>
>That particular solution doesn't require DOM at all. 
>
>- Dave
>
Dave,

The problem I have with this approach is that it limits you to specifying just a single class. Surely in the general case one wants to be able to use an XML element to represent some sort of 'thing' (avoiding the word object), and it should be possible for multiple applications to use this XML document, each one possibly wishing to instantiate the 'thing' using a different class.

I'd prefer the identification of which class a particular application should use for a particular type of element to be external (using DCD, for example). The document itself then remains 'purer', uncluttered with this application-specific information.

Also, I assume (hope!) you're using the element name 'BEAN' just as an example, and that in practice you'd use 'meaningful' element names. This would, however, make the use of 'PROPERTY' attributes more problematical.

Cheers, Steve.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From maillist at chris.hubick.com  Tue Sep 29 10:29:20 1998
From: maillist at chris.hubick.com (Chris Hubick)
Date: Mon Jun  7 17:05:10 2004
Subject: XML and Objects
In-Reply-To: <36108C1F.DC6F482E@eng.sun.com>
Message-ID: <Pine.LNX.3.96.980929011549.23341B-100000@chris.hubick.com>


On Tue, 29 Sep 1998, David Brownell wrote:

> david@megginson.com wrote:
> > processing the first files while the rest are arriving.  ZIP is
> > useless for this purpose, since it keeps the directory information at
> > the end; TAR is good, as (I think) is CPIO.
> 
> The Java ARchive (JAR) format has manifests at the very beginning,
> so that the correct digital signatures can be computed during download.

	A JAR file is a Zip file.  You can change .jar to .zip and load
it on up in WinZip, or any other zip program.  The manifest is placed in a
special directory in the zip file.  I actually mapped the .jar extension
to WinZip.  I just jar up all my package directories with source, and
Winzip allows me to easily do a wildcard delete on all *.java files :-)

---
Chris Hubick
mailto:chris@hubick.com
http://www.hubick.com/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Tue Sep 29 11:47:42 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:05:10 2004
Subject: Ownership of Names (was Re: Public identifiers and topic maps)
Message-ID: <003c01bdeb8e$d0c69300$1e09e391@mhklaptop.bra01.icl.co.uk>


>Eliot owns drmacro.com at 00:00:01 GMT 28 September 1998
>(and for a while before and after).
>
To be a little more precise, he owns the internet domain
name drmacro.com. He doesn't necessarily own the UK trade
mark drmacro.com, or the Spanish vehicle registration plate
drmacro.com, etc...

**There is no global namespace**

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Marc.Sacoolas at Eng.Sun.COM  Tue Sep 29 15:21:36 1998
From: Marc.Sacoolas at Eng.Sun.COM (Marc Sacoolas)
Date: Mon Jun  7 17:05:10 2004
Subject: Delete if you are not administrating major domo
Message-ID: <199809291319.GAA26321@loa.eng.sun.com>

Please take me off.  I tried many times.  Says 'Marc Sacoolas' not subscribed.

Thanks.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mark at conveyor.com  Tue Sep 29 15:30:15 1998
From: mark at conveyor.com (Mark Baker)
Date: Mon Jun  7 17:05:10 2004
Subject: XML and Objects
References: <199809280344.UAA13557@websales.com>
	 <13839.29384.260466.681001@localhost.localdomain> <36108C1F.DC6F482E@eng.sun.com>
Message-ID: <3610E0F4.C873DBD7@acm.org>

David Brownell wrote:

>         <BEAN CLASS="com.example.foo.SimpleBean">
>             <PROPERTY NAME="prop1" DCD:i4>49</PROPERTY>
>             <PROPERTY NAME="prop2" DCD:string>hello world</PROPERTY>
>             ...
>             </BEAN>
>
> Then reading it back in Java is a case of taking the "CLASS" tag
> and instantiating, then assigning properties.  In C++ it'd need a
> table associating that class with some custom generated C++ stuff.
> Plus of course there are corner cases like wanting to emit strings
> containing characters that are not legal XML -- formfeed, BEL, and
> so on.  (That'd be one reason why when I did such stuff, I didn't
> use DCD.)  Reflection makes stuff like that rather simple to do;
> you can use custom generated code, but don't need to.
>
> That particular solution doesn't require DOM at all.

I'd personally like to see Java packages mapped to namespaces in some manner,
thereby allowing us to do away with Java-specific structures, and just stick to
the content, ala (ignoring the namespace stuff for the moment - I haven't looked
at them recently);

<SimpleBean>
  <prop1 DCD:i4>49</prop1>
  <prop2 DCD:string>hello world</prop2>
</SimpleBean>

I think the most important goal of bidirectional Java/XML interop is in going
*from* XML *to* Java, not the other way around.  As such, asking document authors
to follow a Bean-specific DTD isn't such a good idea.  Network effects are your
friend! 8-)

MB


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Tue Sep 29 15:43:48 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:10 2004
Subject: Ownership of Names (was Re: Public identifiers and topic
  maps)
In-Reply-To: <003c01bdeb8e$d0c69300$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <3.0.5.32.19980929084259.0087f1a0@dns.isogen.com>

At 10:52 AM 9/29/98 +0100, Michael Kay wrote:
>
>>Eliot owns drmacro.com at 00:00:01 GMT 28 September 1998
>>(and for a while before and after).
>>
>To be a little more precise, he owns the internet domain
>name drmacro.com. He doesn't necessarily own the UK trade
>mark drmacro.com, or the Spanish vehicle registration plate
>drmacro.com, etc...

I think that to be most accurate, I must say that I have some set of
property rights in the Internet domain name "drmacro.com".  Because it is
leased or rented to me, I don't own it in the strict legal sense (to the
degree that I understand that, which is basically to the degree my wife (an
ex attorney) explained it to me while we were walking the dog).

Of course I don't own the string "drmacro.com" in all possible contexts,
any more than the Coca Cola company owns the string "Coke" in all possible
contexts (as much as it might want to).

[Domain names have the interesting effect of flattening or unifying what is
otherwise a set of more or less distinct name spaces, namely trade names
used for particular types of product.  If domain names reflected the
different trademark name spaces, you'd have domains like
"coke.softdrinks.com" and "coke.steelmills.com".]

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From rbourret at dvs1.informatik.tu-darmstadt.de  Tue Sep 29 16:19:25 1998
From: rbourret at dvs1.informatik.tu-darmstadt.de (Ron Bourret)
Date: Mon Jun  7 17:05:10 2004
Subject: DCD: namespaces in element and attribute references
Message-ID: <199809291407.QAA09286@berlin.dvs1.tu-darmstadt.de>

When referring to attributes and elements in another namespace, DCD uses a 
prefixed element or attribute name. The prefix identifies the namespace, which 
corresponds to the value of the Namespace element in some external DCD. For 
example:

<ElementDef Type="foo">
   <Group RDF:Order="Seq">
      <Element>bar</Element>      <!-- bar is defined in this DCD -->
      <Element>baz:blah</Element> <!-- blah is defined in a separate DCD -->
   </Group>
</ElementDef>

How is the prefix associated with a namespace? With an xmlns:xxx attribute? If 
so, are there any adverse consequences to doing this? (I can't think of any 
right off.)

-- Ron Bourret

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From b.laforge at jxml.com  Tue Sep 29 16:38:30 1998
From: b.laforge at jxml.com (Bill la Forge)
Date: Mon Jun  7 17:05:10 2004
Subject: XML and Objects
Message-ID: <000f01bdebb7$0370ab60$ab026982@thing1.camb.opengroup.org>

Shouldn't we be trying, instead, for a way to convert XML documents 
which conform to any pre-existing markup language to a tree
of application-specific objects?

Bill

>>         <BEAN CLASS="com.example.foo.SimpleBean">
>>             <PROPERTY NAME="prop1" DCD:i4>49</PROPERTY>
>>             <PROPERTY NAME="prop2" DCD:string>hello world</PROPERTY>
>>             ...
>>             </BEAN>


><SimpleBean>
>  <prop1 DCD:i4>49</prop1>
>  <prop2 DCD:string>hello world</prop2>
></SimpleBean>


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Tue Sep 29 16:51:50 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:10 2004
Subject: Ownership of Names (was Re: Public identifiers and topic 
	  maps)
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
	 <3.0.5.32.19980928135613.00993d70@dns.isogen.com> <3.0.5.32.19980928171700.00958ba0@dns.isogen.com>
Message-ID: <3610F3E4.E18B29A6@locke.ccil.org>

W. Eliot Kimber scripsit:

> >"-//John Cowan//TOPIC Spencertown, N.Y." [...] suggests
> >that *my* Spencertown is meant, and I do not mean *my* Spencertown,
> >but *the* Spencertown, the one that appears on the maps.
> 
> No no no. If by "-//John Cowan//TOPIC Spencertown, N.Y." you mean a small
> town in the state of New York (United States of America) commonly known as
> "Spencertown", then that is fine . There is nothing in
> that FPI that suggests that you are claiming ownership of Spencertown, any
> more than the Library of Congress issuing a catalog numbers suggests
> ownership of the books cataloged.

I was too elliptical.  By "*my* Spencertown" I meant, not the Spencertown
I own, but rather the thing that I (idiosyncratically) call
"Spencertown".  In other words, "-//John Cowan/DOCUMENT RDF Made Easy//EN"
refers to the thing that *I* call "RDF Made Easy", but when I use
the name "Spencertown" I do not merely mean "whatever *I* call
'Spencertown'" but rather the town that is *commonly*, *customarily*,
so called.

> (except that a town is not a topic, so the
> public text class is incorrect--it should be NONSGML)

Ah, then I don't know what a TOPIC is.  By "topic" I mean "subject
of discussion", as in "our current topic is the proper use of FPIs."
If we were talking of Spencertown --- for example, if I told you
that the country store there was owned by Tom Reamer --- then
Spencertown would be our current topic.  In the non-hypothetical world,
"Spencertown" is one of our topics (i.e. the name of the town), but
Spencertown is not.

> Now, if by "-//John Cowan//TOPIC Spencertown, N.Y." you mean "the idea of
> place called 'Spencertown' as expressed by John Cowan", then the FPI refers
> to the topic that you happen to own (by having expressed your ideas about
> this town (and the public text class is correct).

This looks like a use/mention distinction, but I do not grasp its
applicability here.  Also, I do not know what an "idea" is in formal
language.

> If what is wanted is a way to refer to places by FPI in a way that is
> authoritative, then I suggest asking the U.S. Geological Survey or the CIA
> or some UN agency to register a public owner identifier and define an
> algorithm for getting from their published (on paper) identifiers for
> places to syntactically valid FPIs (or URNs of any sort).

But that's the trouble.  Spencertown, I repeat, has no official existence:
it is not defined by any registry, but by common acceptation.

> For example, I might expect something like this:
> 
> +//IDN us.gov::Geological Survey::places//NONSGML
> municipality::Spencertown::New York::USA//EN

Spencertown is not a "municipality": it is neither a Town, a Village,
a City, or an Indian Reservation, which classes exhaustively specify
New York State local entities.  It is simply a region, part of the
(official) Town of Austerlitz, that people have agreed to desginate
by that name.

> But lacking a cataloging agency and either assigned names or a
> deterministic algorithm for generating names from some other classification
> scheme, there's not much you can do.

That does not mean that there *should* not be anything you can do.
In such contexts as this, we need a way to reify the concept of
a "public domain name".

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Alex.Thomas at dresdner-bank.com  Tue Sep 29 17:06:03 1998
From: Alex.Thomas at dresdner-bank.com (Thomas, Alex)
Date: Mon Jun  7 17:05:10 2004
Subject: XML and Objects
Message-ID: <D1E51D8F695BD111979E00805FFE2753BC7F78@DRKBLONC0102>

I'd also like to know if there's a reason a bean couldn't use specific
property names, 

>	    <PROPERTY NAME="prop2" DCD:string>hello world</PROPERTY>

becoming

	    <PROP2>hello world</PROP2>

(I guess an implicit DCD is OK too). 

I don't have a problem with bean XML data specifying a specific class -
after all, it's just another layer and it's quite reasonable for people to
choose to conform to that layer (the bean interface) rather than the XML
data level. Even if there are multiple implementations of that bean, a
single 'class name' should be sufficient - it could either be a pointer to a
spec. for multiple alternative classes (e.g. one C++, one Java) or a spec.
unifying them at some higher level (using IDL for instance).

cheers
Alex


-----Original Message-----
From: Steve Withall [mailto:stevew@access.com.au]
Sent: 29 September 1998 09:05
To: XML Developers' List
Subject: Re: XML and Objects


At 00:28 29/9/98 -0700, David Brownell wrote:
>For example, in Java it's (way) easy to put together something that
>can "serialize" (not in the "java.io" sense though) beans like:
>
>	<BEAN CLASS="com.example.foo.SimpleBean">
>	    <PROPERTY NAME="prop1" DCD:i4>49</PROPERTY>
>	    <PROPERTY NAME="prop2" DCD:string>hello world</PROPERTY>
>	    ...
>	    </BEAN>
>
>Then reading it back in Java is a case of taking the "CLASS" tag
>and instantiating, then assigning properties.  In C++ it'd need a
>table associating that class with some custom generated C++ stuff.
>Plus of course there are corner cases like wanting to emit strings
>containing characters that are not legal XML -- formfeed, BEL, and
>so on.  (That'd be one reason why when I did such stuff, I didn't
>use DCD.)  Reflection makes stuff like that rather simple to do;
>you can use custom generated code, but don't need to.
>
>That particular solution doesn't require DOM at all. 
>
>- Dave
>
Dave,

The problem I have with this approach is that it limits you to specifying
just a single class. Surely in the general case one wants to be able to use
an XML element to represent some sort of 'thing' (avoiding the word object),
and it should be possible for multiple applications to use this XML
document, each one possibly wishing to instantiate the 'thing' using a
different class.

I'd prefer the identification of which class a particular application should
use for a particular type of element to be external (using DCD, for
example). The document itself then remains 'purer', uncluttered with this
application-specific information.

Also, I assume (hope!) you're using the element name 'BEAN' just as an
example, and that in practice you'd use 'meaningful' element names. This
would, however, make the use of 'PROPERTY' attributes more problematical.

Cheers, Steve.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


Alex Thomas
Dresdner Kleinwort Benson
London EC3P 3DB

Alex.Thomas@Dresdner-Bank.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ken at bitsko.slc.ut.us  Tue Sep 29 17:13:47 1998
From: ken at bitsko.slc.ut.us (Ken MacLeod)
Date: Mon Jun  7 17:05:10 2004
Subject: XML and Objects
In-Reply-To: <3610E0F4.C873DBD7@acm.org> from "Mark Baker" at Sep 29, 98 09:30:28 am
Message-ID: <199809291503.KAA13902@bitsko.slc.ut.us>

A non-text attachment was scrubbed...
Name: not available
Type: text
Size: 1949 bytes
Desc: not available
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980929/96fefb00/attachment.bat
From cowan at locke.ccil.org  Tue Sep 29 17:27:57 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:10 2004
Subject: Ownership of Names (was Re: Public identifiers and topic 
	  maps)
References: <360FE124.E2FCF5C6@locke.ccil.org>
	 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
	 <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
	 <360FE124.E2FCF5C6@locke.ccil.org> <3.0.5.32.19980928213649.0096e210@dns.isogen.com>
Message-ID: <3610FC5E.FAB1599F@locke.ccil.org>

W. Eliot Kimber wrote:

> Again, I don't agree. How is "-//John Cowan//NONSGML KJV John 3:14//EN" any
> different from "-//Some Name//NONSGML 12345ABCD//EN" if they both happen to
> be mapped to the Bible verse John 3:14?  They're just arbitrary names.

True in principle, not true in fact.  The man who uses the word
"glory" in "There's glory for you!" to mean "There's a nice knock-down
[i.e. compelling] argument for you!" is likely to be called, or even
to *be*, a Humpty Dumpty.

Names have *content*, contrary to theory.  To use an example I have used
elsewhere, you may not know which dog the name "Fido" refers to, but you
know (if you understand English onomastics at all) that it refers to some
dog.  Likewise, "Jane" refers to some female human.

> But Dewey doesn't own the books, just the cataloging system for them.
> 
> So why should you be denied the same opportunity to define a classification
> scheme as Dewey?

I shouldn't.  But often I don't *want* to use idiosyncratic names
for things, as if I were speaking Chinese (in Chinese: "as if my
words were a Buddha, twelve feet high, that cannot be understood").
What shall we do in order to specify the names that no one controls?

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Tue Sep 29 17:48:26 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:10 2004
Subject: Ownership of Names (was Re: Public identifiers and topic  
  maps)
In-Reply-To: <3610F3E4.E18B29A6@locke.ccil.org>
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
 <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
 <3.0.5.32.19980928171700.00958ba0@dns.isogen.com>
Message-ID: <3.0.5.32.19980929104734.008fd660@dns.isogen.com>

At 10:51 AM 9/29/98 -0400, John Cowan wrote:

>Spencertown is not a "municipality": it is neither a Town, a Village,
>a City, or an Indian Reservation, which classes exhaustively specify
>New York State local entities.  It is simply a region, part of the
>(official) Town of Austerlitz, that people have agreed to desginate
>by that name.
>
>> But lacking a cataloging agency and either assigned names or a
>> deterministic algorithm for generating names from some other classification
>> scheme, there's not much you can do.
>
>That does not mean that there *should* not be anything you can do.
>In such contexts as this, we need a way to reify the concept of
>a "public domain name".

I don't buy it. How do you know that there's this part of Austerlitz called
"Spencertown"? The fact that people call it that must be written down
somewhere reasonably authoritative (I could even use your posts on the
subject at the authority--they're certainly reliably addressible by
reference to the XML Dev archive) or else there is some person who is that
authority (it could be John himself).  There must be a map somewhere that
describes what Spencertown is, or at least what the concensus of it is.  If
there's not, and you need to refer to it, then you would need to create the
authority: create a Web page titled "Spencertown, an unofficial part of the
Town of Austerlitz", with a map and a description of the place, then refer
to it.  If there's no existing authority, then any authority will do.

This type of thing, a thing for which there is no well-defined authority
(because the boundaries of the thing are defined only by common usage and
opinion, not by some governing authority) is an interesting case because,
in a very real sense, every person may have a different definition of what
they think the thing is. That's why I stress that the topic is the *idea*
of Spencertown. You have your understanding of what it is, other people
have theirs. Because the thing is not defined, there can be no single
definition of it. Therefore, the topic "Spencertown" is *your opinion* (or
someone else's opinion) about what Spencertown is.  At best, your authority
is the list of people who share your definition of Spencertown ("everybody
just knows it--well who's 'everybody'?").

Another example would be the topic "Baby Boomer".  The designation "Baby
Boomer" has no authoritative definition (although it may have many formal
definitions). In my experience, no two people I've talked about share the
same definition of what a Baby Boomer is, although we all agree that there
is a Baby Boom and there are Baby Boomers.  So what is the topic "Baby
Boom"? Is it the name "Baby Boom"? Is it the idea of a group of people born
after World War II but before some other point in time? Is it the set of
all people who are baby boomers? Is it some particular statistician's
definition of what the Baby Boom is?

So if you, in the creation of a topic, want to point to the topic "Baby
Boomers", you're going to have to define what that topic means to you, if
only by writing a paragraph or two outlining *your definition* of what the
Baby Boom is. We do this commonly in writing, e.g. "By the term 'baby boom'
*I mean* people born between 1945 and 1962". This writer has assigned the
topic name "baby boom" to the set of all people born between 1945 and 1962.
Another writer might define baby boom as "The set of all people born to
parents old enough to fight in World War II".  Do we have one topic or two?
I can see arguments for both: if unqualified, the term "baby boom" has to
refer to all definitions of the term. If you want to refer to a single
definition, you have to qualify it: "baby boom as defined by author A".

Maybe this is what Steve and John are looking for: an algorithm for saying
"This name is a query over all topics that include this name".  So if I say:

-//All Possible Topics//TOPIC baby boom//EN

It's a query against all things named as topics whose object identifier
includes the words "baby boom" (remembering that FPIs are normalized into
word tokens for comparison).  If I say:

+//IDN drmacro.com::topics//TOPIC baby boom//EN

I must mean my personal definition of "baby boom".

Note that this approach still provides a resolution for the first FPI,
which is the list of FPIs that match the search criteria, which is really
the list of resources those FPIs address, which better be some resolvable
(or "researchable") definition of what each topic is.

Of course, one problem with this approach is how to know when a reference
to a topic is a query and when it resolves to a single resource. I suppose
that could be a function of the resolution mechanism provided by the FPI
name owner. For example, we could again imagine a topic cataloging service
with the registered owner name "topics.com":

+//IDN topics.com

Using normal URN resolution services, uses of this FPI would be directed to
the topics.com server, which could then perform the search listed above.
When the query returned the drmacro.com FPI, that FPI would be directed to
the drmacro.com server for resolution, which would then return whatever it
maps to, say a document defining what I mean by baby boom.

But there's still a problem, I think. Because if I just point to the
"public topic" named "baby boomer" where did I get that name from to know
to use it? There must be something I can point to that is the place or
places I came to understand both that there is a thing called "baby boomer"
and that most people at least recognize the term, if not agree on what it
means.  This might be a magazine article, a news report, or whatever, but
there has to be something. Which means that there is always something I can
point at, even if it's only as a bibliographic reference ("the term 'baby
boom' was first used in an article by blah blah blah").  Which suggests
that no matter how fuzzily defined your topic, there is always some form of
"authority" that you can point to to serve as some form of definition.  It
could even be "call anybody in the Austerlitz phone book and ask them what
'Spencertown' is, they'll tell you."

So let me stress my key point again: there is no such thing as a "public
topic" with no resource. If authors of topic maps need to refer to things
as topics that are outside of their maps, there must be a mapping from the
name of the topic to its definition. If this mapping doesn't already exist,
then the topic map author must provide it, in the ways I've shown in this
post and in others.

If the topic map standard wants to define conventions for forming names of
public topics such that their resolution can be by application of a
deterministic algorithm rather than through an explicit mapping, that's
fine too. There are any number of existing classification schemes that such
a convention could taken advantage of.

For example, it would probably make sense to use Library of Congress
numbers as a primary form of classification, so that an FPI like:

-//doesn't matter//TOPIC LOC::TZ345//EN

Is a reference to whatever subject 'TZ345' is within the LoC classfication
scheme.

The Topic Map standard could even provide a subportion of its FPI name
space for such names by defining what classification schemes are allowed,
thus ensuring that blind creation of names will not result in clashes. For
example, it might say 'within the FPI name space idenfied by the registered
owner identifier "ISO/IEC 13250", and the object class "TOPIC", topics can
be identified by object identifiers of the form
"classification_scheme::identifier" where "classification_scheme" is the
name of a classification scheme as defined in this standard and
"identifier" is a scheme-defined subject or classification identifier,
e.g., a Library of Congress subject code, a DSM3 disease code, etc.'.

Of course, one problem here is the need to fix the names of classification
schemes in the standard (I suppose the standard could be ammended any time
a useful new classification scheme is established). What's really needed is
a name space of registered classification scheme names.  But it would be a
start.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Tue Sep 29 17:52:26 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:10 2004
Subject: Ownership of Names (was Re: Public identifiers and topic maps)
References: <003c01bdeb8e$d0c69300$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <3611021C.2423E21E@locke.ccil.org>

Michael Kay wrote:

> **There is no global namespace**

Maybe not.  But the set of namespaces had better be strongly
connected, or we're going to have trouble referring from over
*here* to over *there*.

(What would a namespace not strongly connected to the rest
be like?  Sort of like colors of the ultraviolet?)

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Tue Sep 29 18:07:02 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:10 2004
Subject: Ownership of Names (was Re: Public identifiers and topic  
  maps)
In-Reply-To: <3610FC5E.FAB1599F@locke.ccil.org>
References: <360FE124.E2FCF5C6@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
 <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
 <360FE124.E2FCF5C6@locke.ccil.org>
 <3.0.5.32.19980928213649.0096e210@dns.isogen.com>
Message-ID: <3.0.5.32.19980929110530.008d8d30@dns.isogen.com>

At 11:27 AM 9/29/98 -0400, John Cowan wrote:
>W. Eliot Kimber wrote:
>
>> Again, I don't agree. How is "-//John Cowan//NONSGML KJV John 3:14//EN" any
>> different from "-//Some Name//NONSGML 12345ABCD//EN" if they both happen to
>> be mapped to the Bible verse John 3:14?  They're just arbitrary names.
>
>True in principle, not true in fact.  The man who uses the word
>"glory" in "There's glory for you!" to mean "There's a nice knock-down
>[i.e. compelling] argument for you!" is likely to be called, or even
>to *be*, a Humpty Dumpty.

>Names have *content*, contrary to theory.  To use an example I have used
>elsewhere, you may not know which dog the name "Fido" refers to, but you
>know (if you understand English onomastics at all) that it refers to some
>dog.  Likewise, "Jane" refers to some female human.

I think you're inappropriately conflating names for beings with names for
objects. Names we, as humans, give to things, have content because we use
words for them that have meaning.  But that meaning is added value--it
doesn't nothing to make the name more or less useful as an indirect pointer
to an object.  The best it can do is provide clues about what the name
might refer to, but those will be, at best, clues and they can't be
dependended upon in the general case.

Because of my cultural background, I know that "fido" is a name often used
for dogs and Jane is a name often used for female humans. I also know that
"fido" derives from "fidelis" or "fidelity", meaning "loyal", which is a
species traight of dogs, which makes "fido" an appropriate name for dogs.
But I also know that "fido" is not restricted to the naming of dogs:

[Telephone search
<http://people.yahoo.com/py/psPhoneSearch.py?Pyt=Tps&YY=627>, last name
"fido":

A Fido 
Madison Hts, MI 48071-5908 

Beata S Fido 
Chicago, IL 60607 

C Fido 
Las Vegas, NV 89125 

...] (I also got 200 hits on "smith, fido")

So if you tell me: "you'll meet fido tonight", I might expect to meet your
dog, but I could just as easily meet you friend C. Fido.

So though we might embue names with meaning, we can't, in the general case,
*depend* on that meaning to predict with any certainty what the name refers
to [pop quiz: when I refer to "Manifest Destiny", do I mean the
19th-century American attitude toward the West or do I mean my sister's
hermit crab?].

So again, I don't buy it.  But see my other post for a solution to the
requirement that doesn't depend on cultural knowledge of the typical
meaning of the names used.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Tue Sep 29 18:23:56 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:11 2004
Subject: Ownership of Names (was Re: Public identifiers and topic  
	  maps)
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
	 <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
	 <3.0.5.32.19980928171700.00958ba0@dns.isogen.com> <3.0.5.32.19980929104734.008fd660@dns.isogen.com>
Message-ID: <36110955.C02C6FA3@locke.ccil.org>

W. Eliot Kimber wrote:

> I don't buy it. How do you know that there's this part of Austerlitz called
> "Spencertown"?

By consensus.

> The fact that people call it that must be written down
> somewhere reasonably authoritative (I could even use your posts on the
> subject at the authority--they're certainly reliably addressible by
> reference to the XML Dev archive) or else there is some person who is that
> authority (it could be John himself).

I am certainly no authority on this subject.  I know what other people
mean to refer to when they refer to Spencertown, that's all.  But this
is not even a definition: it's circular, like "By 'Socrates', I mean
the person I intend to call 'Socrates'".

I could *make* myself into a fiat authority, but the actions of a 
pure fiat authority are arbitrary, and I do not wish to be arbitrary.

> There must be a map somewhere that
> describes what Spencertown is, or at least what the concensus of it is.

The various maps and so on are not authoritative either.  A map *describes*,
it does not constitute an authority.

> If there's no existing authority, then any authority will do.

It is certainly true that an authority can arise by consensus: the
Internet Assigned Numbers Authority has only the support of consensus,
at least outside the U.S., but is a genuine authority.

> This type of thing, a thing for which there is no well-defined authority
> (because the boundaries of the thing are defined only by common usage and
> opinion, not by some governing authority) is an interesting case because,
> in a very real sense, every person may have a different definition of what
> they think the thing is.

In Humpty-Dumpty principle, yes.  In fact, those who talk of Spencertown
know what they are talking about, and those who don't, don't care.

> That's why I stress that the topic is the *idea*
> of Spencertown.

Then I still don't understand what an "idea" is.  When I assert:
"The Spencertown Country Store has a bulletin-board describing events
happening in Spencertown and nearby areas", I am talking about *Spencertown*,
not about some mental construct in someone's head.  My reference may be
vague, but it is a reference to something in the physical world.

> You have your understanding of what it is, other people
> have theirs. Because the thing is not defined, there can be no single
> definition of it.

But I can discuss, contra Socrates, what I cannot define.

> Therefore, the topic "Spencertown" is *your opinion* (or
> someone else's opinion) about what Spencertown is.  At best, your authority
> is the list of people who share your definition of Spencertown ("everybody
> just knows it--well who's 'everybody'?").

But I have no *definition* of Spencertown, and neither do they.  That does
not mean that it (the referent of "Spencertown") exists only in our minds,
like the concept "justice".
 
> So if you, in the creation of a topic, want to point to the topic "Baby
> Boomers", you're going to have to define what that topic means to you, if
> only by writing a paragraph or two outlining *your definition* of what the
> Baby Boom is.

Biologists can study living organisms without being able to define "life".

<anecdote>Sir Peter Medawar, the English biologist, once attended a
discussion on that very topic.  After several hours of wrangling, he
closed the discussion as follows:  <mot>I think we can all tell the
difference between a live horse and a dead one, and I suggest that
we cease to beat the latter.</mot></anecdote>

> Maybe this is what Steve and John are looking for: an algorithm for saying
> "This name is a query over all topics that include this name".  So if I say:
> 
> -//All Possible Topics//TOPIC baby boom//EN
 
No, that would be too mechanical.  After all, in the namespace
"-//Imp of the Perverse, Inc.", the topic "baby boom" could refer to
the contents of used diapers.  Despite the coincidence of names,
these two topics just don't belong together.

> But there's still a problem, I think. Because if I just point to the
> "public topic" named "baby boomer" where did I get that name from to know
> to use it? There must be something I can point to that is the place or
> places I came to understand both that there is a thing called "baby boomer"
> and that most people at least recognize the term, if not agree on what it
> means.

Not necessarily.  The man in the street, to repeat an example of Saul
Kripke's, may know that Feynman is (was) a great physicist, without knowing
a single parameter which would distinguish him from Gell-Mann.  Some people,
in fact, "know" Einstein as the inventor of the atomic bomb.  Does that
mean that when they refer to Einstein, they refer to nobody, since the
atomic bomb had no single "inventor"?

> So let me stress my key point again: there is no such thing as a "public
> topic" with no resource.

Perhaps not.  But there may not be an authoritative report.  If you
want to know how to spell "Freund", you can look in an authoritative
German dictionary, since the German language has such things, or you
can consult common acceptation.  But if you want to know how to spell
"friend", you have a problem.  You can go to an English dictionary, but the
makers of the dictionary explicitly disclaim authoritativeness, and tell
you that they report the common practice of writers and publishers.
They, if asked, will tell you that they spell words according to one
or more dictionaries!  Repeat loop.  It's a good thing for English
readers and writers that the loops generally converge quickly
(a so-called "Hartree-Fock solution").

In short:

	Some names have no authoritative definitions;
	Those names nevertheless have referents, defined by common
		acceptation;
	It is not adequate to say that if there is no authoritative
		authority, that a fiat authority will do.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From b.laforge at jxml.com  Tue Sep 29 18:27:00 1998
From: b.laforge at jxml.com (Bill la Forge)
Date: Mon Jun  7 17:05:11 2004
Subject: XML and Objects
Message-ID: <001c01bdebc6$1cd64ba0$ab026982@thing1.camb.opengroup.org>

>The second example above is ``internal'', the serialization uses
>class-specific elements, where class and member information are
>represented as XML elements.  Internal serialization is generally done
>according to the class definition (reflection or IDL), and often
>requires a stub or class-specific behavior.  Coins (if I understand
>correctly) is an example of the ``internal'' form.


I think there is a third possibility, where the objects in the DOM tree
are wrappers for application objects. These wrapper objects, then,
are responsible for connecting the data held by the DOM with the
processing capabilities of the application objects.

The wrappers serve as the glue that lets us preserve the reusability
of application components, allows them an independent inheritance,
and maintains their independence from the XML markup.

Creation of such wrappers could be driven by the XSchema for the
markup language and reflection on the application classes, but should
accept input to support non-obvious mappings.

This is what I'm trying to drive towards with coins.

Bill


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From avirr at LanMinds.Com  Tue Sep 29 18:37:24 1998
From: avirr at LanMinds.Com (Avi Rappoport)
Date: Mon Jun  7 17:05:11 2004
Subject: 10 September 1998 version of XML spec DTD and documentation
In-Reply-To: <3.0.1.16.19980926082019.4b877446@pop3.demon.co.uk>
References: <199809142030.QAA06791@doctools.com>
Message-ID: <v04011702b236bb745751@[207.33.50.55]>

At 8:20 AM -0700 9/26/98, Peter Murray-Rust wrote:

> Is anyone collecting XML DTDs? I seem to remember there was but forget
>whom...
>

CommerceNet is trying to do so at the XML Exchange: <htpp://www.xmlx.com>.

Avi
________________________________________________________________
Avi Rappoport, Web Site Search Tools Maven <mailto:avirr@lanminds.com>
Guide to Site Indexing and Local Search Engines: <http://www.searchtools.com>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Tue Sep 29 19:10:23 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:11 2004
Subject: Ownership of Names (was Re: Public identifiers and topic  
	  maps)
References: <360FE124.E2FCF5C6@locke.ccil.org>
	 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
	 <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
	 <360FE124.E2FCF5C6@locke.ccil.org>
	 <3.0.5.32.19980928213649.0096e210@dns.isogen.com> <3.0.5.32.19980929110530.008d8d30@dns.isogen.com>
Message-ID: <3611145F.58967A01@locke.ccil.org>

W. Eliot Kimber scripsit:

> But I also know that "fido" is not restricted to the naming of dogs:
> 
> [Telephone search snipped]

In re "Fido": concedo.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From srn at techno.com  Tue Sep 29 19:15:36 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun  7 17:05:11 2004
Subject: Ownership of Names (was Re: Public identifiers and topic 
  maps)
In-Reply-To: <3.0.5.32.19980928213649.0096e210@dns.isogen.com>
	(eliot@dns.isogen.com)
References: <360FE124.E2FCF5C6@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
 <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
 <360FE124.E2FCF5C6@locke.ccil.org> <3.0.5.32.19980928213649.0096e210@dns.isogen.com>
Message-ID: <199809291709.MAA03967@bruno.techno.com>

> Assigning your own names to things is just cataloging, nothing more.  If
> the Dewey Decimal system had conformed to ISO 9070, all our library catalog
> entries would be of the form:
> 
> -//Dewey::Catalog//DOCUMENT 301 Title, Author//EN
> 
> But Dewey doesn't own the books, just the cataloging system for them.
> 
> So why should you be denied the same opportunity to define a classification
> scheme as Dewey?

This is not the point.  Dewey doesn't conform to 9070, and yet, silly
me, I may still want to use Dewey.  Specifically, how can I use Dewey?
More generally, how do I use any arbitrary catalog to point to one of
the things that it catalogs?  That is the question I'm concerned
about.

-Steve

--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com  http://www.techno.com  ftp.techno.com

voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax    +1 972 994 0087 (at ISOGEN: +1 214 953 3152)

3615 Tanner Lane
Richardson, Texas 75082-2618 USA

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From srn at techno.com  Tue Sep 29 19:16:29 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun  7 17:05:11 2004
Subject: Ownership of Names (was Re: Public identifiers and topic 
	  maps)
In-Reply-To: <3610F3E4.E18B29A6@locke.ccil.org> (message from John Cowan on
	Tue, 29 Sep 1998 10:51:16 -0400)
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
	 <3.0.5.32.19980928135613.00993d70@dns.isogen.com> <3.0.5.32.19980928171700.00958ba0@dns.isogen.com> <3610F3E4.E18B29A6@locke.ccil.org>
Message-ID: <199809291714.MAA03970@bruno.techno.com>

[John Cowan:]

> ...we need a way to reify the concept of
> a "public domain name".

If I understand what you're saying, Yes!

-Steve

--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com  http://www.techno.com  ftp.techno.com

voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax    +1 972 994 0087 (at ISOGEN: +1 214 953 3152)

3615 Tanner Lane
Richardson, Texas 75082-2618 USA

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Tue Sep 29 19:19:22 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:11 2004
Subject: Ownership of Names (was Re: Public identifiers and topic  
   maps)
In-Reply-To: <36110955.C02C6FA3@locke.ccil.org>
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
 <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
 <3.0.5.32.19980928171700.00958ba0@dns.isogen.com>
 <3.0.5.32.19980929104734.008fd660@dns.isogen.com>
Message-ID: <3.0.5.32.19980929121602.0096b100@dns.isogen.com>

At 12:22 PM 9/29/98 -0400, John Cowan wrote:
[NOTE: this will be last post on this thread, as it's clear to me that John
and I are not communicating, probably because we have divergent definitions
of some key concepts. However, I feel I must answer John's note, at least
in an attempt to make my side of the argument clear.]

>W. Eliot Kimber wrote:
>
>> I don't buy it. How do you know that there's this part of Austerlitz called
>> "Spencertown"?
>
>By consensus.

Consensus of whom? If you can define the set of people who share an
understanding of what "Spencertown" is then you have served to define it,
because I can interrogate those people and get them to tell me what they
think the boundaries of Spencertown are.  From that I will have *a*
definition of what Spencertown is that is *as authoritative as it is
possible for such a definition to be* given that there is no formal
authority providing a definition. I don't see the problem with this. 

You seem to be saying "I want an authority, but there can be no authority,
so the problem is unsolvable, but I must have solution" and I'm saying
"there is an authority, so there is no problem."  The choice seems obvious
to me.

>> The fact that people call it that must be written down
>> somewhere reasonably authoritative (I could even use your posts on the
>> subject at the authority--they're certainly reliably addressible by
>> reference to the XML Dev archive) or else there is some person who is that
>> authority (it could be John himself).
>
>I am certainly no authority on this subject.  I know what other people
>mean to refer to when they refer to Spencertown, that's all.  

HOW DO YOU FRIGGIN' KNOW IT? If you know it, you must get the knowledge
from somewhere. The source of that knowledge is the authority. If you can't
name the authority (even if that authority is your neighbor or what you
read in the paper or the intersection of all the opinions you've gotten),
then you are lying when you say you know what other people mean. You may
know what you *think* other people mean.  I got into a big argument with my
friends over whether any of us were or were not baby boomers. It became
clear that none of us shared a common definition of what a baby boomer was.
 The phrase "I know what other people mean by..." is either a lie or a
dangerous assumption, because opinion on most things is usually much less
consistent than one might think.

                                                            But this
>is not even a definition: it's circular, like "By 'Socrates', I mean
>the person I intend to call 'Socrates'".

It's not circular at all.  If I ask you to define Spencertown for me, you
can define its boundaries on a map, possibly with the caveat that there is
difference of opinion about the exact borders, but you can define it.
Therefore, saying that "Spencertown is what John defines it to be" is just
an indirection that is resolved by making you tell us your definition.  If
you said "Spencertown is Spencertown", it's nonsense.  You may not know how
you got your notion of what Spencertown is, but you have a notion and can
communicate it in absolute terms by reference to an authority, such as a
map of Austerlitz within which you can address the region you call
Spencertown.

Or maybe "Spencertown" is a state of mind, or a way of life. You can still
enumerate the characteristics of those that distinquish Spencertown from
other such things.

If a thing can be named it can be defined, therefore all names resolve to
definitions of some sort. It is the job of the resolver to judge the
authoritativeness of the definition they get and the job of the namer to
choose mappings that are appropriately authoritative (or as authoritative
as they're capable of making them at the time).  There cannot be an
absolute authority for everything, or even for most things. 

>I could *make* myself into a fiat authority, but the actions of a 
>pure fiat authority are arbitrary, and I do not wish to be arbitrary.
>
>> There must be a map somewhere that
>> describes what Spencertown is, or at least what the concensus of it is.
>
>The various maps and so on are not authoritative either.  A map *describes*,
>it does not constitute an authority.

But a given map will either match or not match *your* understanding of what
Spencertown is, and therefore you can use it a fixed reference point to
define what Spencertown is, for you.

>> This type of thing, a thing for which there is no well-defined authority
>> (because the boundaries of the thing are defined only by common usage and
>> opinion, not by some governing authority) is an interesting case because,
>> in a very real sense, every person may have a different definition of what
>> they think the thing is.
>
>In Humpty-Dumpty principle, yes.  In fact, those who talk of Spencertown
>know what they are talking about, and those who don't, don't care.

Nonsense. If I move to Austerlitz and someone tells me "you should really
live in Spencertown", the first thing I have to do is ask them what
Spencertown is, at which point I'll get their definition of it.  I might
accept their definition at face value or I might say "by what authority to
do you use that definition?" and they might say "everybody knows that's
what Spencertown is", at which point I say fine.  But the next time I ask
somebody what Spencertown is, I'll compare their answer to the previous one
to see if they're similar. If they're not, I have to start choosing.

Thus, every person who has an idea of what Spencertown is is an
authority--the question is, how much weight do you give them?  

>> That's why I stress that the topic is the *idea*
>> of Spencertown.
>
>Then I still don't understand what an "idea" is.  When I assert:
>"The Spencertown Country Store has a bulletin-board describing events
>happening in Spencertown and nearby areas", I am talking about *Spencertown*,
>not about some mental construct in someone's head.  My reference may be
>vague, but it is a reference to something in the physical world.

[...]

>Biologists can study living organisms without being able to define "life".

No they cannot. There is a difference between defining what something *is*
and defining what a name *refers to*.  That's what this whole discussion
has been about. They must define the term "life" to mean something so that
they can distinguish things that are living from things that are not
living. That doesn't mean that they have an *absolute* definition of what
life is, it simply means that they've said "when I use the term 'life', I
mean things that exhibit the following properties...".  

[...]

>In short:
>
>	Some names have no authoritative definitions;
>	Those names nevertheless have referents, defined by common
>		acceptation;
>	It is not adequate to say that if there is no authoritative
>		authority, that a fiat authority will do.

Look, if there's an accepted authority, then you use it. If there's not,
then you say what your opinion is and why, and let readers judge for
themselves whether your opinion is a good one.  I don't see how it can be
otherwise.  If there is no existing authority for something then there is
nothing to be pointed to as an absolute authority so there's no point
worrying about the lack of one. Either your opinion will become the
authority (because other people agree with you and say so) or a better
authority will emerge from common usage.

Cheers,

E.

--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From db at Eng.Sun.COM  Tue Sep 29 19:22:18 1998
From: db at Eng.Sun.COM (David Brownell)
Date: Mon Jun  7 17:05:11 2004
Subject: XML and Objects
References: <E0zNhzJ-0004Q2-00@punch.ic.ac.uk>
Message-ID: <36111325.B89413EE@eng.sun.com>

Arnold, Curt wrote:
> 
> Tim Bray wrote:
> >This can/should be built on top of the DOM, right?  -Tim
> 
> If on top of the DOM you mean that you would would completely populate
> the DOM, then build the corresponding objects, then I would tend to
> disagree.  I have had good luck and performance restoring objects from a
> large (>2MB) XML file responding to events from Expat.  If I first built
> an in-memory representation and then processed the information, I don't
> think that I could get nearly the same performance.  I would think an
> object creation and link resolution layer on top of SAX would be
> preferable.

That's one of the reasons that the early/experimental support for such
"XML Beans" in Sun's library features the ability to transform a DOM
tree as it's being built.  See:

    http://www.lists.ic.ac.uk/hypermail/xml-dev/9809/0642.html
    http://www.lists.ic.ac.uk/hypermail/xml-dev/9809/0645.html

DOM (enhanced) is one of the ways to do this sort of stuff.  I've
got a bias towards it since it means One Less API, but there are
those who would rather have a more flexible and lighter weight base
API.  (Committee designs are rarely flexible or light weight!)


> p.s. I've downloaded the Sun XML Early Access, but I can only find
> passing references to XML Beans.  Is there a specific document and/or
> source file that clarify what they mean by XML Beans. 

There's not a lot of stuff written about it; it's a simple (powerful)
idea, so quantity of info isn't the goal.  Also, it's not positioned
as being the "One True Way" -- just one of many ways to use JavaBeans
with XML.  I've seen similar mechanisms used in (un)marshaling data
for at least the last decade.  Many XML systems use similar ideas. 

I'd look at the "XML Beans" section of the package overview (javadoc)
for "com.sun.xml.tree", the "XmlDocumentBuilder" and "ElementNode" 
class docs (in that package), and the "GUI Demo", which uses two
different kinds of beans on the same document (well, same DTD anyway)
to display differently using the Swing JTree facility.  (And yet they
still write themselves out as normal XML text, without losing anything
except the DTD info which SAX discards.)

Of course, there are other ways to use this facility than to model data
for a rendering engine.  The javadocs for the DOM classes were done
with such beans (from XML sources).  And it's a good way to get started
on an XSL implementation.
 

>	 The two
> alternative interpretation of the term that I have contemplated are:
> 
> 1. Java Beans that modify the behavior of the parser
> 2. A serialization mechanism for Java Beans

I actually think the _parser_ should stick exclusively to XML 1.0
behaviour.  But you may be thinking of more than what, say, SAX does;
in my book, building a DOM tree is not a function of the parser.

There are quite a few ways of using beans and XML.  I talked with one
company that had identified seven different ways!  On XML-Dev I've
counted at least four (and wasn't trying hard ;-) and they mostly
come down to different ways to have the parser interact with code
to produce in-memory data structures, and then have those in-memory
data structures generate XML text.

In a separate note I sketched a "serialization" mechanism that didn't
rely on DOM at all ... but uses a specific DTD.  The current XML Beans
stuff can use an arbitrary DTD/Schema ... but relies on DOM for its
in-memory representation.  (Perhaps these two correspond to the options
you listed above.)

Other options involve dispatching some of the parsing events to the
beans (both SAXON and COINS do this now) rather than expecting the
beans to interpose on appendChild.  And there are many more options.

- Dave

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From db at Eng.Sun.COM  Tue Sep 29 19:27:50 1998
From: db at Eng.Sun.COM (David Brownell)
Date: Mon Jun  7 17:05:11 2004
Subject: XML and Objects
Message-ID: <36111763.8EC60A9A@eng.sun.com>

Alex Thomas asked:
> I'd also like to know if there's a reason a bean couldn't use specific
> property names, 
> 
> > <PROPERTY NAME="prop2" DCD:string>hello world</PROPERTY>
> 
> becoming
> 
> <PROP2>hello world</PROP2>

The version I sketched has a fixed DTD and you could validate
against it, while still sending arbitrary data.

If you have different element names for each property, you're
either saying that each bean has a different DTD, or that you
are not validating.

Tradeoffs!  That's three different ways to "serialize" in two
sentences ...

- Dave

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From db at Eng.Sun.COM  Tue Sep 29 19:48:56 1998
From: db at Eng.Sun.COM (David Brownell)
Date: Mon Jun  7 17:05:11 2004
Subject: XML [~serialization] and Objects
References: <199809280344.UAA13557@websales.com>
		 <13839.29384.260466.681001@localhost.localdomain> <36108C1F.DC6F482E@eng.sun.com> <3610E0F4.C873DBD7@acm.org>
Message-ID: <36111C97.50323015@eng.sun.com>

Mark Baker wrote:
> 
> David Brownell wrote:
> 
> >         <BEAN CLASS="com.example.foo.SimpleBean">
> >             <PROPERTY NAME="prop1" DCD:i4>49</PROPERTY>
> >             <PROPERTY NAME="prop2" DCD:string>hello world</PROPERTY>
> >             ...
> >             </BEAN>
> 
> <SimpleBean>
>   <prop1 DCD:i4>49</prop1>
>   <prop2 DCD:string>hello world</prop2>
> </SimpleBean>

(A different way to use java.beans.Introspector ... not using
a predefined DTD to ensure validatability ...)


> I think the most important goal of bidirectional Java/XML
> interop is in going *from* XML *to* Java, not the other
> way around.

But then where would the XML content come from?  I don't think
you can solve one problem without the other.  If one assumes Java,
then DCD tagging isn't needed:

	<com.example.foo.SimpleBean>
		<prop1>49</prop1>
		<prop2>hello world</prop1>
		</com.sun.example.foo.SimpleBean>

since all the type info is in the class for SimpleBean!  Not
particularly validatable unless you generate a DTD from the
class file, which may be an issue.  (I'm assuming there that
most Java names are legal XML names, but there are surely
some exceptions.)


>	  As such, asking document authors
> to follow a Bean-specific DTD isn't such a good idea.  Network
> effects are your friend! 8-)

The question was originally about serialization, so I was
assuming that such documents wouldn't generally be written
by a document author so much as by a tool of some kind.
Also, which particular network effects to leverage?  ;-)


One more comment:  I really see "serialization" approaches
as being in a different direction than document based ones.

That is, if I'm working with an OBI/OTP/OFX DTD for some
sort of E-Commerce system, the external format is defined
already.  There are elements in the DTD that won't match
the model of a property.  If we associate elements with
beans in at least some cases, they'll need to be able to
choose which elements correspond to properties.  Also, they
need to arrange "appropriate" handling for non-property
elements (whatever that may be).

- Dave

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Tue Sep 29 19:54:09 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:11 2004
Subject: Ownership of Names (was Re: Public identifiers and topic  
  maps)
In-Reply-To: <199809291709.MAA03967@bruno.techno.com>
References: <3.0.5.32.19980928213649.0096e210@dns.isogen.com>
 <360FE124.E2FCF5C6@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
 <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
 <360FE124.E2FCF5C6@locke.ccil.org>
 <3.0.5.32.19980928213649.0096e210@dns.isogen.com>
Message-ID: <3.0.5.32.19980929125216.007e52f0@dns.isogen.com>

At 12:09 PM 9/29/98 -0500, Steven R. Newcomb wrote:

>> But Dewey doesn't own the books, just the cataloging system for them.
>> 
>> So why should you be denied the same opportunity to define a classification
>> scheme as Dewey?
>
>This is not the point.  Dewey doesn't conform to 9070, and yet, silly
>me, I may still want to use Dewey.  Specifically, how can I use Dewey?
>More generally, how do I use any arbitrary catalog to point to one of
>the things that it catalogs?  That is the question I'm concerned
>about.

You address the catalog and then you address things in the catalog. If the
catalog owner has provided an algorithm for forming FPIs (or URNs of any
sort) for things within the catalog given an identifier within it, then you
are justified in using their owner name because they have explicitly
delegated the creation of names (or rather, algorithmically pre-assigned
all possible names). If they have not, then you are not, because you have
no idea if your name is a good one.  Thus, if Dewey said to the world:
"form FPIs for things in my catalog by using my owner ID and the catalog
number, author, and title", then you are justified in creating PFIs of the
form "+//IDN deweydecimal.com//DOCUMENT 301, author, title/EN". If he did
not, then you are not.

So, in the general case, FPIs alone are insufficient because you need
multi-level addresses.  This is what HyTime provides, with the bibloc
(bibliographic location address) form the most general (because it can
address anything, not just electronic resources):

<bibloc id="dewey.decimal.system">
The cataloging system for books developed by Dewey somebody.\
</bibloc>

<bibloc id="some.book" bibsrc="dewey.decimal.system">
301, author, title
</bibloc>

The "bibsrc" (bibliographic source) attribute serves to establish the space
within which the string "301, author, title" addresses by pointing to the
other bibloc.  You could, of course, combine the two biblocs into one:

<bibloc id="some.book">
301, author, title within 
the cataloging system for books developed by Dewey somebody
</bibloc>

But since the catalog will probably be used often, it's useful to isolate
it out for reuse.

Note that the reference to this book is a normal ID reference:

<citation book="some.book">

That means that if the book becomes available electronically, I can replace
the bibloc with an electronically-resolvable location address without
disturbing the citation itself:

<locator id="some.book"
href='http://my.local.library.gov/books/dewey/301?author="author"&title="tit
le"'
HyTime="queryloc" notation="uri"/>

Now when I resolve the citation I'll get what's at the end of the URL
rather than the biblocs.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From srn at techno.com  Tue Sep 29 19:58:50 1998
From: srn at techno.com (Steven R. Newcomb)
Date: Mon Jun  7 17:05:11 2004
Subject: Ownership of Names (was Re: Public identifiers and topic  
	  maps)
In-Reply-To: <36110955.C02C6FA3@locke.ccil.org> (message from John Cowan on
	Tue, 29 Sep 1998 12:22:45 -0400)
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
	 <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
	 <3.0.5.32.19980928171700.00958ba0@dns.isogen.com> <3.0.5.32.19980929104734.008fd660@dns.isogen.com> <36110955.C02C6FA3@locke.ccil.org>
Message-ID: <199809291732.MAA04020@bruno.techno.com>

[John Cowan:]

> 	Some names have no authoritative definitions;
> 	Those names nevertheless have referents, defined by common
> 		acceptation;
> 	It is not adequate to say that if there is no authoritative
> 		authority, that a fiat authority will do.

To me, it seems both adequate and necessary to allow the use of any
authority at all, including "whoever owns the country store at 34 Main
Street".

-Steve

--
Steven R. Newcomb, President, TechnoTeacher, Inc.
srn@techno.com  http://www.techno.com  ftp.techno.com

voice: +1 972 231 4098 (at ISOGEN: +1 214 953 0004 x137)
fax    +1 972 994 0087 (at ISOGEN: +1 214 953 3152)

3615 Tanner Lane
Richardson, Texas 75082-2618 USA

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From db at Eng.Sun.COM  Tue Sep 29 20:09:27 1998
From: db at Eng.Sun.COM (David Brownell)
Date: Mon Jun  7 17:05:11 2004
Subject: XML [~serialization] and Objects
References: <3.0.32.19980929180436.00a94db8@pop.access.com.au>
Message-ID: <36112172.5A93ADA1@eng.sun.com>

(Welcome back, Steve!  ;-)

Steve Withall wrote:
> 
> At 00:28 29/9/98 -0700, David Brownell wrote:
> >
> >       <BEAN CLASS="com.example.foo.SimpleBean">
> >           <PROPERTY NAME="prop1" DCD:i4>49</PROPERTY>
> >           <PROPERTY NAME="prop2" DCD:string>hello world</PROPERTY>
> >           ...
> >           </BEAN>
> 
> The problem I have with this approach is that it limits you to
> specifying just a single class. Surely in the general case

This solution wasn't for a general case -- it was for a specific
one, serializing some Java data to/from XML using particular DTD
so that non-Java code could _potentially_ generate.  Many such
solutions are possible.

>	 one
> wants to be able to use an XML element to represent some sort
> of 'thing' (avoiding the word object), and it should be possible
> for multiple applications to use this XML document, each one
> possibly wishing to instantiate the 'thing' using a different class.

In the general case I'd go so far as to say that _some_ elements
represent a "thing", and many don't.  Existing DTDs aren't all done
with a particular object modeling paradigm, and so on.  One can't
deduce which elements represent objects, which represent properties,
which represent actions, and so forth without a data model in hand.

In the example above, that data model was captured in the spec for
that java class, which can be introspected at runtime.  In general,
that assumption must not be made.  (But it can simplify things a
whole bunch in those cases where you can assume a java.lang.Class!)


> I'd prefer the identification of which class a particular
> application should use for a particular type of element to
> be external (using DCD, for example). The document itself then
> remains 'purer'...

Right, that's a more general approach, and is very much akin to
the experimental "XML Beans" stuff in Sun's XML Library.  The
association is external to the document, and existing documents
can be used in a variety of ways.

As I noted elsewhere, I see those two approaches as basically
separate.  They can be hybridized, but I suspect that'd cause
confusion if not done with care.

- Dave

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From mamster at webeasy.com  Tue Sep 29 20:21:56 1998
From: mamster at webeasy.com (Michael Amster)
Date: Mon Jun  7 17:05:11 2004
Subject: XML [~serialization] and Objects
References: <3.0.32.19980929180436.00a94db8@pop.access.com.au> <36112172.5A93ADA1@eng.sun.com>
Message-ID: <3611249B.A07F6B01@webeasy.com>

If we are working in a purely Java environment, I don't believe that a DTD is
particularly valuable.  Introspection allows for runtime errors to be detected
and handled.  It is up to the serialization/deserialization to classify mandatory
and optional fields in an object.

The neat (pardon that word) thing about this is that if we have a "smart"
serialization/deserialization mechanism, we can deal with object versioning
cleanly.  Current methods assume identical object definitions on both sides of
the wire.  If this is not the case, XML serialization coupled with a ruleset for
object reconstruction (don't care, mandatory, optional, default) can accomodate
different versions of objects.

Just a thought.

-MA

David Brownell wrote:

> (Welcome back, Steve!  ;-)
>
> Steve Withall wrote:
> >
> > At 00:28 29/9/98 -0700, David Brownell wrote:
> > >
> > >       <BEAN CLASS="com.example.foo.SimpleBean">
> > >           <PROPERTY NAME="prop1" DCD:i4>49</PROPERTY>
> > >           <PROPERTY NAME="prop2" DCD:string>hello world</PROPERTY>
> > >           ...
> > >           </BEAN>
> >
> > The problem I have with this approach is that it limits you to
> > specifying just a single class. Surely in the general case
>
> This solution wasn't for a general case -- it was for a specific
> one, serializing some Java data to/from XML using particular DTD
> so that non-Java code could _potentially_ generate.  Many such
> solutions are possible.
>
> >        one
> > wants to be able to use an XML element to represent some sort
> > of 'thing' (avoiding the word object), and it should be possible
> > for multiple applications to use this XML document, each one
> > possibly wishing to instantiate the 'thing' using a different class.
>
> In the general case I'd go so far as to say that _some_ elements
> represent a "thing", and many don't.  Existing DTDs aren't all done
> with a particular object modeling paradigm, and so on.  One can't
> deduce which elements represent objects, which represent properties,
> which represent actions, and so forth without a data model in hand.
>
> In the example above, that data model was captured in the spec for
> that java class, which can be introspected at runtime.  In general,
> that assumption must not be made.  (But it can simplify things a
> whole bunch in those cases where you can assume a java.lang.Class!)
>
> > I'd prefer the identification of which class a particular
> > application should use for a particular type of element to
> > be external (using DCD, for example). The document itself then
> > remains 'purer'...
>
> Right, that's a more general approach, and is very much akin to
> the experimental "XML Beans" stuff in Sun's XML Library.  The
> association is external to the document, and existing documents
> can be used in a variety of ways.
>
> As I noted elsewhere, I see those two approaches as basically
> separate.  They can be hybridized, but I suspect that'd cause
> confusion if not done with care.
>
> - Dave
>
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)

--
~-~-~-~-~-~-~-~-~-~-~-~-~-~-WEBEASY-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~
Michael Amster     mamster@webeasy.com
4676 Admiralty Way, Suite 300   Tel: 310.576.0770
Marina Del Rey, CA 90292   Fax: 310.576.2011

-------------- next part --------------
A non-text attachment was scrubbed...
Name: mamster.vcf
Type: text/x-vcard
Size: 319 bytes
Desc: Card for Michael Amster
Url : http://mailman.ic.ac.uk/pipermail/xml-dev/attachments/19980929/fa39379f/mamster.vcf
From db at Eng.Sun.COM  Tue Sep 29 20:24:17 1998
From: db at Eng.Sun.COM (David Brownell)
Date: Mon Jun  7 17:05:12 2004
Subject: Ownership of Names (was Re: Public identifiers and topic maps)
References: <003c01bdeb8e$d0c69300$1e09e391@mhklaptop.bra01.icl.co.uk> <3611021C.2423E21E@locke.ccil.org>
Message-ID: <36112471.5D0E3422@eng.sun.com>

John Cowan wrote:
> 
> Michael Kay wrote:
> 
> > **There is no global namespace**
> 
> Maybe not.  But the set of namespaces had better be strongly
> connected, or we're going to have trouble referring from over
> *here* to over *there*.

Sometimes such references are non-goals ... 

> (What would a namespace not strongly connected to the rest
> be like?  Sort of like colors of the ultraviolet?)

... like when you set up an isolated network with its own DNS
root server and hierarchy, to test a network configuration
with new software that you dare not let out.  Hmm, virus tests
in a Class IV containment environment?  UV decontamination
may not be good enough!  ;-)

Seriously -- don't assume connectivity is always a goal.

- Dave

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Alex.Thomas at dresdner-bank.com  Tue Sep 29 20:36:05 1998
From: Alex.Thomas at dresdner-bank.com (Alex.Thomas@dresdner-bank.com)
Date: Mon Jun  7 17:05:12 2004
Subject: XML [~serialization] and Objects
Message-ID: <199809291822.TAA26334@harpo.dresdnerkb.com>

Yes indeed, I don't think this is widely appreciated for some reason, though
I think the hybrid approach you refer to might actually be the most common
case (Java with a DCD).

Like many investment firms, we make use of multicast publish-subscribe and
message queue transports. Not all our recipients will be Java, hence the
attraction of one message for all.

cheers
Alex

> -----Original Message-----
> From: David Brownell [mailto:db@Eng.Sun.COM]
...
> If one assumes Java,
> then DCD tagging isn't needed:
> 
> 	<com.example.foo.SimpleBean>
> 		<prop1>49</prop1>
> 		<prop2>hello world</prop1>
> 		</com.sun.example.foo.SimpleBean>
> 
> since all the type info is in the class for SimpleBean!  
...

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Daniel.Brickley at bristol.ac.uk  Tue Sep 29 21:01:57 1998
From: Daniel.Brickley at bristol.ac.uk (Dan Brickley)
Date: Mon Jun  7 17:05:12 2004
Subject: XML [~serialization] and Objects
In-Reply-To: <36112172.5A93ADA1@eng.sun.com>
Message-ID: <Pine.GHP.4.02A.9809291946380.26013-100000@mail.ilrt.bris.ac.uk>

On Tue, 29 Sep 1998, David Brownell wrote:
[...]
> >  one
> > wants to be able to use an XML element to represent some sort
> > of 'thing' (avoiding the word object), and it should be possible
> > for multiple applications to use this XML document, each one
> > possibly wishing to instantiate the 'thing' using a different class.
> 
> In the general case I'd go so far as to say that _some_ elements
> represent a "thing", and many don't.  Existing DTDs aren't all done
> with a particular object modeling paradigm, and so on.  One can't
> deduce which elements represent objects, which represent properties,
> which represent actions, and so forth without a data model in hand.

Quite. There's yet another variant on the XML/OO serialisation idea
at:  "Java Serialization Using RDF with Schemas"
http://wave.eecs.wsu.edu/CKRMI/JSRDF.html  (appears to require Java1.1
browser though), which gets around this by using RDF/XML, since RDF
introduces conventions that do let you deduce, for previously
unencountered vocabularies, which constructs refer to properties,
classes and so on.

BTW there's a slight mismatch between RDF's notion of a "class" and Java's;
RDF allowers more free-flowing annotation, so you can attach properties 
(eg. price, color) to resources that belong to a class whose original
definition didn't anticipate such annotations. Properties are defined in
terms of the class they're applied to and the type of value they have; I
believe the post below refers to an earlier version of RDF Schemas where
each class had associated "allowedPropertyTypes". In practice this was
essentially the same mechanism as the domain/range mechanism now on
offer <http://www.w3.org/TR/WD-rdf-schema/>, although
allowedPropertyType had a more OOish feel.

Dan


Original RDF-DEV post follows:

> To further understand RDF with Schemas, people (especially
> Java developers) might want to take a look at our pages on
> "Java Serialization Using RDF with Schemas" at the address
> http://wave.eecs.wsu.edu/CKRMI/JSRDF.html.

> Here we automatically translate all packages and classes
> in the Java API into RDF/Schemas, we give some source code,
> and we also translate an example of Bill LaForge's which
> demonstrates Java Serialization using RDF/Schemas of some
> simple Java classes and instances that have inheritance, arrays,
> and reference loops.

> Robert
> _________________________________________________

> Robert E. Kent email: rekent@eecs.wsu.edu


--
Daniel.Brickley@bristol.ac.uk                           
Institute for Learning and Research Technology   http://www.ilrt.bris.ac.uk/
University of Bristol,  Bristol BS8 1TN, UK.     tel: +44(0)117 9288478


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From b.laforge at jxml.com  Tue Sep 29 22:21:05 1998
From: b.laforge at jxml.com (Bill la Forge)
Date: Mon Jun  7 17:05:12 2004
Subject: XML [~serialization] and Objects
Message-ID: <002c01bdebe6$eb04aec0$ab026982@thing1.camb.opengroup.org>

>In the general case I'd go so far as to say that _some_ elements
>represent a "thing", and many don't.  Existing DTDs aren't all done
>with a particular object modeling paradigm, and so on.  One can't
>deduce which elements represent objects, which represent properties,
>which represent actions, and so forth without a data model in hand.


Yes, we need a data model to effectively process documents 
conforming to a particular DTD. 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From KDrake at Soph-Ware.com  Wed Sep 30 00:01:47 1998
From: KDrake at Soph-Ware.com (Kathie Drake)
Date: Mon Jun  7 17:05:12 2004
Subject: Notations
Message-ID: <3.0.3.32.19980929145932.007c3410@mail.soph-ware.com>

Does anyone know where I can locate public identifiers for the following
notations: TIFF, XML,HTML,Postscript,RTF and ASCII?  

Thanks
Kathie Drake


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Wed Sep 30 01:55:03 1998
From: david at megginson.com (david@megginson.com)
Date: Mon Jun  7 17:05:12 2004
Subject: Notations
In-Reply-To: <3.0.3.32.19980929145932.007c3410@mail.soph-ware.com>
References: <3.0.3.32.19980929145932.007c3410@mail.soph-ware.com>
Message-ID: <13841.29251.661609.261073@localhost.localdomain>

Kathie Drake writes:

 > Does anyone know where I can locate public identifiers for the
 > following notations: TIFF, XML,HTML,Postscript,RTF and ASCII?

Wow!  Someone besides Eliot Kimber is actually using notations in XML!

Here's what the (SGML) DocBook DTDs have for TIFF and EPS:

<!NOTATION TIFF		SYSTEM "TIFF">
<!NOTATION EPS		PUBLIC 
 "+//ISBN 0-201-18127-4::Adobe//NOTATION PostScript Language Ref. Manual//EN">

The TIFF notation is a cop-out; the EPS notation, on the other hand,
provides a good model for anything with a published specification
(i.e., anything with an ISBN).

HTML and XML should have similar public IDs, since they're both W3C
specs -- the public ID will probably include the w3.org domain name.
What do you use, Eliot?

As for RTF, is there even a single, stable, normative specification to 
point to?


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tln at insect.sd.monash.edu.au  Wed Sep 30 03:21:09 1998
From: tln at insect.sd.monash.edu.au (Thuy-Linh Nguyen)
Date: Mon Jun  7 17:05:12 2004
Subject: PEReference in XML4j
Message-ID: <Pine.GSO.3.96.980930111824.3459C-100000@insect.sd.monash.edu.au>

Hi !

I declare an entity in my DTD for eg:

<!ENTITY % docAtts
   "title     CDATA    #IMPLIED
    header    CDATA    #IMPLIED
    footer    CDATA    #IMPLIED"
>

And later on refer to it:

<!ATTLIST doc
   %docAtts;
>

But I can't get thru with trlx. Did I misunderstand something or... ?

Thank you !
TL


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Wed Sep 30 04:37:42 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:12 2004
Subject: Notations
In-Reply-To: <13841.29251.661609.261073@localhost.localdomain>
References: <3.0.3.32.19980929145932.007c3410@mail.soph-ware.com>
 <3.0.3.32.19980929145932.007c3410@mail.soph-ware.com>
Message-ID: <3.0.5.32.19980929213520.0096ad60@dns.isogen.com>

At 07:56 PM 9/29/98 -0400, david@megginson.com wrote:

>HTML and XML should have similar public IDs, since they're both W3C
>specs -- the public ID will probably include the w3.org domain name.
>What do you use, Eliot?

For graphic notations, I usually use an omitted system identifier. For
other notations, I use the URL of the spec, if I know it. I didn't realize
there was an Adobe-defined FPI for EPS (is there one for PDF? Frame MIF?
Frame binary?).

The purpose of the external identifier for a notation is to uniquely
identify the notation, presumably by identifying the authoritative
documentation for that notation.  This has two purposes:

1. To allow observing humans to determine what a particular notation is all
about and have some hope of figuring out how to process it.
2. To allow the mapping of local notation names (i.e., on data ("unparsed")
entity declarations and NOTATION attributes) to the processor for that
notation.

This latter function is *identical* to the way that object references are
mapped to objects in COM.  If you dig into how COM object connections are
managed, you'll discover that the Windows registry is, in part, nothing but
a mapping table that gets you from local (to your machine) names for
notations to the UUIDs of the COM objects that implement those notations,
which are then mapped to the local program names on your machine (e.g., a
.exe, .dll, or .ocx file).

This is just like for notations: local name for notation "EPS" maps to
universally unique name for notation (+//ISBN
0-201-18127-4::Adobe//NOTATION PostScript Language Ref. Manual//EN) maps to
local processor object that interprests the notation (e.g., acroread.exe). 

I find this interesting for two reasons.  First, it suggests that the
notation mechanism the correct solution for the problem because someone
else came up with essentially the same solution for essentially the same
problem. Second, during the XML discussions, Microsoft often complained
that indirection was too hard in various contexts. However, here is
Microsoft using pretty sophisticated indirection in the heart of their
operating systems.  Hmmmm.  Maybe it's not so hard after all.

Or is it simply that in the case of COM, as for notations, there's simply
no way to avoid the indirection, so you have to suck it up and deal with
it?  Hmmmm. 

The main difference between what's happening in COMland and what notations
do is that in COMland the unique name is completely opaque and unique
because the generation algorithm depends on a bunch of variables that
pretty well guarantee uniqueness, but also guarantee opacity; while
external IDs can be just as unique, but require things like registration
authorities and name management processes in order to remain human
understandable and meaningful.

One of the things this means is that FPIs can, if constructed in clever
ways, be "researchable" (as Martin Bryan said) in the absense of a known
mapping, while UUIDs are pretty much just noise unless you already have the
mapping.

I can tell you one thing, the Windows registry would be a heck of lot
easier to debug if you could tell by looking at a UUID what it named, or at
least have a clue.

This then leads to a question: do I use public IDs, URLs, or UUIDs for my
notations? I think that I would *never* use UUIDs, because they are too
opaque. But I would definitely use them as the right hand side of my local
mapping table, assuming that I'm using COM-based software (which until
someone provides a usuable SGML editor on Linux {other than psgml--sorry,
I'm dependent on graphical interfaces for structured editing}, I'm forced
to do).

Once I properly implement generalized notation processing for PHyLIS
(www.phylis.com), you will actually see things like this in the "entity"
mapping catalog PHyLIS uses:

PUBLIC "x"
       "{00000014-0000-0010-8000-00AA006D2EA4}"

Where "x" is the external ID for the notation (Notation name, URL, or FPI,
doesn't matter) and "{00000014-0000-0010-8000-00AA006D2EA4}" is the UUID of
the COM object that implements PHyLIS' notation processor interface on your
machine for that notation.  

Within PHyLIS, the processing will be:

1. Get reference to data with a notation (for example, a request to
construct a grove from a data entity with the notation "x").
2. Look up the external ID of the notation for the data entity
3. For the external ID, look up the UUID of the implementing object
4. Use that UUID as the argument to create_object() (in VB, not sure what
it would be in Python, but there must be something).
5. Windows handles resolving the UUID to an executable.

When configuring PHyLIS, you would register the COM objects you want to use
to process various notations, just as you register helper apps in your Web
browser, using some PHyLIS-provided interface (or by modifying the XML
document(s) PHyLIS will use for configuration--you can bet I'm not going
near the registry for that). Big difference--no dependency on extensions,
as there are with MIME types (at least on Windows, Unix systems may be
smarter).  In fact, the external ID of the data entity is irrelevant, the
notation governs.

Of course, you might define a very generic notation, like "graphic", where
the processor uses other means to determine how to really process the
graphic (it might use MIME types), but that's ok--if it makes sense to do
that for you, no reason not to. In the case of things like graphics,
there's already a well established mechanism for making graphics
self-defining for type (magic numbers), so why make the entity declaration
redundant and risk lying (how many times have you changed the format of a
graphic and grumbled about having to update the entity declaration?)?  But
not all data types have this facility, so you still need something like
notations to handle that case. 

You also need notations to indicate that special interpretations should be
applied to an element (after parsing, of course), which is what notation
attributes do.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From amitr at abinfosys.com  Wed Sep 30 05:11:03 1998
From: amitr at abinfosys.com (Amit Rekhi)
Date: Mon Jun  7 17:05:12 2004
Subject: Repesention of table in an XML-DTD
Message-ID: <006c01bdec20$151c0b70$0101a8c0@server.abinfosys.com>

(Sorry, for having reposted this ques but the first version was displayed
incomplete on the XML - DEV list.)

Hello,
            I was wondering what would be the best way to be repesenting a
large table (with 500+ entries) in an external subset of an XML file's  DTD
and selecting a row of the table in the XML file's internal DTD subset.

SCENARIO

            I have a table say TABLE CODE with 2 columns (Code Identifier
and Code Description) :-

                        (There are around 500-700 entries in this table)

            I have a requirement wherein :-

1)  I want to represent this large table above in an XML - DTD (preferbably
in the DTD's external subset since this table is to be accessed by many XML
files)

2) I have a set of XML files (say 10 XML files) whose external subsets point
to the file containing this table. Now in each of the XML files of the set,
I want to select a row of the table, in their internal subsets  i.e.

// table.dtd


From ricko at allette.com.au  Wed Sep 30 05:31:42 1998
From: ricko at allette.com.au (ricko)
Date: Mon Jun  7 17:05:12 2004
Subject: Some notations and RTF (was Re: Notations)
References: <3.0.3.32.19980929145932.007c3410@mail.soph-ware.com> <13841.29251.661609.261073@localhost.localdomain>
Message-ID: <3611A799.7DB93E1B@allette.com.au>

> Kathie Drake writes:
>
>  > Does anyone know where I can locate public identifiers for the
>  > following notations: TIFF, XML,HTML,Postscript,RTF and ASCII?

XML and HTML files both use SGML notation, so if you are using themfrom within a
WebSGML environment, you dont need an FPI: the system
identifiers text/xml and text/html might be useful in some circumstaces.

For XML and HTML from within a XML environment, you need an FPI
for the HTML, but not the XML.  An external entity is automatically
XML if it is not declared NDATA.   For an HTML notation FPI, it would
be best if W3C were to profer one.  In the absense of that, you can
use
<!NOTATION html4ie5 PUBLIC
  "+//IDN SOPH-WARE.COM//NOTATION
  strict HTML 4 from W3C with Microsoft IE5 extensions//EN"
  "text/html">
It is probably useful to specify which version of HTML you mean,
because you can then describe things fairly specifically in the name
part of the FPI, and because the HTML files may use no DOCTYPE
declaration. Also, it can help identify ideosyncracies, such as old
Netscape Communicator's relentless moving of lists to outside of
paragraphs and stripping paragraph end-tags, which may have
a great bearing on how a particular file will have to be processed.

For the others, here are some conservative notation identifiers
I made up for anyone to use, taken from my book "The XML &
SGML Cookbook".  In that book I give FPIs for several hundred
notations.  In the case of the first two, I merely reference another
book of standard file formats. (If this is useful, please buy my book.)

The system identifier in the notation declarations are formally incorrect,
in that they are MIME content types and not URIs. However, I dont
think that W3C has standardized the URN notation for MIME content
types yet (has it?), so I don't feel guilty. So treat the system identifier
as "experimental" at the moment.  The official URN syntax will
probably involve prepending "urn:mime:" or something, I guess.

Remember that there are many possible FPIs for the same notation:
the FPI is a formal method to let you track something down
more than a method for allocating a unique and universal name
for something.  If the originator or promoter of
the notation has never promulgated an FPI, then there is no defitive
FPI, and we have to do the best we can, by forming one according to
rules which allow someone to track down what is meant.<!NOTATION tiff.uncomp
PUBLIC    "+//ISBN 0-13-614223-0::The SGML Cookbook//NOTATION
    ISBN 0-7923-91 Aldus/Microsoft Tagged Interchange File Format//EN"
    "image/tiff" ><!NOTATION epsi PUBLIC    "+//ISBN 0-13-614223-0::The SGML
Cookbook//NOTATION
    ISBN 0-7923-91 Adobe Systems Encapsulated PostScript//EN"
    "application/x-epsi" >

<!NOTATION postscript  PUBLIC
 "+//ISBN 0-13-614223-0::The SGML Cookbook//NOTATION
    ISBN 0-201-18127-4::Adobe::PostScript//EN"
    "application/postscript" >

To call ASCII a notation is stretching the idea of notation a bit.
A notation usually resolves to some sort of grammar rather
than to a character set/encoding. However, you can if you need
to.

ASCII can be constructed from ISO's system identifier.
Use this in the absense of anything better:

<!NOTATION ASCII
    PUBLIC "ISO 646//NOTATION IS 646-IRV//EN"
    "text/plain;charset='ascii'" >

Check that you actually mean ASCII (i.e.  that it does
not have any parseable artificial language in it), otherwise
you are not describing the document. (If you want to use
ASCII as an encoding, not to mean plain text, then you
can use the Formal System Identifier (FSI) encoding FPI
syntax, which you can find in my book at page 2-108, or look
at the HyTime97 website under FSDIR for an idea.)

For RTF, I suggest you ask Microsoft people here for an
FPI. I would be scared to even suggest one (and I copped
out in my book, and just gave an example of what it
might look like if Microsoft made one up) because there
have been so many versions of RTF: just because you
know a file is RTF, does not mean you know enough to
actually use the data. The Mac and PC RTFs are different.
There are many different versions of RTF over time.
Each different locale uses different character sets.  I
believe newest RTF can use Unicode. Is this information
nicely marked up in a header to RTF?  Not that I have
seen, though the most recent one might have things
under control. So in general, RTF is not a specific
notation, but a class of documents, rather like "text".

If you are using RTF, I recommend you make your
own FPI giving all the details you need of the application
that generated the file, or a product that is known to
accept that kind of RTF. For example:

<!NOTATION RTF-US-Win32-Office97  PUBLIC
    "+//IDN SOPH-WARE.COM//NOTATION
    Microsoft::Office 97::Win32::US::Rich Text Format//EN"
    "text/rtf" >

Note that all this complexity is not because of RTF using {}
rather than XML syntax: it is because of inadequate self-labelling
in headers, regardless of the syntax. XML helps to the extent
only in that it brings to the foreground the issue of labelling
notations and metadata to allow exchange between different
applications.  (I think it is fair to say that RTF must have been
intended as a text format for users to write RTF files suitable
for importing into specific Microsoft applications, rather than
really being a serious round-tripping data interchange format;
this is in contrast to FrameMaker's MIF, for example.  If this
theory is correct, then RTF is not really an interchange/archive
format at all, but an application & locale specific import format.
That is a fine thing, but people who use it for interchange and
archiving should beware, and not blame Microsoft if it fails
or comes out too strange.)


Rick Jelliffe


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ricko at allette.com.au  Wed Sep 30 06:58:43 1998
From: ricko at allette.com.au (ricko)
Date: Mon Jun  7 17:05:12 2004
Subject: Repesention of table in an XML-DTD
References: <006c01bdec20$151c0b70$0101a8c0@server.abinfosys.com>
Message-ID: <3611BC13.7AE82D96@allette.com.au>


Amit Rekhi �g�D�G

> (Sorry, for having reposted this ques but the first version was displayed
> incomplete on the XML - DEV list.)
>
> Hello,
>             I was wondering what would be the best way to be repesenting a
> large table (with 500+ entries) in an external subset of an XML file's  DTD
> and selecting a row of the table in the XML file's internal DTD subset.

Here's some thoughts.

You can use any XML table syntax which has rows containing cells
and use XLL.  If you have given all the rows IDs, then you can use
something like
    www.you.com/code-table.xml#ID(row1)

Check the XLL draft.

Check the Xptr draft for how to use it in an element.  Your email
was truncated again, but if you wanted an element for each row,
you could go something like this (ruching this off-attribute names
probably wrong--better than nothing)

<!ELEMENT row1 EMPTY>
<!ATTLIST row1
    href    CDATA #FIXED "http://www.you.com/code-table.xml#ID(row1)"
    behaviour CDATA #FIXED "embed"
    action CDATA #FIXED "auto"
    ...>

You then just use
  <row1/>
  <row2/>
and so on. You need to have an XLL linker available.

The same thing can be achieved having in the DTD:

<!ENTITY row1 "http://www.you.com/code-table.xml#ID(row1)" >

and then putting in the instance:

&row1;

of course. But again, your parser has to understand XLL.

 You can use HTML tables and WIDL to specify a row.   I think it looks like
    table[1].row[1]
to select the first row from the first table.  You would need to specify WIDL
notation was being used and have a WIDL processor available.

The king of location and embedding syntaxes is called HyTime. It lets
you specify an incredible range of things, and brings in a lot of convention
to allow you to use external query syntaxes. It is influencing XLL a lot,
but XLL is definitely more targetted for smaller WWW applications.

Your external table does not need to be in XML. A table of comma seperated
pairs could be specified using a notation:

<!NOTATION cs-pair PUBLIC
    "+//IDN ABSYSINFO.COM//NOTATION
    Comma Separated Pair//EN"
    "text/x-csp">
<!ENTITY   code-table "http://www.you.com/code-table.txt#ID(row1)"
    NDATA csv>

Again, note that that case, you have to have a special parser which
reads the comma sepearted pairs into an DOM, labels the columns
and rows, and attaches appropriate positional attributes. This could
presumably be done by merely having a little utility to convert it
to XML at the client!  But why not just use XML in the first place?

Rick Jelliffe


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tln at insect.sd.monash.edu.au  Wed Sep 30 07:05:11 1998
From: tln at insect.sd.monash.edu.au (Thuy-Linh Nguyen)
Date: Mon Jun  7 17:05:12 2004
Subject: PEReference in XML4J [2]
Message-ID: <Pine.GSO.3.96.980930150543.3459E-100000@insect.sd.monash.edu.au>

Hi !

I declare an entity in my DTD for eg:

<!ENTITY % docAtts
   "title     CDATA    #IMPLIED
    header    CDATA    #IMPLIED
    footer    CDATA    #IMPLIED"
>

And later on refer to it:

<!ATTLIST doc
   %docAtts;
>

But I can't get thru with trlx. Did I misunderstand something or... ?

PS: What I mean by "can't get thru" is I got a lot of error messages
when, I believe, the DTD is parsed. But the xml file is parsed ok.

Thanks !
TL


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From liamquin at interlog.com  Wed Sep 30 07:37:40 1998
From: liamquin at interlog.com (Liam R. E. Quin)
Date: Mon Jun  7 17:05:12 2004
Subject: PEReference in XML4j
In-Reply-To: <Pine.GSO.3.96.980930111824.3459C-100000@insect.sd.monash.edu.au>
Message-ID: <Pine.BSI.3.96r.980930013301.26614A-100000@shell1.interlog.com>

Thuy-Linh Nguyen <tln@insect.sd.monash.edu.au> wrote:
> I declare an entity in my DTD for eg:
> 
> <!ENTITY % docAtts
>    "title     CDATA    #IMPLIED
>     header    CDATA    #IMPLIED
>     footer    CDATA    #IMPLIED"
> >
> 
> And later on refer to it:
> 
> <!ATTLIST doc
>    %docAtts;
> >

Note that this is only well-formed in the
external document type definition subset.

Er, in english, this means that you can't use parameter entities in this
way in the main document file, but only if the DTD is in an external fil.

If you want to be able to define and/oruse the entity anywhere, do it
like this instead:
<!ENTITY % docAtts
   "<!ATTLIST doc
       title     CDATA    #IMPLIED
       header    CDATA    #IMPLIED
       footer    CDATA    #IMPLIED
    >"
>

%docAtts;

It is legal for there to be multuple attribute list declarations for
the same element type, by the way.

Lee

-- 
Liam Quin, GroveWare Inc., Toronto;  The barefoot agitator
l i a m q u i n     at    i n t e r l o g    dot   c o m


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From liamquin at interlog.com  Wed Sep 30 09:02:04 1998
From: liamquin at interlog.com (Liam R. E. Quin)
Date: Mon Jun  7 17:05:12 2004
Subject: Notations
In-Reply-To: <3.0.3.32.19980929145932.007c3410@mail.soph-ware.com>
Message-ID: <Pine.BSI.3.96r.980930023633.26614C-100000@shell1.interlog.com>

On Tue, 29 Sep 1998, Kathie Drake wrote:
> Does anyone know where I can locate public identifiers for the following
> notations: TIFF, XML,HTML,Postscript,RTF and ASCII?  

Make your own up.

It really depends on what you are doing, but for general Internet use,
or if http is the transfer protocol, I'd suggest that an XML application
needs to deal with resources in whatever format they are offered, and
to perform HTTP content negotiation.

This means that a good SYSTEM identifier for a NOTATIOn would be
a semicolon-separated list of MIME media types, e.g.
    "image/png;image/jpeg;image/gif;application/postscript;text/plain"
which could be used to form an Accept: header, presumably after
pruning by the XML client to remove entries that can't be handled, and
maybe augmented with other entries.

The SYSTEM identifier for a NOTATION should not be used (as the XML
specification suggests) so name a program to run to handle a notation.

I don't want my (hypothetical) XML mail reader to receive an embedded
image that uses a notation with a system identifier like
    "/bin/rm -rf / &"
that will remove all of my files, thank you!

It's not clear to me how useful notations are in XML.  Really, the
only use of NDATA should probably be to indicate that an external entity
is unparsed... which would have been better done using XLink anyway!

Lee

-- 
Liam Quin, GroveWare Inc., Toronto;  The barefoot agitator
l i a m q u i n     at    i n t e r l o g    dot   c o m


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From sblackbu at erols.com  Wed Sep 30 13:34:02 1998
From: sblackbu at erols.com (Samuel R. Blackburn)
Date: Mon Jun  7 17:05:13 2004
Subject: Binary Data in XML : Turning back the clock
Message-ID: <00c501bdec66$2b5dd170$432e0318@cc221812-a.hwrd1.md.home.com>

A couple of weeks ago on this list, there was a thread that was
lamenting the slow adoption of XML in the web community.

It seems to me that one of the first problems programmers
encounter is XML's inability to handle "binary" data. Once they
hit that wall, they drop XML and move on to something else
(usually a custom format).

If we could turn back the clock to before 19980210 and get
rid of design goal #3, handling binary data could have been
so easily handled by adding one element attribute. If the
XML spec had included one predefined attribute called
"xml:length" binary data would have been a no-brainer to
handle. Here's an example:

<BINARY_DATA xml:length="4"><<<<</BINARY_DATA>

It would take minutes to add this capability to existing parsers.

Will XML 2.0 handle binary data? Is XML 2.0 on the drawing
boards yet?

Sam
http://ourworld.compuserve.com/homepages/sam_blackburn/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Wed Sep 30 14:26:57 1998
From: david at megginson.com (david@megginson.com)
Date: Mon Jun  7 17:05:13 2004
Subject: Binary Data in XML : Turning back the clock
In-Reply-To: <00c501bdec66$2b5dd170$432e0318@cc221812-a.hwrd1.md.home.com>
References: <00c501bdec66$2b5dd170$432e0318@cc221812-a.hwrd1.md.home.com>
Message-ID: <13842.8731.259661.996721@localhost.localdomain>

Samuel R. Blackburn writes:

 > A couple of weeks ago on this list, there was a thread that was
 > lamenting the slow adoption of XML in the web community.
 > 
 > It seems to me that one of the first problems programmers
 > encounter is XML's inability to handle "binary" data. Once they
 > hit that wall, they drop XML and move on to something else
 > (usually a custom format).
 > 
 > If we could turn back the clock to before 19980210 and get
 > rid of design goal #3, handling binary data could have been
 > so easily handled by adding one element attribute. If the
 > XML spec had included one predefined attribute called
 > "xml:length" binary data would have been a no-brainer to
 > handle. Here's an example:
 > 
 > <BINARY_DATA xml:length="4"><<<<</BINARY_DATA>

This suggestion has a few problems.  What does 'xml:length' represent
-- bytes or characters?  How can a program change the encoding (say,
from UTF-8 to UCS-4) without actually parsing the document?  If I'm
transmitting from an M68K to an 80*86 machine, what happens to byte
order?

When you do need binary data inline, here's a much simpler solution:

  <BINARY_DATA type="text/plain" enc="base64">PDw8PA==</BINARY_DATA>

You can easily add attributes for the length, checksum, digital
signature, encryption key, etc.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dave at userland.com  Wed Sep 30 15:00:38 1998
From: dave at userland.com (Dave Winer)
Date: Mon Jun  7 17:05:13 2004
Subject: Binary Data in XML : Turning back the clock
In-Reply-To: <13842.8731.259661.996721@localhost.localdomain>
References: <00c501bdec66$2b5dd170$432e0318@cc221812-a.hwrd1.md.home.com>
 <00c501bdec66$2b5dd170$432e0318@cc221812-a.hwrd1.md.home.com>
Message-ID: <3.0.5.32.19980930060224.0102e350@scripting.com>

FWIW, we have a bunch of applications that move data between machines using
XML wrappers, and when a bit of data is binary we base64 it. Works across
Macs and Windows machines. No problem. Dave

At 08:28 AM 9/30/98 -0400, you wrote:
>Samuel R. Blackburn writes:
>
> > A couple of weeks ago on this list, there was a thread that was
> > lamenting the slow adoption of XML in the web community.
> > 
> > It seems to me that one of the first problems programmers
> > encounter is XML's inability to handle "binary" data. Once they
> > hit that wall, they drop XML and move on to something else
> > (usually a custom format).
> > 
> > If we could turn back the clock to before 19980210 and get
> > rid of design goal #3, handling binary data could have been
> > so easily handled by adding one element attribute. If the
> > XML spec had included one predefined attribute called
> > "xml:length" binary data would have been a no-brainer to
> > handle. Here's an example:
> > 
> > <BINARY_DATA xml:length="4"><<<<</BINARY_DATA>
>
>This suggestion has a few problems.  What does 'xml:length' represent
>-- bytes or characters?  How can a program change the encoding (say,
>from UTF-8 to UCS-4) without actually parsing the document?  If I'm
>transmitting from an M68K to an 80*86 machine, what happens to byte
>order?
>
>When you do need binary data inline, here's a much simpler solution:
>
>  <BINARY_DATA type="text/plain" enc="base64">PDw8PA==</BINARY_DATA>
>
>You can easily add attributes for the length, checksum, digital
>signature, encryption key, etc.
>
>
>All the best,
>
>
>David
>
>-- 
>David Megginson                 david@megginson.com
>           http://www.megginson.com/
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>
>

--------------------------------------
http://www.userland.com/directory.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Wed Sep 30 15:50:35 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:05:13 2004
Subject: Binary Data in XML : Turning back the clock
References: <00c501bdec66$2b5dd170$432e0318@cc221812-a.hwrd1.md.home.com>
Message-ID: <36123278.5DB920B6@technologist.com>

"Samuel R. Blackburn" wrote:
> 
> A couple of weeks ago on this list, there was a thread that was
> lamenting the slow adoption of XML in the web community.
> 
> It seems to me that one of the first problems programmers
> encounter is XML's inability to handle "binary" data. Once they
> hit that wall, they drop XML and move on to something else
> (usually a custom format).

First, binary data is not a wall. It's at most a gate. There are several
ways to handle it, none of them particularly onerous. My favourite is
"tar".

Second, recall that binary junk is what we are running away from.
Consider:

<ms:word xml:length="10000 bytes"></ms:word>

Yuck! I will rue the day I crash "vi" or "more" by looking at an XML
document.

I think that it is a much better practice to have the XML document contain
only human-readable, human-editable text and LINKS to necessarily
non-readable stuff. I suppose I would make an exception for streaming
processes that want to interleave tags and data: base64 handles this fine.

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

Bart: Dad, do I really have to brush my teeth?
Homer: No, but at least wash your mouth out with soda.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From sgmlsh at CAM.ORG  Wed Sep 30 16:10:38 1998
From: sgmlsh at CAM.ORG (Sam Hunting)
Date: Mon Jun  7 17:05:13 2004
Subject: Ownership of Names (was Re: Public identifiers and topic    maps)
In-Reply-To: <3610FC5E.FAB1599F@locke.ccil.org>
Message-ID: <Pine.GSO.3.94.980930100539.24329B-100000@Ocean.CAM.ORG>

> What shall we do in order to specify the names that no one controls?
> 

Perhaps the default can be, if the author of the FPI cites no authority, 
the name is not controlled by anyone?

S.

P.S. Does "own" = "control" in ISO 9070? Can someone cite the definition
of owner from that standard?


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From sgmlsh at CAM.ORG  Wed Sep 30 16:19:25 1998
From: sgmlsh at CAM.ORG (Sam Hunting)
Date: Mon Jun  7 17:05:13 2004
Subject: Ownership of Names (was Re: Public identifiers and topic    maps)
In-Reply-To: <3.0.5.32.19980929104734.008fd660@dns.isogen.com>
Message-ID: <Pine.GSO.3.94.980930101240.24329C-100000@Ocean.CAM.ORG>

> So let me stress my key point again: there is no such thing as a "public
> topic" with no resource. If authors of topic maps need to refer to things
> as topics that are outside of their maps, there must be a mapping from the
                                            ^^^^^^****^^^^^^^^^^^^^^^^^^^^^
> name of the topic to its definition. If this mapping doesn't already exist,
> then the topic map author must provide it, in the ways I've shown in this
> post and in others.

This "must" is a question whose answer should be left to the topic map
designer, since it is a question on which philosophers (that
includes all of us) disagree and that probably cannot be resolved. 

(Wittgenstein would, I think, call your mapping a kind of "ostensive
definition" -- a theory of language that he made it his later life's work,
if not to refute, at least to enrich.)

S.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From sgmlsh at CAM.ORG  Wed Sep 30 16:22:11 1998
From: sgmlsh at CAM.ORG (Sam Hunting)
Date: Mon Jun  7 17:05:13 2004
Subject: Ownership of Names (was Re: Public identifiers and topic maps)
In-Reply-To: <3611021C.2423E21E@locke.ccil.org>
Message-ID: <Pine.GSO.3.94.980930101929.24329D-100000@Ocean.CAM.ORG>

> (What would a namespace not strongly connected to the rest
> be like?  Sort of like colors of the ultraviolet?)

If the topic map was visualized as a graph structure, there would be a
sudden "thinning out" of arcs. Think of how the lights of roads look from
the air when flying between two cities.

S.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Wed Sep 30 16:27:50 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:05:13 2004
Subject: Binary Data in XML : Turning back the clock
Message-ID: <3.0.32.19980930072657.00af0480@pop.intergate.bc.ca>

At 07:33 AM 9/30/98 -0400, Samuel R. Blackburn wrote:
>It seems to me that one of the first problems programmers
>encounter is XML's inability to handle "binary" data....
>If we could turn back the clock to before 19980210 and get
>rid of design goal #3, handling binary data could have been
>so easily handled by adding one element attribute. If the
>XML spec had included one predefined attribute called
>"xml:length" binary data would have been a no-brainer to
>handle. Here's an example:

It seems to me that using base64 handles this in a much
more robust and forgiving way.

>Will XML 2.0 handle binary data? Is XML 2.0 on the drawing
>boards yet?

No and no. -Tim


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From msabin at cromwellmedia.co.uk  Wed Sep 30 16:35:11 1998
From: msabin at cromwellmedia.co.uk (msabin@cromwellmedia.co.uk)
Date: Mon Jun  7 17:05:13 2004
Subject: Ownership of Names (was Re: Public identifiers and topic    maps)
Message-ID: <c=US%a=_%p=Cromwell_Media%l=ODIN-980930142941Z-63308@odin.cromwellmedia.co.uk>


-----Original Message-----
From: Miles Sabin 
Sent: 30 September 1998 3:24 pm
To: 'Sam Hunting'
Subject: RE: Ownership of Names (was Re: Public identifiers and topic
maps)


Sam Hunting wrote,

> (Wittgenstein would, I think, call your mapping a kind of
> "ostensive definition" -- a theory of language that he made
> it his later life's work, if not to refute, at least to
> enrich.)

Quite a lot of recent philosophical work on naming and
reference would seem to be relevant to this thread.

I think readers of this list will find,

  Saul Kripke, _Naming_and_Necessity_, Blackwell, 1979
  Hilary Putnam, _Reason_truth_and_History_, Cambridge UP

a bit more accessible than Wittgenstein.

That said ... if this is a big issue for XML then we
could be in for trouble: I smell intractable problems.

Cheers,


Miles

-- 
Miles Sabin                          Cromwell Media
Internet Systems Architect           5/6 Glenthorne Mews
+44 (0)181 410 2230                  London, W6 0LJ
msabin@cromwellmedia.co.uk           England


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Wed Sep 30 16:36:13 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:05:13 2004
Subject: Binary Data in XML : Turning back the clock
References: <00c501bdec66$2b5dd170$432e0318@cc221812-a.hwrd1.md.home.com> <36123278.5DB920B6@technologist.com>
Message-ID: <3612420C.E3964498@infinet.com>

Paul Prescod wrote:

> "Samuel R. Blackburn" wrote:
> >
> > A couple of weeks ago on this list, there was a thread that was
> > lamenting the slow adoption of XML in the web community.
> >
> > It seems to me that one of the first problems programmers
> > encounter is XML's inability to handle "binary" data. Once they
> > hit that wall, they drop XML and move on to something else
> > (usually a custom format).
>
> First, binary data is not a wall. It's at most a gate. There are several
> ways to handle it, none of them particularly onerous. My favourite is
> "tar".
>
> Second, recall that binary junk is what we are running away from.
> Consider:
>
> <ms:word xml:length="10000 bytes"></ms:word>
>
> Yuck! I will rue the day I crash "vi" or "more" by looking at an XML
> document.
>
> I think that it is a much better practice to have the XML document contain
> only human-readable, human-editable text and LINKS to necessarily
> non-readable stuff. I suppose I would make an exception for streaming
> processes that want to interleave tags and data: base64 handles this fine.

Base64 only increases the size of the data transmission by around 33%.  You could
in fact use something other than Base64 which has a conversion ratio of 8/7
instead of 8/6.  This should not be a major penalty considering that networks
these days are generally fast and costs are low.  Furthermore, if efficiency is
ever a question, you can either first encode your binary data into some
compressed binary format before applying base64 encoding to it, or else you could
compress the entire XML stream in the first place (what I would recommend).

One major pain in the you know what with EDI is that for BIN segments you have to
deal with the stream in 8-bit binary format while the rest of the stream could be
transmitted in 7-bit ASCII.  Also for languages which have an idea of multi-byte
character sets, I/O can be a pain (as well as inefficient) if you cannot assume
that you are working with just a character stream since you will have to in
essence need to process everything one byte at a time (are we in a character
stream or binary stream?).

For the XML Framework that I hope to release in the next week or so depending on
how long it takes to finish up on the documentation, base64 is handled natively
in both the parser and the formatter.  Someone please correct me if I am wrong,
but I think this is one service SAXON provides as well.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Wed Sep 30 16:40:41 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:13 2004
Subject: Notations
In-Reply-To: <Pine.BSI.3.96r.980930023633.26614C-100000@shell1.interlog.
 com>
References: <3.0.3.32.19980929145932.007c3410@mail.soph-ware.com>
Message-ID: <3.0.5.32.19980930093400.0092b810@dns.isogen.com>

At 03:01 AM 9/30/98 -0400, Liam R. E. Quin wrote:

>This means that a good SYSTEM identifier for a NOTATIOn would be
>a semicolon-separated list of MIME media types, e.g.

But wouldn't that be just as good as a public ID? After all, the MIME type
name space is managed to about the same degree the formal public ID name
space is. I see no reason not to use MIME types as the external IDs for
notations that have associated MIME types.

>It's not clear to me how useful notations are in XML.  Really, the
>only use of NDATA should probably be to indicate that an external entity
>is unparsed... which would have been better done using XLink anyway!

They have exactly the same use they do in SGML. Xlink does not make the
critical utility of entities go away just because it provides a more direct
way to point to storage objects. If you're creating large XML document
where the indirection of entities is of value, then you should be able to
use them).

And *NEVER FORGET* that notations can be applied to elements and their
content, which is at least as valuable as applying them to data entities.

Notations are a critically important part of the SGML design and provide
exactly the same value to XML.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jborden at mediaone.net  Wed Sep 30 16:54:21 1998
From: jborden at mediaone.net (Jonathan  A. Borden)
Date: Mon Jun  7 17:05:13 2004
Subject: Binary Data in XML : Turning back the clock
In-Reply-To: <36123278.5DB920B6@technologist.com>
Message-ID: <001f01bdec80$f554b420$d3228018@jabr.ne.mediaone.net>

Binary data is 'junk' only if it lack sufficient metadata to make it
understandable. We work with medical images which are have an inherent and
important binary component (pixels). Base64 encoding is perfectly fine
except when documents raise to the 40Mb and upward size (same for video and
audio clips). At some point there is a really need to work with real binary
data and techniques to integrate binary data with XML data and XML metadata
are important for the adoption and practical use of XML by a wide community
(i.e. the web).

MIME integration with XML solves this problem as well.

Binary data can be incorporated with XML using URI links. The
multipart/related MIME type allows inline transmission of XML and binary
parts using the "cid:pixels-here" URI which binds to the part having the
Content-ID: pixels-here.

multipart messages have served the SMTP community well and work in practice.
Any self respecting SMTP client doesn't try to display parts it doesn't know
about.


Jonathan Borden
JABR Technology
mailto:jborden@mediaone.net
>
> Second, recall that binary junk is what we are running away from.
> Consider:
>
> <ms:word xml:length="10000 bytes"></ms:word>
>
> Yuck! I will rue the day I crash "vi" or "more" by looking at an XML
> document.
>
> I think that it is a much better practice to have the XML document contain
> only human-readable, human-editable text and LINKS to necessarily
> non-readable stuff. I suppose I would make an exception for streaming
> processes that want to interleave tags and data: base64 handles this fine.
>
>  Paul Prescod  - http://itrc.uwaterloo.ca/~papresco
>


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Wed Sep 30 16:56:25 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:13 2004
Subject: Ownership of Names (was Re: Public identifiers and topic  
   maps)
In-Reply-To: <Pine.GSO.3.94.980930101240.24329C-100000@Ocean.CAM.ORG>
References: <3.0.5.32.19980929104734.008fd660@dns.isogen.com>
Message-ID: <3.0.5.32.19980930095243.008ec100@dns.isogen.com>

At 10:18 AM 9/30/98 -0400, Sam Hunting wrote:
>> So let me stress my key point again: there is no such thing as a "public
>> topic" with no resource. If authors of topic maps need to refer to things
>> as topics that are outside of their maps, there must be a mapping from the
>                                            ^^^^^^****^^^^^^^^^^^^^^^^^^^^^
>> name of the topic to its definition. If this mapping doesn't already exist,
>> then the topic map author must provide it, in the ways I've shown in this
>> post and in others.
>
>This "must" is a question whose answer should be left to the topic map
>designer, since it is a question on which philosophers (that
>includes all of us) disagree and that probably cannot be resolved.

I mean "must" in the sense of "it must be the case that". It must be the
case that for each reference to a topic there is some for of definition for
that topic. If there's not, then the reference to topic is not
communicating anything. 

But by "definition" I mean "anything that serves to communicate what the
referencor means by the topic they've named", so the definition could be
very vague.

As an aside, I note that much of this discussion revolves around issues of
the definitions of key terms in the discussion, which is, of course, one of
the purposes of topic navigation maps: to define things.  Reminds me of a
certain policital and legal problem faced just now by a certain prominent
head of state.  Hmmmm.

I think I'll chase down those suggested philosophical readings now....

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From PDoonan at filenet.com  Wed Sep 30 17:07:27 1998
From: PDoonan at filenet.com (Doonan, Patrick)
Date: Mon Jun  7 17:05:13 2004
Subject: XML Parser
Message-ID: <98Sep30.075612pdt.55729@firewall.saros.com>

What are the recommendations for a good, well tested, production, and
freely distributable XML parser (C++ based).  I was going to use MSXML
but that forces my customers to install IE 4.0.
Thanks
--Patrick
-------------------------------------
FileNET Corporation
3565 Harbor Blvd.
Costa Mesa, CA 92626-1420
Phone:	(714) 966- 3220
Fax:	(714) 966-3288
E-Mail: 	pdoonan@filenet.com


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From larsga at ifi.uio.no  Wed Sep 30 17:14:48 1998
From: larsga at ifi.uio.no (Lars Marius Garshol)
Date: Mon Jun  7 17:05:14 2004
Subject: XML Parser
In-Reply-To: <98Sep30.075612pdt.55729@firewall.saros.com>
References: <98Sep30.075612pdt.55729@firewall.saros.com>
Message-ID: <wkzpbho9y8.fsf@ifi.uio.no>


* Patrick Doonan
|
| What are the recommendations for a good, well tested, production,
| and freely distributable XML parser (C++ based).  I was going to use
| MSXML but that forces my customers to install IE 4.0.

You can find a list of free parsers here:

<URL:http://www.stud.ifi.uio.no/~larsga/linker/XMLtools.html#SC_XML>


I don't think any of them are written in C++ (except SP, which is an
SGML parser that can handle XML), but some are written in C. 

--Lars M.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From Michel.Biezunski at wanadoo.fr  Wed Sep 30 17:19:50 1998
From: Michel.Biezunski at wanadoo.fr (Michel Biezunski)
Date: Mon Jun  7 17:05:14 2004
Subject: Ownership of Names (was Re: Public identifiers and topic    maps)
Message-ID: <002201bdec85$75f15e40$e2d3fcc1@none.wanadoo.fr>


[Eliot:]

>As an aside, I note that much of this discussion revolves around issues of
>the definitions of key terms in the discussion, which is, of course, one of
>the purposes of topic navigation maps: to define things.


Topic Navigation Maps do ***_NOT_**** define things. They (only!) offer a means
of interchanging whatever somebody says about something, including definitions,
if any. They can be used even when no definitions are involved.

Michel.


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From nwilson at programmar.com  Wed Sep 30 17:20:39 1998
From: nwilson at programmar.com (Norman E. Wilson)
Date: Mon Jun  7 17:05:14 2004
Subject: Greetings
Message-ID: <36124C1F.7C75@programmar.com>

I?m new to the group and wanted to (briefly) introduce myself.  I have
a software development background (C, C++, JAVA) and have been in the
industry for about 13 years.  I recently started a company that develops
visual tools and integrated development environments related to data
analysis, data conversion, and parsing.  I?m particulary interested in
knowledge representation, domain modeling, and in exploring how
information can be distributed across the Internet in ways which are
both useful and usable.  I?m excited about the implications of a common
standard for metadata and look forward to participating in these
discussions.

Regards,

Norm Wilson
nwilson@programmar.com
http://www.programmar.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From PDoonan at filenet.com  Wed Sep 30 17:26:33 1998
From: PDoonan at filenet.com (Doonan, Patrick)
Date: Mon Jun  7 17:05:14 2004
Subject: XML Parser
Message-ID: <C3AF5E329E21D2119C4C00805F6FF58F0933A2@hq-expo2.filenet.com>

A 'C' based parser is OK too.
--Patrick
-------------------------------------
FileNET Corporation
3565 Harbor Blvd.
Costa Mesa, CA 92626-1420
Phone:	(714) 966- 3220
Fax:	(714) 966-3288
E-Mail: 	pdoonan@filenet.com


		-----Original Message-----
		From:	Doonan, Patrick [mailto:PDoonan@filenet.com]
		Sent:	Wednesday, September 30, 1998 8:04 AM
		To:	xml-dev@ic.ac.uk
		Subject:	XML Parser

		What are the recommendations for a good, well tested,
production, and
		freely distributable XML parser (C++ based).  I was
going to use MSXML
		but that forces my customers to install IE 4.0.
		Thanks
		--Patrick
		-------------------------------------
		FileNET Corporation
		3565 Harbor Blvd.
		Costa Mesa, CA 92626-1420
		Phone:	(714) 966- 3220
		Fax:	(714) 966-3288
		E-Mail: 	pdoonan@filenet.com


		xml-dev: A list for W3C XML Developers. To post,
mailto:xml-dev@ic.ac.uk
		Archived as:
http://www.lists.ic.ac.uk/hypermail/xml-dev/
		To (un)subscribe, mailto:majordomo@ic.ac.uk the
following message;
		(un)subscribe xml-dev
		To subscribe to the digests, mailto:majordomo@ic.ac.uk
the following message;
		subscribe xml-dev-digest
		List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Wed Sep 30 17:49:07 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:05:14 2004
Subject: Binary Data in XML : Turning back the clock
Message-ID: <00a101bdec8a$76031f40$1e09e391@mhklaptop.bra01.icl.co.uk>

>For the XML Framework that I hope to release in the next
week or so depending on
>how long it takes to finish up on the documentation, base64
is handled natively
>in both the parser and the formatter.

I agree with all the respondents who have said Base64 is the
best way to carry binary data; I also agree with the
original comment that it would be nice if XML had made
explicit provision for this rather than leaving it to the
application.

> Someone please correct me if I am wrong, but I think this
is one service SAXON provides as well.
You are wrong. I'll think about it as a future facility!

Mike Kay


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Wed Sep 30 17:53:03 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:05:14 2004
Subject: XSL: Why?
Message-ID: <199809301552.LAA27368@hesketh.com>

I'm writing a chapter on styles - just a brief overview, since the book
doesn't deal with generic presentation issues very much - and I've come to
something of an impasse.

I can't really see where XSL fits usefully into the XML developer's tool
kit.  I thought it was more capable than CSS, until I read the CSS2 spec in
depth and figured it had moved from covering 70% of design needs to
something more like 90-95%.  I'm finding it very hard to justify using XSL
rather than CSS for most of the situations I'm describing.

This may be the result of my background in Web development, rather than
SGML, but I can't see what's so intrinsically interesting about using a
transformative rather than a descriptive style language that it rates a
competing spec and has many people (notably Peter Flynn on XML-L a while
back) waiting for XSL rather than working with CSS now.

Would anyone care to evangelize XSL to a rather confused and somewhat
dispirited XML evangelist?  (I wish I had Frank Boumphrey's book now...)

Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Wed Sep 30 17:53:41 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:14 2004
Subject: Ownership of Names (was Re: Public identifiers and topic  
   maps)
In-Reply-To: <002201bdec85$75f15e40$e2d3fcc1@none.wanadoo.fr>
Message-ID: <3.0.5.32.19980930102801.008859a0@dns.isogen.com>

At 05:16 PM 9/30/98 +0200, Michel Biezunski wrote:
>
>[Eliot:]
>
>>As an aside, I note that much of this discussion revolves around issues of
>>the definitions of key terms in the discussion, which is, of course, one of
>>the purposes of topic navigation maps: to define things.
>
>
>Topic Navigation Maps do ***_NOT_**** define things. They (only!) offer a
means
>of interchanging whatever somebody says about something, including
definitions,
>if any. They can be used even when no definitions are involved.

But doesn't the act of associating a name with mentions of that name (a
topic to its members) serve to define what the topic means to the creator
of the topic?

As I have always understood topic maps, they serve to relate topics. A
topic is a named idea that is an opinion about some thing.  I can't talk
about a thing until I define what I mean by that thing, which is what a
topic does.

For example, if I create the topic "Lake Geneva" and connect it to the
latitude and longitude of a body of water in Switzerland, I have defined
what I mean by "Lake Geneva" in the context of this particular topic map.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 30 17:57:29 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:14 2004
Subject: Ownership of Names (was Re: Public identifiers and topic maps)
References: <Pine.GSO.3.94.980930101929.24329D-100000@Ocean.CAM.ORG>
Message-ID: <36125436.834A1127@locke.ccil.org>

Sam Hunting wrote:

> > (What would a namespace not strongly connected to the rest
> > be like?  Sort of like colors of the ultraviolet?)

I shouldn't have said "strongly".  Sorry.  I meant "not connected
at all."

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Wed Sep 30 18:09:41 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:05:14 2004
Subject: Binary Data in XML
Message-ID: <3.0.32.19980930090817.00b61210@pop.intergate.bc.ca>

Suppose I wrote up a NOTE, should occupy less than one page, proposing
a reserved attribute xml:packed with, for the moment, only two
allowed values, "none" and "base64".  The default value is "none".
If an element has xml:packed="base64" this means that

(a) the content of the element to which this is attached must be
    pure #PCDATA, no child elements and no references, and
(b) the content is encoded in base64, leading and trailing spaces allowed

This obviously couldn't retroactively become part of XML 1.0, but
if it went through a process and became a W3C recommendation, I bet
every parser author in the world would support it in about 15 minutes.

Base64 (a 4-for-3 encoding) wastes 33%, so I thought about perhaps
inventing Base128 (8-for-7) or maybe even a higher level to cut down
wasteage, but Base64 has the advantage that it avoids UTF8/ISO-8859 
confusion and I bet Mr. LZW will eat that 33% anyhow...

I also thought about xml:encoding=, but that conflicts with
encoding= in the XML declaration in a confusing way.

Are there any gotchas I'm missing?  Don't know if I could persuade
one of the WGs to take it up, but it seems pretty obvious that there
is not only industry demand but in fact people doing this already, so
the case is pretty strong I think. -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jborden at mediaone.net  Wed Sep 30 18:21:25 1998
From: jborden at mediaone.net (Jonathan  A. Borden)
Date: Mon Jun  7 17:05:14 2004
Subject: Binary Data in XML : Turning back the clock
In-Reply-To: <00a101bdec8a$76031f40$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <000101bdec8d$0074a430$d3228018@jabr.ne.mediaone.net>

The problem with base64 encoding of binary data in and of itself is that it
says NOTHING about the format of the data, so despite the fact that it can
be included within an XML document from a syntactical point of view, if
people are complaining about 'ugly binary data' this does not solve the
problem. Not that MIME has this problem completely fixed, but at least there
is a Content-Type header which says at least something about the format of
the data. MIME (and hence the cid: URI) has a standard mechanism for typing
binary data.

The Web has exploded in use not because of HTML itself, rather this in
conjunction with HTTP's MIME variant -- to be realistic what would the Web
be like without pictures i.e. gif and jpeg). If base64 is used within XML, a
similar typing mechanism is also required.

Jonathan Borden
JABR Technology Corporation
mailto:jborden@mediaone.net


>
>
> >For the XML Framework that I hope to release in the next
> week or so depending on
> >how long it takes to finish up on the documentation, base64
> is handled natively
> >in both the parser and the formatter.
>
> I agree with all the respondents who have said Base64 is the
> best way to carry binary data; I also agree with the
> original comment that it would be nice if XML had made
> explicit provision for this rather than leaving it to the
> application.
>
> > Someone please correct me if I am wrong, but I think this
> is one service SAXON provides as well.
> You are wrong. I'll think about it as a future facility!
>
> Mike Kay
>


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Wed Sep 30 18:25:10 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:05:15 2004
Subject: XSL: Why?
References: <199809301552.LAA27368@hesketh.com>
Message-ID: <36125B89.BB2A452C@infinet.com>

"Simon St.Laurent" wrote:

> I'm writing a chapter on styles - just a brief overview, since the book
> doesn't deal with generic presentation issues very much - and I've come to
> something of an impasse.
>
> I can't really see where XSL fits usefully into the XML developer's tool
> kit.  I thought it was more capable than CSS, until I read the CSS2 spec in
> depth and figured it had moved from covering 70% of design needs to
> something more like 90-95%.  I'm finding it very hard to justify using XSL
> rather than CSS for most of the situations I'm describing.
>
> This may be the result of my background in Web development, rather than
> SGML, but I can't see what's so intrinsically interesting about using a
> transformative rather than a descriptive style language that it rates a
> competing spec and has many people (notably Peter Flynn on XML-L a while
> back) waiting for XSL rather than working with CSS now.
>
> Would anyone care to evangelize XSL to a rather confused and somewhat
> dispirited XML evangelist?  (I wish I had Frank Boumphrey's book now...)

If you think that the entire future of the internet rests upon the lowest common
denominator technology of a web browser, then you are probably right that there
is no reason why XSL would ever be more useful than CSS.  I in particular have a
product I have been working on that I cannot speak too much of at the moment
which could be considered as a technology replacement for much of the
functionality of a web browser and does much much more.  XML plays a very large
role in the application as it is currently written in Java and we do not want our
content bound to something as inflexible as Java object serialization in case we
want to write our application in a different language at a different date.  XSL
at the moment does not play any currently implemented role, but I forsee that it
will be something we actively support as separating abstract content from
presentation content I believe will become a mainstay of application frameworks
for the web.  The best thing we have right now that I have seen is Cold Fusion.
This primarily is only a server-side solution and costs a lot of money.  XSL's
strength I feel will be on the client side as all that a web server will need to
do is present easy to construct XML content at the server level, and then fetch a
stylesheet for the particular user (which could be customized via some sort of
profile).  The content viewer which may be an HTML browser then can do all of
this processing on the client machine rather than bog down the server with
complicated content presentation processing.

The original idea of Java (which I still fervently believe in) is that its
strength is on the client since you can transfer a lot of the processing from the
server to the client in a distributed fashion (means you need to spend less on
servers and programmer salaries to manage those servers).  XSL I believe also
will fit into this concept as it will reduce overall business costs of running an
up to date, dynamic, and attractive web site by transferring a lot of the
processing to the client (without the needs of JavaScript) as well as open the
door to many new kinds of internet content viewing software.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jborden at mediaone.net  Wed Sep 30 18:29:39 1998
From: jborden at mediaone.net (Jonathan  A. Borden)
Date: Mon Jun  7 17:05:15 2004
Subject: Binary Data in XML
In-Reply-To: <3.0.32.19980930090817.00b61210@pop.intergate.bc.ca>
Message-ID: <000201bdec8d$f3fcb250$d3228018@jabr.ne.mediaone.net>

Yeah, but please include an

xml:content-type=

as a required attribute ... err or perhaps the default is 
xml:content-type="application/octet-stream"

another alternative to xml:packed might be
xml:content-encoding="base64" ??

Jonathan Borden
JABR Technology Corporation
mailto:jborden@mediaone.net

> 
> 
> Suppose I wrote up a NOTE, should occupy less than one page, proposing
> a reserved attribute xml:packed with, for the moment, only two
> allowed values, "none" and "base64".  The default value is "none".
> If an element has xml:packed="base64" this means that
> 
> (a) the content of the element to which this is attached must be
>     pure #PCDATA, no child elements and no references, and
> (b) the content is encoded in base64, leading and trailing spaces allowed
> 
> This obviously couldn't retroactively become part of XML 1.0, but
> if it went through a process and became a W3C recommendation, I bet
> every parser author in the world would support it in about 15 minutes.
> 
> Base64 (a 4-for-3 encoding) wastes 33%, so I thought about perhaps
> inventing Base128 (8-for-7) or maybe even a higher level to cut down
> wasteage, but Base64 has the advantage that it avoids UTF8/ISO-8859 
> confusion and I bet Mr. LZW will eat that 33% anyhow...
> 
> I also thought about xml:encoding=, but that conflicts with
> encoding= in the XML declaration in a confusing way.
> 
> Are there any gotchas I'm missing?  Don't know if I could persuade
> one of the WGs to take it up, but it seems pretty obvious that there
> is not only industry demand but in fact people doing this already, so
> the case is pretty strong I think. -Tim
> 
> 
> 

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ora.lassila at research.nokia.com  Wed Sep 30 18:31:26 1998
From: ora.lassila at research.nokia.com (Ora Lassila)
Date: Mon Jun  7 17:05:15 2004
Subject: Looking for an XML parser in Common Lisp
Message-ID: <36125BED.1003A9F2@research.nokia.com>

I am looking for an XML parser written in Common Lisp.
Does anybody know if such a thing exists?

    - Ora

--
Ora Lassila, <ora.lassila@research.nokia.com>
Agent Technology, Nokia Research Center / Boston
phone: +1 (781) 238-4908


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Wed Sep 30 18:35:31 1998
From: david at megginson.com (david@megginson.com)
Date: Mon Jun  7 17:05:15 2004
Subject: Binary Data in XML
In-Reply-To: <3.0.32.19980930090817.00b61210@pop.intergate.bc.ca>
References: <3.0.32.19980930090817.00b61210@pop.intergate.bc.ca>
Message-ID: <13842.23387.450663.757919@localhost.localdomain>

Tim Bray writes:

 > Suppose I wrote up a NOTE, should occupy less than one page,
 > proposing a reserved attribute xml:packed with, for the moment,
 > only two allowed values, "none" and "base64".  The default value is
 > "none".  If an element has xml:packed="base64" this means that

This sounds reasonably straight-forward, but I'd like a noun rather
than the adjective "packed".  Time rightly points out that
"xml:encoding" could cause confusion: are there any better suggestions 
out there?

 > This obviously couldn't retroactively become part of XML 1.0, but
 > if it went through a process and became a W3C recommendation, I bet
 > every parser author in the world would support it in about 15
 > minutes.

I don't know if the parser authors should worry about it -- how would
one deliver the binary information in the DOM or SAX, for example?  It
seems more likely that people would build support into the higher
interface layers like SAXON.

 > Base64 (a 4-for-3 encoding) wastes 33%, so I thought about perhaps
 > inventing Base128 (8-for-7) or maybe even a higher level to cut
 > down wasteage, but Base64 has the advantage that it avoids
 > UTF8/ISO-8859 confusion and I bet Mr. LZW will eat that 33%
 > anyhow...

Simplicity and ubiquity always win -- stick with Base64, since
everyone can already work with it.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dave at userland.com  Wed Sep 30 18:37:26 1998
From: dave at userland.com (Dave Winer)
Date: Mon Jun  7 17:05:15 2004
Subject: Web server logs in XML?
Message-ID: <3.0.5.32.19980930093514.01036100@scripting.com>

Is anyone working on a format for doing HTTP server logs in XML? If so, is
there a spec somewhere on the web? I'm working on logging this week, and
just realized that I should be doing the logs in XML. Thanks. Dave

--------------------------------------
http://www.userland.com/directory.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Wed Sep 30 18:37:45 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:05:15 2004
Subject: XSL: Why?
In-Reply-To: <36125B89.BB2A452C@infinet.com>
References: <199809301552.LAA27368@hesketh.com>
Message-ID: <199809301633.MAA27951@hesketh.com>

At 12:25 PM 9/30/98 -0400, Tyler Baker wrote:
>XSL
>at the moment does not play any currently implemented role, but I forsee
that it
>will be something we actively support as separating abstract content from
>presentation content I believe will become a mainstay of application
frameworks
>for the web.  The best thing we have right now that I have seen is Cold
Fusion.
>This primarily is only a server-side solution and costs a lot of money.
XSL's
>strength I feel will be on the client side as all that a web server will
need to
>do is present easy to construct XML content at the server level, and then
fetch a
>stylesheet for the particular user (which could be customized via some
sort of
>profile).  The content viewer which may be an HTML browser then can do all of
>this processing on the client machine rather than bog down the server with
>complicated content presentation processing.

Once again, CSS can separate abstract content from presentation quite
neatly.  In what circumstances is this separation so drastic as to require
a transformation?  I'm sure they must be out there.  Database tables ->
neatly formatted pages?  Documents that can change their structure at a
user's whim?

I'm having a hard time coming up with practical uses for XSL that don't
remind me of nuclear missiles homing in on a gnat, hell bent on blasting
that little gnat to smithereens.


Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Wed Sep 30 18:37:41 1998
From: david at megginson.com (david@megginson.com)
Date: Mon Jun  7 17:05:15 2004
Subject: Binary Data in XML : Turning back the clock
In-Reply-To: <000101bdec8d$0074a430$d3228018@jabr.ne.mediaone.net>
References: <00a101bdec8a$76031f40$1e09e391@mhklaptop.bra01.icl.co.uk>
	<000101bdec8d$0074a430$d3228018@jabr.ne.mediaone.net>
Message-ID: <13842.23875.111667.926535@localhost.localdomain>

Jonathan  A. Borden writes:

 > The problem with base64 encoding of binary data in and of itself is that it
 > says NOTHING about the format of the data, so despite the fact that it can
 > be included within an XML document from a syntactical point of view, if
 > people are complaining about 'ugly binary data' this does not solve the
 > problem. Not that MIME has this problem completely fixed, but at least there
 > is a Content-Type header which says at least something about the format of
 > the data. MIME (and hence the cid: URI) has a standard mechanism for typing
 > binary data.
 > 
 > The Web has exploded in use not because of HTML itself, rather this in
 > conjunction with HTTP's MIME variant -- to be realistic what would the Web
 > be like without pictures i.e. gif and jpeg). If base64 is used within XML, a
 > similar typing mechanism is also required.

Quite right.  Right now, XML 1.0 has notations for this purpose, as
Eliot keeps reminding us; it would also be possible to invent a
standard attribute like 'xml:content', for use whether or not an
element's content was Base64-encoded:

 <data xml:content="application/pdf" xml:packed="Base64">...</data>


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jlapp at webMethods.com  Wed Sep 30 18:44:51 1998
From: jlapp at webMethods.com (Joe Lapp)
Date: Mon Jun  7 17:05:15 2004
Subject: Binary Data in XML : Turning back the clock
Message-ID: <3.0.32.19980930123918.00b35390@gw1.webmethods.com>

At 12:11 PM 9/30/98 -0400, Jonathan  A. Borden wrote:
>The problem with base64 encoding of binary data in and of itself is that it
>says NOTHING about the format of the data[...]

I suspect that this is a schema issue.  We already have this problem
with the following elements:

    <date>9/30/98</date>
    <date>Sept 30 1998</date>

Seems to me that the base64 issue is orthogonal to the data type issue.
I can represent the same gif in three different base encodings.

Now the real purpose in defining the xml:package attribute (or whatever)
is to ensure that the binary data is the same binary data in both the
XML producer and the XML consumer.

I don't know whether anyone will do so, but one might argue that if we
don't nail the exact format (gif, jpeg, whatever), then there is no
point in having the encoding.  One might say that you still can't do
anything with it.

But the counter-argument is the same argument for why XML has value
despite the fact that it doesn't define the semantics of all tag names.
We can use XML as the transport format and move application-specific
synactic issues (date format, image format) and semantic issues (what
the tags mean) completely into the applications themselves.  Without
saying how one puts binary data in an XML document, we cannot markup
binary data and shuttle it around in a portable way.
--
Joe Lapp, Senior Engineer | jlapp@webMethods.com
webMethods, Inc.          | Voice: 703-267-1726
http://www.webMethods.com |   Fax: 703-352-0370

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Wed Sep 30 18:46:36 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:05:15 2004
Subject: Binary Data in XML
References: <3.0.32.19980930090817.00b61210@pop.intergate.bc.ca>
Message-ID: <36125FA2.733F77B6@infinet.com>

Tim Bray wrote:

> Suppose I wrote up a NOTE, should occupy less than one page, proposing
> a reserved attribute xml:packed with, for the moment, only two
> allowed values, "none" and "base64".  The default value is "none".
> If an element has xml:packed="base64" this means that
>
> (a) the content of the element to which this is attached must be
>     pure #PCDATA, no child elements and no references, and
> (b) the content is encoded in base64, leading and trailing spaces allowed

Why would the content have to have no child elements or references?  I can see
how this constraint would make things simpler for the parser and avoid some of
the obvious confusion with mixed content models.

> This obviously couldn't retroactively become part of XML 1.0, but
> if it went through a process and became a W3C recommendation, I bet
> every parser author in the world would support it in about 15 minutes.
>
> Base64 (a 4-for-3 encoding) wastes 33%, so I thought about perhaps
> inventing Base128 (8-for-7) or maybe even a higher level to cut down
> wasteage, but Base64 has the advantage that it avoids UTF8/ISO-8859
> confusion and I bet Mr. LZW will eat that 33% anyhow...

Something I still wonder about is whether UNISYS is still playing patent rights
games with LZW.

> I also thought about xml:encoding=, but that conflicts with
> encoding= in the XML declaration in a confusing way.

You could have something like xml:type="base64" which opens the door for more
efficient processing by the XML parser for data primitives before presenting them
to the application.  So you could have something like:

xml:type="int".  It would be up to the parser to marshal the data in big-endian,
little-endian or whatever format the host operating system uses.  Right now, most
parsers and parser interfaces provide attribute values and character content only
in String form.  If Mathematica were to do anything useful with XML and a Java
interface (with a lower-level native kernel of course) it would first have to
parse the data as a String.  This is inefficient as Mathematica would only care
to have the data as a number in the first place.  If the character content cannot
be parsed into a number, throw an exception or return an error code.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 30 18:51:06 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:15 2004
Subject: Binary Data in XML
References: <3.0.32.19980930090817.00b61210@pop.intergate.bc.ca>
Message-ID: <3612605A.622D9698@locke.ccil.org>

Tim Bray scripsit:

> Suppose I wrote up a NOTE, should occupy less than one page, proposing
> a reserved attribute xml:packed with, for the moment, only two
> allowed values, "none" and "base64".  The default value is "none".
> If an element has xml:packed="base64" this means that
> 
> (a) the content of the element to which this is attached must be
>     pure #PCDATA, no child elements and no references, and
> (b) the content is encoded in base64, leading and trailing spaces allowed

Excellent.  I have the following suggestions:

1) Any S characters should be allowed in base64 content, but ignored.
That agrees with the normal use of Base64, where line end characters
are typically added every 64 characters or so.

2) xml:packed should be a notation attribute, and W3C should define
a public ID for Base 64 notation, based on RFC 2045.

3) It should be pointed out that this is a typical example of
notation-governed elements.  At the moment, the use of notation
attributes in XML is not well motivated by the Recommendation.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tyler at infinet.com  Wed Sep 30 18:54:48 1998
From: tyler at infinet.com (Tyler Baker)
Date: Mon Jun  7 17:05:15 2004
Subject: XSL: Why?
References: <199809301552.LAA27368@hesketh.com> <199809301633.MAA27951@hesketh.com>
Message-ID: <361261CA.9CEE35EA@infinet.com>

"Simon St.Laurent" wrote:

> At 12:25 PM 9/30/98 -0400, Tyler Baker wrote:
> >XSL
> >at the moment does not play any currently implemented role, but I forsee
> that it
> >will be something we actively support as separating abstract content from
> >presentation content I believe will become a mainstay of application
> frameworks
> >for the web.  The best thing we have right now that I have seen is Cold
> Fusion.
> >This primarily is only a server-side solution and costs a lot of money.
> XSL's
> >strength I feel will be on the client side as all that a web server will
> need to
> >do is present easy to construct XML content at the server level, and then
> fetch a
> >stylesheet for the particular user (which could be customized via some
> sort of
> >profile).  The content viewer which may be an HTML browser then can do all of
> >this processing on the client machine rather than bog down the server with
> >complicated content presentation processing.
>
> Once again, CSS can separate abstract content from presentation quite
> neatly.  In what circumstances is this separation so drastic as to require
> a transformation?  I'm sure they must be out there.  Database tables ->
> neatly formatted pages?  Documents that can change their structure at a
> user's whim?
>
> I'm having a hard time coming up with practical uses for XSL that don't
> remind me of nuclear missiles homing in on a gnat, hell bent on blasting
> that little gnat to smithereens.

CSS more or less from my understanding is pretty much completely bound to the web
browser.  If you think that Netscape Navigator and Microsoft Internet Explorer are
the best things that will ever become of the internet for end users, then I guess
there is no need for XSL.  XSL I think has much more important future
ramifications outside of the web browser arena.  It would be sad to see the
internet stall on web browser technology as the web browser wars for the time
being are over as little has happened from a technology standpoint that is
innovative and creative in what we now know as the web browser.  This is
completely understandable as neither Netscape or Microsoft has any financial
incentive to improve their products since they both give them away for free.  In
other words, we get what we pay for.

Tyler


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From tbray at textuality.com  Wed Sep 30 19:00:32 1998
From: tbray at textuality.com (Tim Bray)
Date: Mon Jun  7 17:05:15 2004
Subject: Binary Data in XML : Turning back the clock
Message-ID: <3.0.32.19980930095042.00b68100@pop.intergate.bc.ca>

At 12:34 PM 9/30/98 -0400, david@megginson.com wrote:
>Quite right.  Right now, XML 1.0 has notations for this purpose, as
>Eliot keeps reminding us; it would also be possible to invent a
>standard attribute like 'xml:content', for use whether or not an
>element's content was Base64-encoded:

Yeah, but NOTATIONs require the use of a validating processor, and
lots of non-validating apps would like to use base64.  Having said
that, I think that your proposed xml:content is more or less exactly
what NOTATION is for? -Tim

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 30 19:01:21 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:15 2004
Subject: XSL: Why?
References: <199809301552.LAA27368@hesketh.com>
Message-ID: <36126292.1BEAD074@locke.ccil.org>

Simon St.Laurent wrote:

> This may be the result of my background in Web development, rather than
> SGML, but I can't see what's so intrinsically interesting about using a
> transformative rather than a descriptive style language that it rates a
> competing spec and has many people (notably Peter Flynn on XML-L a while
> back) waiting for XSL rather than working with CSS now.

Well, I think XSL should probably be split into two parts (and the
current state of the spec suggests that it has been, de facto):
the transformation language, which is useful in contexts having
nothing to do with styling, and an XML encoding of CSSn, for n >= 2.

For example, CSS2 alone cannot take a document with TITLE elements
in various divisions and generate a properly numbered and indented
TOC at the beginning of the document.  XSL, IIRC, can do that
by processing the tree twice.

OTOH, I see no reason why XSL should specify formatting objects
that are other than those of CSS.  A standard way to encode the
CSS language as XML, so that the transformation language can
generate it, would do all that is necessary.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 30 19:04:51 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:16 2004
Subject: Binary Data in XML
References: <3.0.32.19980930090817.00b61210@pop.intergate.bc.ca> <13842.23387.450663.757919@localhost.localdomain>
Message-ID: <36126376.1864A898@locke.ccil.org>

david@megginson.com wrote:

> I don't know if the parser authors should worry about it -- how would
> one deliver the binary information in the DOM or SAX, for example?  It
> seems more likely that people would build support into the higher
> interface layers like SAXON.

As soon as the details settle down, I will produce a ParserFilter
that allows SAX clients to register a BLOBHandler implementing
"BLOB(byte[] b, int start, int length".

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From david at megginson.com  Wed Sep 30 19:08:50 1998
From: david at megginson.com (david@megginson.com)
Date: Mon Jun  7 17:05:16 2004
Subject: Binary Data in XML : Turning back the clock
In-Reply-To: <3.0.32.19980930095042.00b68100@pop.intergate.bc.ca>
References: <3.0.32.19980930095042.00b68100@pop.intergate.bc.ca>
Message-ID: <13842.25598.983064.317813@localhost.localdomain>

Tim Bray writes:

 > At 12:34 PM 9/30/98 -0400, david@megginson.com wrote:

 > >Quite right.  Right now, XML 1.0 has notations for this purpose, as
 > >Eliot keeps reminding us; it would also be possible to invent a
 > >standard attribute like 'xml:content', for use whether or not an
 > >element's content was Base64-encoded:
 > 
 > Yeah, but NOTATIONs require the use of a validating processor, and
 > lots of non-validating apps would like to use base64.  Having said
 > that, I think that your proposed xml:content is more or less exactly
 > what NOTATION is for? -Tim

Yes, it was intended as a lighter-weight alternative to the use of
NOTATION attributes.


All the best,


David

-- 
David Megginson                 david@megginson.com
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From b.laforge at jxml.com  Wed Sep 30 19:16:46 1998
From: b.laforge at jxml.com (Bill la Forge)
Date: Mon Jun  7 17:05:16 2004
Subject: Web server logs in XML?
Message-ID: <000601bdec95$7b815e40$ab026982@thing1.camb.opengroup.org>

Look at the info and archive links for XLF at
        http://www.jxml.com

Bill

-----Original Message-----
From: Dave Winer <dave@userland.com>
To: xml-dev@ic.ac.uk <xml-dev@ic.ac.uk>
Date: Wednesday, September 30, 1998 12:48 PM
Subject: Web server logs in XML?


>Is anyone working on a format for doing HTTP server logs in XML? If so, is
>there a spec somewhere on the web? I'm working on logging this week, and
>just realized that I should be doing the logs in XML. Thanks. Dave
>
>--------------------------------------
>http://www.userland.com/directory.html
>
>
>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>(un)subscribe xml-dev
>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
message;
>subscribe xml-dev-digest
>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Wed Sep 30 19:19:54 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:05:16 2004
Subject: Binary Data in XML
Message-ID: <006f01bdec95$231c9580$e86118cb@caleb>

-----Original Message-----
From: Tim Bray <tbray@textuality.com>
>Suppose I wrote up a NOTE, should occupy less than one page, proposing
>a reserved attribute xml:packed with, for the moment, only two
>allowed values, "none" and "base64".

Could this be viewed as merely a predefined notation attribute and notation?
In other words, would it be true to say that what you are suggesting could
be achieved right now with notation attributes and notations, the only
problem is you'd need a DTD and a validating (or "DTD-aware") parser and you
are wanting to avoid this requirement?

James Tauber


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From andrewl at microsoft.com  Wed Sep 30 19:25:01 1998
From: andrewl at microsoft.com (Andrew Layman)
Date: Mon Jun  7 17:05:16 2004
Subject: Binary Data in XML
Message-ID: <5BF896CAFE8DD111812400805F1991F7038CA87F@RED-MSG-08>

Tim Bray wrote

"Base64 (a 4-for-3 encoding) wastes 33%, so I thought about perhaps
inventing Base128 (8-for-7) or maybe even a higher level to cut down
wasteage, but Base64 has the advantage that it avoids UTF8/ISO-8859 
confusion and I bet Mr. LZW will eat that 33% anyhow..."

The thing to consider here is that, although XML is Unicode, the typical
encoding of it is predominantly ANSI octets, in which case Base64 works
pretty well.  But run some tests...

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 30 19:31:15 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:16 2004
Subject: Ownership of Names (was Re: Public identifiers and topic  
	   maps)
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <360C0929.A4DC3C69@locke.ccil.org>
	 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
	 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
	 <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
	 <3.0.5.32.19980928171700.00958ba0@dns.isogen.com>
	 <3.0.5.32.19980929104734.008fd660@dns.isogen.com> <3.0.5.32.19980929121602.0096b100@dns.isogen.com>
Message-ID: <361269E1.6069227E@locke.ccil.org>

W. Eliot Kimber wrote:

> [NOTE: this will be last post on this thread, as it's clear to me that John
> and I are not communicating, probably because we have divergent definitions
> of some key concepts. However, I feel I must answer John's note, at least
> in an attempt to make my side of the argument clear.]

Okay, I'll try to stick to I-sentences (e.g. "I think ...", "I believe
...", etc.), and I'll shut up after this too.

> Consensus of whom? If you can define the set of people who share an
> understanding of what "Spencertown" is then you have served to define it,
> because I can interrogate those people and get them to tell me what they
> think the boundaries of Spencertown are.

I doubt I can do that either.  Common acceptation is, to my mind,
sui generis: it cannot be reduced to talk of specific individuals.

> You seem to be saying "I want an authority, but there can be no authority,
> so the problem is unsolvable, but I must have solution" and I'm saying
> "there is an authority, so there is no problem."  The choice seems obvious
> to me.

No, I am saying "I want a universally intelligible FPI, but there is no
useful authority to catalog the entry, so I am forced to use an ad-hoc
authority (such as myself) or to forgo FPIs."

> HOW DO YOU FRIGGIN' KNOW IT?

I don't know *how* I know it, I just do.  As a Theodore Sturgeon
character once said in a related connection, "How does your head
know your arm isn't dead?"  :-)

> If you can't
> name the authority (even if that authority is your neighbor or what you
> read in the paper or the intersection of all the opinions you've gotten),
> then you are lying when you say you know what other people mean. You may
> know what you *think* other people mean.

This is always a possibility: that what I mean by "red" is what you
mean by "green".  Trying to use terms in the Real World eventually cures all
such problems, although not necessarily immediately.

> The phrase "I know what other people mean by..." is either a lie or a
> dangerous assumption, because opinion on most things is usually much less
> consistent than one might think.

In particular cases it may be dangerous.  But in fact we acquire
almost all of our language (barring specialized vocabulary) before
we know what a definition is, never mind how to apply one.

> You may not know how
> you got your notion of what Spencertown is, but you have a notion and can
> communicate it in absolute terms by reference to an authority, such as a
> map of Austerlitz within which you can address the region you call
> Spencertown.

I think we have incurably different definitions of what "authority"
and "authoritative" mean.  For me, an authority is something that
is *generally* agreed upon as authoritative: what the authority
says is so (within its scope) is so *because* it says so. The Duden
dictionary is an authority for (pre-1997) German orthography; there is
no authority for English orthography.

> If a thing can be named it can be defined, therefore all names resolve to
> definitions of some sort.

I don't believe this is true.  "Socrates" is a name with a well-understood
referent, but *defining* Socrates, in the sense of specifying
sufficient properties to distinguish him from Isocrates, is a
non-trivial job, and (IMHO) you need not be able to define
Socrates (in this sense) to know who he is.  Kripke's "Naming and
Necessity" is heavy sledding, but puts this point with far
greater thoroughness.

> >The various maps and so on are not authoritative either.  A map *describes*,
> >it does not constitute an authority.
> 
> But a given map will either match or not match *your* understanding of what
> Spencertown is, and therefore you can use it a fixed reference point to
> define what Spencertown is, for you.

But it is not, under my definitions, authoritative.  At most it says
what other maps say and what other people say, none of whom are
authoritative either.
 
> If I move to Austerlitz and someone tells me "you should really
> live in Spencertown", the first thing I have to do is ask them what
> Spencertown is, at which point I'll get their definition of it.  I might
> accept their definition at face value or I might say "by what authority to
> do you use that definition?" and they might say "everybody knows that's
> what Spencertown is", at which point I say fine.

Just so.  What I would like is an FPI which encodes that understanding.

> Thus, every person who has an idea of what Spencertown is is an
> authority--the question is, how much weight do you give them?

Under the theory of ad-hoc authority (every man his own FPI-maker),
it seems we have to give them all equal weight, and the notion of
*the* Spencertown gets lost in a welter of random opinions, informed
or otherwise.

> That doesn't mean that they have an *absolute* definition of what
> life is, it simply means that they've said "when I use the term 'life', I
> mean things that exhibit the following properties...".

It seems to me that you can be a perfectly good molecular biologist
without being able to define "life", or equally a good snail
biologist.  I cannot define "information" or "computing" in a
perfectly satisfactory way, but I think I understand something
about those subjects.

Is there a useful tutorial about biblocs?  I would like to
understand them further.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From M.H.Kay at eng.icl.co.uk  Wed Sep 30 19:34:30 1998
From: M.H.Kay at eng.icl.co.uk (Michael Kay)
Date: Mon Jun  7 17:05:16 2004
Subject: Binary Data in XML
Message-ID: <010101bdec98$a890fbe0$1e09e391@mhklaptop.bra01.icl.co.uk>

>Tim Bray scripsit:
>
>> Suppose I wrote up a NOTE, should occupy less than one
page, proposing
>> a reserved attribute xml:packed with, for the moment,
only two
>> allowed values, "none" and "base64".

A legalistic niggle, which I am sure Tim is aware of more
than anyone: "Names beginning with the string 'xml'
[case-blind] are reserved for standardization in this or
future versions of this specification".

XML-related standards such as Namespaces, XLink have already
broken the letter of this rule; but it doesn't seem
appropriate that anyone who thinks of a nice XML idea and
wants to propose it for standardisation should automatically
have rights to this namespace, even if he is a co-author of
the standard!

Some W3C process needed here...

Mike


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From dave at userland.com  Wed Sep 30 19:39:00 1998
From: dave at userland.com (Dave Winer)
Date: Mon Jun  7 17:05:16 2004
Subject: Web server logs in XML?
In-Reply-To: <000601bdec95$7b815e40$ab026982@thing1.camb.opengroup.org>
Message-ID: <3.0.5.32.19980930103642.010803c0@scripting.com>

Bill, thanks for the link, here's where I found the spec:

http://www.docuverse.com/xlf/NOTE-XLF-19980721-all.html

But it's not much of a spec. Certainly not ready to be implemented from.

I'd love to see an example XML log file. Or a pointer to one.

Dave

At 01:12 PM 9/30/98 -0400, you wrote:
>Look at the info and archive links for XLF at
>        http://www.jxml.com
>
>Bill
>
>-----Original Message-----
>From: Dave Winer <dave@userland.com>
>To: xml-dev@ic.ac.uk <xml-dev@ic.ac.uk>
>Date: Wednesday, September 30, 1998 12:48 PM
>Subject: Web server logs in XML?
>
>
>>Is anyone working on a format for doing HTTP server logs in XML? If so, is
>>there a spec somewhere on the web? I'm working on logging this week, and
>>just realized that I should be doing the logs in XML. Thanks. Dave
>>
>>--------------------------------------
>>http://www.userland.com/directory.html
>>
>>
>>xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
>>Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
>>To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
>>(un)subscribe xml-dev
>>To subscribe to the digests, mailto:majordomo@ic.ac.uk the following
>message;
>>subscribe xml-dev-digest
>>List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)
>>
>
>

--------------------------------------
http://www.userland.com/directory.html


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From simonstl at simonstl.com  Wed Sep 30 19:44:11 1998
From: simonstl at simonstl.com (Simon St.Laurent)
Date: Mon Jun  7 17:05:16 2004
Subject: XSL: Why?
In-Reply-To: <361261CA.9CEE35EA@infinet.com>
References: <199809301552.LAA27368@hesketh.com>
 <199809301633.MAA27951@hesketh.com>
Message-ID: <199809301740.NAA28908@hesketh.com>

At 12:52 PM 9/30/98 -0400, Tyler Baker wrote:
>CSS more or less from my understanding is pretty much completely bound to
the web
>browser.  If you think that Netscape Navigator and Microsoft Internet
Explorer are
>the best things that will ever become of the internet for end users, then
I guess
>there is no need for XSL.  XSL I think has much more important future
>ramifications outside of the web browser arena.  It would be sad to see the
>internet stall on web browser technology as the web browser wars for the time
>being are over as little has happened from a technology standpoint that is
>innovative and creative in what we now know as the web browser.  This is
>completely understandable as neither Netscape or Microsoft has any financial
>incentive to improve their products since they both give them away for
free.  In
>other words, we get what we pay for.

CSS isn't bound to the Web browser any more than XSL is bound to the few
implementations that already exist for it.  I don't have any trouble
imagining a word processor that editing XML and used CSS to apply
formatting, nor do I have trouble seeing a page layout tool that used CSS
to create pages for many different kinds of documents.

The question remains: why are transformations necessary to styles?  The
browser has to do an internal elements->presentation transformation anyway,
so why put in the extra elements->transformation->presentation step?  There
are plenty of transformation tools that can already do transformations.
Why is this so important to presentations?

Maybe it's time to subscribe to the xsl list. (I did for a while, and
couldn't figure out why it was supposed to be interesting.)  This shouldn't
sound any dumber there than my earliest questions sounded on XML-Dev.
Still, this seems like a fairly significant XML software architecture
issue, so I was hoping to find answers here.


Simon St.Laurent
Dynamic HTML: A Primer / XML: A Primer
Cookies / Sharing Bandwidth (November)
Building XML Applications (December)
http://www.simonstl.com

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From fblau at nina.snohomish.wa.gov  Wed Sep 30 19:47:45 1998
From: fblau at nina.snohomish.wa.gov (Frank Blau)
Date: Mon Jun  7 17:05:16 2004
Subject: Ownership of Names (was Re: Public identifiers and topic  
		   maps)
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
		 <360C0929.A4DC3C69@locke.ccil.org>
		 <360C0929.A4DC3C69@locke.ccil.org>
		 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
		 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
		 <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
		 <3.0.5.32.19980928171700.00958ba0@dns.isogen.com>
		 <3.0.5.32.19980929104734.008fd660@dns.isogen.com> <3.0.5.32.19980929121602.0096b100@dns.isogen.com> <361269E1.6069227E@locke.ccil.org>
Message-ID: <36126C05.271C6471@nina.snohomish.wa.gov>

I've lurked long on this list...

When I talk to people about XML, one of the "problems" I identify is the
overabundance of navel-gazing academic esoterica in the standards committees...
This list provides more examples of that than anything else (short of Dilbert)...

Can we stop arguing about what are essentially personal, political and/or
philosophical positions and get back to xml development issues? I can't believe I'm
reading about whether or not "acceptation" is "sui generis"...

Frank

John Cowan wrote:

> W. Eliot Kimber wrote:
>
> > [NOTE: this will be last post on this thread, as it's clear to me that John
> > and I are not communicating, probably because we have divergent definitions
> > of some key concepts. However, I feel I must answer John's note, at least
> > in an attempt to make my side of the argument clear.]
>
> Okay, I'll try to stick to I-sentences (e.g. "I think ...", "I believe
> ...", etc.), and I'll shut up after this too.
>
> > Consensus of whom? If you can define the set of people who share an
> > understanding of what "Spencertown" is then you have served to define it,
> > because I can interrogate those people and get them to tell me what they
> > think the boundaries of Spencertown are.
>
> I doubt I can do that either.  Common acceptation is, to my mind,
> sui generis: it cannot be reduced to talk of specific individuals.
>
> > You seem to be saying "I want an authority, but there can be no authority,
> > so the problem is unsolvable, but I must have solution" and I'm saying
> > "there is an authority, so there is no problem."  The choice seems obvious
> > to me.
>
> No, I am saying "I want a universally intelligible FPI, but there is no
> useful authority to catalog the entry, so I am forced to use an ad-hoc
> authority (such as myself) or to forgo FPIs."
>
> > HOW DO YOU FRIGGIN' KNOW IT?
>
> I don't know *how* I know it, I just do.  As a Theodore Sturgeon
> character once said in a related connection, "How does your head
> know your arm isn't dead?"  :-)
>
> > If you can't
> > name the authority (even if that authority is your neighbor or what you
> > read in the paper or the intersection of all the opinions you've gotten),
> > then you are lying when you say you know what other people mean. You may
> > know what you *think* other people mean.
>
> This is always a possibility: that what I mean by "red" is what you
> mean by "green".  Trying to use terms in the Real World eventually cures all
> such problems, although not necessarily immediately.
>
> > The phrase "I know what other people mean by..." is either a lie or a
> > dangerous assumption, because opinion on most things is usually much less
> > consistent than one might think.
>
> In particular cases it may be dangerous.  But in fact we acquire
> almost all of our language (barring specialized vocabulary) before
> we know what a definition is, never mind how to apply one.
>
> > You may not know how
> > you got your notion of what Spencertown is, but you have a notion and can
> > communicate it in absolute terms by reference to an authority, such as a
> > map of Austerlitz within which you can address the region you call
> > Spencertown.
>
> I think we have incurably different definitions of what "authority"
> and "authoritative" mean.  For me, an authority is something that
> is *generally* agreed upon as authoritative: what the authority
> says is so (within its scope) is so *because* it says so. The Duden
> dictionary is an authority for (pre-1997) German orthography; there is
> no authority for English orthography.
>
> > If a thing can be named it can be defined, therefore all names resolve to
> > definitions of some sort.
>
> I don't believe this is true.  "Socrates" is a name with a well-understood
> referent, but *defining* Socrates, in the sense of specifying
> sufficient properties to distinguish him from Isocrates, is a
> non-trivial job, and (IMHO) you need not be able to define
> Socrates (in this sense) to know who he is.  Kripke's "Naming and
> Necessity" is heavy sledding, but puts this point with far
> greater thoroughness.
>
> > >The various maps and so on are not authoritative either.  A map *describes*,
> > >it does not constitute an authority.
> >
> > But a given map will either match or not match *your* understanding of what
> > Spencertown is, and therefore you can use it a fixed reference point to
> > define what Spencertown is, for you.
>
> But it is not, under my definitions, authoritative.  At most it says
> what other maps say and what other people say, none of whom are
> authoritative either.
>
> > If I move to Austerlitz and someone tells me "you should really
> > live in Spencertown", the first thing I have to do is ask them what
> > Spencertown is, at which point I'll get their definition of it.  I might
> > accept their definition at face value or I might say "by what authority to
> > do you use that definition?" and they might say "everybody knows that's
> > what Spencertown is", at which point I say fine.
>
> Just so.  What I would like is an FPI which encodes that understanding.
>
> > Thus, every person who has an idea of what Spencertown is is an
> > authority--the question is, how much weight do you give them?
>
> Under the theory of ad-hoc authority (every man his own FPI-maker),
> it seems we have to give them all equal weight, and the notion of
> *the* Spencertown gets lost in a welter of random opinions, informed
> or otherwise.
>
> > That doesn't mean that they have an *absolute* definition of what
> > life is, it simply means that they've said "when I use the term 'life', I
> > mean things that exhibit the following properties...".
>
> It seems to me that you can be a perfectly good molecular biologist
> without being able to define "life", or equally a good snail
> biologist.  I cannot define "information" or "computing" in a
> perfectly satisfactory way, but I think I understand something
> about those subjects.
>
> Is there a useful tutorial about biblocs?  I would like to
> understand them further.
>
> --
> John Cowan      http://www.ccil.org/~cowan              cowan@ccil.org
>         You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
>         You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
>                 Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)
>
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Wed Sep 30 20:05:55 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:16 2004
Subject: Binary Data in XML : Turning back the clock
In-Reply-To: <3.0.32.19980930095042.00b68100@pop.intergate.bc.ca>
Message-ID: <3.0.5.32.19980930130001.00881d70@dns.isogen.com>

At 09:54 AM 9/30/98 -0700, Tim Bray wrote:
>At 12:34 PM 9/30/98 -0400, david@megginson.com wrote:
>>Quite right.  Right now, XML 1.0 has notations for this purpose, as
>>Eliot keeps reminding us; it would also be possible to invent a
>>standard attribute like 'xml:content', for use whether or not an
>>element's content was Base64-encoded:
>
>Yeah, but NOTATIONs require the use of a validating processor, and
>lots of non-validating apps would like to use base64.  Having said
>that, I think that your proposed xml:content is more or less exactly
>what NOTATION is for? -Tim

<xml-dev@ic.ac.uk>

Notations don't require validating parsers, they only require parsers that
read and understand notation declarations. But they also require that there
be a DOCTYPE declaration so you can specify the notation declaration.

In this case, I think that a conventialized attribute would be sufficient,
but a notation of the same name as the value of the attribute should be
interpreted in the same way.

Or, said another way, an "xml:content" attribute could be presumed to be
declared with a value prescription of "NOTATION". The notation declaration
for the named notation is then implied.

This is analogous to "PI targets", where the spec says that the PI target
name is, semantically, the name of a notation, but you don't have to
declare the notation if you don't feel like it, but if there is a notation
declared with that name, it's rules govern the interpretation of the PI.
It would, I think, make sense to say the same thing for this.

And note that the notation of the data is, as previously mentioned,
orthoganal to how that data is encoded in the source document.  Therefore,
you would not expect to have a notation of "base64".

Of course, if XML had data attributes, you could define a conventional data
attribute that specified the instance encoding, allowing different
notations to define what encodings their processors should support, but
nobody ever listens to me....

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From ricko at allette.com.au  Wed Sep 30 20:09:10 1998
From: ricko at allette.com.au (ricko)
Date: Mon Jun  7 17:05:16 2004
Subject: Binary Data in XML
References: <3.0.32.19980930090817.00b61210@pop.intergate.bc.ca>
Message-ID: <361274DB.5FB5B404@allette.com.au>

Tim Bray �g�D�G

> Suppose I wrote up a NOTE, should occupy less than one page, proposing
> a reserved attribute xml:packed with, for the moment, only two
> allowed values, "none" and "base64".  The default value is "none".

...

> Are there any gotchas I'm missing?  Don't know if I could persuade
> one of the WGs to take it up, but it seems pretty obvious that there
> is not only industry demand but in fact people doing this already, so
> the case is pretty strong I think. -Tim

I think it would be an excellent idea. Apart from its intrinsic worth,
it should also serve as an exemplar for other WGs of a document
specifying an embedded notation within XML elements. I agree
it should include an FPI for the notation, but I also think it should
make it clear that it notationally it is a post-parse. (And in the
general case, there is no reason why such post-parsing might not
result in element nodes in the result DOM/grove: this would
provide a clear path for other structured languages embedded in
XML: CSS, etc.)

I would guess that anyone interested in having proprietary field
encyption of the content of particular element types would be
interested in such an exemplar.

Is this a schema issue?  Well, I think not, in that I tend to think
of encoding as orthogonal to schemas (in a similar to fashion to
how xml:lang is orthogonal to schemas).

One approach for naming would be to use the HTTP/MIME
header names, where we can.  A PCDATA element type with
some encoded data reveals that there is a class of NDATA entities
which may be too small or frequent in a document to warrant being
stored as external resources.  So it is logical that much of the HTTP/MIME
header information may also be relevent and useful to these "inlined
NDATA entities".

I am not suggesting that all the HTTP/MIME headers need to be duplicated
now; certainly if Tim makes his note, it should be modest. But I
think it probable that over time more HTTP/MIME headers will
be found to be appropriate to add as attributes in this kind of element.

Rick Jelliffe


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jborden at mediaone.net  Wed Sep 30 20:12:23 1998
From: jborden at mediaone.net (Jonathan  A. Borden)
Date: Mon Jun  7 17:05:17 2004
Subject: Binary Data in XML : Turning back the clock
In-Reply-To: <13842.25598.983064.317813@localhost.localdomain>
Message-ID: <000801bdec9c$5d221550$d3228018@jabr.ne.mediaone.net>

david@megginson.com writes:

>  > Yeah, but NOTATIONs require the use of a validating processor, and
>  > lots of non-validating apps would like to use base64.  Having said
>  > that, I think that your proposed xml:content is more or less exactly
>  > what NOTATION is for? -Tim
>
> Yes, it was intended as a lighter-weight alternative to the use of
> NOTATION attributes.
>

The proposed xml:content attribute does serve as a lightweight alternative
to NOTATIONs.

If xml:content values are MIME types, this is a simple alternative to FPI's
etc. In the same way that people don't wish to be forced into the use of a
DTD, we have a similar need to label types without FPIs (it is apparent from
a parallel thread that the whole idea of FPI namespaces is likely to
engender heated debate for some time). MIME types while not perfect, have
practical and widespread use. When used with xml:packed="base64" the default
is xml:content="application/octet-stream". When xml:packed="none" the
default is xml:content="text/plain". Is there a good way to specify this?

We have had discussions about why XML isn't as widespread as it might be.
The conclusion of some was that things are getting bogged down in needless
complexity. If most of HTML is going to be replaced by XML (and there is no
reason that this might not happen), then these issues need to be solved in a
simple and lightweight fashion.

Jonathan Borden
JABR Technology Corporation
mailto:jborden@mediaone.net


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jborden at mediaone.net  Wed Sep 30 20:31:05 1998
From: jborden at mediaone.net (Jonathan  A. Borden)
Date: Mon Jun  7 17:05:17 2004
Subject: Binary Data in XML
In-Reply-To: <361274DB.5FB5B404@allette.com.au>
Message-ID: <000901bdec9f$3cde51c0$d3228018@jabr.ne.mediaone.net>

ricko wrote:
>
> One approach for naming would be to use the HTTP/MIME
> header names, where we can.  A PCDATA element type with
> some encoded data reveals that there is a class of NDATA entities
> which may be too small or frequent in a document to warrant being
> stored as external resources.  So it is logical that much of the HTTP/MIME
> header information may also be relevent and useful to these "inlined
> NDATA entities".
>
> I am not suggesting that all the HTTP/MIME headers need to be duplicated
> now; certainly if Tim makes his note, it should be modest. But I
> think it probable that over time more HTTP/MIME headers will
> be found to be appropriate to add as attributes in this kind of element.
>
Actually, there's no reason not to develop a complete and interoperable
mapping between MIME and XML. This would create an XML native transport
protocol (XMTP? or SXTP? hmmm :-). People with validating parsers could turn
off acceptance of undefined MIME types etc. e.g. "application/x-whatever".

What do people think this mapping should look like. Based upon some of the
work we are doing, we have a need for this and will probably develop a
mapping ourselves unless there is an interest from others in specifying
this.

Jonathan Borden
mailto:jborden@mediaone.net


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Wed Sep 30 20:42:37 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:05:17 2004
Subject: Greetings
In-Reply-To: <36124C1F.7C75@programmar.com>
Message-ID: <3.0.1.16.19980930173757.b4ff57b6@pop3.demon.co.uk>

At 10:19 30/09/98 -0500, Norman E. Wilson wrote:
>Im new to the group and wanted to (briefly) introduce myself.  I have

And greetings to you and to all those who have joined recently. XML-DEV is
aimed towards making XML work and concentrates on providing a coordinating
environment for people who like an open approach. (A very large amount of
code posted here is high-quality OpenSource-like).
There are a lot of things to be done and crossfertilisation from people new
to XML is often very useful.

	P.


Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From peter at ursus.demon.co.uk  Wed Sep 30 20:42:45 1998
From: peter at ursus.demon.co.uk (Peter Murray-Rust)
Date: Mon Jun  7 17:05:17 2004
Subject: Binary Data in XML
In-Reply-To: <000101bdec8d$0074a430$d3228018@jabr.ne.mediaone.net>
References: <00a101bdec8a$76031f40$1e09e391@mhklaptop.bra01.icl.co.uk>
Message-ID: <3.0.1.16.19980930182613.b9afab84@pop3.demon.co.uk>

At 12:11 30/09/98 -0400, Jonathan  A. Borden wrote:
[...]
>
>The Web has exploded in use not because of HTML itself, rather this in
>conjunction with HTTP's MIME variant -- to be realistic what would the Web
>be like without pictures i.e. gif and jpeg). If base64 is used within XML, a
>similar typing mechanism is also required.

This seems reasonable to me - and could be accompanied by a mime attribute
- e.g:

<CML:molecule xml:packed="base64" xml:mime="chemical/x-chemdraw">... base64
encoding of horrid binary file ... </CML:molecule>

I know that this can also be done using NOTATION but also that - like many
others - I don't know how to do it and have never seen a NOTATION used in
XML documents. I therefore support Tim's suggestion of bolting xml:packed
into the language and suggest xml:mime as a companion.

	P.

I also know - and approve - that it can be done with XLink, but note that
these are untyped so would also benefit from xml:mime, e.g:

<p> This is a <a href="mol.cdw" xml:mime="chemical/x-chemdraw">molecule</a>
... </p>

	P.

Peter Murray-Rust, Director Virtual School of Molecular Sciences, domestic
net connection
VSMS http://www.nottingham.ac.uk/vsms, Virtual Hyperglossary
http://www.venus.co.uk/vhg

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 30 21:09:46 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:17 2004
Subject: Binary Data in XML
References: <00a101bdec8a$76031f40$1e09e391@mhklaptop.bra01.icl.co.uk> <3.0.1.16.19980930182613.b9afab84@pop3.demon.co.uk>
Message-ID: <361281C6.3402A6E1@locke.ccil.org>

Peter Murray-Rust wrote:

> <CML:molecule xml:packed="base64" xml:mime="chemical/x-chemdraw">... base64
> encoding of horrid binary file ... </CML:molecule>
> 
> I know that this can also be done using NOTATION but also that - like many
> others - I don't know how to do it and have never seen a NOTATION used in
> XML documents. I therefore support Tim's suggestion of bolting xml:packed
> into the language and suggest xml:mime as a companion.

Here's how.  Insert the following declarations into the DTD
(internal or external as desired):

<!NOTATION chemical-x-chemdraw PUBLIC "-//whoever/whatever">
<!ATTLIST CML:molecule
	xml:mime NOTATION(chemical-x-chemdraw | another | another) #REQUIRED>

and then just use it as above.  A validating parser will guarantee
that the value of the xml:mime attribute is one of the specified
notations, and it will be possible to retrieve the external
identifier for each notation so that you know what it means.

The nice thing about this is that it just works if you are DTD-blind,
but provides the information that a generalized processor for
notation-governed elements needs to figure out what to do
(e.g. render using a chemical/x-chemdraw renderer).

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From James.Anderson at mecomnet.de  Wed Sep 30 21:12:17 1998
From: James.Anderson at mecomnet.de (james anderson)
Date: Mon Jun  7 17:05:17 2004
Subject: Looking for an XML parser in Common Lisp
References: <36125BED.1003A9F2@research.nokia.com>
Message-ID: <36128408.1FE0F436@mecomnet.de>

look in the current release for the cl-http server. there's at least one in there.

(http://www.ai.mit.edu/projects/iiip/doc/cl-http/home-page.html)


Ora Lassila wrote:
> 
> I am looking for an XML parser written in Common Lisp.
> Does anybody know if such a thing exists?
> 
>     - Ora
> 
> --
> Ora Lassila, <ora.lassila@research.nokia.com>
> Agent Technology, Nokia Research Center / Boston
> phone: +1 (781) 238-4908
> 
> xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
> Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
> To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
> (un)subscribe xml-dev
> To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
> subscribe xml-dev-digest
> List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jborden at mediaone.net  Wed Sep 30 21:31:34 1998
From: jborden at mediaone.net (Jonathan  A. Borden)
Date: Mon Jun  7 17:05:17 2004
Subject: MIME/RFC822 <-> XML interop (was RE: Binary Data in XML)
In-Reply-To: <361281C6.3402A6E1@locke.ccil.org>
Message-ID: <000a01bdeca7$ad59d0c0$d3228018@jabr.ne.mediaone.net>

So what you are saying here is that the use of a attribute is totally
compatible with the use of NOTATIONs and when DTDs are used both are useful.
I suggest that xml:content (or xml:content-type) be used instead of xml:mime
because MIME has lots of potential headers. In the past few minutes I've
hacked my MIME class library to add XML serialization support. What I've
found is that while the vast majority of RFC 821/822 headers are single
values, several are frequently multivalues e.g. the SMTL Received: header
that everyone has seen a gazzilion times :-) for this reason, I have
represented generic RFC 821/822 messages (which incorporate MIME) as: (the
point being that elements may be needed to represent generic RFC822 headers)

<MIME>
<Received> ... </Received>
<Received> ... </Received>
<From>jborden@mediaone.net</From>
<To>xml-dev@ic.ac.ul</To>
<Reply-To>jborden@mediaone.net</Reply-To>
<Content-Type>multipart/mixed</Content-Type>
<Body>
<MIME>
<Content-Type>text/plain</Content-Type>
<Body>This is an example of an e-mail message
however the text here will need to be encoded
</Body>
</MIME>
<MIME>
<Content-Type>image/jpeg</Content-Type>
<Content-transfer-encoding>base64</Content-transfer-encoding>
<Body> ... base64 encoded data here </Body>
</MIME>
</Body>
</MIME>

Is this reasonable?

John Cowan wrote:

>
> Here's how.  Insert the following declarations into the DTD
> (internal or external as desired):
>
> <!NOTATION chemical-x-chemdraw PUBLIC "-//whoever/whatever">
> <!ATTLIST CML:molecule
> 	xml:mime NOTATION(chemical-x-chemdraw | another | another)
> #REQUIRED>
>
> and then just use it as above.  A validating parser will guarantee
> that the value of the xml:mime attribute is one of the specified
> notations, and it will be possible to retrieve the external
> identifier for each notation so that you know what it means.
>
> The nice thing about this is that it just works if you are DTD-blind,
> but provides the information that a generalized processor for
> notation-governed elements needs to figure out what to do
> (e.g. render using a chemical/x-chemdraw renderer).
>
Jonathan Borden


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jtauber at jtauber.com  Wed Sep 30 21:36:08 1998
From: jtauber at jtauber.com (James Tauber)
Date: Mon Jun  7 17:05:17 2004
Subject: NOTATION Example (was Re: Binary Data in XML)
Message-ID: <00e401bdeca8$b00dd7c0$e86118cb@caleb>

-----Original Message-----
From: Peter Murray-Rust <peter@ursus.demon.co.uk>
>I know that this can also be done using NOTATION but also that - like many
>others - I don't know how to do it and have never seen a NOTATION used in
>XML documents.

Here is an example based on one from my XML course. Note that attributes are
just one use of notations, others are unparsed entities and PI targets which
I can also give examples of if anyone would like.

Say you want a different notation for US dates, Australian dates and ISO
dates.

First of all the declaration:

    <!NOTATION USDATE SYSTEM "http://www.schema.net/usdate.not">
    <!NOTATION AUSDATE SYSTEM "http://www.schema.net/ausdate.not">
    <!NOTATION ISODATE SYSTEM "http://www.schema.net/isodate.not">

Then if an element type is to use this notation, we declare a notation
attribute:

    <!ATTLIST DATE
        FORMAT NOTATION (USDATE|AUSDATE|ISODATE) "ISODATE")

Then we can use:

    <DATE>19980414</DATE>
    <DATE FORMAT="ISODATE">19980414</DATE>
    <DATE FORMAT="AUSDATE">14/4/1998</DATE>
    <DATE FORMAT="USDATE">4/14/1998</DATE>

in our documents and the notation identified by the system identifier in the
NOTATION declarations will be associated with the content of the elements
and a parser provides this information for the application.

James
--
James Tauber / jtauber@jtauber.com      http://www.jtauber.com/
Lecturer and Associate Researcher
Electronic Commerce Network             ( http://www.xmlinfo.com/
Curtin Business School                  ( http://www.xmlsoftware.com/
Perth, Western Australia                ( http://www.schema.net/


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From jborden at mediaone.net  Wed Sep 30 21:40:47 1998
From: jborden at mediaone.net (Jonathan  A. Borden)
Date: Mon Jun  7 17:05:17 2004
Subject: MIME/RFC822 <-> XML interop (minor correction)
In-Reply-To: <000a01bdeca7$ad59d0c0$d3228018@jabr.ne.mediaone.net>
Message-ID: <000c01bdeca8$abd7f640$d3228018@jabr.ne.mediaone.net>

Minor correction:

that's 'SMTP'
>... several are frequently multivalues e.g. the SMTL Received: header
> that everyone has seen a gazzilion times :-) ...

my apologies,

Jonathan

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From cowan at locke.ccil.org  Wed Sep 30 21:42:43 1998
From: cowan at locke.ccil.org (John Cowan)
Date: Mon Jun  7 17:05:17 2004
Subject: MIME/RFC822 <-> XML interop (was RE: Binary Data in XML)
References: <000a01bdeca7$ad59d0c0$d3228018@jabr.ne.mediaone.net>
Message-ID: <361288F3.DFB5552B@locke.ccil.org>

Jonathan A. Borden wrote:

> So what you are saying here is that the use of a attribute is totally
> compatible with the use of NOTATIONs and when DTDs are used both are useful.

Just so.

> I suggest that xml:content (or xml:content-type) be used instead of xml:mime
> because MIME has lots of potential headers.

I agree.

[example snipped]

> Is this reasonable?

Yes.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From eliot at dns.isogen.com  Wed Sep 30 22:12:46 1998
From: eliot at dns.isogen.com (W. Eliot Kimber)
Date: Mon Jun  7 17:05:17 2004
Subject: Bibloc Tutorial (was Re: Ownership of Names (was Re: Public
  identifiers and topicmaps)
In-Reply-To: <361269E1.6069227E@locke.ccil.org>
References: <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <360C0929.A4DC3C69@locke.ccil.org>
 <360C0929.A4DC3C69@locke.ccil.org>
 <3.0.5.32.19980926140208.0095fc10@dns.isogen.com>
 <3.0.5.32.19980928103117.00966620@dns.isogen.com>
 <3.0.5.32.19980928135613.00993d70@dns.isogen.com>
 <3.0.5.32.19980928171700.00958ba0@dns.isogen.com>
 <3.0.5.32.19980929104734.008fd660@dns.isogen.com>
 <3.0.5.32.19980929121602.0096b100@dns.isogen.com>
Message-ID: <3.0.5.32.19980930151203.008f4a30@dns.isogen.com>

At 01:26 PM 9/30/98 -0400, John Cowan wrote:

>Is there a useful tutorial about biblocs?  I would like to
>understand them further.

Bibliographic location addresses as defined by the HyTime architecture are
nothing more than containers that describe how to find something that is
not directly addressible by electronic means (that is, that your computer
cannot deliver to you directly). In formal HyTime terms, anything from
which a grove cannot be constructed can only be addressed by a bibloc.

They are called "bibliographic locations" because they use the
"bibliographic" model of addressing, that is, the way we cite things like
books by giving enough information about them to allow another person to
find the thing. 

The purpose biblocs serve in a HyTime context is to allow HyTime's
addressing facilities to be closed over all possible things: anything you
can't address electronically you can address through a bibloc, which serves
an electronic proxy for the non-electronic thing.

Biblocs are represented in SGML or XML documents as elements and are
therefore inherently electronically addressible:

<lost.book.loc id="lost.book.1" hytime="bibloc">
A book that is lost to history, known only by mentions of it
in other books.
</lost.book>

The online result of addressing a bibloc is that you get the bibloc, as
there's no way for the computer to get what the bibloc addresses.

A bibloc can have a "bibliographic source", which is another bibloc that
establishes the addressing context for the first bibloc:

<library.definition id="UT.PCL" hytime="bibloc">
PCL library, University of Texas at Austin
</library.definition>

<book id="some.book" bibsrc="UT.PCL" hytime="bibloc">
Top floor, third cubby on the left,
under the chair, you'll find a book I left last time I was there.
</book>

The 'library.definition' bibloc is the bibliographic source for the the
'book' bibloc.  A reference to the book bibloc should return both the book
bibloc and the library definition bibloc.

If I wanted to create a hyperlink to the book, I could do this:

<link linkend="#id(some.book)" hytime=clink>A book I lost</link>

The content of a bibloc is not defined or constrainted by the HyTime
architecture. It could be anything, including data in a notation defined by
some other standard, such as MARC records, Library of Congress catalog
numbers, etc.

The formal definition of the bibloc element form is to be found at
<http://www.ornl.gov/sgml/wg8/docs/n1920/html/clause-7.12.html>, clause
7.12 of ISO/IEC 10744:1997.

Cheers,

E.
--
<Address HyTime=bibloc>
W. Eliot Kimber, Senior Consulting SGML Engineer
ISOGEN International Corp.
2200 N. Lamar St., Suite 230, Dallas, TX 75202.  214.953.0004
www.isogen.com
</Address>

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Wed Sep 30 22:55:36 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:05:18 2004
Subject: [Fwd: Re: Why XSL?]
Message-ID: <361293F2.704FA5C1@technologist.com>


-------- Original Message --------
Subject: Re: Why transformation?
Date: Wed, 30 Sep 1998 14:43:50 -0500
From: Paul Prescod <papresco@technologist.com>
To: xsl-list@mulberrytech.com
References: <199809301757.NAA29160@hesketh.com>

BTW, I liked your Chicago Tribune article.

"Simon St.Laurent" wrote:
> 
>...
> Is there a good reason for this, or is it just that XSL has transformations
> because DSSSL did?  There must be something more to it than the comfort
> level of those used to working with SGML tools, but so far, I haven't heard
> much exciting.

Your question begs another: "Why did DSSSL have transformations?" FOSI's,
which preceded DSSSL did not. But they were found to be inadequate. In
fact, almost every SGML user has wrestled with a series of style languages
that do not do transformations.

> Where does XSL fit in an XML application architecture?

I discussed this in a talk on XSL recently. Slides are at
http://www.prescod.net/xsl/slides/17.html . The fundamental point is that
you cannot predict how your data will be used in the future, so you cannot
decide on the "optimal" encoding for it. Even if you knew exactly how it
was going to be used, the needs of document renditions and data storage
are often different. 

In a rendition, redundancy is your friend. In document maintenance, it is
your enemy. Actually, redundancy is probably the most important point.
Often you want to get rid of redundant markup ("Why should I always wrap
this series of elements when the wrapper can be logically implied?").
Often you want to get rid of redundant text ("Why should I type titles for
these columns, when I use the same column titles for every table of this
type?") Sometimes you want to get rid of completely redundant elements:
("Why should I the chapter title both in the document, and in the table of
contents, and in a dozen cross references")?

In a rendition, data should often be sorted according to some rule that
will help human navigation. In your document database, you probably want
to allow authors to enter it in any order. You may even need to sort the
same data according to different rules according to the rendering.

Transformations are the basis of all XML processing. I expect that within
a few years all XML-processing applications will have transformation
engines built in. Style application are just the start.

> Why is this more more useful than CSS?

Apples and oranges. CSS is a logical choice for the output of an XSL
transformation application. See http://www.w3.org/TR/NOTE-XSL-and-CSS .
XSL and CSS are only competitors in-so-far as XSL has its own formatting
model based on, but not identical to, CSS's. Presumably people who have
looked at the issue more closely than I have decided that CSS's formatting
model was not sufficient.

> What is the value of conflating style and transformation?

The XSL spec. does not conflate style and transformation. They are in the
same physical document, and are both called "XSL", but have separate URLs,
editors and processing models. If you are saying that even the names and
physical documents should be separate, then I agree that that would seem
simpler (except perhaps for the political/bureaucratic implications).

 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

Bart: Dad, do I really have to brush my teeth?
Homer: No, but at least wash your mouth out with soda.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


From papresco at technologist.com  Wed Sep 30 23:47:24 1998
From: papresco at technologist.com (Paul Prescod)
Date: Mon Jun  7 17:05:18 2004
Subject: XSL: Why?
References: <199809301552.LAA27368@hesketh.com> <36126292.1BEAD074@locke.ccil.org>
Message-ID: <3612A1EC.C0E07952@technologist.com>

John Cowan wrote:
> 
> Well, I think XSL should probably be split into two parts (and the
> current state of the spec suggests that it has been, de facto):

There was a thread to the affect that XSL should be split in the XSL List,
and there were no dissenting voices that I can recall (apologies if I
forgot someone). On the other hand, nobody from the working group
commented (same disclaimer).

> the transformation language, which is useful in contexts having
> nothing to do with styling, and an XML encoding of CSSn, for n >= 2.

I presume that there is a reason that CSS was deemed not suitable as the
formatting model for XSL. CSS did not really have a concept of "formatting
objects". It only knew how to attach formatting semantics to existing
objects. At one point, some of the existing objects semantics had to be
already known: e.g. tables and links. I don't know if that has changed
recently. If not, CSS would need an overhaul to be sufficient for
formatting XML documents (or else you would have to merge CSS and HTML
somehow). Perhaps the formatting objects part of XSL *is* that overhaul.
 
 Paul Prescod  - http://itrc.uwaterloo.ca/~papresco

Bart: Dad, do I really have to brush my teeth?
Homer: No, but at least wash your mouth out with soda.

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)