XML QuotedCData question

Norbert H. Mikula nmikula at edu.uni-klu.ac.at
Mon Mar 10 15:30:31 GMT 1997

lee at sq.com wrote:
> Most programming languages talk explicitly about tokenisation,
> or tokenization if you prefer :-), and in doing so explain how
> the sequence of tokens that a compiler (say) sees is derived from
> an input stream.  Usually, comments are stripped at this stage,
> and in languages such as C or SGML that have (in effect) macros,
> the macros are expanded at input time.

I don't think that C and SGML/XML use or rather can use the 
same principle of includes/macros.

C uses a pre-processor that resolves includes. Then the actual
compiler gets started without having to worry about includes
anymore. (To my understanding of things..)

For practical reasons, at least for XML processors for online
browsers, I think, we don't want to first do the include and then do 
the parsing, keeping all that stuff in memory while we do so.

Furthermore I see problems arise if we have the following scenario :

<!ENTITY %UnixSpecifics SYSTEM "http....">
<!ENTITY %DosSpecifics SYSTEM "http....">



Too much to do for a pre-processor, I guess, it can, or 
at least should, include the appropriate external 
entity only after it has parsed and resolved the content 
of %Dos and %Unix.

I am not sure whether I have addressed what you had in mind,
but I do believe that XML is too smart for a pre-processor,
thus we need other ways to look at PE resolving.

Best regards,
Norbert H. Mikula

= SGML, DSSSL, Intra- & Internet, AI, Java 
= mailto:nmikula at edu.uni-klu.ac.at 
= http://www.edu.uni-klu.ac.at/~nmikula

xml-dev: A list for W3C XML Developers
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To unsubscribe, send to majordomo at ic.ac.uk the following message;
unsubscribe xml-dev
List coordinator, Henry Rzepa (rzepa at ic.ac.uk)

More information about the Xml-dev mailing list