XML Torture Test: Parsers Fail

Elliotte Rusty Harold elharo at metalab.unc.edu
Tue Apr 6 21:10:16 BST 1999

At 12:30 PM -0400 4/5/99, Kar Yan Ng wrote:
>Hello, it seems that the files refers to a bunch of external
>dtd's. Could u put these into a zip file so that we don't
>have to download all the dtd's one by one.

Such a file is now available from
I've fixed the problem with the non-deterministic content model.
I'll be updating that soon with a new file that actually uses all 1000+
entities, as opposed to the current version which defines 1000+ entities
but only uses one. I'll also fix the problems with multiple definitions of
some entities.

After my most recent changes, I now feel reasonably confident that this
file is in fact valid. xmlproc and Java Project X have both been reported
as validating it. IBM's xml4j trips over the relative URLs used in entity
references. I stil don't know what IE5 and the DCXML parsers are tripping
over, but there's definitely something. One parser failed simply because of
the number of entities, and the resulting overflow of Solaris's maximum
number of file descriptors per process.

| Elliotte Rusty Harold | elharo at metalab.unc.edu | Writer/Programmer |
|        XML: Extensible Markup Language (IDG Books 1998)            |
|   http://www.amazon.com/exec/obidos/ISBN=0764531999/cafeaulaitA/   |
|  Read Cafe au Lait for Java News:  http://sunsite.unc.edu/javafaq/ |
|  Read Cafe con Leche for XML News: http://sunsite.unc.edu/xml/     |

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)

More information about the Xml-dev mailing list