Lotsa laughs

Chris Lilley chris at w3.org
Tue May 25 16:44:17 BST 1999



John Cowan wrote:
> 
> James Clark wrote:
> 
> > This is in conformance with the W3C Character Model WD. See
> 
> More specifically, it conforms to the HTML 4.0 recommendation
> (http://www.w3.org/TR/REC-html40/appendix/notes.html#h-B.2) which
> recommends that user agents should fix up URLs containing
> illegal characters.

Thanks, I had forgotten that HTML 4.0 already did that; in which case
IE5 is correctly carrying out error recovery, in accordance with the
HTML 4.0 spec, on an illegal URL.

As Appendix B2 makes clear, the actual URL is still illegal; and the
document is still wrong; and could be trivially fixed by url encoding
the invalid bytes, having first converted the characters to bytes in
UTF-8.
 
> Nevertheless, it is an embarrassment that such a site exports
> illegal HTML.

Well, yes. 

--
Chris

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo at ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)





More information about the Xml-dev mailing list