ANN: XML Schema Regular Expression Parser

Daniel Potter dpotter at
Tue Jan 4 19:15:34 GMT 2000

A working model of a regular expression parser designed to parse and
match strings based on the current XML Schema pattern definition is now
available at <>.  This is a Java applet
(Java 1.1) and will require a Java enabled browser.  It is still
currently a test, meaning that there are still bugs which have yet to be
fixed or even found.

Keep in mind the differences between regular expressions used in Perl
and regular expressions used to specify patterns in the XML Schema spec!
For example, the non-greedy operators aren't used (*?, +?);
backreferencing (\1) is unavailable, and others.

However, one known lacking feature is the \p{xx} forms.  Although the
parser will recognize \p (or \P) it will ignore it and move on, causing
the {xx} after it to be parsed, interpretting it as an {m,n} form.  This
in most cases will cause an error.

The source code is currently not available for general download, but
anyone interested can e-mail me (<mailto:dpotter at>).

Try it out and see what you think!  Please send me any bugs that you
discover.  Thanks.

- Dan Potter

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at
Archived as: and on CD-ROM/ISBN 981-02-3594-1
To unsubscribe, mailto:majordomo at the following message;
unsubscribe xml-dev
To subscribe to the digests, mailto:majordomo at the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at

More information about the Xml-dev mailing list