DOM -> SQL -> DOM: Serialization...

Fabio Arciniegas A. l-arcini at uniandes.edu.co
Thu Nov 11 21:46:06 GMT 1999


Kevin, I agree 100% with Prakash about the time restriction. During the
development of Samxa and Munin(two of my xml projects) i was faced with a
similar problem and definetly the cost of  deserializing/serializing DOM
docs was excesive to my needs (which were lighter than yours!)

The kind of solution you need depends largely on what kind of analysis you
need to run on the docs... for me, things were solved watching for regular
expresions on the startElement and characterData callbacks of a sax
parser... then again, i somehow suspect you need much more than that.
anyway,if you want, email me to tell me more about your kind of analysis and
maybe i can help with a bit of samxa code.

Fabio



- -----Original Message-----
- From: owner-xml-dev at ic.ac.uk [mailto:owner-xml-dev at ic.ac.uk]On Behalf Of
- Prakash Mandgi
- Sent: Jueves, 11 de Noviembre de 1999 12:17 a.m.
- To: Kevin A. Burton; xml-dev at ic.ac.uk; cocoon at list.working-dogs.com
- Subject: Re: DOM -> SQL -> DOM: Serialization...
-
-
- Hi Kevin
-
- with sql server / oracle i doubt you can retrieve that data so fast
- especially over
- the network let alone parse it.
- You may want to consider using Object databases
- 1) Persist DOM documents as is in an object database and then run XQL /
- XPath etc
- 2) Use entity java beans to store the DOM document in memory
-
- You can pretty much forget about serializing / deserializing DOM documents
- of that size in less than a second.
-
- Prakash
-
- ----- Original Message -----
- From: Kevin A. Burton <burton at relativity.yi.org>
- To: <xml-dev at ic.ac.uk>; <cocoon at list.working-dogs.com>
- Sent: Thursday, November 11, 1999 4:56 AM
- Subject: DOM -> SQL -> DOM: Serialization...
-
-
- > I need to take a ton of XML documents... about say 100M and run analysis
- > on them in realtime.  (less than 1 second)
- >
- > I am planning on using a parser to return a DOM, then take that DOM and
- > Serialize it to SQL with maybe ENTITY|ATTRIBUTE|CDATA tables, etc.  And
- > then write an interface (maybe XMLQL) to get the data.
- >
- > Has anyone seen an Open Source project to do this?
- >
- > Kevin
- >
- > xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
- > Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on
- CD-ROM/ISBN 981-02-3594-1
- > To unsubscribe, mailto:majordomo at ic.ac.uk the following message;
- > unsubscribe xml-dev
- > To subscribe to the digests, mailto:majordomo at ic.ac.uk the following
- message;
- > subscribe xml-dev-digest
- > List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
- >
- >
-
- xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
- Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on
- CD-ROM/ISBN 981-02-3594-1
- To unsubscribe, mailto:majordomo at ic.ac.uk the following message;
- unsubscribe xml-dev
- To subscribe to the digests, mailto:majordomo at ic.ac.uk the
- following message;
- subscribe xml-dev-digest
- List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)
-
-
-


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev at ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To unsubscribe, mailto:majordomo at ic.ac.uk the following message;
unsubscribe xml-dev
To subscribe to the digests, mailto:majordomo at ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa at ic.ac.uk)





More information about the Xml-dev mailing list