What are the advantages and disadvantages of using a search engine as a key value store?

Question

Given a search engine like Lucene and a set of XML documents which need to be fully preserved, what are the advantages and disadvantages of using the search engine as key value store for returning XML doucments given a unique primary key which each document contains?

I can't make out what you mean. Is the idea that you do conventional full-text indexing, or that you use some sort of schema mapping to turn data items from XML into many fields in Lucene? — bmargulies
I want to be able to fetch (or reconstruct I suppose, but preferably fetch) the original document I put in by a simple query for an unambiguous unique key for that given document. Basically I want to treat the search engine as a SQL database with a primary key field and a clob field for each row. — Jherico
Whoever voted to close as not a real question: like hell. My boss specifically wants me to come up with the pros and cons of doing this as an implementation of a KV store for documents (as opposed to using the filesystem or something like couch db) — Jherico
Perhaps you might edit the question to include the material in the your comment? — bmargulies
You need ACID (or near ACID) sematics which Lucene doesn't guarantee. To put it simply, DB can recover from failure to last consistent state. A crash while writing to Lucene index might render it useless. — Shashikant Kore

skaffman skaffman · Accepted Answer · 2010-01-11T23:25:46

If you use something like Compass, and it's XML-to-Lucene mapping engine, it's a great solution for storing and querying XML documents, without going all the way to a XML database.

One downside is that the XML documents can only be retrieved via the Lucene API (the underlying data store is pretty impenetrable), but I can live with that.

What are the advantages and disadvantages of using a search engine as a key value store?

3 Answers