4
votes

I need to figure out how many unique terms are in our Solr (Lucene) index. And by unique terms, I dont mean by document. I mean total unique terms from all documents in the index. The Solr admin console shows how many documents, but not total terms.

I found a tool called Luke (Lucene Index Toolbox) that will give you that info, but its a UI tool, and my Sorl index is on a Ubuntu server so I cant run it.

Any Ideas?

1
do you need to do this programmatically? what do you intend to do with that information? - Mauricio Scheffer

1 Answers

3
votes

You can use the LukeRequestHandler. You should see the tag "numTerms" in the xml returned.