How to write a Lucene query that returns all words containing the letter "t"?

Question

I tried this Lucene code example, which worked:
http://snippets.dzone.com/posts/show/8965

However changing:
Query query = parser.parse("st.");
to
Query query = parser.parse("t");

returned zero hits.

How to write a Lucene query that returns all words containing the letter "t" ?
(max nbr of hits to return = 20)

Edit: here's what worked:

RegexQuery regexquery = new RegexQuery(new Term("fieldname", ".t."));
isearcher.search(regexquery, collector);
System.out.println("collector.getTotalHits()=" + collector.getTotalHits());

Yuval F Yuval F · Accepted Answer · 2010-02-01T12:47:21

You need a different Analyzer. The example uses StandardAnalyzer, which removes punctuation and breaks words according to white space and some other more elaborate rules. It does not, however, break words into characters. You will probably need to build your own custom analyzer to do this, and it seems it will be costly in both run time and memory consumption. Another (probably better) option is to use a RegexQuery.

How to write a Lucene query that returns all words containing the letter "t"?

2 Answers