I am working on indexing large text file with text without spaces. Currently i have ngram method to generate string of length 12 and then i index them. Same way to search,i get the string from the user generate ngrams of 12 and then use it in building the query. On searching,read about ngram tokenizer present in lucene. But couldnt find much oof any examples.
How to implement ngram tokenizer in lucene 4.0 ??