Modifying .tim and .tip files in Lucene Index

Question

I have a Lucene application with multiple indices in which the relevancy scoring suffers due to differences in the term frequencies across the different indices. My understanding is that the Term Dictionary (.tim file) contains "term statistics" such as the document frequency statistics on each term. I was thinking that one approach might be to modify the .tim files for each index (and related segments) and update the "term statistics". Is it possible to overwrite or modify the .tim and .tip files in such a way?

mindas mindas · Accepted Answer · 2014-06-25T15:43:29

relevancy scoring suffers

From the FAQ:

score values are meaningful only for purposes of comparison between other documents for the exact same query and the exact same index. when you try to compute a percentage, you are setting up an implicit comparison with scores from other queries.

Modifying .tim and .tip files in Lucene Index

2 Answers