I want Lucene Scoring function to have no bias based on the length of the document. This is really a follow up question to Calculate the score only based on the documents have more occurance of term in lucene
I was wondering how Field.setOmitNorms(true) works? I see that there are two factors that make short documents get a high score:
- "boost" that shorter length posts - using doc.getBoost()
- "lengthNorm" in the definition of norm(t,d)
I was wondering - if I wanted no bias towards shorter documents, is Field.setOmitNorms(true) enough?