We're running SOLR 4.3.1, and I have a question about controlling how SOLR scores certain documents.
In some cases, we have indexed documents that are of "poor" quality - the main body (a description field, in this case) may only have 3 or 4 words. Other documents may have much better descriptions. The problem arises when a search is performed, and a the searched term is found in both "good" (longer) and "poor" (shorter) documents.
SOLR seems to score the matches in the shorter documents higher, which makes sense, as the term searched for may be 1 of only 3 or 4 words, so it's a higher percentage than on a document with a longer description, where only 1 or 2 matches are found in 100 words (for example).
Is it possible to somehow penalize or reduce the score on really short documents? I know it's possible that some very short documents are ok, but as a general rule, really short documents in our case are usually "poor quality".
Suggestions?
We're using edismax searching.
Thanks,
Bill