I have looked at this question - Indexing multilingual words in lucene and it confirmed some of my suspicions.
I have an entity with a number of fields I wish to index. One of these fields can be one of several languages, and I need to use different analyzers for each language.
Am I best to implement this as different fields in the same index or as different indexes for each language?
I am guessing that the trade off is between the overhead of running multiple indexes and the suckiness of cluttering up a single index.
Any advice appreciated.