Correct use case of multiple cores in Solr 4

Question

We use Solr 4.8 for our project.

One colleague created 2 cores in the same instance to index 80GB documents XML, from the same source. He said that one core can contain a maximum of 50GB of indexed data, so we split the 80GB to 2 cores. These cores have the same config files and schema. For indexation, he puts odd docs in the 1st core, and even docs in the 2nd core. For search, he uses one of SolrJ API to query on all documents from each core.

As we have only one server, distribution and replication aren't applied for the project.

My question: is this architecture a correct use case for Solr multiple cores? Anyone have some suggests?

MYK MYK · Accepted Answer · 2014-09-21T16:47:08

instead of storing two indexes and manually managing storing of documents on different cores, you should create solrcloud, which automatically distributes the data among the shards. It also allows you to distribute your data on multiple machines.

It will also make your performance better, querying would be much easier and you could add multiple collections(with different schema's) too.

Correct use case of multiple cores in Solr 4

3 Answers