I am using SOLR with an index size of 20 million data. I used DataimportHandler for indexing data into SOLR. Now a days It is taking so much time to do full-index as the mysql query becomes very heavy. Please suggest some techniques to faster the indexing process.
I was thinking like pre-processing data in some other data store like Hadoop and then index from there or something. Please suggest which data store will be good If I want to store the pre-process data.
I am using MYSQL as master database.
Delta updates will be like around 100 000 for last hour.