Apache Nifi : Oracle To Mongodb data transfer

Question

I want to transfer data from oracle to MongoDB using apache nifi. Oracle has a total of 9 million records. I have created nifi flow using QueryDatabaseTable and PutMongoRecord processors. This flow is working fine but has some performance issues.

After starting the nifi flow, records in the queue for SplitJson -> PutMongoRecord are increasing. Is there any way to slow down records putting into the queue by SplitJson processor?

OR

Increase the rate of insertion in PutMongoRecord?

Right now, in 30 minutes 100k records are inserted, how to speed up this process?

steven-matison steven-matison · Accepted Answer · 2020-06-29T11:12:19

@Vishal. The solution you are looking for is to increase the concurrency of PutMongoRecord:

You can also experiment with the the BATCH size in the configuration tab:

You can also reduce the execution time splitJson. However you should remember this process is going to take 1 flowfile and make ALOT of flowfiles regardless of the timing.

How much you can increase concurrency is going to depend on how many nifi nodes you have, and how many CPU Cores each node has. Be experimental and methodical here. Move up in single increments (1-2-3-etc) and test your file in each increment. If you only have 1 node, you may not be able to tune the flow to your performance expectations. Tune the flow instead for stability and as fast as you can get it. Then consider scaling.

How much you can increase concurrency and batch is also going to depend on the MongoDB Data Source and the total number of connections you can get fro NiFi to Mongo.

Apache Nifi : Oracle To Mongodb data transfer

2 Answers