I'm using the MergeContent in the following way to "batch" incoming responses from a number of ExecuteSQL. In the MergeContent Processor, I have the Minimum Number of Entries set to 1000 and the Max Bin Age set to 30 seconds. I then have a Correlation Attribute Name that bins the incoming FlowFiles. This seems to be working as I expect, but my question is two fold:
A. Is this a sensible approach or is there a better/more efficient way to be doing this? Maybe a combo of ListFile/GetFile/MergeContent, etc...
B. Is there a performance/scalability issue with "larger" numbers of Minimum Number of Entries?
My end goal is to try to merge as many of the results coming out from ExecuteSQL commands into a single file, binned by its Correlation Attribute Name.