Spring Batch Framework

Question

I am not able to finalize whether Spring Batch framework is applicable for the below requirement. I need experts inputs on this.

Following is my requirement:

Read multiple Oracle tables (at least 10 tables including both transaction and master), do complex calculation based on the business rules, Insert / Update / Delete records in transaction tables.

I have identified the following two designs:

Design # 1:

ItemReader: Select eligible records from Key transaction table.

ItemProcessor: Fetch additional details from DB using the key available in the record retrieved by ItemReader.(It would require multipble DB transactions) Do the validation and computation and add the details to be written to DB as objects in a list.

ItemWriter: Write the details available in objects using CustomItemWriter(insert / update / delete operation)

With this design, we can achieve parallel processing but increase the number of DB transactions.

Design # 2:

Step # 1

ItemReader: Use Composite Item Reader (Group of ItemReaders) to read all the required tables.

ItemWriter: Save the result sets as lists of Objects (One list per table) in execution context

Step # 2

ItemReader: Retrieve lists of Objects available in execution context and group them into one list of objects based on the business processing so that processor can process them.

IremProcessor: Process the chunk of Objects returned by ItemReader. Do the validation and computation and add the details to be written to DB as objects in a list.

ItemWriter: Write the details available in objects using CustomItemWriter(insert / update / delete operation)

With this design, we can REDUCE the number of DB Transactions but we are delaying the processing till all table records are retrieved and stored in execution context ie we are not using parallel processing provided by SpringBatch.

Please advise whether the above is feasible using SpringBatch or we need to use conventional Java program.

Don't store everything in the execution context as that is serialized to the the storage used to store the execution details. Next to that reading everything into memory isn't really smart as you eventually will run into memory issues. What is wrong with more tranactions as that gives you also more control and the possibility to restart from a certain point (ie where it failed). — M. Deinum
We will be having an OLTP database which will be used by Web application(s) and these kind of batches. These batches need to update transaction tables used by Web application(s) more frequently. As we cannot control Online transactions, we are planning to reduce the number of batch transactions to avoid OLTP DB overload. — Vijay
The transactions should be small enough to not cause (that much) trouble for your OLTP. If you do a long running transaction you get a lock for a very large amount of time, that is guaranteed to trouble your online process. — M. Deinum

Tobb Tobb · Accepted Answer · 2015-01-10T15:33:05

From my understanding, Spring batch has nothing to do with database batch operations (or at least the word 'batch' has a different meaning in these two contexts..) Spring batch is used to create processes with multiple steps, and gives you the chance to restart a process if one of the process steps fails (without repeating the previously finished process steps.)

Spring Batch Framework

2 Answers