I am trying to figure out how to do aggregation with Spring Batch. For example, I have a CSV file with a list of names:
name
John
Amy
John
Ryan
And I want name count in text file:
name, count
Amy, 1
John, 2
Ryan, 1
From what I learned from Spring Batch, the ETL batch process (itemReader -> ItemProcessor -> ItemWriter) is more like just a mapping phase in map-reduce lingo. How do I do the reduce(aggregation) phase in Spring Batch?
Is Spring Batch the right tool to use? Or should I use Spark for this? Thanks.