This is a very basic question about Hadoop:
Suppose I have 3 mappers and 2 reducers. The mappers produced the following output:
Mapper 1 output : {1 -> "a1", 2 -> "b1"},
Mapper 2 output : {2 -> "b2", 3 -> "c2"},
Mapper 3 output : {1 -> "a3", 3 -> "c3"}
Now, as I understand, the framework partitions the output into 2 parts (a part per reducer). Does the framework sort all output before partitioning? Is it possible that the reducers get the following input ?
Reducer 1 input : {1 -> "a1", 2 -> "b1", "b2"}
Reducer 2 input : {1 -> "a3", 3 -> "c2", "c3"}