In hadoop, suppose the number of nodes is fixed (no server crash during the run), if i use the same partitioner (e.g., hash partitioning on the key of map output) to partition the mapper output, and i execute the job to read the same data set twice. Is it sure that the data with the same key will go to the same reducer? thanks
For example, the my mapoutput consist of two rows: Key | value
A | anything
B | anything
Suppose, I have two reducer 1 and 2. In first run, the line "A|anything" goes to reducer 1, and "B|anything" goes to the reducer2. If i run again, is it possible that "A|anything" goes to reducer 2, and "B|anything" goes to the reducer1?
Thanks!