I am very new to the Hadoop system and in learning phase.
One thing i noticed in Shuffle and Sort phase that Spill will happen whenever the MapOutputBuffer reaches 80% ( i think this can also be configurable).
Now why spilling phase is required ?
Is it because MapOutputBuffer is a circular buffer and if we don't empty it than it may cause data overwrite and memory leak?