There is a hadoop map-red job on a large amount of data. The map phase takes a long time to complete (~2-3 days). It completed.
But the task failed at ~92% in the reduce phase. Is it possible to retrieve the output/computations performed by the successful map tasks so that only reduce phase may need to be re-run?
Running Hadoop 1.2.1, Java7, Single node linux system.