I'm using MultipleOutputs to write three files ie, name, attrib, others and using 6 redcuers. I get these files in my Output Directory:
attrib-r-00003 name-r-00004 part-r-00000 part-r-00002 part-r-00004 _SUCCESS
_logs other-r-00001 part-r-00001 part-r-00003 part-r-00005
My Question is, how are these files named(As in why is a -r-0003 appended to attrib file, is it that the task 0003 compiled this file?). I'm currently running Hadoop in Pseudo Mode, on a real cluster would there be a need to combine files(ie would attrib have diffrent files by diff reducers) ? Also, is there a way that i can remove -r-xxxxx from my output file name ?
P.S my knowledge of Hadoop is pretty limited.