I have setup the hadoop on my ubuntu, and ran example codes to test. One of the common examples is https://github.com/tomwhite/hadoop-book/tree/master/ch02/src/main/python
I have tested this code w/ given sample file(https://github.com/tomwhite/hadoop-book/blob/master/input/ncdc/sample.txt). However, when I modified the mapper code acording to my data file, reducer goes from 0% to 33% and then back to 0%. Can anyone help on why that happens or how should I modify the code. My data looks like:
STN---,WBAN , YEARMODA, TEMP, , DEWP, , SLP , , STP , , VISIB, , WDSP, , MXSPD, GUST, MAX , MIN ,PRCP ,SNDP , FRSHTT,
690190,13910, 20120101, 42.9,18, 29.4,18, 1033.3,18, 968.7,18, 10.0,18, 8.7,18, 15.0, 999.9, 52.5*, 31.6*, 0.00I,999.9, 000000,