For a python Hadoop streaming job, how do I pass a parameter to, for example, the reducer script so that it behaves different based on the parameter being passed in?
I understand that streaming jobs are called in the format of:
hadoop jar hadoop-streaming.jar -input -output -mapper mapper.py -reducer reducer.py ...
I want to affect reducer.py.