1
votes

I am running a simple count program on Hadoop. My input file is of size 4 GB. For some reason the job keeps failing with the errors:

enter image description here

enter image description here

However if I try the same code with a small input file say 100MB , it works perfectly fine. I am new at this and I can't seem to find any viable solution. My set up is Psuedo Distributed.

Do I need to make any configuration changes? I have made the standard configurations for the Psuedo distributed set up as provided by the hadoop documentation.

Any help will be highly appreciated.

1

1 Answers

0
votes

From the Error Stacktrace you have posted, the ConnectionRefused exception is for JobHistoryServer.

For Pseudo Distributed setup, no configurational changes are required to start the JobHistoryServer. Use this command to start it,

$HADOOP_HOME/sbin/mr-jobhistory-daemon.sh start historyserver