cannot start hadoop namenode

Question

I am following this official site and I am sure I am doing all right. However, I cannot run sbin/start-dfs.sh for a Pseudo-Distributed node. Everytime I am getting the following error:

starting namenodes on [localhost]
pdsh@ML-Learning: localhost:connect: Connection refused
starting datanodes
pdsh@ML-Learning: localhost:connect: Connection refused
starting secondary namenodes [ML-Learning]
pdsh@ML-Learning: localhost:connect: Connection refused

I executed ssh locaohost without password.

Previously, I was able to run the following commands to start namenode and datanode.

bin/hadoop --daemon start namenode
bin/hadoop --daemon start datanode

However, I cannot stop them with the following commands, and the error is saying those daemons are running with process

bin/hadoop --daemon stop namenode
bin/hadoop --daemon stop datanode

So I was simply killing those processes, and it seems after that I ruined my whole set up. I cannot start namenode and datanode anymore.

bin/hadoop --daemon start namenode
bin/hadoop --daemon start datanode

It stuck in the following message:

INFO blockmanagement.CacheReplicationMonitor: Starting CacheReplicationMonitor with interval 30000 milliseconds.

Any idea on this issue? I even restarted my VM, but the issue is still there.

Deno George Deno George · Accepted Answer · 2016-11-17T12:55:05

Remove Your current setup by formatting NameNode using,

$HADOOP_HOME/bin/hadoop namenode -format

Follow this without any failure,

  http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/

cannot start hadoop namenode

2 Answers