1
votes

I am following this official site and I am sure I am doing all right. However, I cannot run sbin/start-dfs.sh for a Pseudo-Distributed node. Everytime I am getting the following error:

starting namenodes on [localhost]
pdsh@ML-Learning: localhost:connect: Connection refused
starting datanodes
pdsh@ML-Learning: localhost:connect: Connection refused
starting secondary namenodes [ML-Learning]
pdsh@ML-Learning: localhost:connect: Connection refused

I executed ssh locaohost without password.

Previously, I was able to run the following commands to start namenode and datanode.

bin/hadoop --daemon start namenode
bin/hadoop --daemon start datanode

However, I cannot stop them with the following commands, and the error is saying those daemons are running with process

bin/hadoop --daemon stop namenode
bin/hadoop --daemon stop datanode

So I was simply killing those processes, and it seems after that I ruined my whole set up. I cannot start namenode and datanode anymore.

bin/hadoop --daemon start namenode
bin/hadoop --daemon start datanode

It stuck in the following message:

INFO blockmanagement.CacheReplicationMonitor: Starting CacheReplicationMonitor with interval 30000 milliseconds.

Any idea on this issue? I even restarted my VM, but the issue is still there.

2

2 Answers

0
votes

Remove Your current setup by formatting NameNode using,

$HADOOP_HOME/bin/hadoop namenode -format

Follow this without any failure,

  http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/
-1
votes

run this command: export PDSH_RCMD_TYPE=ssh