I have installed a hadoop cluster with total 3 machines, with 2 nodes acting as datanodes and 1 node acting as Namenode and as well as a Datanode. I wanted to clear certain doubts regarding hadoop cluster installation and architecture. Here is a list of questions I am looking answers for----
- I uploaded a data file around 500mb size in the cluster and then checked the hdfs report. I noticed that the namenode I made is also occupying 500mb size in the hdfs, along with datanodes with a replication factor of 2. The problem here is that I want the namenode not to store any data on it, in short i dont want it to work as a datanode as it is also storing the file I am uploading. So what is the way of making it only act as a Master Node and not like a datanode?
I tried running the command hadoop -daemon.sh stop on the Namenode to stop the datanode services on it but it wasnt of any help.
- How much metadata does a Namenode generate for a filesize typically of 1 GB? Any approximations?