This is what I saw on Web UI recently
Configured Capacity : 232.5 GB
DFS Used : 112.44 GB
Non DFS Used : 119.46 GB
DFS Remaining : 613.88 MB
DFS Used% : 48.36 %
DFS Remaining% : 0.26 %
and I'm so confused that non-dfs Used takes up more than half of capacity,
which I think means half of hadoop storage is being wasted
After spending meaningless time searching, I just formatted namenode, and started from scratch.
And then I copied one huge text file(about 19gigabytes) from local to HDFS (successed).
Now the UI says
Configured Capacity : 232.5 GB
DFS Used : 38.52 GB
Non DFS Used : 45.35 GB
DFS Remaining : 148.62 GB
DFS Used% : 16.57 %
DFS Remaining% : 63.92 %
before copying, DFS Used and Non DFS Used were both 0.
Because DFS Used is approximately double the original text file size and I configured 2 copy,
I guess that DFS Used is composed up of 2 copies of original and meta.
But still I don't have any idea where Non DFS Used came from and why is that takes up so much capcity more than DFS Used.
What happend? Did I made mistake?