When I copy the data file to HDFS by using -copyFromLocal
command` data gets copied into to HDFS. When I see this file through web browser, it shows that the replication factor is 3 and file is in location "/user/hduser/inputData/TestData.txt" with a size of 250 MB.
I have 3 CentOS servers as DataNodes, CentOS Desktop as NameNode and client.
When I copy from local to the above mentioned path, where exactly it copies to? Does it copy to NameNode or DataNode as blocks of 64 MB? Or, it won't replicate until I run MapReduce job and map prepares splits and replicates the data to DataNodes?
Please clarify my queries.