Need an explanation of command that is used in Hadoop tutorial

Question

Recently I just started on studying about Hadoop and I just do an experiment to really understand it. Here is the tutorial: http://www.codeproject.com/Articles/757934/Apache-Hadoop-for-Windows-Platform What I want to ask is what is the background process that occur in the computer system after I run into several command based on the tutorial, such as the

hadoop namenode -format
javac -classpath C:\hadoop-2.3.0\share\hadoop\common\hadoop-common-2.3.0.jar;C:\hadoop-2.3.0\share\hadoop\mapreduce\hadoop-mapreduce-client-core-2.3.0.jar;C:\hadoop-2.3.0\share\hadoop\common\lib\gson-2.2.4.jar;C:\hadoop-2.3.0\share\hadoop\common\lib\commons-cli-1.2.jar Recipe.java
jar -cvf Recipe.jar *.class
hadoop fs -mkdir /in
hadoop fs -copyFromLocal c:\Hwork\recipeitems-latest.json /in
hadoop jar c:\Hwork\Recipe.jar Recipe /in /out
hadoop fs -ls /out
hadoop fs -cat /out/part-r-00000

Arnon Rotem-Gal-Oz Arnon Rotem-Gal-Oz · Accepted Answer · 2015-01-24T09:57:35

hadoop fs runs commands similar to unix (ls, copy, cat etc.) on the HDFS file system you can see the full list is the filesystem shell documentation

hadoop namenode -format is initialization of the nematode i.e. erasing everything stored in Hadoop - note that on newer Hadoop versions you'd do that via hdfs see here

The two other commands (javac and jar) has to do with compiling and packing a java programs

Need an explanation of command that is used in Hadoop tutorial

2 Answers