I was trying to merge 80 GB files in a cluster using hadoop get merge ,
but as hadoop get merge has the property of copying the files from hdfs to local file system i have to copy to local and then copyFromLocal to hdfs again ,
hadoop fs -getmerge hdfs:///path_in_hdfs/* ./local_path
hadoop fs -copyFromLocal ./local_path hdfs://Destination_hdfs_Path/
My problem here is The datanode local is less than 80 GB,
I need to know is there an alternative to -getmerge where merge happens directly from HDFS to HDFS
I tried hadoop -cat also but it is not working..