Unable to read a file from HDFS using spark shell in ubuntu

Question

I have installed spark and hadoop in standalone modes on ubuntu virtualbox for my learning. I am able to do normal hadoop mapreduce operations on hdfs without using spark. But when I use below code in spark-shell,

val file=sc.textFile("hdfs://localhost:9000/in/file")
scala>file.count()

I get "input path does not exist." error. The core-site.xml has fs.defaultFS with value hdfs://localhost:9000. If I give localhost without the port number, I get "Connection refused" error as it is listening on default port 8020. Hostname and localhost are set to loopback addresses 127.0.0.1 and 127.0.1.1 in etc/hosts. Kindly let me know how to resolve this issue. Thanks in advance!

try this in terminal hadoop fs -ls hdfs://localhost:9000/in/. Is file available? — WoodChopper

Swetha Swetha · Accepted Answer · 2016-07-29T14:16:48

I am able to read and write into the hdfs using

"hdfs://localhost:9000/user/<user-name>/..."

Thank you for your help..

Unable to read a file from HDFS using spark shell in ubuntu

2 Answers