I am trying to read a large matrix of doubles from a tab separated text file, row by row. This is in Scala/Apache Spark.
If I do the following:
val obs = sc.textFile("path_to_text_file")
I get obs: org.apache.spark.rdd.RDD[String]
However, the requirement is to have an RDD of vectors. Would you kindly help?
Thanks and regards,