Reorder Key-Value Pairs using map() in Spark Scala

Question

What is the equivalent of the following pySpark code in Spark-Scala?

rddKeyTwoVal = sc.parallelize([("cat", (0,1)), ("spoon", (2,3))])
rddK2VReorder = rddKeyTwoVal.map(lambda (key, (val1, val2)) : ((key, val1) ,
val2))
rddK2VReorder.collect()
// [(('cat', 0), 1), (('spoon', 2), 3)] -- This is the output.

banjara banjara · Accepted Answer · 2016-09-06T04:25:55

val rddKeyTwoVal = sc.parallelize(Seq(("cat", (0,1)), ("spoon", (2,3))))
val rddK2VReorder = rddKeyTwoVal.map{case (key, (val1, val2)) => ((key, val1), val2)}
rddK2VReorder.collect

or

val rddKeyTwoVal = sc.parallelize(Seq(("cat", (0,1)), ("spoon", (2,3))))
val rddK2VReorder = rddKeyTwoVal.map(r=> ((r._1, r._2._1),r._2._2))
rddK2VReorder.collect

output:

 Array(((cat,0),1), ((spoon,2),3))

Thanks @Alec for suggesting first approach

Reorder Key-Value Pairs using map() in Spark Scala

2 Answers