Scala Spark - empty map on DataFrame column for map(String, Int)

votes

dictionary - Scala Spark - empty map on DataFrame column for map(String, Int) - Stack Overflow

Scala Spark - empty map on DataFrame column for map(String, Int)

Ask Question

Asked 4 years, 3 months ago

Active 3 years, 11 months ago

Viewed 4k times

I am joining two DataFrames, where there are columns of a type Map[String, Int]

I want the merged DF to have an empty map [] and not null on the Map type columns.

val df = dfmerged.
  .select("id"),
          coalesce(col("map_1"), lit(null).cast(MapType(StringType, IntType))).alias("map_1"),
          coalesce(col("map_2"), lit(Map.empty[String, Int])).alias("map_2")

for a map_1 column, a null will be inserted, but I'd like to have an empty map map_2 is giving me an error:

java.lang.RuntimeException: Unsupported literal type class scala.collection.immutable.Map$EmptyMap$ Map()

I've also tried with an udf function like:

case class myStructMap(x:Map[String, Int])
val emptyMap = udf(() => myStructMap(Map.empty[String, Int]))

also did not work.

when I try something like:

.select( coalesce(col("myMapCol"), lit(map())).alias("brand_viewed_count")...

.select(coalesce(col("myMapCol"), lit(map().cast(MapType(LongType, LongType)))).alias("brand_viewed_count")...

I get the error:

cannot resolve 'map()' due to data type mismatch: cannot cast MapType(NullType,NullType,false) to MapType(LongType,IntType,true);

edited Feb 15 2018 at 1:14

Alper t. Turker

31.7k8 gold badges75 silver badges109 bronze badges

asked Nov 4 2017 at 14:29

Lou_Ds

4711 gold badge9 silver badges23 bronze badges

Add a comment |

1 Answer 1

Active Oldest Score

In Spark 2.2

import org.apache.spark.sql.functions.typedLit

val df = Seq((1L, null), (2L, Map("foo" -> "bar"))).toDF("id", "map")

df.withColumn("map", coalesce($"map", typedLit(Map[String, Int]()))).show
// +---+-----------------+
// | id|              map|
// +---+-----------------+
// |  1|            Map()|
// |  2|Map(foobar -> 42)|
// +---+-----------------+

Before

df.withColumn("map", coalesce($"map", map().cast("map<string,int>"))).show
// +---+-----------------+
// | id|              map|
// +---+-----------------+
// |  1|            Map()|
// |  2|Map(foobar -> 42)|
// +---+-----------------+

answered Feb 13 2018 at 20:36

Alper t. Turker

31.7k8 gold badges75 silver badges109 bronze badges

Add a comment |

Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service, privacy policy and cookie policy

Not the answer you're looking for? Browse other questions tagged scala dictionary apache-spark dataframe or ask your own question.

Stack Overflow works best with JavaScript enabled

scaladictionaryapache-sparkdataframe

1 Answers

votes

In Spark 2.2

import org.apache.spark.sql.functions.typedLit

val df = Seq((1L, null), (2L, Map("foo" -> "bar"))).toDF("id", "map")

df.withColumn("map", coalesce($"map", typedLit(Map[String, Int]()))).show
// +---+-----------------+
// | id|              map|
// +---+-----------------+
// |  1|            Map()|
// |  2|Map(foobar -> 42)|
// +---+-----------------+

Before

df.withColumn("map", coalesce($"map", map().cast("map<string,int>"))).show
// +---+-----------------+
// | id|              map|
// +---+-----------------+
// |  1|            Map()|
// |  2|Map(foobar -> 42)|
// +---+-----------------+

Scala Spark - empty map on DataFrame column for map(String, Int)

current community

your communities

more stack exchange communities

Scala Spark - empty map on DataFrame column for map(String, Int)

1 Answer 1

Your Answer

Not the answer you're looking for? Browse other questions tagged scala dictionary apache-spark dataframe or ask your own question.

Linked

Hot Network Questions

1 Answers

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged scala dictionary apache-spark dataframe or ask your own question.

Linked

Related

1 Answers