I am using Spark with Java 8. I have a dataframe where one of the columns contains a single mllib.linalg.Vector. I want to groupBy one of the other columns in the dataframe, say an ID column, and "collect_list" the feature vectors into a list. I'm getting the error below. I don't understand why. This is a generic operation, why does it care about the type of the data in the column? it works fine for scalar numbers, or strings, etc, but does not seem to work for mllib Vector. Is there a workaround this?, maybe another function other than collect_list()?
No handler for Hive udf class org.apache.hadoop.hive.ql.udf.generic.GenericUDAFCollectList because: org.apache.spark.mllib.linalg.VectorUDT@f71b0bce (of class org.apache.spark.mllib.linalg.VectorUDT)