I'm trying to run one of MLlib algorithms, namely LogisticRegressionWithLBFGS on my database.
This algorithm takes the training set as LabeledPoint. Since LabeledPoint requires a double label ( LabeledPoint( double label, Vector features) ) and my database contains some null values, how can I solve this problem?
Here you can see the piece of code related to this issue :
val labeled = table.map{ row =>
var s = row.toSeq.toArray
s = s.map(el => if (el != null) el.toString.toDouble)
LabeledPoint(row(0), Vectors.dense((s.take(0) ++ s.drop(1))))
}
And the error that I get:
error : type mismatch;
found : Any
required: Double
Without using LabeledPoint can I run this algorithm or how can I overcome this "null value" issue?