I have a CSV file with two columns
id, features
the id column is a string and the features column is a comma delimited list of feature values for a Machine Learning algorithm ie. "[1,4,5]" I basically just need to call Vectors.parse() on the value to get a vector, but I don't want to convert to an RDD first.
I want to get this into a Spark Dataframe where the features column is a org.apache.spark.mllib.linalg.Vector
I am reading this into a dataframe with the databricks csv api and I'm trying to convert the features column to a Vector.
Does anyone know how to do this in Java?