I was trying to read a csv file using the latest built of spark (2.1.0 (Dec 28 2016) for Windows in Scala. I did notice that when reading the csv file and inferring the schema, the parsing of the file will fail. The inferschema option would fail and some of the fields that were supposed to be double are listed as strings. I could force the columns to be a specif type of data, but still parsing rows would leave null values. I believe this happened because there were commas inside the string column.