I am parsing csv file using AWS athena from java code. Some columns in csv are of date type and one column has comma in the value.
If the athena table is created with
ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe'
then it is unable to parse the column with comma correctly
However it parses correctly if I use
ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.OpenCSVSerde'
But the issue with OpenCSVSerde
is that it requires all columns to be of string data type and I need to carry out date operations in the query so can't use OpenCSVSerde
.
Any other solution? Please help!