I'm using Spark 1.3.1 from AWS EMR. I've created a spark table using HiveContext and can see this from Hive (using "show tables").
However when I try to query the table(Select..),
it throws following error:
hdfs://IP:9000/user/hive/warehouse/tablename/part-r-00001.parquet not a SequenceFile
When I use "describe tablename", it shows:
col array<string> from deserializer
"Show table" returns the table name properly.
Any idea why the parquet file is not a sequenced file that Spark is generating and how to resolve that? I need to query Spark tables from Hive and using JDBC connections from RStudio or other tools.