I have a log file, and the first column would be my partition in Hive table.
logSchemaRDD.registerTempTable("logs")
hiveContext.sql("insert overwrite table logs_parquet PARTITION(create_date=select ? from logs) select * from logs")
How do I construct the query to select the first column (marked as ? here) and ensure that the one I selected in partition matches the 2nd select (*)?