3
votes

Cloudera documentation, shows a simple way to "create a Avro backed Hive table by using an Avro schema file." This works great. I would like to do the same thing for a Parquet backed Hive table, but the relevant documentation in this case lists out every column type rather than reading from a schema. Is it possible to read the Parquet columns from a schema, in the same way as Avro data?

1

1 Answers

2
votes

Currently, the answer appears to be no. There is an open issue with Hive. https://issues.apache.org/jira/browse/PARQUET-76

The issue has been active recently, so hopefully in the near future Hive will offer the same functionality for Parquet as it does for Avro.