Difference between Spark-SQL and Hive on Spark. I am going through the documentation of spark and sql and trying to understand the difference between Spark-SQL and HIVE on Spark.
- Consider a case when I initiate a spark session without any obvious hive support like copying
hive-site.xml
and then persist a table in my spark program, where will the data and metadata be stored. Will spark create a new Hive Metastore (like derby)? - Consider a case when I initiate a spark session with hive support like copying
hive-ste.xml
and making spark aware of existing hive. Then if I persist the table will data and metadata be stored in my existing Hive Metastore and Data in Warehouse directory of HDFS. - If I run HIVE by changing the execution engine property to Spark then is it same as above mentioned case 2 ?
Thanks.