I am trying to understand hive
in terms of architecture, and I am referring to Tom White's book on Hadoop.
I came across the following terms in regards to hive: Hive Services
, hiveserver2
, metastore
among others.
Referring to below diagrams from the Book (Hadoop: The definitive Guide).
Hive Architecture:
MetaStore configuration:
Hive Architecture which shows what "Driver" is:
I am not able to understand the following:
1) What is Hive Services
in Hive architecture diagram? Is it same when we say hiveserver2
?
2) What is Driver
in Hive architecture diagram?
3) What is MetaStore
(I am NOT referring to Metastore Database). Is it some process which runs? If so, is this part of hiveserver2
? As per the diagram MetaStore
can be remote, so if this is a JVM process, to which component it belongs to?
4) It say Hive service JVM
, MetaStore JVM Server
. But, where do these components gets installed? Are they part of the "server" side of "hive"?
5) In "Hive Architecture" diagram, it say "Hive Server"? What is this? Is this the one which we say "Hive Server 1" , "Hive Server2".
Can anyone help understand this?