Some doubts on HDFS,HBase and Hive

Question

I have several doubts on Hadoop Ecosystem. Eager to understand the concepts well.

These are some of very very basic questions answered in any book or article. So spend some time/do some ground work and get back. — Praveen Sripati

Harsh J Harsh J · Accepted Answer · 2012-12-31T00:01:28

Answers, in order:

Hive typically stores data in table-named directories under its configured filesystem directory, usually a HDFS directory of /user/hive/warehouse, tweak-able via the hive-site.xml property of hive.metastore.warehouse.dir.
Hive and HBase are two different table storage concepts. The former has no notion of records or random reads/writes. The only thing common between them is a connector Hive has to read the table data stored under HBase's servers/formats.
This is covered by the HBase Reference Guide in full detail. The simplest way would be to use a hbase shell.
HDFS is a plain filesystem (only distributed) similar to your Unix or Windows filesystems and hence does not care about the type of data you store on it. You can store whatever you want, provided you also have reader/writer logic available for digesting it later.
Pig does provide a HBaseStorage built-in storage access method as part of its core, to let you access and represent HBase row data in Pig scripts.
See (2). Both are unrelated unless you want them to be, so the answer is a yes.