I am very new to Hadoop and HBase concepts. So please forgive me if the answer to my question is too obvious.
I need to get sales report from two HBase tables. I am trying to represent here the most simplified form of the problem I am dealing with. There are two tables, Products and Sales.
Products Table
ProductCode ProductName
----------- -----------
APL Apple
BAN Banana
MNG Mango
ORG Orange
Sales Table
ProductCode Quantity
--------- ----------
MNG 100
BAN 8
MNG 3
APL 24
APL 57
BAN 33
ORG 40
ORG 15
The kind of reduced output I need :
Report
Product Name Total Sales
========== ========
Apple 81
Banana 41
Mango 103
Orange 55
Only difference in real is that both the table contains 100s of millions of records.
I am trying to use the map reduce example from the Apache HBase Documentation here : http://hbase.apache.org/book/mapreduce.example.html
But I cant find a way to use two tables in Map Reduce.
What is the correct way of doing this ?
Any suggestion would be of great help at this point.