I am working on Proof of Concept task. The task is to implement a feature of our product using Hadoop technology.
Feature is quite simple, we have a UI which will let you insert details about "Network Issue". All details about such a issue are captured and inserted into a table in Oracle DB. We then process data in this table and calculate a Health Score.
I have to use Hadoop instead of a traditional Db So my question is what to go for? Impala on HDFS? or Impala on Hbase ? or Hbase?
I am using a cloudera VM for the POC implementation.
As per my understanding, Hbase is NoSQL distributed database, which is actually a layer on HDFS , which provides java APIs to access data. Impala is a tool which also provides JDBC access to access data over Hbase or directly over HDFS. I am very new to hadoop, can some one please help?