At work we are attempting to do the following:
- Run Elastic MapReduce jobs via Amazon, which freezes Hadoop at version 0.20.205
- Write output to HBase running on EC2, specifically, 0.92.1-cdh4.0.1 from Cloudera
What I've discovered so far is my WordCount test appears to work when I use Apache HBase 0.92.1 in my Hadoop job (packaged via maven). I'm scared that this is working accidentally and that may blow up as my usage matures.
However, when I package HBase 0.92.1-cdh4.0.1 in my Hadoop job, I get a ClassNotFoundException
:
https://emr-qa.eventbrite.com.s3.amazonaws.com/logs/j-RWJ75VR11SLB/steps/1/stderr
- Does the Apache HBase jar play nicely with the CDH Hbase server?
- Is mixing versions and packages like this is horrible idea?