I'm trying to reduce the size of an HBase table and have encountered this:
http://prafull-blog.blogspot.co.uk/2012/06/how-to-calculate-record-size-of-hbase.html
which says that the rowkey is stored for every column in the table. This seems incredibly wasteful and requires me to pay careful attention to every byte added to the rowkey since the number of bytes added to the table will be the total number of columns stored. Also, I expected every row to be stored as a document in the database under a single key, but this doesn't follow that. So, why is this implemented this way?