hbase rowkey design

Question

I am moving from mysql to hbase due to increasing data.

I am designing rowkey for efficient access pattern.

I want to achieve 3 goals.

Get all results of email address
Get all results of email address + item_type
Get all results of particular email address + item_id

I have 4 attributes to choose from

user email
reverse timestamp
item_type
item_id

What should my rowkey look like to get rows efficiently?

Thanks

select * from table where email = "xyz" and item_type=2 order by ts desc...I have unique key index on (email, item_type ,item_id) — Amit Patil
Are email and item_type mandatory in where statement? Do you have statements like select * from table where email = "xyz" or select * from table where item_type=2 ? And you have mentioned you want to "Get all results of particular item_id" in your post. I don't see it from your SQL. — ericson

Arnon Rotem-Gal-Oz Arnon Rotem-Gal-Oz · Accepted Answer · 2013-03-04T04:59:42

Assuming your main access is by email you can have your main table key as email + reverse time + item_id (assuming item_id gives you uniqueness)

You can have an additional "index" table with email+item_type+reverse time+item_id and email+item_id as keys that maps to the first table (so retrieving by these is a two step process)

hbase rowkey design

2 Answers