Time series storage design

Question

I checked the time series paper from google cloud https://cloud.google.com/bigtable/docs/schema-design-time-series and also the scheme design of opentsdb which is based on hbase that is very similar to bigtable.

The scheme design of opentsdb uses a lot of tricks to encode the data points and row keys into wide rows so that the size of per data point is smaller. But in the paper of Google just suggests to use narrow rows.

My question is that, can I get some real benefit from opentsdb scheme design for time series storage with bigtable. And, is it true that the compression of bigtable can help me remove redundancy so that the opentsdb schema makes very little difference?

Les Vogel - Google DevRel Les Vogel - Google DevRel · Accepted Answer · 2015-12-02T04:32:21

To design schemas for your application is typically very specific to your needs. You can have general recommendations, but you might be better served with a radically different design for your specific application.

Many of the suggestions in the StumbleUpon deck and MapR's video (below) are excelent design ideas that were not included in the Time Series paper. To answer your questions:

Can I get some real benefit from opentsdb scheme design for time series storage with bigtable?

Yes - the design ideas from OpenTSDB are good ideas and are compatible with the Cloud Bigtable paper.

Is it true that the compression of bigtable can help me remove redundancy so that the opentsdb schema makes very little difference?

Cloud Bigtable's compression makes a big difference. (Smaller things often compress smaller than bigger things even with redundancies.)

Schema Design

The Google Time Series paper has the engineering team's recommendations in it and has the benefit many years of experience desigining with Bigtable.

Of course you should start with the HBase and Schema Design and Designing your Schema for Cloud Bigtable. Ian Varley's Masters Thesis No Relation: The Mixed Blessings of Non-Relational Databases is also worth reading.

Time Series Design

Cloudera has a good chapter on Schema case studies which talks about Time Series.

OpenTSDB design

MapR's HBase Key Design with OpenTSDB video is short and worth watching. Looking into OpenTSDB there is an interesting deck from StumbleUpon.

Time series storage design

2 Answers

Schema Design

Time Series Design

OpenTSDB design