I want to store massive amounts of time series, as efficiently as possible. Speed is important, but not as important as storage.
My data consists of the name of a stock, followed by 1 minute data for 15 years. The data begins precised on Jan 1, 2000, and the number of minutes each day is precisely 390.
So I don't need to store the timeStamp of each series, because I can calculate that automatically.
So instead of this:
Apple [timeStamp:value][timeStamp:value]
I want this:
Apple [value][value]
Is there a way to load this sort of data in Cassandra so it only stores the sequential value, and not the timestamp for each value.
Presumably, by using a timestamp for each series it would double the storage required: if each timestamp and value is 8 bytes, it would take up 50 terabytes instead of the 25 terabytes if only the value were stored.