Modelling Cassandra Data Struncture for both real-time and search from all?

Question

My project serves both real-time data and past data. It works like feed, so it shows real-time data through socket and past data(if scroll down) through REST api. To get real-time data efficiently, I set date as partition key and time as clustering key. For real-time service, I think this data structure is well modeled. But I also have to get limited number of recent datas(like pagination), which should able to show whole data if requested. To serve data like recent 0~20 / 20~40 / 40~60 through REST api calls, my data-serving server has to remember what it showed before to load next 20 datas continuously, as bookmark. If it was SQL I would use IDs or page&offset things but I cannot do that with Cassandra. So I tried:

SELECT * FROM examples WHERE date<='DATEMARK' AND create_at < 'TIMEMARK' AND entities CONTAINS 'something' limit 20 ALLOW FILTERING;

But since date is the partition key, I cannot use comparison operation >, <. The past data could be created very far from now.

Can I satisfy my real-time+past requirements with Cassandra? I wonder if I have to make another DB for accessing past data.

Mohamadreza Rostami Mohamadreza Rostami · Accepted Answer · 2020-02-15T09:06:02

Yes you can, but you must change your mindset and think like NoSQL patterns, in this scenarios you can save your data in duplicate manner and save your data in other table with another partition key and cluster column that satisfies your needs.

Modelling Cassandra Data Struncture for both real-time and search from all?

2 Answers