We use Cassandra wide rows heavily to store time-series as they are perfect for that use-case.
create table user_events (
user_id text,
timestmp timestamp,
event text,
primary key((user_id), timestmp));
Let’s assume I want to do the following aggregation using spark: select all user_ids which had at least 1 event during the last month.
What is the most efficient way to do that?