Here's my scenario. I've got a table with 5 million + rows. One particular map column has two keys (some entries may be missing one or both keys, but any row has at most two keys for that column).
I'm looking to "clear" the values for that column across all rows. I don't want to get rid of the column, as I'll run something afterwards that'll set some values. I'd imagine simply doing update table set column.key=null ... would fail due to timeout.
What would be the most cassandra friendly way of achieving this? I have access to Spark. Would it be to use spark, read in rdds and issue update queries per row and do that in partitions?
Thanks, Ashic.
PS: Apache Cassandra 2.1.2, Spark 1.1.1.
========================
Edit: I can tolerate downtime.
update
? – maasgmap<text,text>
or something like that? – G Quintana