Apache Cassandra 3.11.x Restoration Process

Question

We are working with Apache Cassandra 3.11 and we are trying to prepare for a DR situation. For instance, we have two keyspaces that we back up via snapshot. For instance, via Ansible:

- name: Create snapshots
  shell: "nodetool snapshot --tag {{ item.tag }} {{ item.name }}"
  loop: "{{ keyspaces }}"

Now If we had to create a new cluster and use these snapshots, from what we understand, we would need to do the following:

Create new cluster (3)
Create keyspaces
Restore each table via the snapshot schema.cql
Use sstableloader

per node (only 2, third node has no snapshot), then

Nodetool repair

per node

A reference to this path can be found here

However, the tables are loaded, but the data is duplicated, in our case, as we only have two nodes with snapshots as the third has none (failed node).

Are we following the correct path here? We have tried copying the snapshot data to the appropriate directory and used nodetool refresh -- keyspace table but the data never appeared.

Any ideas/help would be greatly appreciated.