How Spark writes/reads process through spark-Cassandra connector different from CQLSH read/write process

Question

I am new to spark , trying to understand , how is spark advantageous when using it through spark-Cassandra connector on Cassandra cluster.

How does write (example savetocassandra) to Cassandra works through spark-Cassandra connector (spark SQL queries , does it involve coordinator node still?
How does read to Cassandra works through spark-Cassandra connector (spark SQL queries) , does it involve coordinator node still?
what makes spark overcome the load of Cassandra , during high range read scans on the cluster?
How does a high range scan cql read query gets executed on Cassandra cluster through spark-Cassandra connector?
using IN clause through spark-Cassandra connector on Cassandra cluster is advantage?

Artem Aliev Artem Aliev · Accepted Answer · 2018-01-23T12:23:12

Here is a good explanation. I also recommend other Russell talks, if you want to understand spark-cassandra-connector internals Cassandra and Spark Optimizing for Data Locality - Russell Spitzer (DataStax) https://www.youtube.com/watch?v=ikCzILOpYvA

How Spark writes/reads process through spark-Cassandra connector different from CQLSH read/write process

1 Answers