I'm using Spark Structured Streaming for Machine Learning purpose in real time, and I want to stored predictions in my Cassandra cluster.
Since I am in a streaming context, executing multiple times per seconds the same request, one mandatory optimization is to use PreparedStatement.
In the cassandra spark driver (https://github.com/datastax/spark-cassandra-connector) there is no way to use PreparedStatement (in scala or python, i'm not considering java as a option)
Should i use a scala (https://github.com/outworkers/phantom) / python (https://github.com/datastax/python-driver) cassandra driver ? How does it work then, my connection object need to be serializable to be passed to workers ?
If anyone can help me !
Thanks :)