3
votes

There is a way to enable graceful shutdown of spark streaming by setting property spark.streaming.stopGracefullyOnShutdown to true and then kill the process with kill -SIGTERM command. However I don't see such option available for structured streaming (SQLContext.scala).

Is the shutdown process different in structured streaming? Or is it simply not implemented yet?

3
We had a similar case recently and solved it by using the filesystem to stop the streaming job gracefullyabiratsis

3 Answers

4
votes

This feature is not implemented yet. But the write ahead logs of spark structured steaming claims to recover state and offsets without any issues.

0
votes

This Feature is not implemented yet and also it will give you duplicates if you kill the job from resource Manager while the batch is running.