I am running flink cluster over K8 with ~1TB of state.
One of the problems I am facing is taking a savepoint and restoring a job back. Now, these updates are simple code updates at times and not parallelism changes. But the time to take a savepoint and then restoring the new job with the old state is pretty high.
Is there a way to do an in-place update of the job so that the local states and jobid do not change and hence can avoid the time consumed in doing the savepoint+restore?