When are flink checkpoint files cleaned?

Question

I have a streaming job that:

reads from Kafka --> maps events to some other DataStream --> key by(0) --> reduces a time window of 15 seconds processing time and writes back to a Redis sink.

When starting up, everything works great. The problem is, that after a while, the disk space get's full by what I think are links checkpoints.

My question is, are the checkpoints supposed to be cleaned/deleted while the link job is running? could not find any resources on this.

I'm using a filesystem backend that writes to /tmp (no hdfs setup)

Robert Metzger Robert Metzger · Accepted Answer · 2017-03-23T13:49:15

Flink cleans up checkpoint files while it is running. There were some corner cases where it "forgot" to clean up all files in case of system failures. But for Flink 1.3 the community is working on fixing all these issues.

In your case, I'm assuming that you don't have enough disk space to store the data of your windows on disk.

When are flink checkpoint files cleaned?

2 Answers