Flink taskmanager out of memory and memory configuration

Question

We are using Flink streaming to run a few jobs on a single cluster. Our jobs are using rocksDB to hold a state. The cluster is configured to run with a single Jobmanager and 3 Taskmanager on 3 separate VMs. Each TM is configured to run with 14GB of RAM. JM is configured to run with 1GB.

We are experiencing 2 memory related issues: - When running Taskmanager with 8GB heap allocation, the TM ran out of heap memory and we got heap out of memory exception. Our solution to this problem was increasing heap size to 14GB. Seems like this configuration solved the issue, as we no longer crash due to out of heap memory. - Still, after increasing heap size to 14GB (per TM process) OS runs out of memory and kills the TM process. RES memory is rising over time and reaching ~20GB per TM process.

1. The question is how can we predict the maximal total amount of physical memory and heap size configuration?

2. Due to our memory issues, is it reasonable to use a non default values of Flink managed memory? what will be the guideline in such case?

Further details: Each Vm is configured with 4 CPUs and 24GB of RAM Using Flink version: 1.3.2

Hard to say without knowing your code. I would suggest that you use some Raw State in some operator: ci.apache.org/projects/flink/flink-docs-release-1.3/dev/stream/…. Can you identify in which operator the problem appears and show some code? — TobiSH

Till Rohrmann Till Rohrmann · Accepted Answer · 2018-06-12T12:09:20

The total amount of required physical and heap memory is quite difficult to compute since it strongly depends on your user code, your job's topology and which state backend you use.

As a rule of thumb, if you experience OOM and are still using the FileSystemStateBackend or the MemoryStateBackend, then you should switch to RocksDBStateBackend, because it can gracefully spill to disk if the state grows too big.

If you are still experiencing OOM exceptions as you have described, then you should check your user code whether it keeps references to state objects or generates in some other way large objects which cannot be garbage collected. If this is the case, then you should try to refactor your code to rely on Flink's state abstraction, because with RocksDB it can go out of core.

RocksDB itself needs native memory which adds to Flink's memory footprint. This depends on the block cache size, indexes, bloom filters and memtables. You can find out more about these things and how to configure them here.

Last but not least, you should not activate taskmanager.memory.preallocate when running streaming jobs, because streaming jobs currently don't use managed memory. Thus, by activating preallocation, you would allocate memory for Flink's managed memory which is reduces the available heap space.

Flink taskmanager out of memory and memory configuration

2 Answers