Does Flink JobManager on Yarn need Zookeeper for HA setup

Question

Flink documentation says "When running a highly available YARN cluster, we don’t run multiple JobManager (ApplicationMaster) instances, but only one, which is restarted by YARN on failures.". Then down below "high-availability: zookeeper".

I don't have experience with yarn, but why do we need to setup zookeeper if Yarn takes care of the restarts and we only have one JobManager? Or is this the zookeeper for resource manager(s)?

ImbaBalboa ImbaBalboa · Accepted Answer · 2017-04-03T08:24:59

To insure "high-availability", a Zookeeper-based implementation of YARN is often recommended. With YARN, only one instance of the RessourceManager runs, a Zookeeper based implementation provides high availibility to the RessourceManager, which allows a failover of the RessourceManager to another instance when the active one crashes.

This implementation works by storing the current internal state of the RessourceManager in Zookeeper.

Source : Apache Zookeeper Essentials, Saurav Haloi

Does Flink JobManager on Yarn need Zookeeper for HA setup

2 Answers