How do I make Kubernetes scale my deployment based on the "ready"/ "not ready" status of my Pods?

Question

I have a deployment with a defined number of replicas. I use readiness probe to communicate if my Pod is ready/ not ready to handle new connections – my Pods toggle between ready/ not ready state during their lifetime.

I want Kubernetes to scale the deployment up/ down to ensure that there is always the desired number of pods in a ready state.

Example:

If replicas is 4 and there are 4 Pods in ready state, then Kubernetes should keep the current replica count.
If replicas is 4 and there are 2 ready pods and 2 not ready pods, then Kubernetes should add 2 more pods.

How do I make Kubernetes scale my deployment based on the "ready"/ "not ready" status of my Pods?

Then you'd have 4 not-ready -- because the new pods will spin up in this state first, by which point the first not-ready pod will have become ready. — Software Engineer
@EngineerDollery this is not just for spin-up, this is mainly for general lifecycle — orirab
You can scale up / down the deployment based on CPU, memory, etc utilization of such resources.. not based on pod status — src3369

Rajesh Deshpande Rajesh Deshpande · Accepted Answer · 2019-02-07T09:06:44

I don't think this is possible. If pod is not ready, k8 will not make it ready as It is something which releated to your application.Even if it create new pod, how readiness will be guaranted. So you have to resolve the reasons behind non ready status and then k8. Only thing k8 does it keep them away from taking world load to avoid request failure

How do I make Kubernetes scale my deployment based on the "ready"/ "not ready" status of my Pods?

3 Answers