Kubernetes single point of failure and load balancing

Question

I have few basic queries on Kubernetes.

Consider below deployment. A Layer 7 Load Balancer, will route request to NGINX servers through a Kubernetes service, and NGINX will route to Tomcat, Kubernetes service.

Queries:

If Kubernetes service a single point of failure or because it is supported by multiple pods on which kube-proxy is configured and services is just a virtual layer, it cannot be considered as single point of failure?
Above diagram is a single Kubernetes cluster, is this a single point of failure or should I plan for multiple Kubernetes cluster for system where I need to support HA with zero downtime.
Above diagram leverages Kubernetes services which by default supports only L4 Load balancing (round robin only). Hence say a tomcat server is heavily loaded, round robin will not distribute load evenly based on usage. How to achieve load distribution based upon system resource consumption or usage or no. of open connections in the above topology?

Note: no. of rectangular boxes in the above diagrams are representative only. I will be deploying 10 to 20 pods per tier to meet my workload.

caesarxuchao caesarxuchao · Accepted Answer · 2016-06-23T06:20:03

If Kubernetes service a single point of failure or because it is supported by multiple pods on which kube-proxy is configured and services is just a virtual layer, it cannot be considered as single point of failure?

I think your latter interpretation is correct.

Above diagram is a single Kubernetes cluster, is this a single point of failure or should I plan for multiple Kubernetes cluster for system where I need to support HA with zero downtime.

The k8s cluster is not HA, because the master node is the single point of failure. Important components on the master node include the apiserver and controller manager, without them you cannot create more pods or services. That said, your deployed services should continue to work even if the master node is down.

There is a guide on how to set up the k8s cluster in HA mode, I haven't tried it personally though: http://kubernetes.io/docs/admin/high-availability/. Also there is Ubernetes (WIP), which allows you federate multiple k8s clusters accross cloud providers.

Above diagram leverages Kubernetes services which by default supports only L4 Load balancing (round robin only).

This is not true, kubernetes have beta feature called ingress, which support L7 load balancing, see if it helps http://kubernetes.io/docs/user-guide/ingress/ :)

Kubernetes single point of failure and load balancing

2 Answers