I'm looking for clarification regarding HPA in my GKE cluster. I've read these docs https://cloud.google.com/kubernetes-engine/docs/how-to/horizontal-pod-autoscaling but I wanted to know, by setting the minReplicas and maxReplicas are these the min and max values across all nodes in the cluster or is this the min and max per zone?
For instance if I have a cluster with 3 zones and 2 nodes per zone, and I set minReplicas=1 and maxReplicas=3 what will be the min and max number of pods in the entire cluster? Assuming the HPA is operating on a deployment that creates a single node.