DNS problem on AWS EKS when running in private subnets

Question

I have an EKS cluster setup in a VPC. The worker nodes are launched in private subnets. I can successfully deploy pods and services.

However, I'm not able to perform DNS resolution from within the pods. (It works fine on the worker nodes, outside the container.)

Troubleshooting using https://kubernetes.io/docs/tasks/administer-cluster/dns-debugging-resolution/ results in the following from nslookup (timeout after a minute or so):

Server: 172.20.0.10 Address 1: 172.20.0.10

nslookup: can't resolve 'kubernetes.default'

When I launch the cluster in an all-public VPC, I don't have this problem. Am I missing any necessary steps for DNS resolution from within a private subnet?

Many thanks, Daniel

is kube-dns or core-dns up? what does it say when you type kubectl get pods -n kube-system? check the the /etc/resolv.conf in the container in the pod, it should point to the kube-dns/core-dns internap IP address — Rico
Rico, kube-dns is up and runnning. Not sure how I find the internal IP of the kube-dns, but the resolv.conf in the container looks like this: nameserver 10.100.0.10 search default.svc.cluster.local svc.cluster.local cluster.local eu-west-1.compute.internal us-west-2.compute.internal options ndots:5 — Daniel
Found the IP of the kube-dns service, and it's 10.100.0.10, i.e. the same that is specified in /etc/resolv.conf in my container,. — Daniel
You're right! The problem was the network ACLs in our custom VPC. Had to open up UDP traffic for kube-dns to work properly. Haven't been able to figure out which ports yet, seems like multiple ports (including 53) are required. Thanks for helping out! — Daniel
@TommyAdamski simply allowing outbound UDP traffic on port 53 on my ACL worked for me - give it a few seconds to update before trying — apdm

apdm apdm · Accepted Answer · 2018-12-07T07:41:46

I feel like I have to give this a proper answer because coming upon this question was the answer to 10 straight hours of debugging for me. As @Daniel said in his comment, the issue I found was with my ACL blocking outbound traffic on UDP port 53 which apparently kubernetes uses to resolve DNS records.

The process was especially confusing for me because one of my pods worked actually worked the entire time since (I think?) it happened to be in the same zone as the kubernetes DNS resolver.

DNS problem on AWS EKS when running in private subnets

6 Answers