My development environment worked well all of 2019, but when entering 2020 one of my services was not working, when looking at the detail what causes the failure is that the health check indicates that the VM is not healthy.
The scheme is simple, my WCF service runs on IIS andI use a TCP load balancer. This same scheme is replicated to have access by two different IPs but only one of the health checks fails. The service is available from localhost, it is available from the ephemeral IP of the instance, but it fails if I access from the static IP that I assigned to the load balancer.
Since it is the same application running in the two VMs and all the configuration is similar, I want to validate what makes the health check to fail, checking with wireshark the connection to the enabled port, I find that all connections from the load balancer say TCP Retransmission but the connections to the ephemeral IP arrive well and receive the 200 OK.
My knowledge about this topic is minimal and I can not identify information that is useful to solve the problem, taking into account that everything is working well last year.
I want to see what information stackdriver can give me, but for some reason I can't find the TCP load balancers on the list.
EDIT: Apparently the problem was due to the fact that the "Google Compute Engine Agent" service was not running, but I couldn't see any log indicating what caused the service to stop. When starting the service again everything worked normally.