We have deployed our solution for high availability using Azure Traffic manager with default settings.
Selected routing method for us is Performance.
We expected that as soon as the primary server is down, the users are transferred to the secondary server. But unfortunately there is a 30 sec delay. For those 30 seconds in our testing we found that users are getting not responding issues and requests time out. It takes almost a minute to get back everything at work. Azure Traffic Manager with 30 second TTL Generally we do not observe these dropouts in Facebook or Microsoft sites which definitely maintain a solution for high availability.
Do we need to code in our application to handle these dropouts gracefully, like showing a dialog on client side that we will be back soon etc? What could be the best solution so that user experience is seamless.