Redeploy Terraform aws_instance nodes consecutively, not in parallel

Question

I have several aws_instance nodes that are in a load balancer target group in Terraform. I made a change that requires destroying each instance and recreating it. By default, Terraform will destroy and recreate all of these instances at the same time. Destroying all of them at once is bad because then no nodes will be in the load balancer.

Is there a way to configure Terraform so it waits for one instance to be fully destroyed/recreated before destroying/creating the other instances?

Do you just have straight instances (using the aws_instance resource) or are you using an autoscaling group? This is easier to achieve with an ASG even if you don't need to be able to actually autoscale (set min and max to the same and/or no autoscaling policy). — ydaetskcoR
Instances using the aws_instance resource, though I may follow your suggestion below. — Kevin Burke

ydaetskcoR ydaetskcoR · Accepted Answer · 2019-10-10T10:21:59

You can use the create_before_destroy lifecycle customisation to force Terraform to create the new resource before destroying the old one during a replacement action.

Unfortunately if your instance takes a while to start the service you need then you're still going to have a problem because as soon as the AWS API returns that the instance is running then Terraform will consider it job done and start terminating the old instance that it wants to replace.

You can solve this by having the instances in an autoscaling group (even if you don't need them to autoscale so having the same min and max size or not attaching an autoscaling policy to the group) and setting the health_check_type to ELB. This will make sure that the instance isn't considered healthy until it passes load balancer health checks rather than the default EC2 health checks (ie if it's running and doesn't have a system or instance status check failure). This way, Terraform will wait until the new ASG has the minimum number of instances passing the load balancer health checks (and is attached to the relevant target group or ELB) before it will consider it complete and then start to remove the old ASG.

Redeploy Terraform aws_instance nodes consecutively, not in parallel

2 Answers