Google Cloud: Update instances on instance group when instance_template is changed #3875

premist · 2015-11-12T08:55:47Z

On resource google_compute_instance_group_manager, you can specify instance_template resource for the instance group. When I modify google_compute_instance_template resource, associated google_compute_instance_group_manager is updated but its instances are not updated. I had to manually delete VM instances so Instance Group can recreate instances with new instance template. I can think two possible approach to this:

1. Recreate instances after updating instance-groups

By using gcloud compute instance-groups managed recreate-instances after updating an instance group, you can trigger recreation of the instances. This is probably the easiest way to solve a problem, but instance group will have no active instances for a while, which is not ideal on production environment.

2. Use rolling update

By using gcloud alpha compute rolling-updates start, you can gracefully retire old instances and create new instances. It has several flags that can be used to control the instance count and interval between each batch updates. We've used this command before we investigate Terraform, and it worked pretty well even though it's still in alpha state.

The text was updated successfully, but these errors were encountered:

lwander · 2015-11-12T20:06:51Z

Good catch on the update issue. I will implement approach 1 since we'd rather not support the Alpha API as it's subject to change.

lwander · 2015-11-12T20:57:03Z

As you pointed out, it's bad in prod to restart all the instances at once. So I added the update_strategy field where you can specify NONE and then rely on running the alpha command by hand until that makes its way into the stable v1 API, where Terraform will give you the option to do that automatically.

premist · 2015-11-13T06:54:27Z

Thank you @lwander, it sounds like a good idea! Since we're testing Terraform on our new staging environment, I'll try out your PR locally to see if it works well.

sparkprime · 2015-11-13T18:39:55Z

If you make the IGM depend on the template, then changing the template should recreate the IGM (and all its instances). If this doesn't happen this is a core bug.

Of course that's not the behavior you want :)

What I would actually do is create an IGM for each version of your software -- then you can independently scale them / tear them down and implement whatever phased rollout policy you want.

lwander · 2015-11-13T18:44:44Z

No, instance_template isn't marked as ForceNew, so the IGM shouldn't restart on a change to the template.

sparkprime · 2015-11-13T19:08:45Z

Let's navel-gaze... In an ideal would one could imagine a declarative deployment agent being able to do 2 things:

You update the template, it shuts everything down and restarts it. This is appropriate for local development, staging environment, automated integration tests, load tests, etc.
You update the template, you get a rolling update of your fleet that ensures that sufficient instances are always running at any given time and automatically rolls back on failures that it detects (e.g. a spike in HTTP errors, latency spike, not meeting SLO, etc.) Only one update is allowed at a time (new updates fail) but there is an explicit rollback command. This is appropriate for managing a highly available production service.

This IGM rollout stuff gets you some but not all of the way to 2. You actually need application knowledge to do it properly, and I've only seen it done in PAAS providers. What IGM currently supports is definitely useful though, although unfortunately it is exposed imperatively so requires some mapping to expose naturally in Terraform.

However you can, with a bit of work, build (2) on top of (1) by executing multiple Terraform applies from a continuous deployment agent that is aware of your application's monitoring. However you'd need a ForceNew instance_template for that, which would invalidate the current support for (2).

Maybe the right thing to do is like the startup_script -- have one forcenew field and one updateable field.

lwander · 2015-11-13T20:08:16Z

For anyone reading this in the future:

@sparkprime and I talked this out and agreed to keep update_strategy for the following reasons:

A policy of "RESTART" is the same as modifying instance_template designated as ForceNew, but with the added benefit that it does not recreate the entire IGM. This allows a user to implement @sparkprime's rollout approach detailed above.
A policy of "NONE" is the same as modifying instance_template designated as not ForceNew.
Future policies will all be supported, all without creating extra fields for the same attribute.

lwander · 2015-12-15T18:09:35Z

Closing since #3892 has been merged

premist · 2015-12-16T02:38:49Z

Awesome, thanks!

ghost · 2020-04-29T02:08:53Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.

If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

jen20 added enhancement provider/google-cloud labels Nov 12, 2015

lwander mentioned this issue Nov 12, 2015

provider/google: Fix instance group manager instance restart policy #3892

Merged

lwander closed this as completed Dec 15, 2015

ghost locked and limited conversation to collaborators Apr 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Google Cloud: Update instances on instance group when instance_template is changed #3875

Google Cloud: Update instances on instance group when instance_template is changed #3875

premist commented Nov 12, 2015

lwander commented Nov 12, 2015

lwander commented Nov 12, 2015

premist commented Nov 13, 2015

sparkprime commented Nov 13, 2015

lwander commented Nov 13, 2015

sparkprime commented Nov 13, 2015

lwander commented Nov 13, 2015

lwander commented Dec 15, 2015

premist commented Dec 16, 2015

ghost commented Apr 29, 2020

Google Cloud: Update instances on instance group when instance_template is changed #3875

Google Cloud: Update instances on instance group when instance_template is changed #3875

Comments

premist commented Nov 12, 2015

1. Recreate instances after updating instance-groups

2. Use rolling update

lwander commented Nov 12, 2015

lwander commented Nov 12, 2015

premist commented Nov 13, 2015

sparkprime commented Nov 13, 2015

lwander commented Nov 13, 2015

sparkprime commented Nov 13, 2015

lwander commented Nov 13, 2015

lwander commented Dec 15, 2015

premist commented Dec 16, 2015

ghost commented Apr 29, 2020