Don't recreate resources as we're tearing a thing down. #2678

mattmoor · 2018-12-09T00:23:38Z

We recently started registering DeleteFunc handlers for our resources, which enables us to quickly recreate resources if they are deleted from out from under us.

A byproduct of this (still needs confirmation) is that I noticed that sometimes when a Revision is being torn down, that right at the moment the Deployments pod starts Terminating another Pod (under a different ReplicaSet) appears. At first I thought this was us updating the Deployment (e.g. #2632), but we are clearly creating two Deployments in our logs:

I think what's happening is that we Reconcile the Revision when it has a DeletionTimestamp (mid delete) and see no Deployment, so we recreate it.

cc @lichuqiang @dgerd

The text was updated successfully, but these errors were encountered:

dgerd · 2018-12-10T18:56:19Z

Your explanation makes the most sense to me. Is this reproducible all the time? If so it should be easy to add some instrumentation to confirm. Happy to take a stab at doing this.

serving/pkg/reconciler/v1alpha1/revision/reconcile_resources.go

Lines 43 to 56 in 678373d

    
           ns := rev.Namespace 
        
           deploymentName := resourcenames.Deployment(rev) 
        
           logger := logging.FromContext(ctx).With(zap.String(logkey.Deployment, deploymentName)) 
        
           deployment, err := c.deploymentLister.Deployments(ns).Get(deploymentName) 
        
           if apierrs.IsNotFound(err) { 
        
           	// Deployment does not exist. Create it. 
        
           	rev.Status.MarkDeploying("Deploying") 
        
           	deployment, err = c.createDeployment(ctx, rev) 
        
           	if err != nil { 
        
           		logger.Errorf("Error creating deployment %q: %v", deploymentName, err) 
        
           		return err 
        
           	} 
        
           	logger.Infof("Created deployment %q", deploymentName)

dgerd · 2018-12-20T21:06:08Z

/milestone Serving 0.4

dgerd · 2018-12-21T21:26:30Z

/assign @dgerd

dgerd · 2018-12-21T21:35:07Z

From @mattmoor: "I think the key is that we didn't make Reconcile() sensitive to metadata.deletionTimestamp when we added the OnDelete hooks. I believe deletionTimestamp gets set as a tombstone as Forground GC runs"

dgerd · 2018-12-21T22:43:10Z

I added some logging and attempted to reproduce this at HEAD by creating a runLatest Service and then deleting the service. I tried multiple times waiting: immediately after, after a few seconds, and after a few minutes. I have yet to reproduce this.

I am using kubectl apply -f to create and kubectl delete -f to delete the service. I will try throwing traffic at it before deleting to see if that changes the behavior at all.

Let me know if you have any other reproduction advice.

mattmoor · 2019-01-10T05:13:51Z

So I can see this in the scale test (at least cranked up to 150 as I have it right now), e.g.

scale-00150-015-zztulrto-00001-deployment-6977d4bcc-xd749    1/2     Terminating   0          3m
scale-00150-015-zztulrto-00001-deployment-97785665f-8t659    1/2     Terminating   0          1m

Interpreting from the suffixes, these pods came from distinct ReplicaSets, and the main way that would happen would be if we created a second deployment while the pods from the first were still tearing down.

mattmoor · 2019-01-28T15:16:44Z

I added some logic in the revision controller to log when we see Revisions with a DeletionTimestamp, and running N=100 a bunch, I'm not seeing my log statement. The only other thing that comes to mind would be adding a delay to the queuing of Deployment deletion events using this.

mattmoor · 2019-01-29T00:32:16Z

I'd initially thought this was us doing something obviously wrong, but given that DeletionTimestamp is unset when this happens this seems cosmetic and somewhat tough to fix. Moving out of v1.

When we had finalizers previously we would race with K8s GC to recreate our children as K8s reaped them. The simplest way to test this is to enable "foreground" deletion in our e2e tests, which is implemented as a finalizer. Fixes: knative#2678

When we had finalizers previously we would race with K8s GC to recreate our children as K8s reaped them. The simplest way to test this is to enable "foreground" deletion in our e2e tests, which is implemented as a finalizer. Fixes: #2678

When we had finalizers previously we would race with K8s GC to recreate our children as K8s reaped them. The simplest way to test this is to enable "foreground" deletion in our e2e tests, which is implemented as a finalizer. Fixes: knative#2678

knative-prow-robot added this to the Serving 0.4 milestone Dec 20, 2018

knative-prow-robot assigned dgerd Dec 21, 2018

mattmoor modified the milestones: Serving 0.4, Ice Box Jan 29, 2019

mattmoor mentioned this issue Jan 30, 2019

Ignore resources when DeletionTimestamp is set. #3036

Merged

knative-prow-robot closed this as completed in #3036 Jan 30, 2019

dprotaso removed this from the Ice Box milestone Oct 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't recreate resources as we're tearing a thing down. #2678

Don't recreate resources as we're tearing a thing down. #2678

mattmoor commented Dec 9, 2018

dgerd commented Dec 10, 2018

dgerd commented Dec 20, 2018

dgerd commented Dec 21, 2018

dgerd commented Dec 21, 2018

dgerd commented Dec 21, 2018

mattmoor commented Jan 10, 2019

mattmoor commented Jan 28, 2019

mattmoor commented Jan 29, 2019

Don't recreate resources as we're tearing a thing down. #2678

Don't recreate resources as we're tearing a thing down. #2678

Comments

mattmoor commented Dec 9, 2018

dgerd commented Dec 10, 2018

dgerd commented Dec 20, 2018

dgerd commented Dec 21, 2018

dgerd commented Dec 21, 2018

dgerd commented Dec 21, 2018

mattmoor commented Jan 10, 2019

mattmoor commented Jan 28, 2019

mattmoor commented Jan 29, 2019