🐛 Adjust machinepool helper e2e timeout #8739

killianmuldoon · 2023-05-24T14:05:54Z

Adjust the timeout in the PollImmediate call in getMachinePoolInstanceVersions.

In this function we don't get the MachinePool so the nodeRefs stay the same on each call. Because the timeout for this function is 3 minutes per Node and the timeout of the wrapping Eventually call is set at 5 minuted in our end to end test the end result is that we only ever run one get request for the MachinePool - there are two nodes each gets a 3 minute timeout.

If upgrades aren't finished or it's out of sync when the function is initalized the nodes being looked for are never updated.

Also added some better logging - improving on ##8728 so if this doesn't fix the issue, or if there's additional flakes in future, we might get more information from the logs.

Fixes (Hopefully) #8718

killianmuldoon · 2023-05-24T14:07:36Z

@chrischdi Maybe this is the cause of the flake. What I'm not certain about is how this ever really worked given we call this function right after the patch call and I don't understand how the NodeRefs are updated that quickly in a passing test.

chrischdi

Woah that's a very ugly timing bug!

👍 Huge thanks for digging more into it!

chrischdi · 2023-05-24T14:38:50Z

/lgtm

k8s-ci-robot · 2023-05-24T14:38:57Z

LGTM label has been added.

Git tree hash: 5afd629d9bf973857452bdf686b8bae8a7cbae97

killianmuldoon · 2023-05-24T15:02:36Z

/test pull-cluster-api-e2e-full-main

sbueringer · 2023-05-24T16:51:08Z

Let's please merge the CR bump first

killianmuldoon · 2023-05-24T16:51:52Z

/hold

To merge Controller Runtime bump first

Thanks for the heads up @sbueringer

test/framework/machinepool_helpers.go

killianmuldoon · 2023-05-24T18:22:09Z

/retest

sbueringer · 2023-05-24T19:23:59Z

lgtm pending the rebase in a bit

Signed-off-by: killianmuldoon <[email protected]>

sbueringer · 2023-05-24T19:46:20Z

/lgtm
/approve

feel free to hold cancel obviously :)

k8s-ci-robot · 2023-05-24T19:46:26Z

LGTM label has been added.

Git tree hash: 90117fb38f2388abfdd93125685f8e3ccf22aad9

k8s-ci-robot · 2023-05-24T19:46:27Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: sbueringer

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [sbueringer]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

killianmuldoon · 2023-05-24T19:47:08Z

/hold cancel

Just saw this flake again in the CI - hopefully this gets ahead of it 😄

killianmuldoon · 2023-05-24T19:49:24Z

/cherry-pick release-1.3

k8s-infra-cherrypick-robot · 2023-05-24T19:49:26Z

@killianmuldoon: once the present PR merges, I will cherry-pick it on top of release-1.3 in a new PR and assign it to you.

In response to this:

/cherry-pick release-1.3

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

killianmuldoon · 2023-05-24T19:49:30Z

/cherry-pick release-1.4

k8s-infra-cherrypick-robot · 2023-05-24T19:49:31Z

@killianmuldoon: once the present PR merges, I will cherry-pick it on top of release-1.4 in a new PR and assign it to you.

In response to this:

/cherry-pick release-1.4

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

killianmuldoon · 2023-05-24T19:50:02Z

We should hold the cherry-picks until we have some signal that this works - but I'd prefer to have them in the queue as a reminder.

k8s-infra-cherrypick-robot · 2023-05-24T19:57:27Z

@killianmuldoon: #8739 failed to apply on top of branch "release-1.3":

Applying: Adjust machinepool helper e2e timeout
Using index info to reconstruct a base tree...
M	test/framework/machinepool_helpers.go
Falling back to patching base and 3-way merge...
Auto-merging test/framework/machinepool_helpers.go
CONFLICT (content): Merge conflict in test/framework/machinepool_helpers.go
error: Failed to merge in the changes.
hint: Use 'git am --show-current-patch=diff' to see the failed patch
Patch failed at 0001 Adjust machinepool helper e2e timeout
When you have resolved this problem, run "git am --continue".
If you prefer to skip this patch, run "git am --skip" instead.
To restore the original branch and stop patching, run "git am --abort".

In response to this:

/cherry-pick release-1.3

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-infra-cherrypick-robot · 2023-05-24T19:58:01Z

@killianmuldoon: #8739 failed to apply on top of branch "release-1.4":

Applying: Adjust machinepool helper e2e timeout
Using index info to reconstruct a base tree...
M	test/framework/machinepool_helpers.go
Falling back to patching base and 3-way merge...
Auto-merging test/framework/machinepool_helpers.go
CONFLICT (content): Merge conflict in test/framework/machinepool_helpers.go
error: Failed to merge in the changes.
hint: Use 'git am --show-current-patch=diff' to see the failed patch
Patch failed at 0001 Adjust machinepool helper e2e timeout
When you have resolved this problem, run "git am --continue".
If you prefer to skip this patch, run "git am --skip" instead.
To restore the original branch and stop patching, run "git am --abort".

In response to this:

/cherry-pick release-1.4

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

johannesfrey · 2023-06-05T05:34:49Z

/area machinepool

killianmuldoon · 2023-06-05T09:10:34Z

area e2e-testing

killianmuldoon · 2023-06-05T09:10:51Z

/area e2e-testing

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label May 24, 2023

k8s-ci-robot requested review from chrischdi and stmcginnis May 24, 2023 14:06

k8s-ci-robot added the size/S Denotes a PR that changes 10-29 lines, ignoring generated files. label May 24, 2023

killianmuldoon force-pushed the pr-mp-helper-timeout branch from 17f0f9d to a618773 Compare May 24, 2023 14:08

chrischdi reviewed May 24, 2023

View reviewed changes

k8s-ci-robot assigned chrischdi May 24, 2023

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 24, 2023

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label May 24, 2023

sbueringer reviewed May 24, 2023

View reviewed changes

test/framework/machinepool_helpers.go Outdated Show resolved Hide resolved

killianmuldoon force-pushed the pr-mp-helper-timeout branch from a618773 to 5269466 Compare May 24, 2023 17:05

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 24, 2023

k8s-ci-robot requested a review from chrischdi May 24, 2023 17:05

sbueringer reviewed May 24, 2023

View reviewed changes

test/framework/machinepool_helpers.go Show resolved Hide resolved

killianmuldoon force-pushed the pr-mp-helper-timeout branch from 5269466 to c0884ca Compare May 24, 2023 17:54

Adjust machinepool helper e2e timeout

a26fe0e

Signed-off-by: killianmuldoon <[email protected]>

killianmuldoon force-pushed the pr-mp-helper-timeout branch from c0884ca to a26fe0e Compare May 24, 2023 19:41

k8s-ci-robot assigned sbueringer May 24, 2023

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 24, 2023

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 24, 2023

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label May 24, 2023

k8s-ci-robot merged commit 1f69d07 into kubernetes-sigs:main May 24, 2023

k8s-ci-robot added this to the v1.5 milestone May 24, 2023

This was referenced May 26, 2023

🐛 [release-1.3] Adjust machinepool helper e2e timeout #8755

Merged

🐛 [release-1.4] Adjust machinepool helper e2e timeout #8756

Merged

k8s-ci-robot added the area/machinepool Issues or PRs related to machinepools label Jun 5, 2023

killianmuldoon removed the area/machinepool Issues or PRs related to machinepools label Jun 5, 2023

k8s-ci-robot added the area/e2e-testing Issues or PRs related to e2e testing label Jun 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 Adjust machinepool helper e2e timeout #8739

🐛 Adjust machinepool helper e2e timeout #8739

killianmuldoon commented May 24, 2023

killianmuldoon commented May 24, 2023

chrischdi left a comment

chrischdi commented May 24, 2023

k8s-ci-robot commented May 24, 2023

killianmuldoon commented May 24, 2023

sbueringer commented May 24, 2023

killianmuldoon commented May 24, 2023 •

edited

Loading

killianmuldoon commented May 24, 2023

sbueringer commented May 24, 2023

sbueringer commented May 24, 2023

k8s-ci-robot commented May 24, 2023

k8s-ci-robot commented May 24, 2023

killianmuldoon commented May 24, 2023

killianmuldoon commented May 24, 2023

k8s-infra-cherrypick-robot commented May 24, 2023

killianmuldoon commented May 24, 2023

k8s-infra-cherrypick-robot commented May 24, 2023

killianmuldoon commented May 24, 2023

k8s-infra-cherrypick-robot commented May 24, 2023

k8s-infra-cherrypick-robot commented May 24, 2023

johannesfrey commented Jun 5, 2023

killianmuldoon commented Jun 5, 2023

killianmuldoon commented Jun 5, 2023

🐛 Adjust machinepool helper e2e timeout #8739

🐛 Adjust machinepool helper e2e timeout #8739

Conversation

killianmuldoon commented May 24, 2023

killianmuldoon commented May 24, 2023

chrischdi left a comment

Choose a reason for hiding this comment

chrischdi commented May 24, 2023

k8s-ci-robot commented May 24, 2023

killianmuldoon commented May 24, 2023

sbueringer commented May 24, 2023

killianmuldoon commented May 24, 2023 • edited Loading

killianmuldoon commented May 24, 2023

sbueringer commented May 24, 2023

sbueringer commented May 24, 2023

k8s-ci-robot commented May 24, 2023

k8s-ci-robot commented May 24, 2023

killianmuldoon commented May 24, 2023

killianmuldoon commented May 24, 2023

k8s-infra-cherrypick-robot commented May 24, 2023

killianmuldoon commented May 24, 2023

k8s-infra-cherrypick-robot commented May 24, 2023

killianmuldoon commented May 24, 2023

k8s-infra-cherrypick-robot commented May 24, 2023

k8s-infra-cherrypick-robot commented May 24, 2023

johannesfrey commented Jun 5, 2023

killianmuldoon commented Jun 5, 2023

killianmuldoon commented Jun 5, 2023

killianmuldoon commented May 24, 2023 •

edited

Loading