-
Notifications
You must be signed in to change notification settings - Fork 517
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Kubelet cannot be restarted via API if it's not already running #4665
Comments
Thanks, we should get it addressed, and the root cause shouldn't happen as well (that condition failed error). |
This is a bug in Talos, but interestingly enough this code got removed from Talos since 0.14.0-beta.0. So the actual issue with the kubelet failing to restart is not fixed, but the bug with condition failed shouldn't exist anymore. |
In addition to restart action, allow also start action. If the service fails to start, it transitions to `Failed` state and it should be actually started to bring it back to running state. Fixes siderolabs#4665 Also GC'ed now unused condition (it had been used before kubelet started being controlled via COSI). Signed-off-by: Andrey Smirnov <[email protected]> (cherry picked from commit ab42886)
So this would potentially fix #4574 (sorry for not providing those logs in a reasonable amount of time, it's been hectic lately). |
This feels like different issue completely, probably we should move this to #4574. I think it might be helpful to compare the logs before the error, there might be something about the bond setup which makes a difference. |
Bug Report
Description
If the kubelet service is in "Failed" state it cannot be restarted.
Works fine if it's already running.
Ref #4407.
Logs
Environment
talosctl version --nodes <problematic nodes>
]kubectl version --short
]The text was updated successfully, but these errors were encountered: