-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MHC support #224
MHC support #224
Conversation
Skipping CI for Draft Pull Request. |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: slintes The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/test ? |
@slintes: The following commands are available to trigger required jobs:
Use In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/test 4.13-openshift-e2e |
1 similar comment
/test 4.13-openshift-e2e |
return result, err | ||
} | ||
|
||
if !r.ReconcileMHC { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
shouldn't this be at the top of method ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
no, the MHCChecker above isn't new
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What I mean is, in case the feature is disabled wouldn't it make sense to return immediately and skip other tests ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
MHCChecker needs to always run 🤷🏼♂️
checking old NHC e2e tests /test 4.11-openshift-e2e |
/test 4.11-openshift-e2e |
/test 4.11-openshift-e2e |
/test 4.12-openshift-e2e |
/test 4.12-openshift-e2e |
/test 4.12-openshift-e2e |
After a lot of "fun" with rebase, fixing merge conflicts, and getting leases to work with MHC, unit tests are green again (locally at least). Let's see how old NHC e2e looks like now. Not ready for review! Still several TODOs in code. And no e2e for MHC yet. /test 4.13-openshift-e2e |
Signed-off-by: Marc Sluiter <[email protected]>
MHC needs to also remediate failed Machines which don't have a node yet. Because of that, the CR name always is the Machine name. And we can't create a node lease in this case. Signed-off-by: Marc Sluiter <[email protected]>
Signed-off-by: Marc Sluiter <[email protected]>
Signed-off-by: Marc Sluiter <[email protected]>
Signed-off-by: Marc Sluiter <[email protected]>
Signed-off-by: Marc Sluiter <[email protected]>
Works on OCP 4.14+ only Signed-off-by: Marc Sluiter <[email protected]>
Otherwise we might miss MHCChecker update in case getting MHC fails Signed-off-by: Marc Sluiter <[email protected]>
Signed-off-by: Marc Sluiter <[email protected]>
Signed-off-by: Marc Sluiter <[email protected]>
Signed-off-by: Marc Sluiter <[email protected]>
Signed-off-by: Marc Sluiter <[email protected]>
rebased on #272 |
/lgtm |
Signed-off-by: Marc Sluiter <[email protected]>
/test 4.12-openshift-e2e |
@slintes: The following test failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
/test 4.13-openshift-e2e |
/hold cancel |
@slintes: Overrode contexts on behalf of slintes: e2e-k8s In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/lgtm |
Handle failed machines and unhealthy nodes based on MachineHealthChecks.
This basically is a copy of the controller and unit tests from https://github.com/openshift/machine-api-operator/tree/master/pkg/controller/machinehealthcheck.
Where applicable, existing NHC code was reused.
Limitations, can potentailly be handled in a follow up:
Done:
ECOPROJECT-773