Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MCM to mark terminating nodes #488

Closed
guydc opened this issue Jul 16, 2020 · 6 comments · Fixed by #492 or #505
Closed

MCM to mark terminating nodes #488

guydc opened this issue Jul 16, 2020 · 6 comments · Fixed by #492 or #505
Assignees
Labels
kind/enhancement Enhancement, improvement, extension status/in-progress Issue is in progress/work
Milestone

Comments

@guydc
Copy link
Contributor

guydc commented Jul 16, 2020

What would you like to be added:

When MCM replaces nodes (unhealthy node termination, node rolling updates, ...), nodes that are being terminated should be marked in some way (e.g. annotation) to reflect the reason of their termination.

Why is this needed:
End-users can setup monitoring for the progress of node rolling updates. If PDBs are used, end-users can automate removal of PDBs from pods that are running on nodes which are being terminated due to health issues.

@guydc guydc added the kind/enhancement Enhancement, improvement, extension label Jul 16, 2020
@hardikdr
Copy link
Member

Sounds like a good idea to me to have an indication on node-object, whenever it's being terminated, likely due to health-conditions.

@prashanth26
Copy link
Contributor

I got the information that you are working on a fix for this. Hence assigning you to the issue.

/assign @guydaichs

@guydc
Copy link
Contributor Author

guydc commented Jul 21, 2020

Hi @prashanth26. Yes, I'm working on this. I noticed that there's already a mechanism for syncing the machine config with a node annotation. However, in my case, I want the annotation to capture a certain transition, and not the current state.

So, I was thinking to annotate the node in the machineDelete method, and reflect machine.Status.CurrentStatus.Phase if it's not Terminating. For example, in case of replacement due to health issues, the node would be annotated with node.machine.sapcloud.io/termination: failed, until it's actually removed from the cluster.

WDYT?

@prashanth26
Copy link
Contributor

/status in-progress

@gardener-robot gardener-robot added the status/in-progress Issue is in progress/work label Aug 16, 2020
@prashanth26
Copy link
Contributor

/reopen to port this changes on OOT.

@prashanth26 prashanth26 added this to the v0.34.0 milestone Aug 29, 2020
@prashanth26
Copy link
Contributor

/unassign @guydaichs
/assign

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/enhancement Enhancement, improvement, extension status/in-progress Issue is in progress/work
Projects
None yet
4 participants