Unclear wording in “Disruptions” concept page #22391

mltsy · 2020-07-06T15:18:14Z

This is a Bug Report

Problem:
Under the "Dealing with Disruptions" heading, it says "The frequency of voluntary disruptions varies. On a basic Kubernetes cluster, there are no voluntary disruptions at all." This is just not true... unless I'm missing some very limited definition of "Basic Kubernetes Cluster".

Essentially every possible "voluntary disruption" mentioned earlier on the page does happen on a basic Kubernetes cluster (deploying a new revision, deleting a pod accidentally, upgrading the cluster, etc.). I assume what is meant here is that no voluntary disruptions are automated on a basic K8s cluster? But I'm not sure...

Proposed Solution:
Possibly change the wording to "On a basic Kubernetes cluster, there is no automated process that causes voluntary disruptions (all processes involving voluntary disruption are initiated manually, by default)."

Page to Update:
https://kubernetes.io/docs/concepts/workloads/pods/disruptions/

sftim · 2020-07-06T21:24:10Z

On a basic Kubernetes cluster, there are no voluntary disruptions at all.

This seems valid to me, although the text doesn't back this up with an explanation. The blue-green deployment pattern is simpler to implement than, say, a canary release

If you use blue-green deployments then you have two Deployments (blue and green) and neither of these has any voluntary disruptions whilst in service.

If rewording this page to make things clearer, bear in mind that coarse-grained deployments are easier to implement than the fine-grained kind that need to heed PodDisruptionBudget etc.

@mltsy the text you suggested:

On a basic Kubernetes cluster, there is no automated process that causes voluntary disruptions (all processes involving voluntary disruption are initiated manually, by default).

also doesn't feel quite right, because you can have a very basic-looking cluster where deployments (ie, updates to Deployments) are triggered automatically. I think that might even be on the CKAD syllabus, it's that fundamental.

(Aside, but relevant: PodDisruptionBudget and EvenPodsSpread are both beta features).

mltsy · 2020-07-06T22:10:40Z

Interesting... though I'm not sure I understand what your opinion is. First you say that the phrase "there are no voluntary disruptions at all" seems valid to you, but then you are also saying it's incorrect to claim there are no automated voluntary disruptions because "you can have a very basic-looking cluster where deployments are triggered automatically" ... so does a basic cluster have or not have voluntary disruptions? If it has "no voluntary disruptions at all" then it certainly doesn't have any automated voluntary disruptions, right?

Even if we ignore the case of deployments for the time being, and expect that on a "basic cluster" you should be using blue-green deployments, surely someone who launches a basic cluster will update the cluster with a new K8s version at some point, right? That's a voluntary disruption. It just seemed confusing to me, after I just got done reading all the things that might cause a "voluntary disruption" to say that they never happen on a "basic Kubernetes cluster".

I guess I'm just not sure what the point of that sentence is in the first place. I think it's meant to illustrate the difference between something like a hosted cluster (like GKE), and the most basic kind you might setup on your own... where in the case of a hosted cluster, there are automations, by default, that will cause voluntary disruptions, whereas in the case of a default cluster setup on your own, there would not be, unless you set them up. (But that doesn't mean there will be no voluntary disruptions ever on that cluster - it just means they aren't going to happen unless you make them happen)

sftim · 2020-07-06T22:42:35Z

We'd have to ask the original author to uncover their intent.

I suspect they were trying to distinguish between cases where people aren't worried about voluntary disruptions and comparing that to the cases where someone is actively keen to manage the impact of voluntary disruptions, whether those are from application rollouts or from cluster upgrades.

mltsy · 2020-07-06T23:05:14Z

Sure - that makes sense. Maybe better wording, then, would be:

"On a basic Kubernetes cluster, there are no hidden/default automated voluntary disruptions to worry about."

That seems more to the point and a bit less confusing to me. Does it seem at least as accurate as the current wording?

sftim · 2020-08-10T12:31:49Z

/kind cleanup
/language en
/priority backlog
/retitle Unclear wording in “Disruptions” concept page

fejta-bot · 2020-11-08T13:10:12Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

mltsy · 2020-11-09T16:00:10Z

/remove-lifecycle stale

The confusing documentation still exists.

fejta-bot · 2021-02-07T16:49:24Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

mltsy · 2021-02-08T16:45:38Z

/remove-lifecycle stale

I think this could be resolved by replacing:
"On a basic Kubernetes cluster, there are no voluntary disruptions at all."
with:
"On a basic Kubernetes cluster, there are no automated voluntary disruptions."

Although is this actually true? The text mentions "Removing a pod from a node to permit something else to fit on that node." which I believe is something a basic Kubernetes cluster will do if you try to deploy something that doesn't fit on a node currently, but could by shifting other pods around... isn't it? That strikes me as an automated process (although it has a manual catalyst) that causes a voluntary disruption.

If that's the case, it might be better to say: "Rescheduling (moving) other pods during a deployment is the only voluntary disruption that may be considered automated (or indirectly triggered at least) on a basic Kubernetes cluster." Or maybe more succinctly and more to the point... "Every multi-node cluster is, by default, subject to some voluntary disruption"

sftim · 2021-02-08T17:27:11Z

Removing a pod from a node to permit something else to fit on that node.

AIUI Kubernetes does not come with a descheduler but you can add one in, such as: https://github.com/kubernetes-sigs/descheduler

/sig scheduling

mltsy · 2021-02-08T18:01:48Z

Ah! Okay, well that simplifies it then. (I'm using GKE, so it must have use a custom de/scheduler if Kubernetes doesn't come with a standard one that has this behavior) Yeah, it looks like the default plugin that determines this behavior is the DefaultPreemption plugin, which only evicts pods if a priority are set, which is not a default, so I think it's safe to say that's not a "default" behavior (since you have to enable it by creating and using PriorityClasses).

So, how about: "On a basic Kubernetes cluster, there are no automated voluntary disruptions (only user-triggered ones)."

sftim · 2021-03-03T22:56:16Z

So, how about: "On a basic Kubernetes cluster, there are no automated voluntary disruptions (only user-triggered ones)."

(Sounds good to me)

Closes kubernetes#22391

See issue kubernetes#22391

fejta-bot · 2021-06-01T23:53:52Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

mltsy · 2021-06-02T03:15:33Z

This is resolved :)

sftim mentioned this issue Jul 7, 2020

Document graceful termination #14713

Closed

k8s-ci-robot changed the title ~~Issue with k8s.io/docs/concepts/workloads/pods/disruptions/~~ Unclear wording in “Disruptions” concept page Aug 10, 2020

k8s-ci-robot added kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. language/en Issues or PRs related to English language priority/backlog Higher priority than priority/awaiting-more-evidence. labels Aug 10, 2020

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 8, 2020

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 9, 2020

kbhawkey mentioned this issue Feb 4, 2021

Mention PDB updates in PodDisruptionBudget task page #26385

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 7, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 8, 2021

k8s-ci-robot added the sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. label Feb 8, 2021

mltsy pushed a commit to mltsy/website-1 that referenced this issue Mar 4, 2021

Clarify sources of voluntary disruptions

f2f579c

Closes kubernetes#22391

This was referenced Mar 4, 2021

Clarify sources of voluntary disruptions #26838

Closed

Clarify sources of voluntary disruptions #26839

Closed

mltsy added a commit to mltsy/website-1 that referenced this issue Mar 4, 2021

Clarify sources of voluntary disruptions

21b1e83

See issue kubernetes#22391

mltsy mentioned this issue Mar 4, 2021

Clarify sources of voluntary disruptions #26840

Merged

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 1, 2021

mltsy closed this as completed Jun 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unclear wording in “Disruptions” concept page #22391

Unclear wording in “Disruptions” concept page #22391

mltsy commented Jul 6, 2020

sftim commented Jul 6, 2020

mltsy commented Jul 6, 2020

sftim commented Jul 6, 2020

mltsy commented Jul 6, 2020

sftim commented Aug 10, 2020

fejta-bot commented Nov 8, 2020

mltsy commented Nov 9, 2020

fejta-bot commented Feb 7, 2021

mltsy commented Feb 8, 2021

sftim commented Feb 8, 2021 •

edited

Loading

mltsy commented Feb 8, 2021

sftim commented Mar 3, 2021

fejta-bot commented Jun 1, 2021

mltsy commented Jun 2, 2021

Unclear wording in “Disruptions” concept page #22391

Unclear wording in “Disruptions” concept page #22391

Comments

mltsy commented Jul 6, 2020

sftim commented Jul 6, 2020

mltsy commented Jul 6, 2020

sftim commented Jul 6, 2020

mltsy commented Jul 6, 2020

sftim commented Aug 10, 2020

fejta-bot commented Nov 8, 2020

mltsy commented Nov 9, 2020

fejta-bot commented Feb 7, 2021

mltsy commented Feb 8, 2021

sftim commented Feb 8, 2021 • edited Loading

mltsy commented Feb 8, 2021

sftim commented Mar 3, 2021

fejta-bot commented Jun 1, 2021

mltsy commented Jun 2, 2021

sftim commented Feb 8, 2021 •

edited

Loading