TaskRun support stop/wait status #2217

withlin · 2020-03-12T03:56:16Z

Support stop/wait status,which will stop/wait the taskrun pod

idea:

1. user set task spec status to stop/wait staus
2. have a CRD find out which machine the current taskrun pod is on.
3. and then get running container id.
4. excute kill -STOP ${pid} ,which the pod will be  hold (use daemonset).
5. when set task status is reruning will excute  kill -CONT ${pid}

Do you have any better ideas?

The text was updated successfully, but these errors were encountered:

withlin · 2020-03-12T03:57:04Z

/cc @vdemeester @sbwsg

vdemeester · 2020-03-12T07:17:31Z

/kind feature

ghost · 2020-03-12T11:50:26Z

It would be good to know a bit more about the intended use case for this feature. Why do you want it?

Is it to support something like #2159 ?

In general I think we would not want the Pod hanging around consuming resources while the TaskRun is waiting. Ideally when a TaskRun begins waiting its resources are released so that other tasks can be scheduled while it's sleeping. When the TaskRun is told to continue it would spin up a new Pod at the point where it left off.

The use cases for this behaviour (that I know of) are often systems that run for 24 hours or more. Examples include:

Manual Approvals that are performed by teams operating in different timezones
Approvals that trigger actions which are not allowed on the last day of the week and so must wait until the start of the following week
Canary analysis systems that observe metrics over a long period of time before making decisions
Global rollouts that span multiple days

In all of these cases it would be much better if the TaskRun released its resources while in a paused state.

laik · 2020-03-12T14:11:42Z

we test pod during taskRun, send a stop signal to the container system pid, at this time, the process stops and does not take up the load. I let this be an effective way to control the transfer of resources. If there is a rerun function, it is even better.

laik · 2020-03-12T14:12:20Z

/cc @sbwsg

ghost · 2020-03-12T14:18:39Z

the process stops and does not take up the load

Curious about this - What state does the Pod enter into? Does Kubernetes release the resource quota it has set aside for the Pod?

laik · 2020-03-12T14:46:51Z

the process stops and does not take up the load

Curious about this - What state does the Pod enter into? Does Kubernetes release the resource quota it has set aside for the Pod?

the pod still running, but it doesn't work anymore, as we use fg/bg, my idea is to identify a paused state record in taskRun

withlin · 2020-03-20T09:36:11Z

/assign

tekton-robot · 2020-11-09T05:54:08Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
If this issue is safe to close now please do so with /close.

/lifecycle stale

Send feedback to tektoncd/plumbing.

vdemeester · 2020-11-09T06:55:29Z

/remove-lifecycle stale

tekton-robot · 2021-02-07T07:18:08Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale with a justification.
Stale issues rot after an additional 30d of inactivity and eventually close.
If this issue is safe to close now please do so with /close with a justification.
If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/lifecycle stale

Send feedback to tektoncd/plumbing.

tekton-robot · 2021-03-09T08:05:11Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten with a justification.
Rotten issues close after an additional 30d of inactivity.
If this issue is safe to close now please do so with /close with a justification.
If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/lifecycle rotten

Send feedback to tektoncd/plumbing.

tekton-robot · 2021-05-07T16:40:46Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen with a justification.
Mark the issue as fresh with /remove-lifecycle rotten with a justification.
If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/close

Send feedback to tektoncd/plumbing.

tekton-robot · 2021-05-07T16:40:47Z

@tekton-robot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen with a justification.
Mark the issue as fresh with /remove-lifecycle rotten with a justification.
If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/close

Send feedback to tektoncd/plumbing.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

tekton-robot added the kind/feature Categorizes issue or PR as related to a new feature. label Mar 12, 2020

dibyom added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Mar 12, 2020

tekton-robot assigned withlin Mar 20, 2020

vdemeester removed the priority/backlog Higher priority than priority/awaiting-more-evidence. label Jun 29, 2020

FogDong mentioned this issue Sep 14, 2020

Feat: Add pause condition in pipeline run #3223

Closed

4 tasks

withlin mentioned this issue Sep 16, 2020

TEP-0015 - Add a pending setting to Tekton PipelineRun and TaskRuns tektoncd/community#203

Merged

tekton-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 9, 2020

tekton-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 9, 2020

tekton-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 7, 2021

tekton-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Mar 9, 2021

tekton-robot closed this as completed May 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TaskRun support stop/wait status #2217

TaskRun support stop/wait status #2217

withlin commented Mar 12, 2020 •

edited

Loading

withlin commented Mar 12, 2020

vdemeester commented Mar 12, 2020

ghost commented Mar 12, 2020

laik commented Mar 12, 2020

laik commented Mar 12, 2020

ghost commented Mar 12, 2020

laik commented Mar 12, 2020

withlin commented Mar 20, 2020

tekton-robot commented Nov 9, 2020

vdemeester commented Nov 9, 2020

tekton-robot commented Feb 7, 2021

tekton-robot commented Mar 9, 2021

tekton-robot commented May 7, 2021

tekton-robot commented May 7, 2021

TaskRun support stop/wait status #2217

TaskRun support stop/wait status #2217

Comments

withlin commented Mar 12, 2020 • edited Loading

withlin commented Mar 12, 2020

vdemeester commented Mar 12, 2020

ghost commented Mar 12, 2020

laik commented Mar 12, 2020

laik commented Mar 12, 2020

ghost commented Mar 12, 2020

laik commented Mar 12, 2020

withlin commented Mar 20, 2020

tekton-robot commented Nov 9, 2020

vdemeester commented Nov 9, 2020

tekton-robot commented Feb 7, 2021

tekton-robot commented Mar 9, 2021

tekton-robot commented May 7, 2021

tekton-robot commented May 7, 2021

withlin commented Mar 12, 2020 •

edited

Loading