Skip to content

Commit

Permalink
TEP-0065: Retry failed tasks on demand in a pipeline, feedback
Browse files Browse the repository at this point in the history
  • Loading branch information
ScrapCodes committed May 10, 2021
1 parent 2454c29 commit 9d849a9
Showing 1 changed file with 2 additions and 10 deletions.
12 changes: 2 additions & 10 deletions teps/0065-retry-failed-tasks-on-demand.md
Original file line number Diff line number Diff line change
Expand Up @@ -84,8 +84,8 @@ individually, mark tests for rerun.

1. CI/CD use case, manually rerun all the failed jobs for a particular pull
request.
2. As a backend for Kubeflow Pipelines (KFP), one would want to manually retry a failed
`pipelineRun` to optimally use the resources. KFP already implemented this feature as the [pipeline 'retry' API](https://www.kubeflow.org/docs/components/pipelines/reference/api/kubeflow-pipeline-api-spec/#operation--apis-v1beta1-runs--run_id--retry-post) on the Argo backend.
2. As a backend for kubeflow, one would want to manually retry a failed
`pipelineRun` to optimally use the resources.
3. If the pipeline failed due to external factors such as cluster node
failure and image registry disconnect, user can retry from the same
stage at a later time.
Expand Down Expand Up @@ -199,14 +199,6 @@ Why should this TEP _not_ be implemented?

## Alternatives

1. One of design alternative can be, instead of giving an API dedicated for retry.
Allow setting the status of failed pipeline to something like:
[TEP-0015 Pending](0015-pending-pipeline.md).
This may be cumbersome, as opposed to
an API for retry. Also retry is also used by other pipeline engines e.g.
[Kubeflow pipelines](https://www.kubeflow.org/docs/components/pipelines/reference/api/kubeflow-pipeline-api-spec/#operation--apis-v1beta1-runs--run_id--retry-post)


<!--
What other approaches did you consider and why did you rule them out? These do
not need to be as detailed as the proposal, but should include enough
Expand Down

0 comments on commit 9d849a9

Please sign in to comment.