Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix gpu sample bug and add same notebook sample #605

Closed
wants to merge 0 commits into from

Conversation

hongye-sun
Copy link
Contributor

@hongye-sun hongye-sun commented Jan 2, 2019

This change is Reviewable

@hongye-sun
Copy link
Contributor Author

/assign @Ark-kun

Copy link
Contributor

@qimingj qimingj left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@hongye-sun
Copy link
Contributor Author

/test kubeflow-pipeline-build-image

@Ark-kun
Copy link
Contributor

Ark-kun commented Jan 3, 2019

/lgtm

@Ark-kun
Copy link
Contributor

Ark-kun commented Jan 3, 2019

/test kubeflow-pipeline-sample-test

@@ -102,7 +102,7 @@ def kubeflow_training(output, project,
# TODO: use the argo job name as the workflow
workflow = '{{workflow.name}}'
# set the flag to use GPU trainer
use_gpu = True
use_gpu = False
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why change the use_gpu to false?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

By default, the gpu is not enabled in kubeflow's deployment. I am going to add a doc about how to enable gpu in new/existing cluster and will put the doc link in comment here.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

SG

@hongye-sun
Copy link
Contributor Author

/test kubeflow-pipeline-sample-test

@k8s-ci-robot
Copy link
Contributor

New changes are detected. LGTM label has been removed.

@k8s-ci-robot k8s-ci-robot removed the lgtm label Jan 3, 2019
@hongye-sun
Copy link
Contributor Author

/assign @Ark-kun @qimingj
Need another lgtm label. Thanks.

@gaoning777
Copy link
Contributor

gaoning777 commented Jan 4, 2019

Discussed offline that we need to add sample tests to cover this GPU testing.
but feel free to push in another PR

@hongye-sun
Copy link
Contributor Author

/test kubeflow-pipeline-build-image

@hongye-sun
Copy link
Contributor Author

The ml-pipeline-test doesn't have gpu quota. Requesting quota will take several days. I will revert my testing changes and continue without gpu disabled in sample in this PR.

@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by:

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

1 similar comment
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by:

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot
Copy link
Contributor

@hongye-sun: The following test failed, say /retest to rerun them all:

Test name Commit Details Rerun command
kubeflow-pipeline-e2e-test 5abc1a4 link /test kubeflow-pipeline-e2e-test

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants