Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

/tests/assets/crs/experiments/ are out of date #214

Closed
DnPlas opened this issue Jul 10, 2024 · 1 comment · Fixed by #222
Closed

/tests/assets/crs/experiments/ are out of date #214

DnPlas opened this issue Jul 10, 2024 · 1 comment · Fixed by #222
Labels
bug Something isn't working

Comments

@DnPlas
Copy link
Contributor

DnPlas commented Jul 10, 2024

Bug Description

It looks like the experiments in that directory are out of date. I noticed this while testing on a live deployment and one of the experiments failed with Normal BackOff 15s (x111 over 25m) kubelet Back-off pulling image "docker.io/kubeflowkatib/mxnet-mnist:v0.17.0-rc.1".

We need to make sure all experiments are up to date with the upstream examples

To Reproduce

  1. Deploy juju deploy kubeflow --channel 1.9/beta --trust
  2. Access the dashboard and create a new experiment
  3. Wait for it to start running
kubectl get experiments -ntest
NAMESPACE   NAME                    TYPE      STATUS   AGE
test     bayesian-optimization   Running   True     32m
  1. List the pods in the namespace and describe the problematic one
 kubectl get pods -ntest | grep baye
bayesian-optimization-bayesianoptimization-78dcf9b7f9-wb8fq   1/1     Running            0          34m
bayesian-optimization-mk9ptnxd-2v6zm                          2/3     ImagePullBackOff   0          34m

Environment

microk8s 1.29-strict/stable
juju 3.4/stable (3.4.4)

Relevant Log Output

Normal  BackOff  15s (x111 over 25m)  kubelet  Back-off pulling image "docker.io/kubeflowkatib/mxnet-mnist:v0.17.0-rc.1"

Additional Context

No response

@DnPlas DnPlas added the bug Something isn't working label Jul 10, 2024
Copy link

Thank you for reporting us your feedback!

The internal ticket has been created: https://warthogs.atlassian.net/browse/KF-5984.

This message was autogenerated

orfeas-k added a commit that referenced this issue Jul 30, 2024
Update CRs according to upstream examples.

Closes #214
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant