-
Notifications
You must be signed in to change notification settings - Fork 130
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Wait for job does not work as expected #60
Comments
I most definitely use it to wait for a job to be completed. example: - name: wait-for-onezone
image: {{ .Values.wait_for.image }}
imagePullPolicy: {{ template "imagePullPolicy" dict "root" . "context" .Values.wait_for }}
args:
- "job"
- "{{ template "onezone_name" . }}-ready-check" Please try image |
Will do. Thanks to the quick response. |
Version 1.5.1 works as expected. I'm not in production yet so I'm willing to help isolate the issue. I'll try a 1.6 version tomorrow and let you know the results. |
Had a hunch and it was right that's a diff between < Start Time: Wed, 21 Sep 2022 11:03:23 +0200
< Pods Statuses: 1 Active / 0 Succeeded / 0 Failed
---
> Start Time: Wed, 21 Sep 2022 09:03:23 +0000
> Pods Statuses: 1 Running / 0 Succeeded / 0 Failed They changed Running to Active... not sure how it could break the code yet, since it uses regexp-es that should be ok with that... |
Version 1.6 does not work. I diff'd wait_for.sh and don't see anything that would change its behavior. v1.5.1 uses kubectl 1.21.0 and v1.6 uses kubectl 1.24.0 so there is probably a change there. |
Switched to |
Got hit by this as well, switched to |
Also got hit by this in v1.7, is someone working on a fix? |
I found the problem. After all, the regexp was not working after k8s changed this: Pods Statuses: 0 Running / 1 Succeeded / 0 Failed
Pods Statuses: 1 Active (0 Ready) / 0 Succeeded / 0 Failed The change is connected with feature gate JobReadyPods that as far as I find, was introduced k8s v1.23. It adds As far as I understand Furthermore, it seems that @fdutton , @anleib, @DARB-CCM-S-20, @stephenpope if you could possibly share on which k8s version did you experience your problems? So that we can be sure that my conclusions here are correct. |
@groundnuty Great work! 1.24 for me. I've internalized v1.7 for now and changed to |
I am on 1.24 K8s as well |
Running v1.24.14 and ended up having to use v1.5.1- newer versions just completed immediately |
I was expecting for this app to wait until a job completed successfully but it only waited for the job to be ready. Am I misunderstanding something?
This is a portion of my deployment resource and I have verified that my job runs to completion and exits with a status code of 0.
The text was updated successfully, but these errors were encountered: