You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
test_sample_rayservice_yamls.py is very flaky (~20% to fail). Previously, it was very stable. #731 said that "I have run more than ten times on my cluster, and the result is always pass.".
[root cause]: RayServiceAddCREvent fails to converge due to the absence of rayservice-sample-serve-svc in 90 seconds. It is too slow.
The RayServiceAddCREvent converges in 32 seconds. I guessed the root cause of this issue is RAY_IMAGE. The current test framework will not replace the RAY_IMAGE automatically, so the Pod will pull the image from DockerHub instead if RAY_IMAGE is not the same as the value of image in the YAML file.
Search before asking
KubeRay Component
ray-operator
What happened + What you expected to happen
See #837 for more details.
test_sample_rayservice_yamls.py is very flaky (~20% to fail). Previously, it was very stable. #731 said that "I have run more than ten times on my cluster, and the result is always pass.".
rayservice-sample-serve-svc
in 90 seconds. It is too slow.Reproduction script
Run the following command several times.
Anything else
No response
Are you willing to submit a PR?
The text was updated successfully, but these errors were encountered: