-
Notifications
You must be signed in to change notification settings - Fork 402
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix config frequent update #1014
Fix config frequent update #1014
Conversation
…rve application Signed-off-by: Sihan Wang <[email protected]>
d1ab8c7
to
4cb0037
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This change needs to be unit tested. I'm approving this for now since there are users that need this fix to go through asap, and the current testing framework doesn't let us easily test timing interactions with the runtime_env
. We should follow up with a PR that adds tests.
What's the relationship between runtime env preparation and the value of |
When the runtime env take very long time (e.g. user has a long list with pip install), serve will not provide deployment status since there is no serve deployment starting yet. |
Sync with @sihanwang41 offline.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Wait for the testing result of test_sample_rayservice_yamls.py
.
Ran the test_sample_rayservice_yamls.py, there are
it is consistent error even with latest master. I don't think it is caused by this pr. |
Sync with Sihan offline. This error is due to use x86 image on Mac M1. |
…rve application (ray-project#1014) Ignore deployments status to decide whether to deploy serve application
Why are these changes needed?
Current logic will cause the ray-operator keep sending requests when there is no deployment statuses. When the runtime env preparation time is long, ray-operator will keep redeploying the ray serve cluster.
So after ray-operator sends the config request, it will eventually get the status from ray serve.
Collaborating with customer to fix the runtime env issue, and verify RayService can be started with this change.
Config:
Change
serveConfig
inray-operator/config/samples/ray_v1alpha1_rayservice.yaml
toRelated issue number
Checks