-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Doc] [KubeRay] Add end-to-end tutorial for real-world RayJob workload (batch inference) #38857
[Doc] [KubeRay] Add end-to-end tutorial for real-world RayJob workload (batch inference) #38857
Conversation
…nchmark.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Kai-Hsun Chen <[email protected]>
Co-authored-by: angelinalg <[email protected]> Signed-off-by: Kai-Hsun Chen <[email protected]>
…nchmark.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Kai-Hsun Chen <[email protected]>
…nchmark.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Kai-Hsun Chen <[email protected]>
…nchmark.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Kai-Hsun Chen <[email protected]>
Co-authored-by: angelinalg <[email protected]> Signed-off-by: Kai-Hsun Chen <[email protected]>
Co-authored-by: angelinalg <[email protected]> Signed-off-by: Kai-Hsun Chen <[email protected]>
Co-authored-by: angelinalg <[email protected]> Signed-off-by: Kai-Hsun Chen <[email protected]>
Co-authored-by: angelinalg <[email protected]> Signed-off-by: Kai-Hsun Chen <[email protected]>
Co-authored-by: angelinalg <[email protected]> Signed-off-by: Kai-Hsun Chen <[email protected]>
Do you want to add this example to the Examples Gallery? Instructions are at go/example-gallery. |
Thanks for the review!
@angelinalg I'm not sure how to decide. Should all examples be in the gallery or is there some criteria? Also, would you tag this with code-example or tutorial? |
…example.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
…example.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
…example.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
…example.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
…example.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
…example.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
…example.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
…example.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
…example.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
…example.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
…example.md Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
Signed-off-by: Archit Kulkarni <[email protected]>
Signed-off-by: Archit Kulkarni <[email protected]>
Test failure tests:test_object_assign_owner_client_mode unrelated |
…d (batch inference) (ray-project#38857) This PR adds a tutorial for running a batch inference workload on KubeRay using the RayJob CRD. It also updates the GPU/GKE doc (which is used as a subroutine in this tutorial) to remove the instructions related to taints and tolerations and GPU driver installation, both of which are currently handled automatically by GKE. --------- Signed-off-by: Kai-Hsun Chen <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]> Co-authored-by: Kai-Hsun Chen <[email protected]> Co-authored-by: Kai-Hsun Chen <[email protected]> Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]>
…38857) (#39186) * [Doc] [KubeRay] Add tutorial for connecting to google cloud storage bucket from GKE RayCluster (#38858) This PR adds a self contained tutorial for connecting to a google cloud storage bucket. (Mostly self contained, we do link out to the google cloud docs for creating a bucket.) --------- Signed-off-by: Kai-Hsun Chen <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]> Co-authored-by: Kai-Hsun Chen <[email protected]> Co-authored-by: Kai-Hsun Chen <[email protected]> Co-authored-by: angelinalg <[email protected]> * [Doc] [KubeRay] Add end-to-end tutorial for real-world RayJob workload (batch inference) (#38857) This PR adds a tutorial for running a batch inference workload on KubeRay using the RayJob CRD. It also updates the GPU/GKE doc (which is used as a subroutine in this tutorial) to remove the instructions related to taints and tolerations and GPU driver installation, both of which are currently handled automatically by GKE. --------- Signed-off-by: Kai-Hsun Chen <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]> Co-authored-by: Kai-Hsun Chen <[email protected]> Co-authored-by: Kai-Hsun Chen <[email protected]> Co-authored-by: angelinalg <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]> --------- Signed-off-by: Kai-Hsun Chen <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]> Co-authored-by: Kai-Hsun Chen <[email protected]> Co-authored-by: Kai-Hsun Chen <[email protected]> Co-authored-by: angelinalg <[email protected]>
…d (batch inference) (ray-project#38857) This PR adds a tutorial for running a batch inference workload on KubeRay using the RayJob CRD. It also updates the GPU/GKE doc (which is used as a subroutine in this tutorial) to remove the instructions related to taints and tolerations and GPU driver installation, both of which are currently handled automatically by GKE. --------- Signed-off-by: Kai-Hsun Chen <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]> Co-authored-by: Kai-Hsun Chen <[email protected]> Co-authored-by: Kai-Hsun Chen <[email protected]> Co-authored-by: angelinalg <[email protected]>
…d (batch inference) (ray-project#38857) This PR adds a tutorial for running a batch inference workload on KubeRay using the RayJob CRD. It also updates the GPU/GKE doc (which is used as a subroutine in this tutorial) to remove the instructions related to taints and tolerations and GPU driver installation, both of which are currently handled automatically by GKE. --------- Signed-off-by: Kai-Hsun Chen <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]> Co-authored-by: Kai-Hsun Chen <[email protected]> Co-authored-by: Kai-Hsun Chen <[email protected]> Co-authored-by: angelinalg <[email protected]> Signed-off-by: Jim Thompson <[email protected]>
…d (batch inference) (ray-project#38857) This PR adds a tutorial for running a batch inference workload on KubeRay using the RayJob CRD. It also updates the GPU/GKE doc (which is used as a subroutine in this tutorial) to remove the instructions related to taints and tolerations and GPU driver installation, both of which are currently handled automatically by GKE. --------- Signed-off-by: Kai-Hsun Chen <[email protected]> Signed-off-by: Archit Kulkarni <[email protected]> Co-authored-by: Kai-Hsun Chen <[email protected]> Co-authored-by: Kai-Hsun Chen <[email protected]> Co-authored-by: angelinalg <[email protected]> Signed-off-by: Victor <[email protected]>
Why are these changes needed?
This PR adds a tutorial for running a batch inference workload on KubeRay using the
RayJob
CRD.It also updates the GPU/GKE doc (which is used as a subroutine in this tutorial) to remove the instructions related to taints and tolerations and GPU driver installation, both of which are currently handled automatically by GKE.
Related issue number
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.method in Tune, I've added it in
doc/source/tune/api/
under thecorresponding
.rst
file.