[Doc] Workspace template examples #32802

justinvyu · 2023-02-24T02:25:36Z

Why are these changes needed?

This PR adds an initial set of examples that can be used as a template for users to get started with certain workloads.

This includes:

Batch prediction w/ AIR
Many model training w/ AIR
Serving a stable diffusion model

For each example, we include:

Jupyter notebook
(Optional) requirements.txt for additional dependencies
Entry in templates.yaml
Entry in release tests yaml

See the contributing guide to see the steps needed to add a new template.

TODOs

Finish READMEs
Create hello world vs. production versions for all examples. Currently, there's just one version of each.
Add release tests

To add in a follow-up PR

Full Ray cluster launcher support
- Guide on how to run these templates on OSS cluster
- Show these templates somewhere in OSS docs
- Create an script to convert OSS cluster config -> Anyscale configs

Related issue number

N/A

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Justin Yu <[email protected]>

into doc/example_templates

Signed-off-by: Justin Yu <[email protected]>

…example_templates

Signed-off-by: Justin Yu <[email protected]>

…example_templates

Signed-off-by: Justin Yu <[email protected]>

…example_templates

Signed-off-by: Justin Yu <[email protected]>

doc/source/examples/01_batch_inference/batch_inference.py

ericl

Let's actually do two things for now:

Don't use BatchPredictor or Checkpoint APIs.
Instead, use the Dataset ActorPool APIs and callable classes.

We are having some internal discussions on possibly deprecating / unifying BatchPredictor: #32929

doc/source/examples/02_many_model_training/many_model_training.py

richardliaw · 2023-03-16T18:46:27Z

doc/source/conf.py

@@ -160,6 +160,7 @@
    "_build",
    "source/workflows/api/doc/ray.workflow.*",
    "source/serve/api/doc/ray.serve.*",
+    "source/templates",


maybe source/templates/*?

Signed-off-by: Justin Yu <[email protected]>

…example_templates

ericl · 2023-03-16T19:32:08Z

doc/source/templates/01_batch_inference/batch_inference.ipynb

+    "NUM_WORKERS: int = 1\n",
+    "\n",
+    "USE_GPU: bool = True\n",
+    "NUM_GPUS_PER_WORKER: float = 1\n"


This cell is repeated twice.

This is how I'm getting around loading a small vs. large version of the template. On product side, we can run some post init command to process the notebook:

Filter out all notebook cells tagged as large. Vice versa for large scale examples. This way, the configurations get set to use the small example defaults.

jupyter nbconvert --TagRemovePreprocessor.remove_input_tags='large' --to notebook --output batch_inference.ipynb batch_inference.ipynb

How does that sound @ericl?

Can we comment these cells like # This is the large case, etc?

Yes, sounds good!

ericl · 2023-03-16T19:34:21Z

doc/source/templates/01_batch_inference/batch_inference.ipynb

+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "preds = predictions.fully_executed()\n",


Suggested change

"preds = predictions.fully_executed()\n",

"preds = predictions.cache()\n",

The fully executed call is deprecated, use cache() instead.

doc/source/templates/02_many_model_training/many_model_training.ipynb

Signed-off-by: Justin Yu <[email protected]>

…example_templates

Signed-off-by: Justin Yu <[email protected]>

…example_templates

Signed-off-by: Edward Oakes <[email protected]>

Signed-off-by: chaowang <[email protected]>

Signed-off-by: elliottower <[email protected]>

Signed-off-by: Jack He <[email protected]>

justinvyu and others added 25 commits February 16, 2023 11:15

Check in example directories

fb94c45

Signed-off-by: Justin Yu <[email protected]>

Add example code to test

636d95b

Signed-off-by: Justin Yu <[email protected]>

Fix batch inference + MMT examples to run properly

ebde3ec

Signed-off-by: Justin Yu <[email protected]>

Add working hello world version of serving stable diffusion

e0cc6f1

Signed-off-by: Justin Yu <[email protected]>

Fix batch prediction example to use numpy

32d50eb

Signed-off-by: Justin Yu <[email protected]>

Update README.md

3d45c13

Signed-off-by: Justin Yu <[email protected]>

Add example image

bd1072e

Signed-off-by: Justin Yu <[email protected]>

Merge branch 'doc/example_templates' of https://github.com/justinvyu/ray

144081e

into doc/example_templates

Add requirements.txt to stable diffusion example

4ad8594

Signed-off-by: Justin Yu <[email protected]>

Add readme skeletons

979663c

Signed-off-by: Justin Yu <[email protected]>

Fix requirements (pin numpy + tensorboard)

53bde79

Signed-off-by: Justin Yu <[email protected]>

Try out runtime env for dependencies

ee68212

Signed-off-by: Justin Yu <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into doc/…

32f1aaa

…example_templates

Working serve example with runtime env

8f22df6

Signed-off-by: Justin Yu <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into doc/…

f10ac4b

…example_templates

Load from requirements.txt file

73e0076

Signed-off-by: Justin Yu <[email protected]>

Add cluster env, compute config, templates.yaml for hello world examples

e136726

Signed-off-by: Justin Yu <[email protected]>

Add 'large scale' stable diffusion serving example

02ded0c

Signed-off-by: Justin Yu <[email protected]>

Reorganize configs to reduce duplication

36b2254

Signed-off-by: Justin Yu <[email protected]>

Update mmt example with larger version

3643274

Signed-off-by: Justin Yu <[email protected]>

Update batch inference example with image workload + large scale version

db4b688

Signed-off-by: Justin Yu <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into doc/…

4f1babd

…example_templates

Fix lint

3a677ac

Signed-off-by: Justin Yu <[email protected]>

Update templates.yaml

242fcdb

Signed-off-by: Justin Yu <[email protected]>

Make batch inference work for small scale

26f1f4e

Signed-off-by: Justin Yu <[email protected]>

ericl reviewed Mar 10, 2023

View reviewed changes

doc/source/examples/01_batch_inference/batch_inference.py Outdated Show resolved Hide resolved

ericl requested changes Mar 10, 2023

View reviewed changes

ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Mar 10, 2023

ericl self-assigned this Mar 10, 2023

ericl reviewed Mar 10, 2023

View reviewed changes

doc/source/examples/02_many_model_training/many_model_training.py Outdated Show resolved Hide resolved

justinvyu assigned richardliaw Mar 16, 2023

justinvyu requested a review from richardliaw March 16, 2023 16:45

justinvyu assigned matthewdeng Mar 16, 2023

richardliaw requested a review from ericl March 16, 2023 17:04

richardliaw reviewed Mar 16, 2023

View reviewed changes

richardliaw approved these changes Mar 16, 2023

View reviewed changes

justinvyu added 2 commits March 16, 2023 11:50

Try fixing exclude pattern

05d0957

Signed-off-by: Justin Yu <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into doc/…

40d50ae

…example_templates

ericl reviewed Mar 16, 2023

View reviewed changes

doc/source/templates/02_many_model_training/many_model_training.ipynb Show resolved Hide resolved

ericl approved these changes Mar 16, 2023

View reviewed changes

ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Mar 16, 2023

justinvyu added 5 commits March 16, 2023 14:49

fully_executed -> cache, clarify duplicate cells

47472a5

Signed-off-by: Justin Yu <[email protected]>

Fix paths to be relative to *source* dir

c1daf10

Signed-off-by: Justin Yu <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into doc/…

f9eb7fb

…example_templates

Remove previously useless exclude patterns

45b00d6

Signed-off-by: Justin Yu <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into doc/…

2947cd4

…example_templates

justinvyu added tests-ok The tagger certifies test failures are unrelated and assumes personal liability. and removed @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. labels Mar 19, 2023

richardliaw changed the title ~~[Doc][Anyscale] Workspace template examples~~ [Doc] Workspace template examples Mar 20, 2023

richardliaw merged commit ed2982f into ray-project:master Mar 20, 2023

edoakes pushed a commit to edoakes/ray that referenced this pull request Mar 22, 2023

[Doc] Workspace template examples (ray-project#32802)

dde6f1b

Signed-off-by: Edward Oakes <[email protected]>

clarng pushed a commit to clarng/ray that referenced this pull request Mar 23, 2023

[Doc] Workspace template examples (ray-project#32802)

3117186

chaowanggg pushed a commit to chaowanggg/ray-dev that referenced this pull request Apr 4, 2023

[Doc] Workspace template examples (ray-project#32802)

0cfe1fd

Signed-off-by: chaowang <[email protected]>

elliottower pushed a commit to elliottower/ray that referenced this pull request Apr 22, 2023

[Doc] Workspace template examples (ray-project#32802)

8668841

Signed-off-by: elliottower <[email protected]>

ProjectsByJackHe pushed a commit to ProjectsByJackHe/ray that referenced this pull request May 4, 2023

[Doc] Workspace template examples (ray-project#32802)

7bac030

Signed-off-by: Jack He <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Doc] Workspace template examples #32802

[Doc] Workspace template examples #32802

justinvyu commented Feb 24, 2023 •

edited

Loading

ericl left a comment

richardliaw Mar 16, 2023

ericl Mar 16, 2023

justinvyu Mar 16, 2023

ericl Mar 16, 2023

justinvyu Mar 16, 2023

ericl Mar 16, 2023

	"preds = predictions.fully_executed()\n",
	"preds = predictions.cache()\n",

[Doc] Workspace template examples #32802

[Doc] Workspace template examples #32802

Conversation

justinvyu commented Feb 24, 2023 • edited Loading

Why are these changes needed?

TODOs

To add in a follow-up PR

Related issue number

Checks

ericl left a comment

Choose a reason for hiding this comment

richardliaw Mar 16, 2023

Choose a reason for hiding this comment

ericl Mar 16, 2023

Choose a reason for hiding this comment

justinvyu Mar 16, 2023

Choose a reason for hiding this comment

ericl Mar 16, 2023

Choose a reason for hiding this comment

justinvyu Mar 16, 2023

Choose a reason for hiding this comment

ericl Mar 16, 2023

Choose a reason for hiding this comment

justinvyu commented Feb 24, 2023 •

edited

Loading