[train][docs] update docstrings/quickstarts to work when `use_gpu=True` #31692

matthewdeng · 2023-01-16T20:05:39Z

Signed-off-by: Matthew Deng [email protected]

Fixes Trainer docstrings and quickstarts to work when use_gpu=True.

Why are these changes needed?

Updated iter_torch_batches to move to the proper device.
1. This is needed for both torch and horovod which use iter_torch_batches.
2. This is not needed for tensorlfow and huggingface which handle device transfer natively.
Extracted use_gpu to the top of each code snippet for easier control handling.

Related issue number

Closes #31684

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Matthew Deng <[email protected]>

…tart-gpu

Yard1

Orthogonal to this, but I think that if we detect we are in a train session, iter_torch_batches should automatically use the default device unless specified otherwise.

maxpumperla · 2023-01-18T18:54:30Z

doc/source/ray-air/doc_code/hf_trainer.py

@@ -12,6 +12,10 @@
 from ray.train.huggingface import HuggingFaceTrainer
 from ray.air.config import ScalingConfig

+
+# If using GPUs, set this to True.
+use_gpu = False


dumb, yet effective trick for cases you want to show users you can use gpus, but want cpus on CI:

use_gpu = True # include in docs use_gpu = False # exclude <code using use_gpu>

matthewdeng · 2023-01-18T19:38:33Z

Orthogonal to this, but I think that if we detect we are in a train session, iter_torch_batches should automatically use the default device unless specified otherwise.

@Yard1 yeah I was thinking the same thing, cc @stephanie-wang this could be a nice extension to the DatasetIterator!

Yard1 · 2023-01-18T20:27:59Z

I've got a PR here, comments appreciated - #31745

matthewdeng · 2023-01-26T01:05:18Z

@amogkam let me know how you want to coordinate this with #31753. If you're able to get your changes in by 2.3 I can update this PR to not set device.

amogkam · 2023-01-26T01:21:29Z

Ah sorry- let's get this in ASAP...let me know when it's ready to merge.

I can update this documentation in that PR.

matthewdeng · 2023-01-26T01:23:18Z

@amogkam I think it's good to merge as is!

matthewdeng added 2 commits January 16, 2023 11:56

[train][docs] update docstrings/quickstarts to work when use_gpu=True

1be63df

Signed-off-by: Matthew Deng <[email protected]>

horovod

b6058d6

Signed-off-by: Matthew Deng <[email protected]>

matthewdeng requested review from richardliaw, gjoliver, krfricke, xwjiang2010, amogkam, Yard1, maxpumperla and a team as code owners January 16, 2023 20:05

matthewdeng assigned richardliaw Jan 16, 2023

matthewdeng changed the title ~~Train quickstart gpu~~ [train][docs] update docstrings/quickstarts to work when use_gpu=True Jan 16, 2023

matthewdeng added 2 commits January 17, 2023 12:59

Merge branch 'master' of github.com:ray-project/ray into train-quicks…

ff2b018

…tart-gpu

Merge branch 'master' of github.com:ray-project/ray into train-quicks…

8b3bfdf

…tart-gpu

Yard1 approved these changes Jan 18, 2023

View reviewed changes

maxpumperla reviewed Jan 18, 2023

View reviewed changes

matthewdeng assigned amogkam Jan 26, 2023

amogkam approved these changes Jan 26, 2023

View reviewed changes

amogkam merged commit da79ae9 into ray-project:master Jan 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[train][docs] update docstrings/quickstarts to work when `use_gpu=True` #31692

[train][docs] update docstrings/quickstarts to work when `use_gpu=True` #31692

matthewdeng commented Jan 16, 2023

Yard1 left a comment

maxpumperla Jan 18, 2023

matthewdeng commented Jan 18, 2023

Yard1 commented Jan 18, 2023 •

edited

Loading

matthewdeng commented Jan 26, 2023 •

edited

Loading

amogkam commented Jan 26, 2023

matthewdeng commented Jan 26, 2023

[train][docs] update docstrings/quickstarts to work when use_gpu=True #31692

[train][docs] update docstrings/quickstarts to work when use_gpu=True #31692

Conversation

matthewdeng commented Jan 16, 2023

Why are these changes needed?

Related issue number

Checks

Yard1 left a comment

Choose a reason for hiding this comment

maxpumperla Jan 18, 2023

Choose a reason for hiding this comment

matthewdeng commented Jan 18, 2023

Yard1 commented Jan 18, 2023 • edited Loading

matthewdeng commented Jan 26, 2023 • edited Loading

amogkam commented Jan 26, 2023

matthewdeng commented Jan 26, 2023

[train][docs] update docstrings/quickstarts to work when `use_gpu=True` #31692

[train][docs] update docstrings/quickstarts to work when `use_gpu=True` #31692

Yard1 commented Jan 18, 2023 •

edited

Loading

matthewdeng commented Jan 26, 2023 •

edited

Loading