[Doc][AIR] Improve visibility of Trainer restore and stateful callback restoration #34350

justinvyu · 2023-04-13T00:34:15Z

Why are these changes needed?

This PR moves the FAQ section on Train experiment restoration to the DL/GBDT user guides. Plus this PR makes these code snippets tested. Secondly, this PR improves the docstrings of methods to implement for saving/restoring stateful Callbacks.

Links for reviewers

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Justin Yu <[email protected]>

…restore_loose_ends

Signed-off-by: Justin Yu <[email protected]>

…restore_loose_ends Signed-off-by: Justin Yu <[email protected]>

Signed-off-by: Justin Yu <[email protected]>

…restore_loose_ends

matthewdeng

Very nice!

doc/source/train/config_guide.rst

doc/source/train/doc_code/key_concepts.py

doc/source/train/dl_guide.rst

doc/source/train/doc_code/dl_guide.py

matthewdeng · 2023-04-15T00:37:14Z

doc/source/train/gbdt.rst

+    Once checkpointing is enabled, you can follow :ref:`this guide <train-fault-tolerance>`
+    to enable fault tolerance.


Probably don't want to link this since this is in the Deep Learning guide.

Yeah, I was trying to share some content between the two. I linked it to a section that doesn't really depend on DL trainers. I think it's okay for now, and we can rethink the docs/user guide structure as part of the docs side of the layering project. wdyt?

Signed-off-by: Justin Yu <[email protected]>

…restore_loose_ends

…k restoration (ray-project#34350) Signed-off-by: elliottower <[email protected]>

…k restoration (ray-project#34350) Signed-off-by: Jack He <[email protected]>

…k restoration (ray-project#34350)

justinvyu added 6 commits April 12, 2023 16:34

Move trainer restore from faq to user guide + improvements

8e64cd6

Signed-off-by: Justin Yu <[email protected]>

Add a section for gbdt trainers

ca0d3a9

Signed-off-by: Justin Yu <[email protected]>

Fix config guide

63ebf35

Signed-off-by: Justin Yu <[email protected]>

improve config guide

8679042

Signed-off-by: Justin Yu <[email protected]>

Improve docstrings of callback stateful methods

d371b8e

Signed-off-by: Justin Yu <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into doc/…

4b14653

…restore_loose_ends

justinvyu assigned matthewdeng Apr 13, 2023

justinvyu requested review from richardliaw, krfricke and xwjiang2010 as code owners April 13, 2023 00:34

justinvyu assigned richardliaw Apr 13, 2023

justinvyu requested review from amogkam, matthewdeng, Yard1, maxpumperla and a team as code owners April 13, 2023 00:34

justinvyu added 5 commits April 14, 2023 10:59

Fix doc code

3cee93e

Signed-off-by: Justin Yu <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into doc/…

6dc3912

…restore_loose_ends Signed-off-by: Justin Yu <[email protected]>

Fix merge conflict remainder

6169a3b

Signed-off-by: Justin Yu <[email protected]>

Fix missing reference + incorrect code line highlight

21489f1

Signed-off-by: Justin Yu <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into doc/…

0e80203

…restore_loose_ends

justinvyu requested a review from gjoliver as a code owner April 14, 2023 21:25

matthewdeng approved these changes Apr 15, 2023

View reviewed changes

justinvyu added 5 commits April 17, 2023 17:47

Framework -> <Framework>

3251197

Signed-off-by: Justin Yu <[email protected]>

Separate into two code blocks

11abbc4

Signed-off-by: Justin Yu <[email protected]>

Fix header levels for config guide + add better descriptions

9c6c0d4

Signed-off-by: Justin Yu <[email protected]>

Clarify the run config example snippet

a614ea2

Signed-off-by: Justin Yu <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into doc/…

59e8317

…restore_loose_ends

justinvyu requested a review from matthewdeng April 18, 2023 01:14

Merge branch 'master' of https://github.com/ray-project/ray into doc/…

9bda935

…restore_loose_ends

Merge branch 'master' of https://github.com/ray-project/ray into doc/…

454e14d

…restore_loose_ends

justinvyu added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Apr 19, 2023

richardliaw merged commit 3d94498 into ray-project:master Apr 20, 2023

elliottower pushed a commit to elliottower/ray that referenced this pull request Apr 22, 2023

[Doc][AIR] Improve visibility of Trainer restore and stateful callbac…

374cab9

…k restoration (ray-project#34350) Signed-off-by: elliottower <[email protected]>

ProjectsByJackHe pushed a commit to ProjectsByJackHe/ray that referenced this pull request May 4, 2023

[Doc][AIR] Improve visibility of Trainer restore and stateful callbac…

e3e7abb

…k restoration (ray-project#34350) Signed-off-by: Jack He <[email protected]>

architkulkarni pushed a commit to architkulkarni/ray that referenced this pull request May 16, 2023

[Doc][AIR] Improve visibility of Trainer restore and stateful callbac…

76d2c9f

…k restoration (ray-project#34350)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Doc][AIR] Improve visibility of Trainer restore and stateful callback restoration #34350

[Doc][AIR] Improve visibility of Trainer restore and stateful callback restoration #34350

justinvyu commented Apr 13, 2023 •

edited

Loading

matthewdeng left a comment

matthewdeng Apr 15, 2023

justinvyu Apr 18, 2023

		Once checkpointing is enabled, you can follow :ref:`this guide <train-fault-tolerance>`
		to enable fault tolerance.

[Doc][AIR] Improve visibility of Trainer restore and stateful callback restoration #34350

[Doc][AIR] Improve visibility of Trainer restore and stateful callback restoration #34350

Conversation

justinvyu commented Apr 13, 2023 • edited Loading

Why are these changes needed?

Links for reviewers

Related issue number

Checks

matthewdeng left a comment

Choose a reason for hiding this comment

matthewdeng Apr 15, 2023

Choose a reason for hiding this comment

justinvyu Apr 18, 2023

Choose a reason for hiding this comment

justinvyu commented Apr 13, 2023 •

edited

Loading