[Rllib] - Example Cleanups new API Stack - Autoregressive Action Module #45525

simonsays1980 · 2024-05-23T17:52:16Z

Why are these changes needed?

Implements an example of how to define an RLModule with dependent action distributions.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Simon Zehnder <[email protected]>

… in testing. Signed-off-by: Simon Zehnder <[email protected]>

rllib/examples/envs/classes/correlated_actions_env.py

rllib/examples/rl_modules/classes/autoregressive_actions_rlm.py

…docstrings and annotations and moved experiments to own file in parent folder. In addition fixed some minor bugs and tested the example script for bugs and learning. Signed-off-by: Simon Zehnder <[email protected]>

Signed-off-by: Simon Zehnder <[email protected]>

sven1977 · 2024-05-24T14:14:13Z

rllib/examples/rl_modules/autoregressive_actions_rlm.py

@@ -0,0 +1,107 @@
+"""An example script showing how to define and load an `RLModule` with


Nice nice nice! Thanks for adding this.

sven1977

LGTM! Could we add this script to the CI? New stack only. And some reachable learning criterium with the --as-test option.

simonsays1980 added 9 commits May 10, 2024 12:16

Changed comment.

c748df8

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

6409007

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

d2f9030

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

a3416a8

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

8582ad9

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

b565f34

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

c0eed1f

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

341cb95

Signed-off-by: Simon Zehnder <[email protected]>

Implemented Autoregressive Action RLModule with implementation. Still…

1287ec0

… in testing. Signed-off-by: Simon Zehnder <[email protected]>

simonsays1980 self-assigned this May 23, 2024

simonsays1980 added rllib RLlib related issues rllib-docs-or-examples Issues related to RLlib documentation or rllib/examples rllib-newstack labels May 23, 2024

sven1977 marked this pull request as ready for review May 23, 2024 18:59

sven1977 requested review from sven1977 and ArturNiederfahrenhorst as code owners May 23, 2024 18:59

sven1977 assigned sven1977 and unassigned simonsays1980 May 23, 2024