[RLlib] Cleanup examples folder #10: Add `custom_rl_module.py` example script and matching RLModule example class (tiny CNN).. #45774

sven1977 · 2024-06-06T14:39:20Z

Cleanup examples folder #10: Add custom_rl_module.py example script and matching RLModule example class (tiny CNN)..

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: sven1977 <[email protected]>

…nup_examples_folder_10_custom_rl_module.py

…nup_examples_folder_10_custom_rl_module.py Signed-off-by: sven1977 <[email protected]>

Signed-off-by: sven1977 <[email protected]>

…nup_examples_folder_10_custom_rl_module.py

Signed-off-by: sven1977 <[email protected]>

…with RLlib default ones. Signed-off-by: sven1977 <[email protected]>

simonsays1980

LGTM.

simonsays1980 · 2024-06-06T15:22:57Z

rllib/core/rl_module/torch/torch_rl_module.py

+        # The default implementation is to return TorchCategorical for Discrete action
+        # spaces and TorchDiagGaussian for Box action spaces. For all other spaces,
+        # raise a NotImplementedError
+        if isinstance(self.config.action_space, gym.spaces.Discrete):


Why not using TorchMultiCategorical and TorchMultiDistribution - things that get assembled inside of the Catalog?

Not sure either, tbh. I just wanted to get the most simple setup automated. I feel like users that just want to "hack together an RLModule" should not be concerned about picking the categorical distr for their CartPole action space :)

Yes, we should extend this method to even more decent defaults, I think.

Let's continue brainstorming how to simplify the general RLModule experience for the user ...

simonsays1980 · 2024-06-06T15:24:23Z

rllib/env/single_agent_env_runner.py

@@ -91,12 +91,9 @@ def __init__(self, config: AlgorithmConfig, **kwargs):
        try:
            module_spec: SingleAgentRLModuleSpec = self.config.rl_module_spec
            module_spec.observation_space = self._env_to_module.observation_space
-            # TODO (simon): The `gym.Wrapper` for `gym.vector.VectorEnv` should


Great that this is gone now.

Yeah, it didn't seem to be a problem anymore (e.g. for PPO Pendulum, everything looks completely fine w/o any weird space errors on the Box actions). So I removed this comment.

…nup_examples_folder_10_custom_rl_module.py

Signed-off-by: sven1977 <[email protected]>

…nup_examples_folder_10_custom_rl_module.py

….py` example script and matching RLModule example class (tiny CNN).. (ray-project#45774) Signed-off-by: Richard Liu <[email protected]>

sven1977 added 7 commits May 15, 2024 20:49

wip

16ec8ad

Signed-off-by: sven1977 <[email protected]>

wip

0af3228

Signed-off-by: sven1977 <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into clea…

bbd5ec2

…nup_examples_folder_10_custom_rl_module.py

Merge branch 'master' of https://github.com/ray-project/ray into clea…

375c7ba

…nup_examples_folder_10_custom_rl_module.py Signed-off-by: sven1977 <[email protected]>

wip

df4192b

Signed-off-by: sven1977 <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into clea…

65f32d2

…nup_examples_folder_10_custom_rl_module.py

LINT and test case and docstring

b8b459f

Signed-off-by: sven1977 <[email protected]>

sven1977 requested review from ArturNiederfahrenhorst and simonsays1980 as code owners June 6, 2024 14:39

sven1977 assigned simonsays1980 Jun 6, 2024

SAEnvRunner bug fix overriding an already provided model_config_dict …

682da4d

…with RLlib default ones. Signed-off-by: sven1977 <[email protected]>

simonsays1980 approved these changes Jun 6, 2024

View reviewed changes

sven1977 enabled auto-merge (squash) June 6, 2024 15:55

github-actions bot added the go add ONLY when ready to merge, run all tests label Jun 6, 2024

sven1977 added 4 commits June 6, 2024 20:55

Merge branch 'master' of https://github.com/ray-project/ray into clea…

4f0f949

…nup_examples_folder_10_custom_rl_module.py

Merge branch 'master' of https://github.com/ray-project/ray into clea…

ee86f90

…nup_examples_folder_10_custom_rl_module.py

wip

6f4672a

Signed-off-by: sven1977 <[email protected]>

wip

a30f9b6

Signed-off-by: sven1977 <[email protected]>

github-actions bot disabled auto-merge June 7, 2024 07:00

sven1977 added 2 commits June 7, 2024 13:01

fixes

587b5f0

Signed-off-by: sven1977 <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into clea…

5994553

…nup_examples_folder_10_custom_rl_module.py

sven1977 merged commit ef54ee5 into ray-project:master Jun 7, 2024
6 checks passed

sven1977 deleted the cleanup_examples_folder_10_custom_rl_module.py branch June 7, 2024 12:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Cleanup examples folder #10: Add `custom_rl_module.py` example script and matching RLModule example class (tiny CNN).. #45774

[RLlib] Cleanup examples folder #10: Add `custom_rl_module.py` example script and matching RLModule example class (tiny CNN).. #45774

sven1977 commented Jun 6, 2024 •

edited

Loading

simonsays1980 left a comment

simonsays1980 Jun 6, 2024

sven1977 Jun 6, 2024

sven1977 Jun 6, 2024

simonsays1980 Jun 6, 2024

sven1977 Jun 6, 2024

[RLlib] Cleanup examples folder #10: Add custom_rl_module.py example script and matching RLModule example class (tiny CNN).. #45774

[RLlib] Cleanup examples folder #10: Add custom_rl_module.py example script and matching RLModule example class (tiny CNN).. #45774

Conversation

sven1977 commented Jun 6, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

simonsays1980 left a comment

Choose a reason for hiding this comment

simonsays1980 Jun 6, 2024

Choose a reason for hiding this comment

sven1977 Jun 6, 2024

Choose a reason for hiding this comment

sven1977 Jun 6, 2024

Choose a reason for hiding this comment

simonsays1980 Jun 6, 2024

Choose a reason for hiding this comment

sven1977 Jun 6, 2024

Choose a reason for hiding this comment

[RLlib] Cleanup examples folder #10: Add `custom_rl_module.py` example script and matching RLModule example class (tiny CNN).. #45774

[RLlib] Cleanup examples folder #10: Add `custom_rl_module.py` example script and matching RLModule example class (tiny CNN).. #45774

sven1977 commented Jun 6, 2024 •

edited

Loading