[RLlib] Fix calling of callback `on_episode_created` to conform to docstring (after reset). #45651

simonsays1980 · 2024-05-31T08:54:10Z

Why are these changes needed?

The docstring of the on_episode_created callback states clearly that this callback should be called before the env.reset.

Related issue number

Closes #45544

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Simon Zehnder <[email protected]>

…fter the 'env.reset' instead before. The docstring in the callback clearly states, it should come before the reset. Signed-off-by: Simon Zehnder <[email protected]>

Signed-off-by: Simon Zehnder <[email protected]>

sven1977 · 2024-05-31T09:25:33Z

rllib/env/multi_agent_env_runner.py

        # Create a new multi-agent episode.
        _episode = self._new_episode()
        self._make_on_episode_callback("on_episode_created", _episode)
        _shared_data = {
            "agent_to_module_mapping_fn": self.config.policy_mapping_fn,
        }

+        # Reset the environment.


Hey @simonsays1980 , thanks for looking into this problem. I think this issue in general here is unfortunately more complex that what meets the eye right now :( . Let me explain:

On single-agent, users are currently not even allowed to override the on_episode_created callback :D . This is because in single-agent, we use gym's vector env, which resets envs automatically after a terminal is hit, which makes it impossible to call the on_episode_created callback before this auto-reset happens. See here and here.

For multi-agent (where currently we don't use gym.vector) this does actually work and I therefore would suggest, we only fix this for now on the multi-agent env runner.

In MultiAgentEnvRunner, however, we should then also fix it for _sample_episodes().

We should update the docstring in callbacks.py to reflect that this callback is NOT currently valid for new API stack + single-agent.

We should remove the on_episode_created callback call entirely from single-agent env runner.

sven1977

Thanks for raising this issue @simonsays1980 , an important one.

I wrote my thoughts and the current problems with this particular callback below and suggested some changes. Then we can merge this. :)

Signed-off-by: Simon Zehnder <[email protected]>

sven1977

We still need to:

Add the on_episode_created callback call to MultiAgentEpisode._sample_episodes().

rllib/algorithms/callbacks.py

Signed-off-by: Simon Zehnder <[email protected]>

…cstring (after reset). (ray-project#45651) Signed-off-by: Richard Liu <[email protected]>

simonsays1980 added 15 commits May 10, 2024 12:16

Changed comment.

c748df8

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

6409007

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

d2f9030

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

a3416a8

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

8582ad9

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

b565f34

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

c0eed1f

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

341cb95

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

b76807f

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

af9c9e9

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

e422c42

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

26e0926

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

562f586

Signed-off-by: Simon Zehnder <[email protected]>

Fixed a minor bug that was calling the calback 'on_episode_created' a…

b95848c

…fter the 'env.reset' instead before. The docstring in the callback clearly states, it should come before the reset. Signed-off-by: Simon Zehnder <[email protected]>

Modified callback order in 'MultiAgentEnvRunner'.

ca98704

Signed-off-by: Simon Zehnder <[email protected]>

sven1977 reviewed May 31, 2024

View reviewed changes

Implemented @sven1977's review.

d8eee30

Signed-off-by: Simon Zehnder <[email protected]>

sven1977 marked this pull request as ready for review May 31, 2024 12:15

sven1977 requested a review from ArturNiederfahrenhorst as a code owner May 31, 2024 12:15

sven1977 changed the title ~~[RLlib] - Fix calling of callback on_episode_created to conform to docstring (after reset).~~ [RLlib] Fix calling of callback on_episode_created to conform to docstring (after reset). May 31, 2024

sven1977 approved these changes May 31, 2024

View reviewed changes

sven1977 reviewed May 31, 2024

View reviewed changes

rllib/algorithms/callbacks.py Show resolved Hide resolved

simonsays1980 added 2 commits June 5, 2024 10:56

Merge branch 'master' of https://github.com/ray-project/ray

a694987

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' into fix-callback-on-episode-created

4ba3105

Signed-off-by: Simon Zehnder <[email protected]>

simonsays1980 self-assigned this Jun 5, 2024

simonsays1980 added bug Something that is supposed to be working; but isn't rllib RLlib related issues rllib-newstack labels Jun 5, 2024

sven1977 enabled auto-merge (squash) June 5, 2024 10:48

github-actions bot added the go add ONLY when ready to merge, run all tests label Jun 5, 2024

sven1977 merged commit 61bc5d4 into ray-project:master Jun 5, 2024
7 checks passed

richardsliu pushed a commit to richardsliu/ray that referenced this pull request Jun 12, 2024

[RLlib] Fix calling of callback on_episode_created to conform to do…

9eed717

…cstring (after reset). (ray-project#45651) Signed-off-by: Richard Liu <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Fix calling of callback `on_episode_created` to conform to docstring (after reset). #45651

[RLlib] Fix calling of callback `on_episode_created` to conform to docstring (after reset). #45651

simonsays1980 commented May 31, 2024 •

edited

Loading

sven1977 May 31, 2024

sven1977 left a comment

sven1977 left a comment

[RLlib] Fix calling of callback on_episode_created to conform to docstring (after reset). #45651

[RLlib] Fix calling of callback on_episode_created to conform to docstring (after reset). #45651

Conversation

simonsays1980 commented May 31, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

sven1977 May 31, 2024

Choose a reason for hiding this comment

sven1977 left a comment

Choose a reason for hiding this comment

sven1977 left a comment

Choose a reason for hiding this comment

[RLlib] Fix calling of callback `on_episode_created` to conform to docstring (after reset). #45651

[RLlib] Fix calling of callback `on_episode_created` to conform to docstring (after reset). #45651

simonsays1980 commented May 31, 2024 •

edited

Loading