[RLlib] Fix wrong `env` being passed into `on_episode_end` callback on MultiAgentEnvRunner when sampling whole episodes. #45617

sven1977 · 2024-05-29T15:17:20Z

Fix wrong env being passed into on_episode_end callback on MultiAgentEnvRunner when sampling whole episodes.

Enhance test cases to capture proper callbacks arguments (env, env_runner, metrics_logger).

Why are these changes needed?

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: sven1977 <[email protected]>

simonsays1980

LGTM.

wip

9938cd4

Signed-off-by: sven1977 <[email protected]>

sven1977 requested review from ArturNiederfahrenhorst and simonsays1980 as code owners May 29, 2024 15:17

sven1977 assigned simonsays1980 May 29, 2024

sven1977 enabled auto-merge (squash) May 29, 2024 15:18

github-actions bot added the go add ONLY when ready to merge, run all tests label May 29, 2024

simonsays1980 approved these changes May 29, 2024

View reviewed changes

sven1977 merged commit 3f29274 into ray-project:master May 29, 2024
7 checks passed