[RLlib] Fix MultiAgentEpisode getter bugs. #44898

sven1977 · 2024-04-22T13:20:20Z

Fix MultiAgentEpisode getter bugs.

Improved MultiAgentEpisode test cases.

e.g.:

        observations = [
            {"a0": 0, "a1": 0},
            {"a0": 1, "a1": 1},
            {"a1": 2},
            {"a1": 3},
            {"a1": 4},
        ]
        actions = observations[:-1]
        rewards = observations[:-1]
        episode = MultiAgentEpisode(
            observations=observations, actions=actions, rewards=rewards, len_lookback_buffer="auto"
        )
        episode.get_actions(0)  # <- should result in an index error (b/c all actions are in the lookback buffer by default), but didn't

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: sven1977 <[email protected]>

sven1977 · 2024-04-23T09:37:01Z

rllib/env/utils/infinite_lookback_buffer.py

@@ -464,7 +464,7 @@ def _get_int_index(
        # If index >= 0 -> Ignore lookback buffer.
        # Otherwise, include lookback buffer.
        if idx >= 0 or neg_indices_left_of_zero:
-            idx = self.lookback + idx - (_ignore_last_ts is True)
+            idx = self.lookback + idx


This is the actual bug fix.

sven1977 · 2024-04-23T09:37:45Z

rllib/env/tests/test_multi_agent_episode.py

@@ -1023,6 +1023,50 @@ def test_get_actions(self):
        act = episode.get_actions(-4, env_steps=False, fill="skip")
        check(act, {"a0": "skip", "a1": 0})

+        episode.add_env_step(


Added this to the tests. Mostly to figure out, whether a hanging action at the edge of the episode or further back would make a difference in get_actions(-1).

sven1977 · 2024-04-23T09:37:58Z

rllib/env/tests/test_multi_agent_episode.py

@@ -975,7 +975,7 @@ def test_get_actions(self):
            check(act, actions[i])
        # Access >=0 integer indices (expect index error as everything is in
        # lookback buffer).
-        for i in range(1, 5):
+        for i in range(0, 5):


idx=0 was NOT working properly before this fix.

sven1977 · 2024-04-23T09:38:10Z

rllib/env/multi_agent_episode.py

@@ -1633,21 +1633,30 @@ def __repr__(self):
        )

    def print(self) -> None:
+        """Prints this MultiAgentEpisode as a table of observations for the agents."""


Made this a little nicer. :)

simonsays1980

LGTM. All tests passed for the MAERB, too. This was the remaining part in the puzzle. Awesome!

simonsays1980 · 2024-04-23T09:37:42Z

rllib/env/utils/infinite_lookback_buffer.py

@@ -464,7 +464,7 @@ def _get_int_index(
        # If index >= 0 -> Ignore lookback buffer.
        # Otherwise, include lookback buffer.
        if idx >= 0 or neg_indices_left_of_zero:
-            idx = self.lookback + idx - (_ignore_last_ts is True)
+            idx = self.lookback + idx


Amazing how such a small modification changes the landscape completely.

sven1977 added 2 commits April 22, 2024 14:05

wip

5384dff

Signed-off-by: sven1977 <[email protected]>

wip

a61cb72

Signed-off-by: sven1977 <[email protected]>

sven1977 marked this pull request as ready for review April 22, 2024 13:22

sven1977 requested review from avnishn, ArturNiederfahrenhorst, maxpumperla, kouroshHakha and simonsays1980 as code owners April 22, 2024 13:22

sven1977 assigned simonsays1980 Apr 22, 2024

sven1977 added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Apr 22, 2024

Merge branch 'master' into fix_multi_agent_episode_getter_bugs

3b3b58d

sven1977 commented Apr 23, 2024

View reviewed changes

simonsays1980 approved these changes Apr 23, 2024

View reviewed changes

sven1977 merged commit 930f31a into ray-project:master Apr 23, 2024
5 checks passed

sven1977 deleted the fix_multi_agent_episode_getter_bugs branch April 23, 2024 10:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Fix MultiAgentEpisode getter bugs. #44898

[RLlib] Fix MultiAgentEpisode getter bugs. #44898

sven1977 commented Apr 22, 2024 •

edited

Loading

sven1977 Apr 23, 2024

sven1977 Apr 23, 2024

sven1977 Apr 23, 2024

sven1977 Apr 23, 2024

simonsays1980 left a comment

simonsays1980 Apr 23, 2024

[RLlib] Fix MultiAgentEpisode getter bugs. #44898

[RLlib] Fix MultiAgentEpisode getter bugs. #44898

Conversation

sven1977 commented Apr 22, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

sven1977 Apr 23, 2024

Choose a reason for hiding this comment

sven1977 Apr 23, 2024

Choose a reason for hiding this comment

sven1977 Apr 23, 2024

Choose a reason for hiding this comment

sven1977 Apr 23, 2024

Choose a reason for hiding this comment

simonsays1980 left a comment

Choose a reason for hiding this comment

simonsays1980 Apr 23, 2024

Choose a reason for hiding this comment

sven1977 commented Apr 22, 2024 •

edited

Loading