[RLlib] [WIP] [MultiAgentEnv Refactor #1] Add new methods to base env #21027

avnishn · 2021-12-11T00:42:55Z

RLlib Environments are difficult to test. Particularly Base Environments and MultiAgentEnvironments. This is because they are missing necessary required fields in order to test them.

The biggest issue that I face right now is that the MultiAgentEnv API is too loosely defined to write basic environment tests where I would follow this rough workflow:

reset_obs = env.try_reset()
assert env.obs_space_contains(reset_obs)
sampled_actions = env.action_space_sample()
env.send_actions(sampled_actions)
next_obs, dones, rewards, infos = env.poll()

assert env.obs_space_contains(next_obs)
assert isinstance(dones, dict)
assert isinstance(rewards, dict)
assert isinstance(infos, dict)

Because the MultiAgentEnv API is too loosely defined to be able to follow this workflow, I can’t write a BaseEnv checking module.

This PR Adds the necessary BaseEnv methods for making BaseEnvs unit-testable and checkable with an environment checking module.

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

gjoliver

2 quick questions.

gjoliver · 2021-12-16T00:08:28Z

rllib/env/base_env.py

+        logger.warning("last has not been implemented for this environment.")
+        return {}, {}, {}, {}, {}
+
+    @PublicAPI
    def observation_space_contains(self, x: MultiEnvDict) -> bool:


wait, is the type still MultiEnvDict here and below??
I thought we are saying gym and multi-agent envs return different types now?

Yes -- the return type of poll() and try_reset() are MultiEnvDicts, and so I thought it would be appropriate if observations and actions produced by the environment/policy should be able to be easily passed to the environment for checking.

ok, it seems to me we are settling on using multi-agent api for single agent env as well, which is totally fine, and probably logical.

The BaseEnv is never a single-agent env. If there is only one agent and we derive the BaseEnv from e.g. a gym.Env, we auto-create "DUMMY_AGENT_ID" in the env as the agent's ID.

gjoliver · 2021-12-16T00:10:46Z

rllib/env/base_env.py

        self._space_contains(self.observation_space, x)

+    @PublicAPI
    def action_space_contains(self, x: MultiEnvDict) -> bool:


a very minor question, do you envision obs/action_space_contains() getting used outside of the environment checking module?

I can imagine it having other uses for users who are trying to develop their own environments -- I have definitely used functions like this while developing my own environments

gjoliver

Looks good to me!

avnishn force-pushed the add_new_methods_to_base_env_2 branch from 4755aea to 1c6d566 Compare December 13, 2021 23:32

avnishn requested review from gjoliver and sven1977 December 15, 2021 18:27

Expand Base env API to add necessary methods for testing

956759a

avnishn force-pushed the add_new_methods_to_base_env_2 branch from 1c6d566 to 956759a Compare December 15, 2021 22:09

gjoliver reviewed Dec 16, 2021

View reviewed changes

gjoliver self-requested a review December 16, 2021 04:56

gjoliver approved these changes Dec 16, 2021

View reviewed changes

sven1977 merged commit 85a368c into ray-project:master Dec 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] [WIP] [MultiAgentEnv Refactor #1] Add new methods to base env #21027

[RLlib] [WIP] [MultiAgentEnv Refactor #1] Add new methods to base env #21027

avnishn commented Dec 11, 2021

gjoliver left a comment

gjoliver Dec 16, 2021

avnishn Dec 16, 2021 •

edited

Loading

gjoliver Dec 16, 2021

sven1977 Dec 16, 2021

gjoliver Dec 16, 2021

avnishn Dec 16, 2021

gjoliver left a comment

[RLlib] [WIP] [MultiAgentEnv Refactor #1] Add new methods to base env #21027

[RLlib] [WIP] [MultiAgentEnv Refactor #1] Add new methods to base env #21027

Conversation

avnishn commented Dec 11, 2021

Checks

gjoliver left a comment

Choose a reason for hiding this comment

gjoliver Dec 16, 2021

Choose a reason for hiding this comment

avnishn Dec 16, 2021 • edited Loading

Choose a reason for hiding this comment

gjoliver Dec 16, 2021

Choose a reason for hiding this comment

sven1977 Dec 16, 2021

Choose a reason for hiding this comment

gjoliver Dec 16, 2021

Choose a reason for hiding this comment

avnishn Dec 16, 2021

Choose a reason for hiding this comment

gjoliver left a comment

Choose a reason for hiding this comment

avnishn Dec 16, 2021 •

edited

Loading