[rllib] Env to base env refactor #20785

avnishn · 2021-11-30T01:41:25Z

Why are these changes needed?

BaseEnv is awkwardly the owner of a significant amount of code that does not belong to BaseEnv. This PR addresses that by moving BaseEnv wrappers to their corresponding class files, and (in progress) removing the to_base_env method from the BaseEnv class, as this class awkwardly returns an instance of itself doesn't rely on having access to itself, is a static function, and should probably be a standalone function.

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Should we add tests for these wrappers? There weren't tests for them already, so we probably should go back and add them eventually?

gjoliver

May be my favorite PR of the week :)
There are also a few test failures. Let me know if you need help debugging those. Something may be off with MultiAgentEnv.

gjoliver · 2021-11-30T02:23:29Z

rllib/env/base_env.py

+    @Deprecated(
+        old="ray.rllib.env.base_env.BaseEnv.to_base_env",
+        new="ray.rllib.env.base_env.convert_to_base_env",
+        error=False)
    def to_base_env(


@sven1977 @avnishn do you think we can just delete these internal functions classes?
these are neither @publicapi nor @DeveloperAPI, so not part of the APIs we publish.

Hmm, let's leave those in for a while. I feel like users should always have n releases to catch the new deprecation warnings and change their code accordingly before we completely remove the old code. I know we have lots of deprecation_warnings in the code right now, but feel free to - here and there - flip some of these to error=True to start the next stage in their deprecation (the last stage being to remove the code entirely).

gjoliver · 2021-11-30T09:13:33Z

rllib/env/base_env.py

 def _with_dummy_agent_id(env_id_to_values: Dict[EnvID, Any],
                         dummy_id: "AgentID" = _DUMMY_AGENT_ID
                         ) -> MultiEnvDict:
    return {k: {dummy_id: v} for (k, v) in env_id_to_values.items()}


+@DeveloperAPI


why make this @DeveloperAPI?

Yeah, probably not necessary here. Let's not make this DevAPI.

gjoliver · 2021-11-30T09:35:41Z

rllib/env/base_env.py

+    # `env` is not a BaseEnv yet -> Need to convert/vectorize.
+
+    # MultiAgentEnv (which is a gym.Env).
+    if isinstance(env, MultiAgentEnv):


I think things will be even cleaner if we can create some kind of common module level apis for these Envs.
Something like the following:

external_env.py

def is_env_type(env) -> bool: return isinstance(env, ExternalEnv)

def to_base_env(env, make_env, num_envs, ...) -> BaseEnv:
... logics for converting ExternalEnv to BaseEnv.

multi_agent_env.py

def is_env_type(env) -> bool: return isinstance(env, MultiAgentEnv)

def to_base_env(env, make_env, num_envs, ...) -> BaseEnv:
... logics for converting MultiAgentEnv to BaseEnv.

vector_env.py

def is_env_type(env) -> bool: return isinstance(env, VectorEnv)

def to_base_env(env, make_env, num_envs, ...) -> BaseEnv:
... logics for converting VectorEnv to BaseEnv.

generic_env.py

def is_env_type(env) -> bool: return isinstance(env, EnvType)

def to_base_env(env, make_env, num_envs, ...) -> BaseEnv:
... logics for converting basically gym.Env to BaseEnv.
... basically the logics in the last big else clause.

Then with this structure, we can clean up this function to be as simple as:

def convert_to_base_env(env, ...) -> BaseEnv: if isinstance(env, BaseEnv): return env from ray.rllib.env import external_env, multi_agent_env, vector_env, generic_env for env_type in [external_env, multi_agent_env, vector_env, generic_env]: if env_type.is_env_type(env): return env_type.to_base_env(env, make_env, ...) raise ValueError("Unknown Env type: ", type(env))

Let me know if there is anything that's not clear here.
I can also help send a patch if it makes things faster.

Not sure about the is_env_type() methods. Wouldn't these just add extra weight to the code?
One could also just do (w/o needing these methods):

def convert_to_base_env(env, ...) -> BaseEnv: if isinstance(env, BaseEnv): return env from ray.rllib.env import external_env, multi_agent_env, vector_env, generic_env for env_type in [external_env, multi_agent_env, vector_env, generic_env]: if isinstance(env, env_type): return env_type.to_base_env(env, make_env, ...) raise ValueError("Unknown Env type: ", type(env))

Happy to move the to_base_env logics into the individual classes. That makes a lot of sense.
Every env-class that has such a method is then directly visible as RLlib-supported.

@avnishn , we can do this in a follow-up PR, though. This one already has lots of changes in it.

is there a reason why we can't instead keep make this a static but standalone function, like we do in this PR anyways? The reason that we have this to_base_env function right now is because RLlib requires a BaseEnv for most of its operations. Is that reason enough that we should have this to_base_env function?

Perhaps it is cleaner, but it does definitely add weight to the class overall.

I like the above solution though, because it removes the need for circular imports in the BaseEnv class

we could probably call it something else? to_base_env implies that the function doesn't return a new class. Maybe we could call it get_base_env or get_new_base_env

good point actually. I think if we can't delete the static BaseEnv.to_base_env() anyways, we might as well just keep using it. No need to add the top-level convert_to_base_env().
the part I care about is to keep BaseEnv.to_base_env() as minimal as possible, and make per-Env conversion logic live with those specific modules.

regarding is_env_type(), that's just so we don't have to import 4 Env classes using 4 import statements. and it provides a little bit of flexibility when you need more logics to decide whether a specific Env should handle the conversion. I won't insist on this either. the code will look like:

def convert_to_base_env(env, ...) -> BaseEnv: if isinstance(env, BaseEnv): return env from ray.rllib.env.external_env import ExternalEnv from ray.rllib.env.multi_agent_env import MultiAgentEnv ... import VectorEnv ... import generic_env_to_base_env for env_type in [ExternalEnv, MultiAgentEnv, VectorEnv]: if isinstance(env, env_type): return env_type.to_base_env(env, make_env, ...) return generic_env_to_base_env(env, make_env, ...)

Just a little less concise.

Either way is fine. Let's worry about it with the followup PR.

Rename _XXXEnvToBaseEnv classes into XXXBaseEnvWrapper(BaseEnv)

avnishn · 2021-11-30T21:12:27Z

@gjoliver I removed the developer API tag, and got the tests passing I think. Could you please re-review?

gjoliver

cool man. looking forward to the followup PR.

gjoliver · 2021-11-30T22:10:21Z

rllib/env/base_env.py

+    # `env` is not a BaseEnv yet -> Need to convert/vectorize.
+
+    # MultiAgentEnv (which is a gym.Env).
+    if isinstance(env, MultiAgentEnv):


good point actually. I think if we can't delete the static BaseEnv.to_base_env() anyways, we might as well just keep using it. No need to add the top-level convert_to_base_env().
the part I care about is to keep BaseEnv.to_base_env() as minimal as possible, and make per-Env conversion logic live with those specific modules.

regarding is_env_type(), that's just so we don't have to import 4 Env classes using 4 import statements. and it provides a little bit of flexibility when you need more logics to decide whether a specific Env should handle the conversion. I won't insist on this either. the code will look like:

def convert_to_base_env(env, ...) -> BaseEnv: if isinstance(env, BaseEnv): return env from ray.rllib.env.external_env import ExternalEnv from ray.rllib.env.multi_agent_env import MultiAgentEnv ... import VectorEnv ... import generic_env_to_base_env for env_type in [ExternalEnv, MultiAgentEnv, VectorEnv]: if isinstance(env, env_type): return env_type.to_base_env(env, make_env, ...) return generic_env_to_base_env(env, make_env, ...)

Just a little less concise.

Either way is fine. Let's worry about it with the followup PR.

avnishn requested review from sven1977 and gjoliver November 30, 2021 01:41

gjoliver reviewed Nov 30, 2021

View reviewed changes

sven1977 self-assigned this Nov 30, 2021

avnishn added 5 commits November 30, 2021 11:54

Temp

7b5006a

Base Env Refactors

a224a94

Rename _XXXEnvToBaseEnv classes into XXXBaseEnvWrapper(BaseEnv)

Deprecate to_base_env

dc2d53b

Add try_reset back to MultiAgentEnvWrapper

1a4ca99

Remove DevAPI decorator

0bbe71b

avnishn force-pushed the env_to_base_env_refactor branch from e9a8da4 to 0bbe71b Compare November 30, 2021 20:03

Remove unused package for lint

2ce4a49

gjoliver approved these changes Nov 30, 2021

View reviewed changes

richardliaw changed the title ~~Env to base env refactor~~ [rllib] Env to base env refactor Dec 1, 2021

richardliaw merged commit 3ddc095 into ray-project:master Dec 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rllib] Env to base env refactor #20785

[rllib] Env to base env refactor #20785

avnishn commented Nov 30, 2021

gjoliver left a comment

gjoliver Nov 30, 2021

sven1977 Nov 30, 2021

gjoliver Nov 30, 2021

sven1977 Nov 30, 2021

avnishn Nov 30, 2021

gjoliver Nov 30, 2021

sven1977 Nov 30, 2021

sven1977 Nov 30, 2021

sven1977 Nov 30, 2021

avnishn Nov 30, 2021

avnishn Nov 30, 2021

avnishn Nov 30, 2021

gjoliver Nov 30, 2021

avnishn commented Nov 30, 2021

gjoliver left a comment

gjoliver Nov 30, 2021

[rllib] Env to base env refactor #20785

[rllib] Env to base env refactor #20785

Conversation

avnishn commented Nov 30, 2021

Why are these changes needed?

Checks

gjoliver left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

avnishn commented Nov 30, 2021

gjoliver left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment