[RLlib] [Connectors] Fix test nested action spaces connectors #30459

avnishn · 2022-11-18T03:52:22Z

Action flattening was never actually happening in the agent collector. This pr introduces that

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

ArturNiederfahrenhorst · 2022-11-18T04:36:46Z

rllib/evaluation/collectors/agent_collector.py

@@ -552,6 +551,18 @@ def _unflatten_as_buffer_struct(
        """Unflattens the given to match the buffer struct format for that key."""
        if key not in self.buffer_structs:
            return data[0]
+        if key == SampleBatch.ACTIONS and not self.disable_action_flattening:


Thanks for doing this! Can you explain a little what's happening here for me and Jun?
Have you tested if this breaks other tests /w connectors?

action flattening is never actually happening at any point in the episode. This fix enables action flattening to happen

ArturNiederfahrenhorst

Thanks man! I'll merge this into the collective branch soon to see if this breaks anything else. One question though?

kouroshHakha

I don't understand the flattening you are trying to do.

rllib/evaluation/collectors/agent_collector.py

this commit flattens actions only if action flattening is not disabled. It does the action flattenning as elements are being added to the agent_collector buffer. Signed-off-by: Avnish <[email protected]>

…test_nested_action_spaces_connectors

Signed-off-by: Avnish <[email protected]>

kouroshHakha

one nit and contingent on tests passing. Let's wait for the tests to finish before another push.

kouroshHakha · 2022-11-18T23:04:58Z

rllib/algorithms/algorithm_config.py

@@ -223,7 +223,7 @@ def __init__(self, algo_class=None):
        self.sample_collector = SimpleListCollector
        self.create_env_on_local_worker = False
        self.sample_async = False
-        self.enable_connectors = False
+        self.enable_connectors = True


Don't forget to revert this once the tests pass?

kouroshHakha · 2022-11-18T23:12:11Z

rllib/evaluation/collectors/agent_collector.py

@@ -266,6 +269,8 @@ def add_action_reward_next_obs(self, input_values: Dict[str, TensorType]) -> Non
                or k.startswith("state_out_")
                or (k == SampleBatch.ACTIONS and not self.disable_action_flattening)
            ):
+                if k == SampleBatch.ACTIONS and not self.disable_action_flattening:


nit: we can rewrite the code to avoid repetition of the condition in two place:

should_flatten_action_key = (k == SampleBatch.ACTIONS and not self.disable_action_flattening) if should_flatten_action_key: v = flatten_to_single_ndarray(v) if x or y or should_flatten_action_key: self.buffers[k][0].append(v)

kouroshHakha · 2022-11-18T23:12:23Z

rllib/evaluation/collectors/agent_collector.py

@@ -511,6 +516,8 @@ def _build_buffers(self, single_row: Dict[str, TensorType]) -> None:
                or col.startswith("state_out_")
                or (col == SampleBatch.ACTIONS and not self.disable_action_flattening)
            ):
+                if col == SampleBatch.ACTIONS and not self.disable_action_flattening:


same thing here

Signed-off-by: Avnish <[email protected]>

…test_nested_action_spaces_connectors

Signed-off-by: Avnish <[email protected]>

kouroshHakha

LGTM. merge contingent on tests passing.

gjoliver

Understand this is an easy change. But do we have to flatten the action here inside agent_collector?
In my imagination, we should create a super simple FlatteningActionConnector, and make it part of the action connector pipeline if config.disable_action_flattening is False.
We will then be able to look at the action connectors and say "oh, ok, dude wants actions to be flattened ...".
Does this make sense? Are we trying to say that the actual action output doesn't need flattening, it only requires flattening when being added to agent_collector?

avnishn · 2022-11-19T15:07:10Z

The action flattening only should happen during training not inference.

Is there a way to write an action connector that only is invoked during training?

avnishn · 2022-11-19T15:09:24Z

It should be fed to the environment as unflattened, based on some flag that I should be able to set about the connector.

…test_nested_action_spaces_connectors

he is ooo

…roject#30459) Signed-off-by: Weichen Xu <[email protected]>

avnishn requested review from sven1977, gjoliver, ArturNiederfahrenhorst, smorad, maxpumperla, kouroshHakha and krfricke as code owners November 18, 2022 03:52

ArturNiederfahrenhorst reviewed Nov 18, 2022

View reviewed changes

kouroshHakha reviewed Nov 18, 2022

View reviewed changes

rllib/evaluation/collectors/agent_collector.py Outdated Show resolved Hide resolved

kouroshHakha reviewed Nov 18, 2022

View reviewed changes

rllib/evaluation/collectors/agent_collector.py Outdated Show resolved Hide resolved

Actual action flattenning fix

1b26e68

this commit flattens actions only if action flattening is not disabled. It does the action flattenning as elements are being added to the agent_collector buffer. Signed-off-by: Avnish <[email protected]>

avnishn force-pushed the fix_test_nested_action_spaces_connectors branch from 7818310 to 1b26e68 Compare November 18, 2022 22:34

avnishn added 2 commits November 18, 2022 14:35

Merge branch 'master' of https://github.com/ray-project/ray into fix_…

aac7257

…test_nested_action_spaces_connectors

Enable connectors for testing

70beb1f

Signed-off-by: Avnish <[email protected]>

kouroshHakha reviewed Nov 18, 2022

View reviewed changes

avnishn added 3 commits November 18, 2022 16:27

Refactor boolean

55bd5fb

Signed-off-by: Avnish <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into fix_…

a6f6e88

…test_nested_action_spaces_connectors

Disable enable_connectors

1cd8a30

Signed-off-by: Avnish <[email protected]>

kouroshHakha approved these changes Nov 19, 2022

View reviewed changes

gjoliver previously requested changes Nov 19, 2022

View reviewed changes

Merge branch 'master' of https://github.com/ray-project/ray into fix_…

5b94475

…test_nested_action_spaces_connectors

sven1977 merged commit 760fbc4 into ray-project:master Nov 21, 2022

WeichenXu123 pushed a commit to WeichenXu123/ray that referenced this pull request Dec 19, 2022

[RLlib] [Connectors] Fix test nested action spaces connectors. (ray-p…

39213a6

…roject#30459) Signed-off-by: Weichen Xu <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] [Connectors] Fix test nested action spaces connectors #30459

[RLlib] [Connectors] Fix test nested action spaces connectors #30459

avnishn commented Nov 18, 2022

ArturNiederfahrenhorst Nov 18, 2022

avnishn Nov 18, 2022

ArturNiederfahrenhorst left a comment

kouroshHakha left a comment

kouroshHakha left a comment

kouroshHakha Nov 18, 2022

avnishn Nov 18, 2022

kouroshHakha Nov 18, 2022

kouroshHakha Nov 18, 2022

kouroshHakha left a comment

gjoliver left a comment

avnishn commented Nov 19, 2022

avnishn commented Nov 19, 2022

[RLlib] [Connectors] Fix test nested action spaces connectors #30459

[RLlib] [Connectors] Fix test nested action spaces connectors #30459

Conversation

avnishn commented Nov 18, 2022

Why are these changes needed?

Related issue number

Checks

ArturNiederfahrenhorst Nov 18, 2022

Choose a reason for hiding this comment

avnishn Nov 18, 2022

Choose a reason for hiding this comment

ArturNiederfahrenhorst left a comment

Choose a reason for hiding this comment

kouroshHakha left a comment

Choose a reason for hiding this comment

kouroshHakha left a comment

Choose a reason for hiding this comment

kouroshHakha Nov 18, 2022

Choose a reason for hiding this comment

avnishn Nov 18, 2022

Choose a reason for hiding this comment

kouroshHakha Nov 18, 2022

Choose a reason for hiding this comment

kouroshHakha Nov 18, 2022

Choose a reason for hiding this comment

kouroshHakha left a comment

Choose a reason for hiding this comment

gjoliver left a comment

Choose a reason for hiding this comment

avnishn commented Nov 19, 2022

avnishn commented Nov 19, 2022