[RLlib] Added `expectation` advantage_type to CRR #26142

kouroshHakha · 2022-06-27T22:48:58Z

Why are these changes needed?

This allows for analytical advantage computation in discrete action space case.

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

sven1977 · 2022-06-28T13:39:16Z

rllib/BUILD

@@ -281,6 +281,20 @@ py_test(
    args = ["--yaml-dir=tuned_examples/crr", '--framework=torch']
 )

+py_test(


sven1977

LGTM. Thanks for adding this additional functionality and respective test cases @kouroshHakha !

kouroshHakha added 3 commits June 27, 2022 15:38

added expectation as a possible adv_type for discrete actions in CRR

46ba041

fixed lint

649510a

added tests + temp to pendulum config

1e2dcdf

kouroshHakha requested review from sven1977, gjoliver, avnishn, ArturNiederfahrenhorst, smorad, maxpumperla and krfricke as code owners June 27, 2022 22:48

kouroshHakha assigned gjoliver Jun 27, 2022

sven1977 reviewed Jun 28, 2022

View reviewed changes

rllib/BUILD

@@ -281,6 +281,20 @@ py_test(

args = ["--yaml-dir=tuned_examples/crr", '--framework=torch']

)

py_test(

Copy link

Contributor

sven1977 Jun 28, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice.

sven1977 approved these changes Jun 28, 2022

View reviewed changes

sven1977 merged commit f421730 into ray-project:master Jun 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Added `expectation` advantage_type to CRR #26142

[RLlib] Added `expectation` advantage_type to CRR #26142

kouroshHakha commented Jun 27, 2022

sven1977 Jun 28, 2022

sven1977 left a comment

[RLlib] Added expectation advantage_type to CRR #26142

[RLlib] Added expectation advantage_type to CRR #26142

Conversation

kouroshHakha commented Jun 27, 2022

Why are these changes needed?

Related issue number

Checks

sven1977 Jun 28, 2022

Choose a reason for hiding this comment

sven1977 left a comment

Choose a reason for hiding this comment

[RLlib] Added `expectation` advantage_type to CRR #26142

[RLlib] Added `expectation` advantage_type to CRR #26142