[RLlib]: Fix FQE Policy call #26671

Rohan138 · 2022-07-18T19:11:08Z

Why are these changes needed?

Hotfix to make OPE work for both PolicyV2 and DQNTorchPolicy (Policyv1)

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: rapotdar <[email protected]>

kouroshHakha

LGTM.

richardliaw

what's the reason for not having a unit test here?

kouroshHakha · 2022-07-19T00:03:00Z

rllib/offline/estimators/tests/test_ope.py

@@ -162,6 +166,44 @@ def test_ope_in_algo(self):
        print(*list(std_est.items()), sep="\n")
        print("\n\n\n")

+    def test_fqe_model(self):
+        # Miscellaneous tests for FQETorchModel


write a docstring on what this function is testing?

kouroshHakha

Tests are added. Reviewed again and approved @richardliaw

Signed-off-by: rapotdar <[email protected]>

Signed-off-by: Xiaowei Jiang <[email protected]>

Signed-off-by: Stefan van der Kleij <[email protected]>

Fix FQE Policy

ce3b3d5

Rohan138 requested review from sven1977, gjoliver, avnishn, ArturNiederfahrenhorst, smorad, maxpumperla, kouroshHakha and krfricke as code owners July 18, 2022 19:11

Rohan138 assigned kouroshHakha and Rohan138 Jul 18, 2022

Rohan138 added rllib RLlib related issues rllib-offline-rl Offline RL problems labels Jul 18, 2022

Rohan138 changed the title ~~[RLlib]: Fix FQE Policy~~ [RLlib]: Fix FQE Policy call Jul 18, 2022

Rohan138 added 4 commits July 18, 2022 12:13

fix

4cb387d

Signed-off-by: rapotdar <[email protected]>

fix

7038f47

Signed-off-by: rapotdar <[email protected]>

Add copy to fqe

a8172e8

fix

e2a0211

Signed-off-by: rapotdar <[email protected]>

kouroshHakha approved these changes Jul 18, 2022

View reviewed changes

richardliaw requested changes Jul 18, 2022

View reviewed changes

Rohan138 added 2 commits July 18, 2022 16:49

Add FQE unittests

177ffa7

Add numpy()

560ac3d

kouroshHakha reviewed Jul 19, 2022

View reviewed changes

kouroshHakha approved these changes Jul 19, 2022

View reviewed changes

Rohan138 added 2 commits July 18, 2022 17:06

Improve comment

e241cd4

Improve comment

4d38f5b

Signed-off-by: rapotdar <[email protected]>

Rohan138 force-pushed the ope-more-fixes branch from 77aef7f to 4d38f5b Compare July 19, 2022 03:37

richardliaw approved these changes Jul 19, 2022

View reviewed changes

richardliaw merged commit 4fded80 into ray-project:master Jul 19, 2022

xwjiang2010 pushed a commit to xwjiang2010/ray that referenced this pull request Jul 19, 2022

[RLlib]: Fix FQE Policy call (ray-project#26671)

7d0dcbf

Signed-off-by: Xiaowei Jiang <[email protected]>

Stefan-1313 pushed a commit to Stefan-1313/ray_mod that referenced this pull request Aug 18, 2022

[RLlib]: Fix FQE Policy call (ray-project#26671)

28c7eab

Signed-off-by: Stefan van der Kleij <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib]: Fix FQE Policy call #26671

[RLlib]: Fix FQE Policy call #26671

Rohan138 commented Jul 18, 2022 •

edited

Loading

kouroshHakha left a comment

richardliaw left a comment

kouroshHakha Jul 19, 2022

kouroshHakha left a comment

[RLlib]: Fix FQE Policy call #26671

[RLlib]: Fix FQE Policy call #26671

Conversation

Rohan138 commented Jul 18, 2022 • edited Loading

Why are these changes needed?

Checks

kouroshHakha left a comment

Choose a reason for hiding this comment

richardliaw left a comment

Choose a reason for hiding this comment

kouroshHakha Jul 19, 2022

Choose a reason for hiding this comment

kouroshHakha left a comment

Choose a reason for hiding this comment

Rohan138 commented Jul 18, 2022 •

edited

Loading