Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RLlib]: Raise deprecation warning in MARWIL OPE methods #26893

Merged
merged 6 commits into from
Jul 23, 2022

Conversation

Rohan138
Copy link
Contributor

MARWIL currently uses off_policy_estimation_methods = {"is": {"type": ImportanceSampling}, "wis": {"type": WeightedImportanceSampling}} by default instead of {} like all of the other algorithms. This should be deprecated and removed in a future release. We can't just remove it because of users that may be using MARWIL with the current default.

Closes #26667

Checks

  • I've run scripts/format.sh to lint the changes in this PR.
  • I've included any doc changes needed for https://docs.ray.io/en/master/.
  • I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
  • Testing Strategy
    • Unit tests
    • Release tests
    • This PR is not tested :(

Signed-off-by: Rohan138 <[email protected]>
Signed-off-by: Rohan138 <[email protected]>
Signed-off-by: Rohan138 <[email protected]>
Signed-off-by: Rohan138 <[email protected]>
Copy link
Member

@avnishn avnishn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

love it

Signed-off-by: Rohan138 <[email protected]>
# TODO: Remove this when the off_polciy_estimation_methods
# default config is removed from MARWIL
# No off-policy estimation.
self.off_policy_estimation_methods = {}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

NIce :)

Copy link
Contributor

@ArturNiederfahrenhorst ArturNiederfahrenhorst left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Any implications to our CI?

@Rohan138
Copy link
Contributor Author

Nope, but there might be a few minor changes once we actually do the eventual deprecation.

@sven1977 sven1977 merged commit a53bbe4 into ray-project:master Jul 23, 2022
Rohan138 added a commit to Rohan138/ray that referenced this pull request Jul 28, 2022
Rohan138 added a commit to Rohan138/ray that referenced this pull request Jul 28, 2022
Stefan-1313 pushed a commit to Stefan-1313/ray_mod that referenced this pull request Aug 18, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[RLlib] MARWIL does is+wis OPE by default
4 participants