[RLlib] Finish testing matthewearl's Gaussian squashed gaussian PR #13292

sven1977 · 2021-01-08T10:31:33Z

This is a follow up PR on Matthew Earl's PR on adding a GaussianSquashedGaussian distribution (which supports entropy and KL methods) to be used for PPO.

#7609

Why are these changes needed?

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Still some bugs to fix

…sian_squashed_gaussian

…sian_squashed_gaussian # Conflicts: # rllib/models/catalog.py

diegoferigo · 2021-06-18T17:28:05Z

Is there any plan to finalize this PR? Or, alternatively, is there any way to use a fixed value of the variance of the policy distribution? (perhaps even using the free_log_std parameter?)

bveeramani · 2022-01-30T05:57:03Z

‼️ ACTION REQUIRED ‼️

We've switched our code formatter from YAPF to Black (see #21311).

To prevent issues with merging your code, here's what you'll need to do:

Install Black

pip install -I black==21.12b0

Format changed files with Black

curl -o format-changed.sh https://gist.githubusercontent.com/bveeramani/42ef0e9e387b755a8a735b084af976f2/raw/7631276790765d555c423b8db2b679fd957b984a/format-changed.sh
chmod +x ./format-changed.sh
./format-changed.sh
rm format-changed.sh

Commit your changes.

git add --all
git commit -m "Format Python code with Black"

Merge master into your branch.

git pull upstream master

Resolve merge conflicts (if necessary).

After running these steps, you'll have the updated format.sh.

stale · 2022-03-13T18:39:17Z

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.

If you'd like to keep this open, just leave any comment, and the stale label will be removed.

stale · 2022-03-28T02:16:51Z

Hi again! The issue will be closed because there has been no more activity in the 14 days since the last message.

Please feel free to reopen or open a new issue if you'd still like it to be addressed.

Again, you can always ask for help on our discussion forum or Ray's public slack channel.

Thanks again for opening the issue!

matthewearl and others added 15 commits March 15, 2020 16:04

Implement GaussianSquashedGaussian. Still buggy

8e63d3c

fix bug in gsg logp

005c524

Fix bugs in KL and entropy methods

ba69bb7

Initial attempt at integrating GSG into catalog

113fc4f

Still some bugs to fix

Fix up the shapes returned by SG

c8e53ce

Reformatting according to scripts/format.sh

f4521f7

code review markup

b0c2323

Bound loc for numerical stability

0e161fc

Merge branch 'master' of github.com:ray-project/ray into me/gsg

511eef6

Merge branch 'me/gsg' of github.com:matthewearl/ray into me/gsg

86527ec

Fix squashed gaussian unit test

f226d2e

Fix gaussian squashed gaussian following the previous commit

3e1d345

add test for gaussian squashed gaussian

9c9b8bc

linter fixes

731afbd

WIP.

a80db8b

sven1977 assigned michaelzhiluo Jan 8, 2021

sven1977 requested a review from michaelzhiluo January 9, 2021 14:18

michaelzhiluo approved these changes Jan 11, 2021

View reviewed changes

sven1977 added 12 commits January 11, 2021 22:45

Merge branch 'master' of https://github.com/ray-project/ray into me/gsg

cd9cef2

WIP.

7e89931

LINT.

9218430

Fix.

ed7d261

Merge branch 'master' of https://github.com/ray-project/ray into gaus…

544b730

…sian_squashed_gaussian

Torch version and LINT.

6098dda

LINT.

37f6986

Fix and LINT.

32f4201

Merge branch 'master' of https://github.com/ray-project/ray into gaus…

44d96f9

…sian_squashed_gaussian

wip

c61739c

Merge branch 'master' of https://github.com/ray-project/ray into gaus…

4f131af

…sian_squashed_gaussian

Merge branch 'master' of https://github.com/ray-project/ray into gaus…

c6319c1

…sian_squashed_gaussian # Conflicts: # rllib/models/catalog.py

sven1977 added 2 commits April 11, 2021 18:38

LINT.

ec3b6dc

fix and LINT.

4878362

stale bot added the stale The issue is stale. It will be closed within 7 days unless there are further conversation label Mar 13, 2022

stale bot closed this Mar 28, 2022

sven1977 deleted the gaussian_squashed_gaussian branch June 2, 2023 20:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Finish testing matthewearl's Gaussian squashed gaussian PR #13292

[RLlib] Finish testing matthewearl's Gaussian squashed gaussian PR #13292

sven1977 commented Jan 8, 2021 •

edited

Loading

diegoferigo commented Jun 18, 2021

bveeramani commented Jan 30, 2022

stale bot commented Mar 13, 2022

stale bot commented Mar 28, 2022

[RLlib] Finish testing matthewearl's Gaussian squashed gaussian PR #13292

[RLlib] Finish testing matthewearl's Gaussian squashed gaussian PR #13292

Conversation

sven1977 commented Jan 8, 2021 • edited Loading

Why are these changes needed?

Related issue number

Checks

diegoferigo commented Jun 18, 2021

bveeramani commented Jan 30, 2022

‼️ ACTION REQUIRED ‼️

stale bot commented Mar 13, 2022

stale bot commented Mar 28, 2022

sven1977 commented Jan 8, 2021 •

edited

Loading