[RLlib] quick fix for learning rate schedule for APPO algorithm. #28013

gjoliver · 2022-08-19T10:17:30Z

Why are these changes needed?

LearningRateSchedule must be initialized after base.init() call.
Longer term, base classes should rely as little as possible on class member variables.
E.g., all these base modules could have simply taken config as input when necessary.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- [*] Unit tests
- Release tests
- This PR is not tested :(

avnishn · 2022-08-19T17:12:52Z

will need to check whether the appo vtrace failing test is related or is flaking

gjoliver · 2022-08-19T17:29:55Z

will need to check whether the appo vtrace failing test is related or is flaking

yeah, good call, I am gonna do it now.

Signed-off-by: Jun Gong <[email protected]>

gjoliver · 2022-08-19T21:33:16Z

will need to check whether the appo vtrace failing test is related or is flaking

yeah, good call, I am gonna do it now.

doesn't seem related. the test also doesn't use lr schedule at all.

[RLlib] quick fix for learning rate schedule for APPO algorithm.

b2bca98

gjoliver requested review from sven1977, avnishn, ArturNiederfahrenhorst, smorad, maxpumperla, kouroshHakha and krfricke as code owners August 19, 2022 10:17

gjoliver assigned sven1977 Aug 19, 2022

avnishn approved these changes Aug 19, 2022

View reviewed changes

lint

d095b68

Signed-off-by: Jun Gong <[email protected]>

richardliaw merged commit ec38b96 into ray-project:master Aug 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] quick fix for learning rate schedule for APPO algorithm. #28013

[RLlib] quick fix for learning rate schedule for APPO algorithm. #28013

gjoliver commented Aug 19, 2022

avnishn commented Aug 19, 2022

gjoliver commented Aug 19, 2022

gjoliver commented Aug 19, 2022

[RLlib] quick fix for learning rate schedule for APPO algorithm. #28013

[RLlib] quick fix for learning rate schedule for APPO algorithm. #28013

Conversation

gjoliver commented Aug 19, 2022

Why are these changes needed?

Related issue number

Checks

avnishn commented Aug 19, 2022

gjoliver commented Aug 19, 2022

gjoliver commented Aug 19, 2022