[RLlib] Make the KL coefficient traced in appo tf #34293

avnishn · 2023-04-12T00:49:09Z

previously, if the appo learner update was traced with tf.function, then the kl coefficient wouldn't be automatically be update because python side affects aren't allowed inside of tf.function traced functions.

This pr makes the kl coefficient a tf variable and then updates it in the loop. It also adds a test to check if the kl coefficient changed after training.

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Avnish <[email protected]>

kouroshHakha

Looks good. Just one comment.

rllib/algorithms/appo/tests/tf/test_appo_learner.py

Signed-off-by: Avnish <[email protected]>

…_appo_kl_coeff_traced

Signed-off-by: Avnish <[email protected]>

Signed-off-by: Avnish <[email protected]> Signed-off-by: elliottower <[email protected]>

Signed-off-by: Avnish <[email protected]> Signed-off-by: Jack He <[email protected]>

[RLlib] Make the KL coefficient traced in appo tf

b50c9ad

Signed-off-by: Avnish <[email protected]>

avnishn requested review from sven1977, gjoliver, ArturNiederfahrenhorst, smorad, maxpumperla, kouroshHakha and krfricke as code owners April 12, 2023 00:49

avnishn assigned kouroshHakha Apr 12, 2023

kouroshHakha approved these changes Apr 12, 2023

View reviewed changes

rllib/algorithms/appo/tests/tf/test_appo_learner.py Outdated Show resolved Hide resolved

avnishn added 2 commits April 12, 2023 23:46

Address feedback:

ea0068b

Signed-off-by: Avnish <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into make…

2f28b2b

…_appo_kl_coeff_traced

gjoliver merged commit f1b14d2 into ray-project:master Apr 13, 2023

vitsai pushed a commit to vitsai/ray that referenced this pull request Apr 17, 2023

[RLlib] Make the KL coefficient traced in appo tf (ray-project#34293)

4df5dcf

Signed-off-by: Avnish <[email protected]>

elliottower pushed a commit to elliottower/ray that referenced this pull request Apr 22, 2023

[RLlib] Make the KL coefficient traced in appo tf (ray-project#34293)

dba4144

Signed-off-by: Avnish <[email protected]> Signed-off-by: elliottower <[email protected]>

ProjectsByJackHe pushed a commit to ProjectsByJackHe/ray that referenced this pull request May 4, 2023

[RLlib] Make the KL coefficient traced in appo tf (ray-project#34293)

2e91f05

Signed-off-by: Avnish <[email protected]> Signed-off-by: Jack He <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Make the KL coefficient traced in appo tf #34293

[RLlib] Make the KL coefficient traced in appo tf #34293

avnishn commented Apr 12, 2023 •

edited

Loading

kouroshHakha left a comment

[RLlib] Make the KL coefficient traced in appo tf #34293

[RLlib] Make the KL coefficient traced in appo tf #34293

Conversation

avnishn commented Apr 12, 2023 • edited Loading

Why are these changes needed?

Related issue number

Checks

kouroshHakha left a comment

Choose a reason for hiding this comment

avnishn commented Apr 12, 2023 •

edited

Loading