[RLlib] Check that results has learner info appo test #34381

avnishn · 2023-04-13T21:16:33Z

The appo kl coefficient learner test is flakey because
we run training until there are some results. What can end up happening is that
training is run for so long that eval results are available but not learner results
This pr fixes this by training until there are learner results that are available
not just evaluation results.

Signed-off-by: Avnish [email protected]

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

The appo kl coefficient learner test is flakey because we run training until there are some results. What can end up happening is that training is run for so long that eval results are available but not learner results This pr fixes this by training until there are learner results that are available not just evaluation results. Signed-off-by: Avnish <[email protected]>

ArturNiederfahrenhorst

Thanks!

) The appo kl coefficient learner test is flakey because we run training until there are some results. What can end up happening is that training is run for so long that eval results are available but not learner results This pr fixes this by training until there are learner results that are available not just evaluation results. Signed-off-by: Avnish <[email protected]>

) The appo kl coefficient learner test is flakey because we run training until there are some results. What can end up happening is that training is run for so long that eval results are available but not learner results This pr fixes this by training until there are learner results that are available not just evaluation results. Signed-off-by: Avnish <[email protected]> Signed-off-by: elliottower <[email protected]>

) The appo kl coefficient learner test is flakey because we run training until there are some results. What can end up happening is that training is run for so long that eval results are available but not learner results This pr fixes this by training until there are learner results that are available not just evaluation results. Signed-off-by: Avnish <[email protected]> Signed-off-by: Jack He <[email protected]>

avnishn requested review from sven1977, gjoliver, ArturNiederfahrenhorst, smorad, maxpumperla, kouroshHakha and krfricke as code owners April 13, 2023 21:16

ArturNiederfahrenhorst approved these changes Apr 13, 2023

View reviewed changes

gjoliver merged commit 4571f1c into ray-project:master Apr 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Check that results has learner info appo test #34381

[RLlib] Check that results has learner info appo test #34381

avnishn commented Apr 13, 2023

ArturNiederfahrenhorst left a comment

[RLlib] Check that results has learner info appo test #34381

[RLlib] Check that results has learner info appo test #34381

Conversation

avnishn commented Apr 13, 2023

Why are these changes needed?

Related issue number

Checks

ArturNiederfahrenhorst left a comment

Choose a reason for hiding this comment