[RLlib] Aggregate impala learner info #25856

avnishn · 2022-06-16T15:55:27Z

Aggregate learner infos for impala training step fn

Why are these changes needed?

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…impala_learner_info_2

rllib/algorithms/impala/impala.py

sven1977 · 2022-06-17T18:51:42Z

rllib/algorithms/impala/impala.py

@@ -795,7 +795,8 @@ def place_processed_samples_on_learner_queue(self) -> None:

    def process_trained_results(self) -> ResultDict:
        # Get learner outputs/stats from output queue.


The comment is outdated here. Could you explain via some more one-line comments what we do here?

sven1977

Looks great! thanks for the fix @avnishn . Could you just add a few more one-line comments explaining why we sometimes need to deepcopy the threads infos (when there is nothing to process ...)?

avnishn · 2022-06-21T17:50:37Z

rllib/algorithms/impala/impala.py

@@ -795,7 +795,8 @@ def place_processed_samples_on_learner_queue(self) -> None:

    def process_trained_results(self) -> ResultDict:
        # Get learner outputs/stats from output queue.


Suggested change

# Get learner outputs/stats from output queue.

# Combine learner stats from the learner queue and update relevant timestep counters

what do you think of this @sven1977

Aggregate impala learner info

e722d14

avnishn requested review from sven1977, gjoliver, ArturNiederfahrenhorst, smorad, maxpumperla, kouroshHakha and krfricke as code owners June 16, 2022 15:55

ArturNiederfahrenhorst approved these changes Jun 16, 2022

View reviewed changes

avnishn added 4 commits June 16, 2022 18:00

Merge branch 'master' of https://github.com/ray-project/ray into fix_…

6ea812a

…impala_learner_info_2

Change scheduler variable to num steps trained, not sampled

f958cac

Lint

cae81e1

Merge branch 'master' of https://github.com/ray-project/ray into fix_…

8770b8f

…impala_learner_info_2

sven1977 reviewed Jun 17, 2022

View reviewed changes

rllib/algorithms/impala/impala.py Show resolved Hide resolved

sven1977 reviewed Jun 17, 2022

View reviewed changes

ArturNiederfahrenhorst approved these changes Jun 17, 2022

View reviewed changes

avnishn commented Jun 21, 2022

View reviewed changes

sven1977 merged commit 871aef8 into ray-project:master Jun 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Aggregate impala learner info #25856

[RLlib] Aggregate impala learner info #25856

avnishn commented Jun 16, 2022

sven1977 Jun 17, 2022

avnishn Jun 21, 2022

sven1977 left a comment

avnishn Jun 21, 2022

avnishn Jun 21, 2022

		@@ -795,7 +795,8 @@ def place_processed_samples_on_learner_queue(self) -> None:

		def process_trained_results(self) -> ResultDict:
		# Get learner outputs/stats from output queue.

	# Get learner outputs/stats from output queue.
	# Combine learner stats from the learner queue and update relevant timestep counters

[RLlib] Aggregate impala learner info #25856

[RLlib] Aggregate impala learner info #25856

Conversation

avnishn commented Jun 16, 2022

Why are these changes needed?

Related issue number

Checks

sven1977 Jun 17, 2022

Choose a reason for hiding this comment

avnishn Jun 21, 2022

Choose a reason for hiding this comment

sven1977 left a comment

Choose a reason for hiding this comment

avnishn Jun 21, 2022

Choose a reason for hiding this comment

avnishn Jun 21, 2022

Choose a reason for hiding this comment