[RLlib] Fix stateful module errors with inference only mode. #45465

simonsays1980 · 2024-05-21T13:25:18Z

Why are these changes needed?

Stateful models need to collect states from the critic during sampling, therefore they cannot be inference-only. This PR fixes this error by setting inference-only=False for stateful modules and checking statefulness in the get_state.

Related issue number

Related to #44758

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Simon Zehnder <[email protected]>

…' mode. Signed-off-by: Simon Zehnder <[email protected]>

Signed-off-by: Simon Zehnder <[email protected]>

sven1977 · 2024-05-21T15:58:12Z

rllib/algorithms/ppo/ppo_rl_module.py

@@ -20,6 +20,9 @@ class PPORLModule(RLModule, abc.ABC):
    def setup(self):
        # __sphinx_doc_begin__
        catalog = self.config.get_catalog()
+        # If we have a stateful model states for the critic need to be collected


Shouldn't we also use is_stateful() here? What if the user doesn't use the built-in use_lstm option, but comes with their own stateful model?

@sven1977 this was my first intend, however at this point in time is_stateful() cannot be called, yet b/c the encoder is not yet defined.

I agree that this is not a nice solution, but at this point in the code we need to know, if the module is stateful or not, but the is_stateful() depends on the encoder which is defined depending on inference-only being True/False.

sven1977

Just one question about use_lstm not being generic enough as a criterion to determine statefulnes..

…ecurrentEncoderConfig' and added an additional check for 'inference-only' b/c negation resulted in learner modules being 'inference-only'. This is fixed now. Signed-off-by: Simon Zehnder <[email protected]>

simonsays1980 · 2024-05-21T17:52:34Z

Just one question about use_lstm not being generic enough as a criterion to determine statefulnes..

I replaced this now with a more generic approach using RecurrentEncoderConfig (we might need to define in the future a StatefulEncoderConfig when attention is joining the club).

simonsays1980 added 8 commits May 10, 2024 12:16

Changed comment.

c748df8

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

6409007

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

d2f9030

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

a3416a8

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

8582ad9

Signed-off-by: Simon Zehnder <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray

b565f34

Signed-off-by: Simon Zehnder <[email protected]>

Fixed an error that occurred with stateful modules in 'inference-only…

f19120e

…' mode. Signed-off-by: Simon Zehnder <[email protected]>

LINTER.

d4f0e30

Signed-off-by: Simon Zehnder <[email protected]>

sven1977 reviewed May 21, 2024

View reviewed changes

sven1977 marked this pull request as ready for review May 21, 2024 15:58

sven1977 requested review from avnishn, ArturNiederfahrenhorst, maxpumperla and kouroshHakha as code owners May 21, 2024 15:58

sven1977 changed the title ~~[RLlib] - Fix stateful module errors with inference only mode~~ [RLlib] Fix stateful module errors with inference only mode. May 21, 2024

sven1977 approved these changes May 21, 2024

View reviewed changes

Replaced testing for 'use_lstm' with a more generic approach using 'R…

c26298d

…ecurrentEncoderConfig' and added an additional check for 'inference-only' b/c negation resulted in learner modules being 'inference-only'. This is fixed now. Signed-off-by: Simon Zehnder <[email protected]>

sven1977 enabled auto-merge (squash) May 22, 2024 10:21

github-actions bot added the go add ONLY when ready to merge, run all tests label May 22, 2024

sven1977 merged commit 1afa2ab into ray-project:master May 22, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Fix stateful module errors with inference only mode. #45465

[RLlib] Fix stateful module errors with inference only mode. #45465

simonsays1980 commented May 21, 2024 •

edited

Loading

sven1977 May 21, 2024

simonsays1980 May 21, 2024

simonsays1980 May 21, 2024

sven1977 left a comment

simonsays1980 commented May 21, 2024

[RLlib] Fix stateful module errors with inference only mode. #45465

[RLlib] Fix stateful module errors with inference only mode. #45465

Conversation

simonsays1980 commented May 21, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

sven1977 May 21, 2024

Choose a reason for hiding this comment

simonsays1980 May 21, 2024

Choose a reason for hiding this comment

simonsays1980 May 21, 2024

Choose a reason for hiding this comment

sven1977 left a comment

Choose a reason for hiding this comment

simonsays1980 commented May 21, 2024

simonsays1980 commented May 21, 2024 •

edited

Loading