-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RLLib] (P1 regression) Fixing view requirements in compute actions #15856
[RLLib] (P1 regression) Fixing view requirements in compute actions #15856
Conversation
…e used_for_compute_actions in inference
@sven1977 Do you think this can be merged before releasing 1.4.0 ? |
The previous fix (which then broke) wasn't merged in 1.3.0, I'd really like to be able to tell griddly users to download a stable release rather than the unstable wheels to get anything to work. |
I agree with you, it is quite a pity it did made it in 1.4.0. it is a very annoying bug. It took me way too long to understand why my code was not working, and there is no clean way to get around it apart from patch ray.rllib. |
Hey @Bam4d , @duburcqa , I'm really sorry for the hold-up. However, please be aware that we have limited resources to get these PRs reviewed and merged. Also, if you have noticed, your PR has some failing test cases, which someone has to look into (and debug) before we can merge this! If all tests were pass, we could have merged this some time ago. |
…_for_compute_actions_ignored
Ok, I looked into this again. Thanks for the fix - again :)
|
…_for_compute_actions_ignored
All failing tests passing locally now, just waiting for CI ... |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for raising this issue and for the suggested fix @Bam4d .
Had to make 2 small adjustments for this to also work with legacy algos (e.g. QMIX). Looks all good now.
@sven1977 Amazing ! Thank you for fixing and merging this PR 👍 |
Why are these changes needed?
@sven1977 This is a regression of a previous bug I raised. Griddly needs this to work!
The view requirments for inference do not just include the model's view requirements.
get_inference_input_dict
needs to return the input dict with all of the view requirements for the whole policy rather than just the model, otherwise all theused_for_compute_actions
get ignored.Related issue number
This bug is a regression of this fix
Regression from this fix: #14386
Closes #14385
Checks
scripts/format.sh
to lint the changes in this PR.