-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RLlib] PyFlyt QuadX WayPoints #44758
Conversation
@sven1977 @simonsays1980 can you take a look please? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome PR - the example is very nice! CHanges requested concern mainly our move from the old to the new stack. @peterghaddad if you need help somewhere or want me to adapt the code, ping me please.
We appreciate your work @peterghaddad ! This is really a nice example I have not thought about before. Hope to see more from you in the future!
@simonsays1980 addressed feedback! Let me know if there is anymore feedback! |
@peterghaddad Thanks for the comments! So, for the error occuring in the new stack we need to dig into this deeper. I have a guess, where this is rooted. I come back to you when I know more. |
@simonsays1980 Appreciate you diving in! From a quality and training perspective, I think this example is good to go. LSTM isn't necessary for us to move forward; however, i'm glad we are finding things that can be fixed! |
@simonsays1980 are we good to move forward here? Also, thanks for diving into the fix use_lstm |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Needs to pull the newest master with a fix for LSTM. Needs a cleanup before or after merge to include RLlib test utils.
Included RLLib test utils! |
Signed-off-by: peterghaddad <[email protected]>
Signed-off-by: peterghaddad <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Awesome PR !
@simonsays1980 thanks for approving! Still seeing the same error with lstm. Let's create another issue and address that there since the agent is converges without lstm. Once you reapprove, will you also be the one to merge it in? Thanks again for all the review! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
c84d54f
to
6c0a7e9
Compare
Signed-off-by: peterghaddad <[email protected]>
Signed-off-by: peterghaddad <[email protected]>
All good to go. Sorry @peterghaddad for the many delays! Will auto-merge this once all tests pass ... |
Signed-off-by: Richard Liu <[email protected]>
Why are these changes needed?
An example to leverage the PyFlyt environment with RLlib enabling users to train drones in the PyBullet physics engine.
Related issue number
Closes #44738
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.method in Tune, I've added it in
doc/source/tune/api/
under thecorresponding
.rst
file.