-
Notifications
You must be signed in to change notification settings - Fork 157
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hindsight Experience Replay #6
Comments
Hi, What's the current status of this? |
I'm currently working on it (on-and-off) on the following branch of my personal fork: https://github.com/prabhatnagarajan/pfrl/tree/her. I'm planning on applying HER to the bit-flip environment from the original paper that introduced HER. I'm fairly confident the Hindsight Experience Replay implementation is good, as we've used a variant of it for other projects successfully. However, currently my performance on the bit-flip environment is poor and requires investigation. |
Ah cool, thanks for the update. |
HER requires that we make updates to the agent's policy+Q-function at the end of the episode. But, PFRL assumes that an |
Note that the HindsightReplayBuffer extends the EpisodicReplayBuffer. If you see the data structures within the EpisodicReplayBuffer, you can see that the episodic buffer maintains a About the use of Does this answer your question? If not, feel free to clarify and I'll do my best to answer. |
Hindsight Experience Replay with bit-flipping example: https://arxiv.org/abs/1707.01495
The text was updated successfully, but these errors were encountered: