Skip to content

ReplayBuffer size with on-policy algorithm #1161

Answered by MischaPanch
jvasso asked this question in Q&A
Discussion options

You must be logged in to vote

@jvasso essentially yes, it should not be smaller. If it's larger that's not a problem, since the buffer will be reset before the next learning step.

This structure (of having to deal with the buffer size at all for an on-policy algo) is one of Tianshou's main technical debts, and we're planning to address this with a major refactoring that will be part of Tianshou 2.0.0

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by jvasso
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants