-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEATURE] Add stochastic muzero implementation #77
Comments
Hey, this is on the roadmap however i dont have any immediate plans to implement this. If you'd like to give it a shot, id be more than happy to review it and assist with development. otherwise, it might be a while until this is implemented. |
Let me try then. I had a little difficult with the loss function. If you could help me in this part would be great. |
@EdanToledo PR #78 created. |
Hey, I havent forgotten about this. Sorry its an important PR and will hopefully get to it asap. |
Hey Edan, could I help you in another point to get this implemented? Regards. |
Hey Fernando, I'm sorry about the delay, I just haven't had time to complete something like this. Stochastic MuZero is a non-trivial algorithm that i would need to gain a good understanding of to ensure the algorithm is implemented correctly. Currently, I havent had too much time to do non-priority features. I promise i will get around to this at some point but i really dont have an ETA. Ideally, if there was more contributors and maintainers to this project it would be easier. |
Add stochastic muzero implementation - paper and the pseudocode
With this improved version of muzero the stoic could be able to train stochastic environments like the 2048 game and poker (leduc poker)
The text was updated successfully, but these errors were encountered: