[FEATURE] Add stochastic muzero implementation #77

ipsec · 2024-05-08T23:23:02Z

Add stochastic muzero implementation - paper and the pseudocode

With this improved version of muzero the stoic could be able to train stochastic environments like the 2048 game and poker (leduc poker)

EdanToledo · 2024-05-09T10:17:46Z

Hey, this is on the roadmap however i dont have any immediate plans to implement this. If you'd like to give it a shot, id be more than happy to review it and assist with development. otherwise, it might be a while until this is implemented.

ipsec · 2024-05-09T10:49:42Z

Let me try then. I had a little difficult with the loss function. If you could help me in this part would be great.

ipsec · 2024-05-15T17:43:06Z

@EdanToledo PR #78 created.
Like said, I have difficult with the loss function, a good revision is necessary.

EdanToledo · 2024-06-15T15:11:32Z

Hey, I havent forgotten about this. Sorry its an important PR and will hopefully get to it asap.

ipsec · 2024-09-05T10:53:56Z

Hey Edan, could I help you in another point to get this implemented?

Regards.

EdanToledo · 2024-09-12T09:32:40Z

Hey Fernando, I'm sorry about the delay, I just haven't had time to complete something like this. Stochastic MuZero is a non-trivial algorithm that i would need to gain a good understanding of to ensure the algorithm is implemented correctly. Currently, I havent had too much time to do non-priority features. I promise i will get around to this at some point but i really dont have an ETA. Ideally, if there was more contributors and maintainers to this project it would be easier.

ipsec added the enhancement New feature or request label May 8, 2024

ipsec changed the title ~~[FEATURE]~~ [FEATURE] Add stochastic muzero implementation May 9, 2024

EdanToledo added the Roadmap On the roadmap and will be addressed in time label May 9, 2024

ipsec linked a pull request May 15, 2024 that will close this issue

Add stochastic muzero #78

Open

EdanToledo linked a pull request Jun 15, 2024 that will close this issue

Add stochastic muzero #78

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Add stochastic muzero implementation #77

[FEATURE] Add stochastic muzero implementation #77

ipsec commented May 8, 2024 •

edited

Loading

EdanToledo commented May 9, 2024

ipsec commented May 9, 2024 •

edited

Loading

ipsec commented May 15, 2024

EdanToledo commented Jun 15, 2024

ipsec commented Sep 5, 2024

EdanToledo commented Sep 12, 2024

[FEATURE] Add stochastic muzero implementation #77

[FEATURE] Add stochastic muzero implementation #77

Comments

ipsec commented May 8, 2024 • edited Loading

EdanToledo commented May 9, 2024

ipsec commented May 9, 2024 • edited Loading

ipsec commented May 15, 2024

EdanToledo commented Jun 15, 2024

ipsec commented Sep 5, 2024

EdanToledo commented Sep 12, 2024

ipsec commented May 8, 2024 •

edited

Loading

ipsec commented May 9, 2024 •

edited

Loading