How exactly does Alphazero's MCTS work? #64

SheldonCurtiss · 2021-08-17T19:00:32Z

Is it directly simulating future boards or is it simulating predicted future boards if that makes sense?

My understanding is it's directly simulating future boards is that correct?

jonathan-laurent · 2021-08-17T19:22:04Z

The question is too vague to answer precisely.
AlphaZero's MCTS uses a perfect simulator of the environment to plan possible future scenarios but which scenarios are explored still depends on the neural network's heuristics.
In contrast, MuZero does not have access to an environment simulator during planning and explores futures scenarios using a learned state-transition model.

SheldonCurtiss closed this as completed Aug 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How exactly does Alphazero's MCTS work? #64

How exactly does Alphazero's MCTS work? #64

SheldonCurtiss commented Aug 17, 2021

jonathan-laurent commented Aug 17, 2021

How exactly does Alphazero's MCTS work? #64

How exactly does Alphazero's MCTS work? #64

Comments

SheldonCurtiss commented Aug 17, 2021

jonathan-laurent commented Aug 17, 2021