You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- MuZero [[Julian Schrittwieser et al.: Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model](https://arxiv.org/abs/1911.08265)]
9
10
- AlphaZero as regularized policy optization [[Jean-Bastien Grill et al.: Monte-Carlo Tree Search as Regularized Policy Optimization](https://arxiv.org/abs/2007.12509)]
0 commit comments