-
Notifications
You must be signed in to change notification settings - Fork 5
References
Itomigna2 edited this page Mar 5, 2024
·
11 revisions
Human-Timescale Adaptation in an Open-Ended Task Space (Adaptive agent, Ada)
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Muesli: Combining Improvements in Policy Optimization
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model(MuZero)
https://github.com/werner-duvaud/muzero-general
https://github.com/werner-duvaud/muzero-general/wiki/How-MuZero-works