References

Jump to bottom

Itomigna2 edited this page Mar 5, 2024 · 11 revisions

References

Papers

Human-Timescale Adaptation in an Open-Ended Task Space (Adaptive agent, Ada)

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

Muesli: Combining Improvements in Policy Optimization

Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model(MuZero)

Repos

https://github.com/werner-duvaud/muzero-general

https://github.com/werner-duvaud/muzero-general/wiki/How-MuZero-works

https://github.com/facebookresearch/torchbeast/tree/main

Toggle table of contents Pages 5

Clone this wiki locally