-
-
Notifications
You must be signed in to change notification settings - Fork 111
Roadmap
Jun Tian edited this page Feb 14, 2019
·
13 revisions
- Rewrite core
- Distributed RL
- Some References:
- Ray Rllib: https://github.com/ray-project/ray/blob/master/doc/source/rllib.rst
- papers: Ray RLlib: http://proceedings.mlr.press/v80/liang18b.html Ray: https://arxiv.org/abs/1712.05889
- talks: Good Intro to distributed RL (first ~25 mins) then discussion of Ray RLlib (next ~15 mins, the rest of the talk is on imitation learning), by two of the RLlib authors: https://www.youtube.com/watch?v=Y6feXBY6_XQ&t=0s&list=PLkFD6_40KJIxJMR-j5A1mkxK26gh_qg37&index=6
- blogs: https://rise.cs.berkeley.edu/blog/scaling-multi-agent-rl-with-rllib/
- Ape-x: https://openreview.net/forum?id=H1Dy---0Z
- uber single machine implementation: https://github.com/uber-research/ape-x
- R2D2: Extension of Ape-x with recurrent networks: https://openreview.net/forum?id=r1lyTjAqYX
- IMPALA: https://icml.cc/Conferences/2018/Schedule?showEvent=3093
- Uber Faster Neuro-evolution: https://eng.uber.com/accelerated-neuroevolution/
- Ray Rllib: https://github.com/ray-project/ray/blob/master/doc/source/rllib.rst
- Some References:
- HyperParameter Optimisation
- Support for easily deploying experiments
- More environments
- Bullet
- Box2D