-
Notifications
You must be signed in to change notification settings - Fork 32
Open
Labels
Description
There are several nice implementations of RL algorithms in
https://github.com/openai/baselines
But they're all implemented in Tensorflow, and in addition have lots of onerous dependencies like MPI. It would be nice to have these implementations in pytorch, and using something other than MPI for communication (maybe [gRPC][(https://github.com/grpc/grpc) ? )
Someone has started a reimplementation of the baselines in pytorch here:
https://github.com/nadavbh12/baselines-pytorch
but they've only implemented A2C so far. It would be nice if we also had an implementation of ACKTR and PPO2