PPO-Pytorch

Minimal implementation of PPO, running in Mujoco env, using Gym-mujoco. It is based on the code openai/baselines.

Now it is a Pytorch version and it works.

Requirement

Python3+
Pytorch 0.4
Mujoco
Gym, Mujoco_py

Using following command to train a model, more args can be set in 'main.py'.

git clone [email protected]:/qqadssp/PPO-Pytorch
cd PPO-Pytorch
python3 main.py --env Ant-v2

I have trained a model in Ant-v2, it is the file 'logdir/checkpoints/00100'. Using following command to run it.

python3 main.py --env Ant-v2 --updates 0 --play --checkpoint logdir/checkpoints/00100

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
env		env
logdir		logdir
util		util
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
main.py		main.py
ppo.py		ppo.py
runner.py		runner.py