GAIL

This project is implement under two classical control problem: Cartpole and Pendulum, which represent discrete and continuous case respectively.

First collect the expert trajectories by the PPO algorithm.
Then utilize these expert trajectories to imitate them with GAIL.
The paper use TRPO to optimize the policy net, however I use PPO with GAE here.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.vscode		.vscode
__pycache__		__pycache__
traj		traj
Readme.md		Readme.md
cartpole_test.py		cartpole_test.py
gail.py		gail.py
net.py		net.py
pendulum_test.py		pendulum_test.py
replay_buffer.py		replay_buffer.py
save_cartpole_traj.py		save_cartpole_traj.py
save_pendulum_traj.py		save_pendulum_traj.py

Provide feedback