Generative Adversarial Imitation Learning Gym environment Env==CartPole-v0 State==Continuous Action==Discrete Generate expert data Generate expert data by ppo python get_expert_data.py Imitation Learning python main.py Result orange : gail blue : ppo Reference [1] Generative Adversarial Imitation Learning