Skip to content

Latest commit

 

History

History
25 lines (19 loc) · 661 Bytes

README.md

File metadata and controls

25 lines (19 loc) · 661 Bytes

Categorical DQN

Attempt at CNTK implementation of Categorical DQN from 'A distributional Perspective on Reinforcement Learning' found here.

Dependencies

  1. Python 3
  2. CNTK v2

CartPole-v0

To train a model for CartPole from OpenAI Gym, use:

python -m experiments.train_cartpole

To watch the trained model in action, use:

python -m experiments.watch_cartpole

Here are the results from a sample run: cartpole rewards cartpole losses

Atari

Not currently planned. If you run it and get results, I'll be happy to include it.