Different existing techniques of reinforcement learning are implemented using gym framework.
- DQN with experience replay buffer and fixed Q targets on cartpole
- Double DQN with experience replay buffer and fixed Q targets on cartpole
| Name | Name | Last commit date | ||
|---|---|---|---|---|