This repository includes simple implementations of classic RL algorithms, as well as a visual environment in which to run them. The implementations presented here are based on Reinforcement Learning: An Introduction, Sutton and Barto, 2nd Edition.
It also implements a cumulative reward visualizer with the help of the easy charts Godot addon. Hold the space bar in an experiment to see a graph of the cumulative reward per episode.
RL Sandbox can be played directly on your browser over on itch
- Dynamic Programming
- Adaptive Dynamic Programming
- Monte Carlo ES
- SARSA
- Q-learning
- LFA Q-learning