Releases: prabhatnagarajan/table-rl
Releases · prabhatnagarajan/table-rl
Release 0.0.2
This release introduces several new features:
- Double Q-learning
- Explorers: linear decay epsilon-greedy exploration and adds percentage decay epsilon-greedy exploration.
- Renames
env
toenvs
- Adds Riverswim environment
v0.1.0
Release 0.1.0
Algorithms
- #25: SARSA
Features/Enhancements
Explorers
- #24: Expands the
observe
method in explorers - #32: Adds
training_mode
to explorers'observe
- #31: Adds PolicyExecutor as an explorer that executes a specific policy
- #27: Provides the observation to explorers
Environments
- #26: Adds overestimation environment from Double Q-learning paper
Learners
- #28: Adds training mode to learners