v0.1.0

Latest

Latest

prabhatnagarajan released this 02 Dec 23:14

· 22 commits to main since this release

Release 0.1.0

Algorithms

#25: SARSA

Features/Enhancements

Explorers

#24: Expands the observe method in explorers
#32: Adds training_mode to explorers' observe
#31: Adds PolicyExecutor as an explorer that executes a specific policy
#27: Provides the observation to explorers

Environments

#26: Adds overestimation environment from Double Q-learning paper

Learners

#28: Adds training mode to learners

Bug fixes

#22: Fixes Transition function in River Swim Environment
#23: Bug in explorer call in Q-learning

Assets 2