Skip to content

v0.1.0

Latest
Compare
Choose a tag to compare
@prabhatnagarajan prabhatnagarajan released this 02 Dec 23:14
· 22 commits to main since this release
a57be4b

Release 0.1.0

Algorithms

Features/Enhancements

Explorers

  • #24: Expands the observe method in explorers
  • #32: Adds training_mode to explorers' observe
  • #31: Adds PolicyExecutor as an explorer that executes a specific policy
  • #27: Provides the observation to explorers

Environments

  • #26: Adds overestimation environment from Double Q-learning paper

Learners

  • #28: Adds training mode to learners

Bug fixes

  • #22: Fixes Transition function in River Swim Environment
  • #23: Bug in explorer call in Q-learning