Skip to content

Releases: prabhatnagarajan/table-rl

Release 0.0.2

25 Sep 06:21
217afb1
Compare
Choose a tag to compare

This release introduces several new features:

  • Double Q-learning
  • Explorers: linear decay epsilon-greedy exploration and adds percentage decay epsilon-greedy exploration.
  • Renames env to envs
  • Adds Riverswim environment

v0.1.0

02 Dec 23:14
a57be4b
Compare
Choose a tag to compare

Release 0.1.0

Algorithms

Features/Enhancements

Explorers

  • #24: Expands the observe method in explorers
  • #32: Adds training_mode to explorers' observe
  • #31: Adds PolicyExecutor as an explorer that executes a specific policy
  • #27: Provides the observation to explorers

Environments

  • #26: Adds overestimation environment from Double Q-learning paper

Learners

  • #28: Adds training mode to learners

Bug fixes

  • #22: Fixes Transition function in River Swim Environment
  • #23: Bug in explorer call in Q-learning