Skip to content
This repository has been archived by the owner on Feb 27, 2020. It is now read-only.

Latest commit

 

History

History
24 lines (15 loc) · 476 Bytes

README.md

File metadata and controls

24 lines (15 loc) · 476 Bytes

A collection of implementations of the bandit problem (school project, Reinforcement Learning class, 2018).

Features

Bandits

  • Normal multi-armed bandits
  • Bernouilli multi-armed bandits

Algorithms

  • Random bandit algorithm
  • Greedy bandit algorithm
  • Epsilon greedy bandit algorithm
  • UCB bandit algorithm
  • Thompson sampling algorithm

Using the code

See the notebook for examples of usage.

Credits

The notebook is from Valentin Emiya.