[FEATURE] support non-stationary rewards #12

thetawom · 2023-02-19T08:02:43Z

Is your feature request related to a problem? Please describe.
Currently, the rewards distribution of each arm is fixed, so the library can't simulate non-stationary (restless) bandit problems.

Describe the solution you'd like
When implementing a custom Arm, there should be an update() function that can be overridden to update the reward distribution's parameters. Then each step, the ArmSet should call update() for each arm to advance its reward distribution.

The text was updated successfully, but these errors were encountered:

thetawom added enhancement New feature or request backlog labels Feb 19, 2023

thetawom added this to mabby Mar 2, 2023

thetawom moved this to Todo in mabby Mar 2, 2023

thetawom changed the title ~~ENH: support non-stationary rewards~~ [FEATURE] support non-stationary rewards Mar 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] support non-stationary rewards #12

[FEATURE] support non-stationary rewards #12

thetawom commented Feb 19, 2023

[FEATURE] support non-stationary rewards #12

[FEATURE] support non-stationary rewards #12

Comments

thetawom commented Feb 19, 2023