Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] support non-stationary rewards #12

Open
thetawom opened this issue Feb 19, 2023 · 0 comments
Open

[FEATURE] support non-stationary rewards #12

thetawom opened this issue Feb 19, 2023 · 0 comments
Labels
backlog enhancement New feature or request

Comments

@thetawom
Copy link
Owner

Is your feature request related to a problem? Please describe.
Currently, the rewards distribution of each arm is fixed, so the library can't simulate non-stationary (restless) bandit problems.

Describe the solution you'd like
When implementing a custom Arm, there should be an update() function that can be overridden to update the reward distribution's parameters. Then each step, the ArmSet should call update() for each arm to advance its reward distribution.

@thetawom thetawom added enhancement New feature or request backlog labels Feb 19, 2023
@thetawom thetawom added this to mabby Mar 2, 2023
@thetawom thetawom moved this to Todo in mabby Mar 2, 2023
@thetawom thetawom changed the title ENH: support non-stationary rewards [FEATURE] support non-stationary rewards Mar 6, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backlog enhancement New feature or request
Projects
Status: Todo
Development

No branches or pull requests

1 participant