Active inference and multi-armed bandits

The code accompanying the paper An empirical evaluation of active inference in multi-armed bandits. We introduce various active inference based multi-armed bandit algorithms and compare them to existing solutions. The current focus is on stationary and switching bandits.

Installation

The following instruction assume that you use anaconda or miniconda package manager. First create the environment using the provided yml file

conda env create -f environment.yml

Activate the environment

conda activate bandits

and follow the official instructions for installing jax. If you have GPU available we recommend installing the jax version with GPU support, as this would speed up the execution of the code by an order of magnitude. The last version which was used and tested on the provided code is jax 1.68 (with support for cuda 11.1).

Usage

The notebooks folder contains examples of code usage and scripts to reproduce the figures in the paper. The scripts available in the main folder

run_estimate_runtime.py
run_stationary_sims.py
run_switching_fixed_diff_sims.py
run_switching_varying_diff_sims.py

allow for command line execution of simulations in stationary and switching bandits. For example, running the following command

python run_estimate_runtime.py -n 1000 -k 10 20 40 80

will estimate the runtimes for different decision making algorithms on your machine by running 1000 parallel simulations for different arm number 10, 20, 40 and 80. The other scripts are executed in a similar manner (open the file to find the list of possible commands).

For more details on the definition of different bandit environments see the methods section of the paper.

As running the simulations can takes long time we provide a pre-generated results at the OSF page of the project osf.io/85ek4/. Create a data folder inside the repository, and download the npz files from the data folder hosted on the project page. Running the notebooks, stationary_bandits_plotting.ipynb and switching_bandits_comparison.ipynb will recreate the figures from the paper.

Citing

To cite our work or the repository please use the following bibtex entry:

@article{markovic,
title = {An empirical evaluation of active inference in multi-armed bandits},
journal = {Neural Networks},
year = {2021},
issn = {0893-6080},
doi = {https://doi.org/10.1016/j.neunet.2021.08.018},
url = {https://www.sciencedirect.com/science/article/pii/S0893608021003233},
author = {Dimitrije Marković and Hrvoje Stojić and Sarah Schwöbel and Stefan J. Kiebel},
keywords = {Decision making, Bayesian inference, Multi-armed bandits, Active inference, Upper confidence bound, Thompson sampling}
}

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
bandits		bandits
notebooks		notebooks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environmnet.yml		environmnet.yml
run_estimate_runtime.py		run_estimate_runtime.py
run_stationary_sims.py		run_stationary_sims.py
run_switching_fixed_diff_sims.py		run_switching_fixed_diff_sims.py
run_switching_varying_diff_sims.py		run_switching_varying_diff_sims.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Active inference and multi-armed bandits

Installation

Usage

Citing

About

Releases

Packages

Contributors 2

Languages

License

dimarkov/aibandits

Folders and files

Latest commit

History

Repository files navigation

Active inference and multi-armed bandits

Installation

Usage

Citing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages