Anti-Concentrated Confidence Bonuses (ACB)

This repository contains a PyTorch implementation of the anti-concentrated confidence bonus for promoting exploration in deep reinforcement learning. For more information, check out our ICLR 2022 paper, Anti-Concentrated Confidence Bonuses for Scalable Exploration. Our code was built off of jcwleo's impelementation of Random Network Distillation.

Dependencies

Required dependencies can be found in setup.py.

Running an experiment

python run.py --intrinsic acb --env breakout
runs an experiment using ACB in the Atari game Breakout.

python run.py --intrinsic rnd --env seaquest --extrinsic
runs an experiment using RND in the atari game Seaquest. The extrinsic flag allows the agent to be trained jointly on intrinsic and extrinsic rewards; by default only the specified intrinsic rewards are used.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
README.md		README.md
agents.py		agents.py
config.conf		config.conf
config.py		config.py
envs.py		envs.py
model_rnd.py		model_rnd.py
nn_grads_proj.py		nn_grads_proj.py
setup.sh		setup.sh
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Anti-Concentrated Confidence Bonuses (ACB)

Dependencies

Running an experiment

About

Releases

Packages

Languages

JordanAsh/acb

Folders and files

Latest commit

History

Repository files navigation

Anti-Concentrated Confidence Bonuses (ACB)

Dependencies

Running an experiment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages