Skip to content
/ acb Public

A PyTorch implementation of the Anti-concentrated Confidence Bonus (ACB) for promoting exploration in deep reinforcement learning.

Notifications You must be signed in to change notification settings

JordanAsh/acb

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Anti-Concentrated Confidence Bonuses (ACB)

This repository contains a PyTorch implementation of the anti-concentrated confidence bonus for promoting exploration in deep reinforcement learning. For more information, check out our ICLR 2022 paper, Anti-Concentrated Confidence Bonuses for Scalable Exploration. Our code was built off of jcwleo's impelementation of Random Network Distillation.

Dependencies

Required dependencies can be found in setup.py.

Running an experiment

python run.py --intrinsic acb --env breakout
runs an experiment using ACB in the Atari game Breakout.

python run.py --intrinsic rnd --env seaquest --extrinsic
runs an experiment using RND in the atari game Seaquest. The extrinsic flag allows the agent to be trained jointly on intrinsic and extrinsic rewards; by default only the specified intrinsic rewards are used.

About

A PyTorch implementation of the Anti-concentrated Confidence Bonus (ACB) for promoting exploration in deep reinforcement learning.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published