Interactive Visual Reinforcement Learning on Atari's Breakout

A CS-8395 Project

This project intends to address challenges with understanding complex reinforcement learning agents on complicated tasks like Atari games through visual aids. An A3C model is trained to play Breakout and the results are analyzed with an interactive dashboard built on Dash.

Baseline

The baseline work for this project comes from Sam Greydanus' paper Visualizing and Understanding Atari Agents.

Environment Install

pip3 install https://download.pytorch.org/whl/cpu/torch-1.0.1.post2-cp37-cp37m-linux_x86_64.whl
pip3 install torchvision
// or if you're using conda
conda install pytorch-cpu torchvision-cpu -c pytorch

pip install dash==0.42.0  
pip install dash-daq==0.1.0
pip install h5py==2.8.0

pip install pandas
pip install matplotlib
pip install scipy

Instructions:

To run the dashboard:

Download the Generated Data and place it into a folder called static.

mkdir static
python app.py # localhost:8050

To train a new model:

cd baby-a3c
python baby-a3c.py --load_model <model name> 

python baby-a3c.py --load_model <old model name> # to continue training on an older model

To generate data used for the dashboard:

The dashboard operates off of two files: a training log file (csv) and an h5 file of generated data from rolling-out models saved at different epochs. This creates an h5 file in static.

Data captured consists of:

Environment state (image as np array)
Hidden state of GRU cell
Logits
Values
output (probability over actions)
Actor/Critic saliency

cd visualize_atari
python generate_data.py

To play around with saliency maps and data:

jupyter notebook
-> visualize_atari/jacobian-vs-perturbation.ipynb

References/Source Material

Code adapted from Sam Greydanus' work:

https://github.com/greydanus/visualize_atari - generate saliency maps of agent playthroughs https://github.com/greydanus/baby-a3c - for a3c model training https://arxiv.org/abs/1711.00138

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
assets		assets
baby-a3c		baby-a3c
fds-sheets		fds-sheets
visualize_atari		visualize_atari
README.md		README.md
app.py		app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Interactive Visual Reinforcement Learning on Atari's Breakout

A CS-8395 Project

Baseline

Environment Install

Instructions:

To run the dashboard:

To train a new model:

To generate data used for the dashboard:

To play around with saliency maps and data:

References/Source Material

About

Releases

Packages

Languages

slowjazz/interactive-atari-RL

Folders and files

Latest commit

History

Repository files navigation

Interactive Visual Reinforcement Learning on Atari's Breakout

A CS-8395 Project

Baseline

Environment Install

Instructions:

To run the dashboard:

To train a new model:

To generate data used for the dashboard:

To play around with saliency maps and data:

References/Source Material

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages