Skip to content

jcwleo/random-network-distillation-pytorch

Folders and files

NameName
Last commit message
Last commit date
Nov 25, 2018
Jan 3, 2019
Nov 12, 2018
Nov 12, 2018
Jan 5, 2019
Dec 1, 2018
Nov 12, 2018
Mar 4, 2019
Nov 23, 2018
Mar 4, 2019
Nov 23, 2018
Nov 23, 2018
Jan 13, 2019
Mar 4, 2019
Nov 12, 2018
Nov 30, 2018

Repository files navigation

Random Network Distillation

Intrinsic Reward Graph with play

Venture Montezuma's Revenge
Video Label
~ New model for Montezuma
  • Advantage Actor critic [1]
  • Parallel Advantage Actor critic [2]
  • Exploration by Random Network Distillation [3]
  • Proximal Policy Optimization Algorithms [4]

1. Setup

Requirements


2. How to Train

Modify the parameters in config.conf as you like.

python train.py

3. How to Eval

python eval.py

4. Loss/Reward Graph

  • Montezuma's Revenge Env image
  • Venture Env image

References

[1] Actor-Critic Algorithms
[2] Efficient Parallel Methods for Deep Reinforcement Learning
[3] Exploration by Random Network Distillation
[4] Proximal Policy Optimization Algorithms