Name		Name	Last commit message	Last commit date
parent directory ..
a3c		a3c
bootstrapped_dqn		bootstrapped_dqn
clipped_ppo		clipped_ppo
ddpg		ddpg
ddpg_her		ddpg_her
dfp		dfp
dqn		dqn
dueling_ddqn		dueling_ddqn
dueling_ddqn_with_per		dueling_ddqn_with_per
qr_dqn		qr_dqn
README.md		README.md

README.md

Coach Benchmarks

The following table represents the current status of algorithms implemented in Coach relative to the results reported in the original papers. The detailed results for each algorithm can be seen by clicking on its name.

The X axis in all the figures is the total steps (for multi-threaded runs, this is the number of steps per worker). The Y axis in all the figures is the average episode reward with an averaging window of 100 timesteps.

For each algorithm, there is a command line for reproducing the results of each graph. These are the results you can expect to get when running the pre-defined presets in Coach.

The environments that were used for testing include:

Atari - Breakout, Pong and Space Invaders
Mujoco - Inverted Pendulum, Inverted Double Pendulum, Reacher, Hopper, Half Cheetah, Walker 2D, Ant, Swimmer and Humanoid.
Doom - Basic, Health Gathering (D1: Basic), Health Gathering Supreme (D2: Navigation), Battle (D3: Battle)
Fetch - Reach, Slide, Push, Pick-and-Place

Summary

Reproducing paper's results

Reproducing paper's results for some of the environments

Training but not reproducing paper's results

Not training

	Environments	Comments
DQN	Atari
Dueling DDQN	Atari
Dueling DDQN with PER	Atari
Bootstrapped DQN	Atari
QR-DQN	Atari
A3C	Atari, Mujoco
Clipped PPO	Mujoco
DDPG	Mujoco
NEC	Atari
HER	Fetch
DFP	Doom	Doom Battle was not verified

Click on each algorithm to see detailed benchmarking results

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchmarks

benchmarks

README.md

Coach Benchmarks

Summary

Files

benchmarks

Directory actions

More options

Directory actions

More options

Latest commit

History

benchmarks

Folders and files

parent directory

README.md

Coach Benchmarks

Summary