Releases · DLR-RM/rl-baselines3-zoo

25 Mar 14:21

araffin

v1.5.0

2fe4418

SB3 v1.5.0: Support for Weight and Biases experiment tracking

Release 1.5.0 (2022-03-25)

Support for Weight and Biases experiment tracking

Breaking Changes

Upgrade to Stable-Baselines3 (SB3) >= 1.5.0
Upgrade to sb3-contrib >= 1.5.0
Upgraded to gym 0.21

New Features

Verbose mode for each trial (when doing hyperparam optimization) can now be activated using the debug mode (verbose == 2)
Support experiment tracking via Weights and Biases via the --track flag (@vwxyzjn)
Support tracking raw episodic stats via RawStatisticsCallback (@vwxyzjn, see #216)

Bug fixes

Policies saved during during optimization with distributed Optuna load on new systems (@JKTerry)
Fixed script for recording video that was not up to date with the enjoy script

Contributors

JKTerry and vwxyzjn

Assets 2

19 Jan 14:01

araffin

v1.4.0

41983ab

SB3 v1.4.0: TRPO, ARS and multi env training for off-policy algorithms

Breaking Changes

Dropped python 3.6 support
Upgrade to Stable-Baselines3 (SB3) >= 1.4.0
Upgrade to sb3-contrib >= 1.4.0

New Features

Added mujoco hyperparameters
Added MuJoCo pre-trained agents
Added script to parse best hyperparameters of an optuna study
Added TRPO support
Added ARS support and pre-trained agents

Documentation

Replace front image

Assets 2

23 Oct 16:00

araffin

v1.3.0

8607c67

SB3 v1.3.0: rliable plots and bug fixes

WARNING: This version will be the last one supporting Python 3.6 (end of life in Dec 2021). We highly recommended you to upgrade to Python >= 3.7.

Breaking Changes

Upgrade to panda-gym 1.1.1
Upgrade to Stable-Baselines3 (SB3) >= 1.3.0
Upgrade to sb3-contrib >= 1.3.0

New Features

Added support for using rliable for performance comparison

Bug fixes

Fix training with Dict obs and channel last images

Other

Updated docker image
constrained gym version: gym>=0.17,<0.20
Better hyperparameters for A2C/PPO on Pendulum

Assets 2

08 Sep 11:40

araffin

v1.2.0

7b4465b

SB3 v1.2.0

Breaking Changes

Upgrade to Stable-Baselines3 (SB3) >= 1.2.0
Upgrade to sb3-contrib >= 1.2.0

Bug fixes

Fix --load-last-checkpoint (@SammyRamone)
Fix TypeError for gym.Env class entry points in ExperimentManager (@schuderer)
Fix usage of callbacks during hyperparameter optimization (@SammyRamone)

Other

Added python 3.9 to Github CI
Increased DQN replay buffer size for Atari games (@nikhilrayaprolu)

Contributors

schuderer, SammyRamone, and nikhilrayaprolu

Assets 2

02 Jul 10:08

araffin

v1.1.0

96f1a59

SB3 v1.1.0

Breaking Changes

Upgrade to Stable-Baselines3 (SB3) >= 1.1.0
Upgrade to sb3-contrib >= 1.1.0
Add timeout handling (cf SB3 doc)
HER is now a replay buffer class and no more an algorithm
Removed PlotNoiseRatioCallback
Removed PlotActionWrapper
Changed 'lr' key in Optuna param dict to 'learning_rate' so the dict can be directly passed to SB3 methods (@justinkterry)

New Features

Add support for recording videos of best models and checkpoints (@mcres)
Add support for recording videos of training experiments (@mcres)
Add support for dictionary observations
Added experimental parallel training (with utils.callbacks.ParallelTrainCallback)
Added support for using multiple envs for evaluation
Added --load-last-checkpoint option for the enjoy script
Save Optuna study object at the end of hyperparameter optimization and plot the results (plotly package required)
Allow to pass multiple folders to scripts/plot_train.py
Flag to save logs and optimal policies from each training run (@justinkterry)

Bug fixes

Fixed video rendering for PyBullet envs on Linux
Fixed get_latest_run_id() so it works in Windows too (@NicolasHaeffner)
Fixed video record when using HER replay buffer

Documentation

Updated README (dict obs are now supported)

Other

Added is_bullet() to ExperimentManager
Simplify close() for the enjoy script
Updated docker image to include latest black version
Updated TD3 Walker2D model (thanks @modanesh)
Fixed typo in plot title (@scottemmons)
Minimum cloudpickle version added to requirements.txt (@amy12xx)
Fixed atari-py version (ROM missing in newest release)
Updated SAC and TD3 search spaces
Cleanup eval_freq documentation and variable name changes (@justinkterry)
Add clarifying print statement when printing saved hyperparameters during optimization (@justinkterry)
Clarify n_evaluations help text (@justinkterry)
Simplified hyperparameters files making use of defaults
Added new TQC+HER agents
Add panda-gymenvironments (@qgallouedec)

Assets 2

17 Mar 14:28

araffin

v1.0

0378524

Stable-Baselines3 v1.0 - 100+ pre-trained models

Blog post: https://araffin.github.io/post/sb3/

Breaking Changes

Upgrade to SB3 >= 1.0
Upgrade to sb3-contrib >= 1.0

New Features

Added 100+ trained agents + benchmark file
Add support for loading saved model under python 3.8+ (no retraining possible)
Added Robotics pre-trained agents (@sgillen)

Bug fixes

Bug fixes for HER handling action noise
Fixed double reset bug with HER and enjoy script

Documentation

Added doc about plotting scripts

Other

Updated HER hyperparameters

Assets 2

27 Feb 19:33

araffin

v0.11.1

f71d490

Big refactor - SB3 upgrade - Last before v1.0 Pre-release

Pre-release

Breaking Changes

Removed LinearNormalActionNoise
Evaluation is now deterministic by default, except for Atari games
sb3_contrib is now required
TimeFeatureWrapper was moved to the contrib repo
Replaced old plot_train.py script with updated plot_training_success.py
Renamed n_episodes_rollout to train_freq tuple to match latest version of SB3

New Features

Added option to choose which VecEnv class to use for multiprocessing
Added hyperparameter optimization support for TQC
Added support for QR-DQN from SB3 contrib

Bug fixes

Improved detection of Atari games
Fix potential bug in plotting script when there is not enough timesteps
Fixed a bug when using HER + DQN/TQC for hyperparam optimization

Documentation

Improved documentation (@cboettig)

Other

Refactored train script, now uses a ExperimentManager class
Replaced make_env with SB3 built-in make_vec_env
Add more type hints (utils/utils.py done)
Use f-strings when possible
Changed PPO atari hyperparameters (removed vf clipping)
Changed A2C atari hyperparameters (eps value of the optimizer)
Updated benchmark script
Updated hyperparameter optim search space (commented gSDE for A2C/PPO)
Updated DQN hyperparameters for CartPole
Do not wrap channel-first image env (now natively supported by SB3)
Removed hack to log success rate
Simplify plot script

Assets 2

28 Oct 16:40

araffin

v0.10.0

a1b35ec

v0.10.0 Pre-release

Pre-release

Release v0.10.0 (#48)

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release 1.5.0 (2022-03-25)

Breaking Changes

New Features

Bug fixes

Contributors

Breaking Changes

New Features

Documentation

Breaking Changes

New Features

Bug fixes

Other

Breaking Changes

Bug fixes

Other

Contributors

Breaking Changes

New Features

Bug fixes

Documentation

Other

Breaking Changes

New Features

Bug fixes

Documentation

Other

Breaking Changes

New Features

Bug fixes

Documentation

Other

Releases: DLR-RM/rl-baselines3-zoo

SB3 v1.5.0: Support for Weight and Biases experiment tracking

Release 1.5.0 (2022-03-25)

Breaking Changes

New Features

Bug fixes

Contributors

SB3 v1.4.0: TRPO, ARS and multi env training for off-policy algorithms

Breaking Changes

New Features

Documentation

SB3 v1.3.0: rliable plots and bug fixes

Breaking Changes

New Features

Bug fixes

Other

SB3 v1.2.0

Breaking Changes

Bug fixes

Other

Contributors

SB3 v1.1.0

Breaking Changes

New Features

Bug fixes

Documentation

Other

Stable-Baselines3 v1.0 - 100+ pre-trained models

Breaking Changes

New Features

Bug fixes

Documentation

Other

Big refactor - SB3 upgrade - Last before v1.0

Breaking Changes

New Features

Bug fixes

Documentation

Other

v0.10.0