rlberry-v0.4.0

TimotheeMathieu released this 18 Jan 10:17

· 190 commits to main since this release

Release of version 0.4.0 of rlberry.

New in 0.4.0

Change the default behavior of plot_writer_data so that if seaborn has version >= 0.12.0 then a 90% percentile interval is used instead of sd.

Add rlberry.envs.PipelineEnv a way to define pipeline of wrappers in a simple way.

PPO can now handle continuous actions.

Implementation of Munchausen DQN in rlberry.agents.torch.MDQNAgent.
Comparison of MDQN with DQN agent in the long tests.

PR #244, #250, #253

Compress the pickles used to save the trained agents.

Implementation of rlberry.envs.SpringCartPole environment, an RL environment featuring two cartpoles linked by a spring.

Improve logging, the logging level can now be changed with rlberry.utils.logging.set_level().
Introduce smoothing in curves done with plot_writer_data when only one seed is used.

Moved PPO from experimental to torch agents. Tested and benchmarked.

Assets 4