Skip to content

Release 0.6.0

Compare
Choose a tag to compare
@yannbouteiller yannbouteiller released this 17 Jan 19:52
· 20 commits to master since this release

Major release 0.6.0

This release introduces support for non-real-time environments in the TMRL library, and support for Linux in the TrackMania pipeline.

Version 0.6.0 is backward-incompatible and requires a clean installation.

Major changes

  • TrackMania example pipeline (see config.json):
    • Support for Linux
    • Support for saving replays automatically
    • Support for reward shaping
  • TMRL library:
    • Support for non-real-time environments and Trainer/Worker synchronization
    • Generic training pipeline (in particular, introduced a generic Memory class for lazy developers, compatible with random sampling in 1-step TD learning)

Minor changes

  • TrackMania example pipeline (see config.json):
    • More fine-tuning options for SAC
    • Set the default Adam Betas to the RL-compatible setting described by Mahmood et al. 2023 in an attempt to avoid policy collapse
    • The default hyperparameters changed for full vision-based training instead of LIDAR training