Release 0.6.0
Major release 0.6.0
This release introduces support for non-real-time environments in the TMRL library, and support for Linux in the TrackMania pipeline.
Version 0.6.0
is backward-incompatible and requires a clean installation.
Major changes
- TrackMania example pipeline (see
config.json
):- Support for Linux
- Support for saving replays automatically
- Support for reward shaping
- TMRL library:
- Support for non-real-time environments and Trainer/Worker synchronization
- Generic training pipeline (in particular, introduced a generic
Memory
class for lazy developers, compatible with random sampling in 1-step TD learning)
Minor changes
- TrackMania example pipeline (see
config.json
):- More fine-tuning options for SAC
- Set the default Adam Betas to the RL-compatible setting described by Mahmood et al. 2023 in an attempt to avoid policy collapse
- The default hyperparameters changed for full vision-based training instead of LIDAR training