Custom SFT Trainer with KL divergence loss

This repo provides an extension of Hugging Face's trl.SFTTrainer that adds a KL divergence loss between a LoRA-adapted model and its base counterpart. It enables more stable and conservative fine-tuning by regularizing the adapted model's predictions against its original distribution.

Setup

conda env create -f environment.yml
conda activate custom_sft_loss

Custom Loss

The custom loss is implemented in the custom_trainer.py file. It extends the SFTTrainer class and overrides the compute_loss method to add a KL divergence loss term.

Training

The training script is implemented in the train.py file. You can compare the custom loss to the standard SFT loss.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
custom_trainer.py		custom_trainer.py
environment.yml		environment.yml
make_model.py		make_model.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Custom SFT Trainer with KL divergence loss

Setup

Custom Loss

Training

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Custom SFT Trainer with KL divergence loss

Setup

Custom Loss

Training

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages