Skip to content

smartliuhw/CoT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Intro

This repo is used for @smartliuhw thesis's model training. The huggingface SFT trainer is used as the training framwork with deepspeed methodology to ensure the RTX 4090 GPU can be used properly.

How to use

Install the dependency

The environment dependency is listed in the requirment file, just run the following command:

pip install -r requirements.txt

Modify data process file

The data processing code is in the utils.py file, all the data should be stored with the Dataset class. The function get_train_data is the most important part, modify it accroding to your demand.

Modify train file

The model tran code is in the train.py file, using trl framework. Modify the args, templates, special tokens accroding to your demand.

Run trainning

You can launch a training by following the train example file, with only few changes about the model and the data.

If you have any question, feel free to ask me.

About

personal code of cot training

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published