Intro

This repo is used for @smartliuhw thesis's model training. The huggingface SFT trainer is used as the training framwork with deepspeed methodology to ensure the RTX 4090 GPU can be used properly.

How to use

Install the dependency

The environment dependency is listed in the requirment file, just run the following command:

pip install -r requirements.txt

Modify data process file

The data processing code is in the utils.py file, all the data should be stored with the Dataset class. The function get_train_data is the most important part, modify it accroding to your demand.

Modify train file

The model tran code is in the train.py file, using trl framework. Modify the args, templates, special tokens accroding to your demand.

Run trainning

You can launch a training by following the train example file, with only few changes about the model and the data.

If you have any question, feel free to ask me.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
data		data
output		output
scripts		scripts
src		src
tensorboard_logs		tensorboard_logs
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intro

How to use

Install the dependency

Modify data process file

Modify train file

Run trainning

About

Releases

Packages

Languages

smartliuhw/CoT

Folders and files

Latest commit

History

Repository files navigation

Intro

How to use

Install the dependency

Modify data process file

Modify train file

Run trainning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages