Wizard_QLoRA_Finetuning

Finetuning Some Wizard Models With QLoRA

Demo

https://youtu.be/hkt5Nz0buso?si=HNmYLp_z5SGZlMbM

Pipeline

Model fine-tuning

Finetuning can be done with the finetune.py script. In this script, a model will be downloaded and finetuned on one of the datasets in 4-bit precision. As finetuning progress is being made, checkpoints are saved to the specified output directory.

Merging

After the model is trained, one of the checkpoint files should be merged so that the LoRA weights and old weights are combined into a single weight matrix, making inference more efficient than if you had them split. merge.py does the merge given a specified checkpoint file and the specified model type.

Inference

Inference has a few scripts. infer.py and infer.ipynb are similar and just run straight inference on a given model. infer_interface.ipynb has an additional interface using Gradio.

Uploading/Saving Models

upload.py can be used to upload huggingface models to the hub easily given a repo name to upload. Make sure to get a write token from huggingface to upload properly.

Data Creation

data_creation.ipynb is a simple example of data creation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wizard_QLoRA_Finetuning

Demo

Pipeline

Model fine-tuning

Merging

Inference

Uploading/Saving Models

Data Creation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitignore		.gitignore
README.md		README.md
data_creation.ipynb		data_creation.ipynb
data_creation.py		data_creation.py
finetune.py		finetune.py
infer.ipynb		infer.ipynb
infer.py		infer.py
infer_interface.ipynb		infer_interface.ipynb
merge.py		merge.py
requirements.txt		requirements.txt
upload.py		upload.py

gmongaras/Wizard_QLoRA_Finetuning

Folders and files

Latest commit

History

Repository files navigation

Wizard_QLoRA_Finetuning

Demo

Pipeline

Model fine-tuning

Merging

Inference

Uploading/Saving Models

Data Creation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages