[WIP] [GenAI] Lora Finetune #7288

LittleLittleCloud · 2024-11-06T17:26:34Z

Lora fine-tuning is an adapter-based technique to fine-tune an LLM. It changes LLM model architecture by adding learnable lora layers to transformers. During fine-tuning, only lora weights are adjustable and the LLM weights are frozen, so it requires much less GPU memory comparing to a full-layer fine-tuning. Based on this table, it requires 16GB memory to fine-tuning a 7B size model in 16bits, which can be fit in rtx 3090, 4080 and 4090. A wider range of GPUs can be fit on 3.8B LLMs like phi-3.5-mini

API design (wip)

Package: `Microsoft.ML.GenAI.Lora`

interface ICausalLMLoraPipeline {} // pipeline for loading causal LM + lora layers

class LoraConfiguration // lora configuration

The text was updated successfully, but these errors were encountered:

LittleLittleCloud added the enhancement New feature or request label Nov 6, 2024

dotnet-policy-service bot added the untriaged New issue has not been triaged label Nov 6, 2024

LittleLittleCloud mentioned this issue Nov 8, 2024

Can ML.NET be used to train other models, such as ChatGLM-6B? dotnet/machinelearning-samples#1043

Open

LittleLittleCloud mentioned this issue Nov 22, 2024

[GenAI] SFT Example #7316

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] [GenAI] Lora Finetune #7288

[WIP] [GenAI] Lora Finetune #7288

LittleLittleCloud commented Nov 6, 2024 •

edited

Loading

[WIP] [GenAI] Lora Finetune #7288

[WIP] [GenAI] Lora Finetune #7288

Comments

LittleLittleCloud commented Nov 6, 2024 • edited Loading

API design (wip)

Package: Microsoft.ML.GenAI.Lora

LittleLittleCloud commented Nov 6, 2024 •

edited

Loading

Package: `Microsoft.ML.GenAI.Lora`