Why is it recommend to set `load_in_8bit: true` for LORA finetuning? #1611

rudolpheric · 2024-05-12T20:12:26Z

rudolpheric
May 12, 2024

I startet to experment lora finetuning and since I have enough memory and still the model always gets worse through lora finetuning I am wondering why this is the case. I saw a warning in the logs that it is recommended to quantize to 8 bit. Why is this recommended? Shouldn't the model loose performance through quantisation?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why is it recommend to set `load_in_8bit: true` for LORA finetuning? #1611

{{title}}

Replies: 0 comments

Select a reply

Why is it recommend to set load_in_8bit: true for LORA finetuning? #1611

rudolpheric May 12, 2024

Replies: 0 comments

Why is it recommend to set `load_in_8bit: true` for LORA finetuning? #1611

rudolpheric
May 12, 2024