Instruction tuning of LLama2 is significantly slower compared to documented 3 hours fine-tuning time on A10G. #35

mlscientist2 · 2023-09-20T08:03:08Z

Hi,

First of all, thanks for setting up the nicely formatted code for fine-tuning LLaMa2 in 4-bits.
I was able to follow all the steps and was able to setup training of the model (as shown in your tutorial/ ipython notebook): https://www.philschmid.de/instruction-tune-llama-2

Your tutorial mentions that the training time on a g5.2x large without flash attention is around 3hours. However, running your code shows training time as 40hours! Can you help narrow down the difference/ issue?

I am attaching some screen-shots. On a high-level I suspect there is a bottleneck in data-loader (since the code is only using 1 cpu core), I did try adding the num_workers flag in TrainingArguments but that did not help. GPU utilization seems decent.

Any thoughts @philschmid ?

hassantsyed · 2023-11-30T01:39:57Z

any ideas here?
showing 11 hrs for 1024 context and 22 hrs for 2048. Would love to get this down to 3!

mlscientist2 changed the title ~~Instruction tuning of LLama2 is significantly slower compared to claimed 3 hours fine-tuning time on A10G.~~ Instruction tuning of LLama2 is significantly slower compared to documented 3 hours fine-tuning time on A10G. Sep 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Instruction tuning of LLama2 is significantly slower compared to documented 3 hours fine-tuning time on A10G. #35

Instruction tuning of LLama2 is significantly slower compared to documented 3 hours fine-tuning time on A10G. #35

mlscientist2 commented Sep 20, 2023 •

edited

Loading

hassantsyed commented Nov 30, 2023 •

edited

Loading

Instruction tuning of LLama2 is significantly slower compared to documented 3 hours fine-tuning time on A10G. #35

Instruction tuning of LLama2 is significantly slower compared to documented 3 hours fine-tuning time on A10G. #35

Comments

mlscientist2 commented Sep 20, 2023 • edited Loading

hassantsyed commented Nov 30, 2023 • edited Loading

mlscientist2 commented Sep 20, 2023 •

edited

Loading

hassantsyed commented Nov 30, 2023 •

edited

Loading