You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for the great repo.
Do you have any suggestions on how to finetune DeepSeek-Coder v2 236B.
Is it possible to do fsdp + qlora for this moe model? What's them minimum required to do finetuning?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Thank you for the great repo.
Do you have any suggestions on how to finetune DeepSeek-Coder v2 236B.
Is it possible to do fsdp + qlora for this moe model? What's them minimum required to do finetuning?
Beta Was this translation helpful? Give feedback.
All reactions