Adapting ALMA source code to LLaMa 3.1 #71

aiyubx · 2024-12-02T05:22:00Z

Hey there!

I am considering adapting ALMA to train LLaMa 3.1.

I was wondering if you have tried it and if the results were similar or better than with LLaMa 2.

Regarding the source code changes, on the first glance, only the eos and padding related parts seem to need some fixes. Any tips would be appreciated!

Thank you.

zhl606 · 2024-12-25T09:07:25Z

Hey there!

I am considering adapting ALMA to train LLaMa 3.1.

I was wondering if you have tried it and if the results were similar or better than with LLaMa 2.

Regarding the source code changes, on the first glance, only the eos and padding related parts seem to need some fixes. Any tips would be appreciated!

Thank you.

Hello, I would like to use this training approach to fine-tune LLaMa 3 as well. I look forward to more discussions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adapting ALMA source code to LLaMa 3.1 #71

Adapting ALMA source code to LLaMa 3.1 #71

aiyubx commented Dec 2, 2024

zhl606 commented Dec 25, 2024

Adapting ALMA source code to LLaMa 3.1 #71

Adapting ALMA source code to LLaMa 3.1 #71

Comments

aiyubx commented Dec 2, 2024

zhl606 commented Dec 25, 2024