We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hey there!
I am considering adapting ALMA to train LLaMa 3.1.
I was wondering if you have tried it and if the results were similar or better than with LLaMa 2.
Regarding the source code changes, on the first glance, only the eos and padding related parts seem to need some fixes. Any tips would be appreciated!
Thank you.
The text was updated successfully, but these errors were encountered:
Hey there! I am considering adapting ALMA to train LLaMa 3.1. I was wondering if you have tried it and if the results were similar or better than with LLaMa 2. Regarding the source code changes, on the first glance, only the eos and padding related parts seem to need some fixes. Any tips would be appreciated! Thank you.
Hello, I would like to use this training approach to fine-tune LLaMa 3 as well. I look forward to more discussions.
Sorry, something went wrong.
No branches or pull requests
Hey there!
I am considering adapting ALMA to train LLaMa 3.1.
I was wondering if you have tried it and if the results were similar or better than with LLaMa 2.
Regarding the source code changes, on the first glance, only the eos and padding related parts seem to need some fixes. Any tips would be appreciated!
Thank you.
The text was updated successfully, but these errors were encountered: