-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AfroBench: How Good are Large Language Models on African Languages? #2825
base: main
Are you sure you want to change the base?
Conversation
Afrixnli task
Afrimmlu direct and few shot
add afrimgsm -direct
bash script for gpt models
update afrixnli tasks
update metrics for afrixnli
update prompt translations
update afrobench
update readme instructions
add individual dataset readmes
add link to collections
Hi! Thanks for your substantial PR. I'll try to review it soon! Just need everyone in the commit history to pacify the CLI assistant by signing the agreement. cc: @theyorubayesian, @dadelani |
Thanks for the comment everyone in the commit history has signed the agreement. |
It looks like a bunch of tests are failing. Can you run the pre-commit script and see about addressing those issues? |
Hi Stella, I've addressed the pertinent issues, but I’m unsure if the remaining ones are within my scope to fix:
|
Overview
AfroBench is a large scale benchmark of African languages on LLMs, covering 15 tasks and 22 datasets.
Features
The PR features 19 new tasks/datasets all in the AfroBench subfolder
It also updates the irokobench tasks - AfriXNLI, AfriMGSM and AfriMMLU to reflect recent experiments performed on these tasks
Testing
The PR has been tested locally against several open source models
Documentation
Adequate documentation has been provided on each task ReadMe pages.