Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AfroBench: How Good are Large Language Models on African Languages? #2825

Open
wants to merge 200 commits into
base: main
Choose a base branch
from

Conversation

JessicaOjo
Copy link
Contributor

@JessicaOjo JessicaOjo commented Mar 20, 2025

Overview

AfroBench is a large scale benchmark of African languages on LLMs, covering 15 tasks and 22 datasets.

Features

The PR features 19 new tasks/datasets all in the AfroBench subfolder
It also updates the irokobench tasks - AfriXNLI, AfriMGSM and AfriMMLU to reflect recent experiments performed on these tasks

Testing

The PR has been tested locally against several open source models

Documentation

Adequate documentation has been provided on each task ReadMe pages.

@CLAassistant
Copy link

CLAassistant commented Mar 20, 2025

CLA assistant check
All committers have signed the CLA.

@baberabb
Copy link
Contributor

Hi! Thanks for your substantial PR. I'll try to review it soon! Just need everyone in the commit history to pacify the CLI assistant by signing the agreement. cc: @theyorubayesian, @dadelani

@JessicaOjo
Copy link
Contributor Author

Hi! Thanks for your substantial PR. I'll try to review it soon! Just need everyone in the commit history to pacify the CLI assistant by signing the agreement. cc: @theyorubayesian, @dadelani

Thanks for the comment everyone in the commit history has signed the agreement.

@StellaAthena
Copy link
Member

Hi! Thanks for your substantial PR. I'll try to review it soon! Just need everyone in the commit history to pacify the CLI assistant by signing the agreement. cc: @theyorubayesian, @dadelani

Thanks for the comment everyone in the commit history has signed the agreement.

It looks like a bunch of tests are failing. Can you run the pre-commit script and see about addressing those issues?

@JessicaOjo
Copy link
Contributor Author

Hi! Thanks for your substantial PR. I'll try to review it soon! Just need everyone in the commit history to pacify the CLI assistant by signing the agreement. cc: @theyorubayesian, @dadelani

Thanks for the comment everyone in the commit history has signed the agreement.

It looks like a bunch of tests are failing. Can you run the pre-commit script and see about addressing those issues?

Hi Stella,

I've addressed the pertinent issues, but I’m unsure if the remaining ones are within my scope to fix:

  • Linters: The files are being formatted, but I’m not sure what’s causing the failure.
  • External LM test: It’s failing due to an accelerate error—specifically, 'clear_device_cache' from accelerate.utils.memory.
  • Scan for changed task test: This seems to be failing at the test script level.
    Let me know if you have any suggestions on how I can resolve these. Thanks!

@baberabb baberabb self-assigned this Mar 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

7 participants