AfroBench: How Good are Large Language Models on African Languages? #2825

JessicaOjo · 2025-03-20T18:06:45Z

Overview

AfroBench is a large scale benchmark of African languages on LLMs, covering 15 tasks and 22 datasets.

Features

The PR features 19 new tasks/datasets all in the AfroBench subfolder
It also updates the irokobench tasks - AfriXNLI, AfriMGSM and AfriMMLU to reflect recent experiments performed on these tasks

Testing

The PR has been tested locally against several open source models

Documentation

Adequate documentation has been provided on each task ReadMe pages.

Afrixnli task

Afrimmlu direct and few shot

add afrimgsm -direct

bash script for gpt models

update afrixnli tasks

update metrics for afrixnli

update prompt translations

update afrobench

update readme instructions

add individual dataset readmes

add link to collections

CLAassistant · 2025-03-20T18:06:59Z

All committers have signed the CLA.

baberabb · 2025-03-21T02:51:53Z

Hi! Thanks for your substantial PR. I'll try to review it soon! Just need everyone in the commit history to pacify the CLI assistant by signing the agreement. cc: @theyorubayesian, @dadelani

JessicaOjo · 2025-03-24T18:03:01Z

Hi! Thanks for your substantial PR. I'll try to review it soon! Just need everyone in the commit history to pacify the CLI assistant by signing the agreement. cc: @theyorubayesian, @dadelani

Thanks for the comment everyone in the commit history has signed the agreement.

StellaAthena · 2025-03-24T20:17:43Z

Hi! Thanks for your substantial PR. I'll try to review it soon! Just need everyone in the commit history to pacify the CLI assistant by signing the agreement. cc: @theyorubayesian, @dadelani

Thanks for the comment everyone in the commit history has signed the agreement.

It looks like a bunch of tests are failing. Can you run the pre-commit script and see about addressing those issues?

JessicaOjo · 2025-03-25T17:17:17Z

Hi! Thanks for your substantial PR. I'll try to review it soon! Just need everyone in the commit history to pacify the CLI assistant by signing the agreement. cc: @theyorubayesian, @dadelani

Thanks for the comment everyone in the commit history has signed the agreement.

It looks like a bunch of tests are failing. Can you run the pre-commit script and see about addressing those issues?

Hi Stella,

I've addressed the pertinent issues, but I’m unsure if the remaining ones are within my scope to fix:

Linters: The files are being formatted, but I’m not sure what’s causing the failure.
External LM test: It’s failing due to an accelerate error—specifically, 'clear_device_cache' from accelerate.utils.memory.
Scan for changed task test: This seems to be failing at the test script level.
Let me know if you have any suggestions on how I can resolve these. Thanks!

JessicaOjo and others added 30 commits May 7, 2024 13:52

add afrixnli to task

a27ea4b

add chat completion

b432d0e

remove chat completion -untested

ee1f296

Merge pull request #1 from JessicaOjo/afrixnli

72f5f4b

Afrixnli task

Merge branch 'EleutherAI:main' into main

eea16d3

afrimmlu added

138eb56

afrimmlu folder update

64490d9

afrimmlu folder update

f64b943

updated prompt

343880a

remove print

901ad39

Merge pull request #2 from JessicaOjo/afrimmlu

816832f

Afrimmlu direct and few shot

add afrimgsm -direct

187ab73

add squad metric

6ca56ea

fix bash script

21fb0db

remove direct util, update common yaml

3eab4b9

remove print

ae6e5cb

add few show. metric fixes

c4f634c

Merge pull request #3 from JessicaOjo/afrimgsm

9f020aa

add afrimgsm -direct

fix direct path, add bash script for gpt models

bcee8f2

Merge pull request #4 from JessicaOjo/afrimgsm

cc58abe

bash script for gpt models

Merge branch 'EleutherAI:main' into main

3a8f1e4

added transate test

452d202

update afrixnli tasks

b84c8c9

update afrixnli tasks

b506793

Merge pull request #5 from JessicaOjo/africanli

f41e442

update afrixnli tasks

update metrics for afrixnli

a3953e0

Merge pull request #6 from JessicaOjo/africanli

a82623e

update metrics for afrixnli

prompt translations fix

6c8d405

prompt translations fix

b691c95

Merge pull request #7 from JessicaOjo/africanli

02f7435

update prompt translations

JessicaOjo added 11 commits March 19, 2025 09:52

update afrobench

bf1b2ca

update afrobench

91ecec7

restore metric to original script

2895fb8

Merge pull request #47 from JessicaOjo/afrobench

26caf93

update afrobench

update readme instructions

189da61

Merge pull request #48 from JessicaOjo/afrobench

f010c47

update readme instructions

add individual dataset readmes

afc2eb1

Merge pull request #49 from JessicaOjo/afrobench

3bc270d

add individual dataset readmes

add link to collections

fa753b1

correct run script

4afc70a

Merge pull request #50 from JessicaOjo/afrobench

e739c96

add link to collections

JessicaOjo requested review from baberabb and StellaAthena as code owners March 20, 2025 18:06

JessicaOjo added 9 commits March 20, 2025 14:17

align with main

cfe0e30

align with main

d164c98

align with main

a1a6a16

align with main

da4ed95

align with main

fd2f338

align with main

1ccec3a

align with main

4c08dc1

align with main

d7a2eb9

merge upstream main

0daebc0

failed run fixes

9664344

failed run fixes

9c037a9

baberabb self-assigned this Mar 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AfroBench: How Good are Large Language Models on African Languages? #2825

AfroBench: How Good are Large Language Models on African Languages? #2825

JessicaOjo commented Mar 20, 2025 •

edited

Loading

CLAassistant commented Mar 20, 2025 •

edited

Loading

baberabb commented Mar 21, 2025

JessicaOjo commented Mar 24, 2025

StellaAthena commented Mar 24, 2025

JessicaOjo commented Mar 25, 2025

AfroBench: How Good are Large Language Models on African Languages? #2825

Are you sure you want to change the base?

AfroBench: How Good are Large Language Models on African Languages? #2825

Conversation

JessicaOjo commented Mar 20, 2025 • edited Loading

Overview

Features

Testing

Documentation

CLAassistant commented Mar 20, 2025 • edited Loading

baberabb commented Mar 21, 2025

JessicaOjo commented Mar 24, 2025

StellaAthena commented Mar 24, 2025

JessicaOjo commented Mar 25, 2025

JessicaOjo commented Mar 20, 2025 •

edited

Loading

CLAassistant commented Mar 20, 2025 •

edited

Loading