Skip to content

Pull requests: EleutherAI/lm-evaluation-harness

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add support for quantization_config
#2842 opened Mar 25, 2025 by jerryzh168 Loading…
Add simple Dockerfile and instructions
#2837 opened Mar 24, 2025 by kiersten-stokes Loading…
feat: Numeric bench
#2835 opened Mar 24, 2025 by Gresham429 Loading…
[leaderboard] math - sync with repo
#2817 opened Mar 19, 2025 by baberabb Loading…
E3 c v3 name entity recognition
#2812 opened Mar 18, 2025 by sfarzi Loading…
Adding ACPBench task
#2807 opened Mar 17, 2025 by harshakokel Loading…
6 tasks done
Add new task named e3c_v3_re
#2806 opened Mar 17, 2025 by sfarzi Loading…
Add GigaChat models
#2805 opened Mar 17, 2025 by seldereyy Loading…
Add GSM8K Platinum
#2771 opened Mar 7, 2025 by Qubitium Loading…
paws-x fix formatting
#2759 opened Mar 5, 2025 by baberabb Loading…
New benchmark: CaselawQA
#2739 opened Feb 26, 2025 by RicardoDominguez Loading…
Add support for sequence labeling
#2718 opened Feb 20, 2025 by jogonba2 Loading…
Add AIBE task and utilities
#2712 opened Feb 18, 2025 by parimalthakre01 Loading…
Add Task (Financial mmlu ko)
#2699 opened Feb 14, 2025 by choics2623 Loading…
Add generation variants of some tasks
#2688 opened Feb 11, 2025 by baberabb Loading…
Convert multiple_choice to gen tasks
#2670 opened Feb 4, 2025 by baberabb Draft
Add from dataframe
#2655 opened Jan 25, 2025 by AMindToThink Loading…
Include all test files in sdist
#2634 opened Jan 19, 2025 by booxter Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.