Skip to content

Commit

Permalink
perf: add QwenMath accuracy example results
Browse files Browse the repository at this point in the history
  • Loading branch information
dacorvo committed Dec 3, 2024
1 parent fa3a92a commit 0e5e040
Showing 1 changed file with 7 additions and 0 deletions.
7 changes: 7 additions & 0 deletions benchmark/text-generation/accuracy/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,3 +21,10 @@ You can evaluate:
| | |none | 0|acc_norm ||0.7581|± |0.0043|
|lambada_openai| 1|none | 0|acc ||0.7173|± |0.0063|
| | |none | 0|perplexity ||3.1102|± |0.0769|

### Qwen/Qwen2.5-Math-7B-Instruct

|Tasks|Version| Filter |n-shot| Metric | |Value | |Stderr|
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
|gsm8k| 3|flexible-extract| 5|exact_match||0.8878|± |0.0087|
| | |strict-match | 5|exact_match||0.8870|± |0.0087|

0 comments on commit 0e5e040

Please sign in to comment.