Unable to reproduce the performance of "mathematical reasoning" #223

jasonaidm · 2025-02-07T09:47:57Z

Hi,
Thanks for your effort!

I cannot reproduce the performance of "mathematical reasoning" when I using the command:
ACCELERATE_LOG_LEVEL=info accelerate launch --config_file recipes/accelerate_configs/zero2.yaml --num_processes=7 src/open_r1/grpo.py --config recipes/deepseek/DeepSeek-R1-Distill-Qwen-7B/grpo/config_base_math_smalllr.yaml

The training curve is as follows：

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to reproduce the performance of "mathematical reasoning" #223

Unable to reproduce the performance of "mathematical reasoning" #223

jasonaidm commented Feb 7, 2025

Unable to reproduce the performance of "mathematical reasoning" #223

Unable to reproduce the performance of "mathematical reasoning" #223

Comments

jasonaidm commented Feb 7, 2025