-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Issues: huggingface/open-r1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
how can I get the prediction using the provided evaluation script?
#625
opened Apr 25, 2025 by
CurryxIaoHu
Does the Qwen-2.5-VL model in the GRPO project currently support multi-image input?
#601
opened Apr 14, 2025 by
zby1218
model.generate
produces right-padded completions, causing incompatibility with Flash Attention 2
#599
opened Apr 14, 2025 by
PolarisHsu
GRPO config for finetuning Qwen-7B-Math-Instruct on OpenR1-Math-220k
#589
opened Apr 9, 2025 by
toslali-ibm
vllm generate n responses, some responses stop after generating </answer>, some can not stop.
#582
opened Apr 6, 2025 by
LaoWangGB
understanding GRPO code pipeline. is this fully online learning in code?
#581
opened Apr 5, 2025 by
dongje
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-03-29.