Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Fix eos sft
#3200 opened Mar 31, 2025 by qgallouedec Draft
5 tasks
[GRPO] Improve completion length logging
#3188 opened Mar 31, 2025 by edbeeching Loading…
GRPO: Scalable training with one LLM/node
#3186 opened Mar 31, 2025 by jglaser Draft
4 tasks
Integrate Liger GRPO Loss to GRPO Trainer
#3184 opened Mar 31, 2025 by shivam15s Draft
1 of 5 tasks
🏃 Faster CI
#3160 opened Mar 25, 2025 by qgallouedec Loading…
5 tasks
Fix: Compatibility for formatting_func returning a list
#3147 opened Mar 24, 2025 by YeFD Loading…
4 of 5 tasks
Fix length bias for Dr GRPO
#3138 opened Mar 23, 2025 by idoru Loading…
5 tasks
Extend BCO Trainer dataset format support
#3134 opened Mar 22, 2025 by reihig-ut Loading…
1 of 5 tasks
feat: Add Interleaved Trainer implementation
#3107 opened Mar 18, 2025 by ucalyptus2 Loading…
3 tasks done
Update sft trainer to include better packing
#3100 opened Mar 17, 2025 by Ishan-Kumar2 Loading…
4 tasks done
add cli dict parsing for grpo_config
#3082 opened Mar 14, 2025 by Tavish9 Draft
2 of 5 tasks
[GRPO] add vlm training capabilities to the trainer
#3072 opened Mar 13, 2025 by CompN3rd Loading…
3 of 5 tasks
[WIP] PEFT 🤝 Liger DPO
#3065 opened Mar 12, 2025 by SalmanMohammadi Draft
5 tasks
Static cache GRPO
#3023 opened Mar 7, 2025 by qgallouedec Draft
5 tasks
[WIP] Iterative training scripts for SPIN and SPPO
#3011 opened Mar 5, 2025 by jkx19 Draft
3 of 5 tasks
ProTip! no:milestone will show everything without a milestone.