Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: more numerically stable qwen custom plan
#1235 opened Sep 30, 2025 by terrykong Loading…
build: Fix ngc pytorch build with deep-ep
#1234 opened Sep 30, 2025 by chtruong814 Loading…
4 tasks
fix: fp8 rollout nightly fix check from step 100 to 40 CI:docs Run doctest
#1233 opened Sep 30, 2025 by terrykong Loading…
chore: Log the initial training master config
#1232 opened Sep 29, 2025 by pjin-nvidia Loading…
4 tasks
fix: fix github to myst-parser admonition conversion documentation Improvements or additions to documentation
#1224 opened Sep 29, 2025 by terrykong Loading…
Tk/slurm bisect documentation Improvements or additions to documentation
#1223 opened Sep 29, 2025 by terrykong Draft
docs: async doc update for importance sampling correction documentation Improvements or additions to documentation r0.4.0
#1222 opened Sep 28, 2025 by parthchadha Loading…
4 tasks
Set attention_mask to None by default. CI:L1 Run doctests, unit tests, and functional tests
#1213 opened Sep 26, 2025 by joyang-nv Draft
4 tasks
feat: Multi-turn tool calling on BFCLv3 dataset community-request documentation Improvements or additions to documentation
#1207 opened Sep 25, 2025 by slikhite-1 Loading…
feat: Compute entropy across full vocab for logging r0.4.0
#1200 opened Sep 24, 2025 by parthchadha Loading…
4 tasks
chore: Bump vllm and ray
#1199 opened Sep 24, 2025 by guyueh1 Loading…
4 tasks
feat: [do not merge] Fp8 training kitchen documentation Improvements or additions to documentation
#1197 opened Sep 24, 2025 by guyueh1 Draft
4 tasks
ci: Test runner CI:L1 Run doctests, unit tests, and functional tests CI Relating to CI
#1196 opened Sep 23, 2025 by chtruong814 Loading…
4 tasks
feat: Adding perf metrics CI:L1 Run doctests, unit tests, and functional tests Performance Related to improving performance
#1183 opened Sep 22, 2025 by youngeunkwon0405 Loading…
4 tasks
feat: FP8 rollout in GRPO for MoE models
#1175 opened Sep 21, 2025 by guyueh1 Loading…
4 tasks
refactor: unify get_logprobs() and score() logic in dtensor CI:L1 Run doctests, unit tests, and functional tests
#1173 opened Sep 21, 2025 by RayenTian Loading…
fix: simplified megatron to hf conversion script r0.4.0
#1169 opened Sep 20, 2025 by ahmadki Loading…
4 tasks
DSV3 feat branch
#1160 opened Sep 18, 2025 by joyang-nv Draft
4 tasks
fix: Fix OOM in validation during colocated training CI:L1 Run doctests, unit tests, and functional tests r0.4.0
#1159 opened Sep 18, 2025 by jseppanen Loading…
fix: Fix gradient clipping of non-float32 params CI:L1 Run doctests, unit tests, and functional tests r0.4.0
#1158 opened Sep 18, 2025 by jseppanen Loading…
feat: Add Penguin env
#1156 opened Sep 18, 2025 by bxyu-nvidia Loading…
4 tasks
Update mcore / mbridge 0917 r0.4.0
#1150 opened Sep 17, 2025 by yaoyu-33 Loading…
4 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.