-
Notifications
You must be signed in to change notification settings - Fork 173
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
chore: rename penguin -> nemo_gym and add the gym submodule
#1587
opened Dec 2, 2025 by
terrykong
Loading…
4 tasks
refactor: Introduce BasePolicyWorker
CI
Relating to CI
documentation
Improvements or additions to documentation
fix: Fix Fp8 sequence padding for PP>1 case
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1579
opened Nov 29, 2025 by
guyueh1
Loading…
4 tasks
feat: Support top-p and top-k
CI:L1
Run doctests, unit tests, and functional tests
#1578
opened Nov 27, 2025 by
zhandaz
Loading…
3 of 4 tasks
feat: plot vllm internal metrics to the wandb log
CI:L1
Run doctests, unit tests, and functional tests
ease of use
#1567
opened Nov 25, 2025 by
youngeunkwon0405
Loading…
4 tasks
chore: Bump vllm to 0.11.2, torch to 2.9, transformers to 4.57.1
CI:L1
Run doctests, unit tests, and functional tests
#1563
opened Nov 24, 2025 by
yfw
Loading…
4 tasks
fix: fix Dtensor sharding error when bump up pytorch version
#1557
opened Nov 21, 2025 by
ZhiyuLi-Nvidia
Loading…
4 tasks
feat: LoRA SFT support for DTensorV2 path
CI:L1
Run doctests, unit tests, and functional tests
#1556
opened Nov 21, 2025 by
samodi-nv
Loading…
2 tasks done
fix: remove sft-qwen2.5-fsdp2tp8sp from nighlies
CI:L0
Run doctests and unit tests
#1555
opened Nov 20, 2025 by
ahmadki
Loading…
fix: add H200 TFLOPS
CI:L0
Run doctests and unit tests
community-request
#1543
opened Nov 19, 2025 by
clumsy
Loading…
4 tasks done
feat: refactor dtensor policy v2 into core modular functions
#1542
opened Nov 19, 2025 by
hemildesai
•
Draft
4 tasks
fix: Use Float16Module even when defer_fp32_logits=True
CI:L1
Run doctests, unit tests, and functional tests
#1537
opened Nov 18, 2025 by
yfw
Loading…
4 tasks
feat: force on-policy ratio to 1
CI:L1
Run doctests, unit tests, and functional tests
#1529
opened Nov 17, 2025 by
yfw
Loading…
4 tasks
feat: Add moe load balancing metrics
CI:L1
Run doctests, unit tests, and functional tests
#1520
opened Nov 13, 2025 by
yfw
Loading…
4 tasks
feat: Automodel init for DTensorPolicyV2
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1509
opened Nov 12, 2025 by
adil-a
Loading…
refactor: refactor env and data processor & add nemotron super 49b recipes
CI:L1
Run doctests, unit tests, and functional tests
documentation
Improvements or additions to documentation
#1506
opened Nov 11, 2025 by
yuki-97
Loading…
feat: pipeline-rl style # of inflight prompt regulation
CI:L1
Run doctests, unit tests, and functional tests
documentation
Improvements or additions to documentation
#1499
opened Nov 10, 2025 by
youngeunkwon0405
Loading…
4 tasks
fix: Support vLLM DP+EP in async engine via Ray-level data parallelism
community-request
#1495
opened Nov 10, 2025 by
clumsy
Loading…
4 tasks done
feat: allow uv-less execution and fingerprint the environment
CI:L1
Run doctests, unit tests, and functional tests
CI
Relating to CI
documentation
Improvements or additions to documentation
#1491
opened Nov 9, 2025 by
terrykong
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.