Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ci(fix): No NGC image for build-test-wheel job
#1838 opened Sep 30, 2025 by ko3n1g Loading…
ci: Add dependabot
#1837 opened Sep 30, 2025 by ko3n1g Loading…
[bugfix] fix typo
#1833 opened Sep 29, 2025 by 1195343015 Loading…
Update model_parallel_config.py
#1832 opened Sep 28, 2025 by skirdey-inflection Loading…
25.09 alpha rope split concat fusion
#1826 opened Sep 24, 2025 by vasunvidia Loading…
Dist_Muon optimizer support
#1813 opened Sep 18, 2025 by BoxiangW Loading…
Fix _set_wandb_writer serialization issues bug Something isn't working module: debugging
#1806 opened Sep 11, 2025 by gakkiri Loading…
5 of 8 tasks
Add files via upload
#1801 opened Sep 10, 2025 by wenchenqian Loading…
Quant
#1794 opened Sep 6, 2025 by Charles2530 Loading…
Update README.md module: documentation
#1792 opened Sep 4, 2025 by yuyu5333 Loading…
Add falcon h1 2 enhancement New feature or request
#1785 opened Sep 2, 2025 by dhiaEddineRhaiem Loading…
bugfix: raise error if eos_token is not set in tokenizer bug Something isn't working module: data pipeline
#1774 opened Aug 27, 2025 by imomayiz Loading…
Fix torch_dist checkpointing ETP replica_id bug Something isn't working module: moe
#1770 opened Aug 25, 2025 by Skylion007 Loading…
Fix Context Parallel NaN Loss bug Something isn't working
#1765 opened Aug 21, 2025 by leoleoasd Loading…
Fix runaway Etpt in straggler detector by resetting FLOPs accumulator bug Something isn't working
#1755 opened Aug 19, 2025 by cms42 Loading…
[main][feature][under updating]zero-overhead activation offload enhancement New feature or request
#1752 opened Aug 18, 2025 by GeYuhong Loading…
fix: Initialize master_weight with params_dtype directly bug Something isn't working
#1748 opened Aug 15, 2025 by Mirza-Samad-Ahmed-Baig Loading…
fix loading dcp OOM bug Something isn't working
#1747 opened Aug 14, 2025 by zjjott Loading…
Hongbinl/1f1b overlap mirror 0813
#1743 opened Aug 13, 2025 by lhb8125 Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.