Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Common] NVTEGroupedTensor class and helpers
#2388 opened Nov 14, 2025 by phu0ngng Draft
13 tasks
Enables specified cp rank slicing
#2387 opened Nov 14, 2025 by jomitchellnv Loading…
1 of 13 tasks
[JAX] Re-use RHT matrix constant
#2386 opened Nov 14, 2025 by jberchtold-nvidia Draft
8 of 13 tasks
[Draft] TopK Fusion to JAX
#2385 opened Nov 14, 2025 by mingxu1067 Loading…
5 of 13 tasks
Set RPATH for cuda libraries from python package
#2381 opened Nov 14, 2025 by take-cheeze Draft
4 of 13 tasks
Add num_splits support for FA3 backend 2.10.0
#2380 opened Nov 14, 2025 by cyanguwa Loading…
8 of 13 tasks
CP + THD + AG + Striped
#2379 opened Nov 13, 2025 by KshitijLakhani Draft
13 tasks
[PyTorch] Reduce CPU overheads
#2377 opened Nov 13, 2025 by ksivaman Loading…
8 of 14 tasks
[Pytorch] Fix backward_dw cuda graph order
#2376 opened Nov 13, 2025 by Wohox Loading…
1 of 13 tasks
[PyTorch] Enable reference Current Scaling recipe
#2368 opened Nov 11, 2025 by negvet Loading…
13 tasks
[JAX] NVFP4 2D 1x1x for Weight
#2365 opened Nov 10, 2025 by phu0ngng Draft
13 tasks
[JAX] cuBlasMp integration for CollectiveGemm custom op 2.10.0
#2361 opened Nov 7, 2025 by denera Loading…
5 of 13 tasks
Add device-Initiated Grouped GEMM supporting m_splits on device
#2360 opened Nov 7, 2025 by QiZhangNV Loading…
1 of 13 tasks
Add num_splits support for FA3 backend
#2357 opened Nov 6, 2025 by wdykas Loading…
13 tasks
[PyTorch][NVFP4][MOE] NVFP4 Grouped Hadamard Amax Kernel
#2351 opened Nov 6, 2025 by zhongbozhu Loading…
4 of 17 tasks
More detailed documentation for recipes
#2343 opened Nov 4, 2025 by pggPL Draft
[Core] Fix inconsistent logic in C++ tensor class
#2330 opened Nov 1, 2025 by timmoon10 Loading…
7 of 13 tasks
[Common] Added an optimized gated rowwise MXFP8 SwiGLU kernel
#2328 opened Oct 31, 2025 by Oleg-Goncharov Loading…
5 of 13 tasks
[Common] Persistent MXFP8 kernel
#2323 opened Oct 30, 2025 by Oleg-Goncharov Draft
13 tasks
[JAX] Make test_layer.py tolerances stricter
#2306 opened Oct 27, 2025 by jberchtold-nvidia Loading…
8 of 13 tasks
Docs fix 2.10.0
#2301 opened Oct 24, 2025 by pggPL Loading…
8 of 12 tasks
Fix runtime lib loading logic
#2297 opened Oct 23, 2025 by ksivaman Loading…
8 of 13 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.