perf(glm52): MLA decode arena + CUDA graph capture on top of #535 (−36%/layer) + kernel bench#533
Closed
n-WN wants to merge 8 commits into
Closed
perf(glm52): MLA decode arena + CUDA graph capture on top of #535 (−36%/layer) + kernel bench#533n-WN wants to merge 8 commits into
n-WN wants to merge 8 commits into
Commits
Commits on Jul 3, 2026
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed