-
Notifications
You must be signed in to change notification settings - Fork 55
Pull requests: NVIDIA/Fuser
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Don't reuse segmented kernel runtimes if we can avoid segmentation
#2163
opened Apr 30, 2024 by
jacobhinkle
•
Draft
allow reduction/normalization schedulers to handle reshape with only split transforms
#2437
opened Jun 18, 2024 by
liqiangxl
Loading…
Perform cancellation in SimplifyingIrBuilder::addExpr
#2020
opened Apr 2, 2024 by
jacobhinkle
Loading…
Set __launch_bounds__ in kernel whenever we are able
#3794
opened Jan 29, 2025 by
jacobhinkle
Loading…
Allow FusionExecutorCache to take preallocated outputs
#2247
opened May 15, 2024 by
samnordmann
Loading…
[wgmma] Insert commit_group and wait_group after mma_async
Matmuls
#3573
opened Dec 11, 2024 by
jacobhinkle
•
Draft
ExpressionEvaluator validates allocation domain for Set.Permute.
allocation domain
issues related to allocation domain support
enhancement
New feature or request
[WIP] Accepting allocated outputs in RunFusionWithInputs, for simple and segmented fusions
#216
opened Apr 24, 2023 by
mmigdal-nv
•
Draft
Add nvfuser benchmark executor and unify test_matmul.py
#4021
opened Mar 6, 2025 by
jacobhinkle
•
Draft
Insert block sync if it doesn't exist in WarAsyncWaitInserter
#4151
opened Mar 26, 2025 by
jacobhinkle
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.