Skip to content

Pull requests: ROCm/triton

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

tune_gemm: use correct output option for rocprofv3
#887 opened Oct 3, 2025 by matthiasdiener Loading…
5 of 7 tasks
Pytorch/rocm7.1 internal testing hstu drop 1
#881 opened Sep 23, 2025 by scxiao Loading…
5 of 7 tasks
Add bf16 gemm pingpong with num_stages=3
#818 opened May 30, 2025 by jungpark-mlir Loading…
Bypass LDS for scale B operand for skinny gemms
#817 opened May 29, 2025 by plognjen Loading…
[DRAFT] Shared/aggregate load
#804 opened May 21, 2025 by alefimov-amd Draft
[AMD] Improve Scheduling for Async BF16 GEMM
#802 opened May 21, 2025 by raikonenfnu Loading…
7 tasks
add predicate mask for atomic_rmw ops
#799 opened May 19, 2025 by scxiao Loading…
4 of 7 tasks
Shaoclee/compare ck
#788 opened May 2, 2025 by k50112113 Loading…
5 of 7 tasks
[WIP] [StreamK]
#782 opened Apr 28, 2025 by zhanglx13 Draft
[AMD] Added bufferOps refinement
#776 opened Apr 14, 2025 by ravil-mobile Loading…
update scale dot assertion in plot_layout.py
#774 opened Apr 10, 2025 by jtang10 Loading…
Tjactions security issue
#773 opened Apr 3, 2025 by Cemberk Loading…
Update FlashAttention transV scripts
#766 opened Mar 21, 2025 by binarman Loading…
Add v2 test to paged_attention_decode
#764 opened Mar 20, 2025 by rahulbatra85 Loading…
MLA prefill, forward_normal benchmark
#750 opened Mar 7, 2025 by Chi-Chu319 Loading…
Cap warp count to 16 for devices with warp size 64
#747 opened Mar 5, 2025 by schung-amd Draft
4 of 7 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.