Skip to content

Commit b151b84

Browse files
nmacchionifacebook-github-bot
authored andcommitted
expand search space for hstu gemm
Summary: before we were using just a single config Reviewed By: xuzhao9 Differential Revision: D66213893 fbshipit-source-id: 6d5a149d496b24cb80e9e56a8cb1d78010da5e0b
1 parent aa67b62 commit b151b84

File tree

1 file changed

+6
-0
lines changed

1 file changed

+6
-0
lines changed

tritonbench/operators/gemm/operator.py

+6
Original file line numberDiff line numberDiff line change
@@ -40,6 +40,12 @@
4040
)
4141

4242
if IS_FBCODE:
43+
import hammer.oss.generative_recommenders.ops.triton.triton_addmm as hstu_triton_addmm
44+
45+
# without this set we can only pick a single config for AMD, Nvidia has 8
46+
# with this set AMD will pick from 256 different configs (not the actual full
47+
# tuning space, so some perf may be left on the table)
48+
hstu_triton_addmm.ENABLE_FULL_TURNING_SPACE = True
4349
from hammer.ops.triton.triton_matmul import (
4450
triton_matmul as hstu_triton_matmul_kernel,
4551
)

0 commit comments

Comments
 (0)