Skip to content

Commit 24f0957

Browse files
authored
Revert "[BE] Better compress flash attention binaries (#1744)" (#1751)
This reverts commit 8ba246f.
1 parent 34a3656 commit 24f0957

File tree

2 files changed

+0
-6
lines changed

2 files changed

+0
-6
lines changed

hopper/setup.py

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -524,9 +524,6 @@ def nvcc_threads_args():
524524
"-DCUTLASS_ENABLE_GDC_FOR_SM90", # For PDL
525525
"-DCUTLASS_DEBUG_TRACE_LEVEL=0", # Can toggle for debugging
526526
"-DNDEBUG", # Important, otherwise performance is severely impacted
527-
"-Xfatbin", # compress all binary sections
528-
"-compress-all",
529-
"-compress-mode=size", # compress with CUDA fatbin more aggressively
530527
]
531528
if get_platform() == "win_amd64":
532529
nvcc_flags.extend(

setup.py

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -206,9 +206,6 @@ def validate_and_update_archs(archs):
206206
"--expt-relaxed-constexpr",
207207
"--expt-extended-lambda",
208208
"--use_fast_math",
209-
"-Xfatbin",
210-
"-compress-all",
211-
"-compress-mode=size",
212209
# "--ptxas-options=-v",
213210
# "--ptxas-options=-O2",
214211
# "-lineinfo",

0 commit comments

Comments
 (0)