Skip to content

Commit d738303

Browse files
committed
Update Cutlass to 4.0
1 parent d417a5b commit d738303

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

.github/workflows/publish.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -144,7 +144,7 @@ jobs:
144144
export LD_LIBRARY_PATH=/usr/local/nvidia/lib64:/usr/local/cuda/lib64:$LD_LIBRARY_PATH
145145
# Limit MAX_JOBS otherwise the github runner goes OOM
146146
# nvcc 11.8 can compile with 2 jobs, but nvcc 12.3 goes OOM
147-
MAX_JOBS=$([ "$MATRIX_CUDA_VERSION" == "128" ] && echo 1 || echo 2) FLASH_ATTENTION_FORCE_BUILD="TRUE" FLASH_ATTENTION_FORCE_CXX11_ABI=${{ matrix.cxx11_abi}} python setup.py bdist_wheel --dist-dir=dist
147+
MAX_JOBS=$([ "$MATRIX_CUDA_VERSION" == "129" ] && echo 1 || echo 2) FLASH_ATTENTION_FORCE_BUILD="TRUE" FLASH_ATTENTION_FORCE_CXX11_ABI=${{ matrix.cxx11_abi}} python setup.py bdist_wheel --dist-dir=dist
148148
tmpname=cu${WHEEL_CUDA_VERSION}torch${MATRIX_TORCH_VERSION}cxx11abi${{ matrix.cxx11_abi }}
149149
wheel_name=$(ls dist/*whl | xargs -n 1 basename | sed "s/-/+$tmpname-/2")
150150
ls dist/*whl |xargs -I {} mv {} dist/${wheel_name}

csrc/cutlass

Submodule cutlass updated 687 files

0 commit comments

Comments
 (0)