Skip to content
Change the repository type filter

All

    Repositories list

    • quack

      Public
      A Quirky Assortment of CuTe Kernels
      Python
      4655381Updated Sep 18, 2025Sep 18, 2025
    • Fast and memory-efficient exact attention
      Python
      2k20k83775Updated Sep 17, 2025Sep 17, 2025
    • Fast Hadamard transform in CUDA, with a PyTorch interface
      C
      3523373Updated Sep 4, 2025Sep 4, 2025
    • Causal depthwise conv1d in CUDA, with a PyTorch interface
      Cuda
      1285873010Updated Aug 29, 2025Aug 29, 2025
    • cutlass

      Public
      CUDA Templates for Linear Algebra Subroutines
      C++
      1.4k100Updated Jun 8, 2025Jun 8, 2025
    • Python
      212640Updated May 29, 2025May 29, 2025
    • Python
      12200Updated May 5, 2025May 5, 2025