Skip to content

xytpai/kfunca

Repository files navigation

kfunca is a minimalist, high-performance GPU-based automatic differentiation framework. The operator scope is focused solely on multimodal transformers. Here are the supported features:

1. Basic infrastructure

  • GPU Launcher
  • Caching Allocator
  • Tensor Implementation
  • Tensor Iterator

2. GPU Operator

Basic operator:

  • from_numpy/to_numpy
  • add/sub/mul/div
  • permute/contiguous/copy
  • sum/mean
  • sort/topk
  • slice/view
  • concat/split

Neural network operator:

  • rms_norm
  • causal attention
  • embedding
  • matmul
  • qkv_linear

3. Floating Point Support

  • fp32/64
  • float16
  • bfloat16

Welcome to reach out for collaboration: [email protected]

About

KFunca: A minimalist, high-performance GPU-based automatic differentiation framework

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published