Skip to content
View lzx1413's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report lzx1413

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

hpc

11 repositories

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,414 123 Updated Mar 3, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,818 6,070 Updated Mar 6, 2025

VideoSys: An easy and efficient system for video generation

Python 1,938 130 Updated Jan 1, 2025

Ongoing research training transformer models at scale

Python 11,658 2,612 Updated Mar 6, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 11,425 1,149 Updated Mar 6, 2025

NumPy & SciPy for GPU

Python 9,963 888 Updated Mar 5, 2025

Accelerating Diffusion Transformers with Token-wise Feature Caching

Python 86 1 Updated Mar 5, 2025

Minimalist ML framework for Rust

Rust 16,716 1,044 Updated Mar 3, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,145 773 Updated Mar 1, 2025

Tile primitives for speedy kernels

Cuda 2,105 121 Updated Mar 6, 2025