fanshiqing

Follow

🎯

Focusing

Shiqing Fan fanshiqing

🎯

Focusing

Follow

NVIDIA, Senior Performance Architect, Full-stack LLM Training Optimization.

118 followers · 51 following

NVIDIA
Hangzhou, Zhejiang
https://fanshiqing.github.io/

Achievements

Achievements

Pinned Loading

Megatron-LM Megatron-LM Public

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python
DAPPLE DAPPLE Public

Forked from AlibabaPAI/DAPPLE

An Efficiency Pipelined Data Parallel Approach for Large Models Training

Python 3
grouped_gemm grouped_gemm Public

Forked from tgale96/grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 67 24
fanshiqing.github.io fanshiqing.github.io Public

HTML