AI Engineering 101

All you need to know to get started with AI Engineering. In this repository, I'm documenting my learning notes of AI Engineering.

Vibe Coding

A short guide (Claude Code) to introduce vibe coding for non-tech guys.
A short guide (OpenCode) to introduce vibe coding for engineers.

Research

Research advances on agent infra and multimodal infra

Agent Infra

Agent infra focuses on optimizing agent runtime performance instead of building agents. For more information, check out Why agent infrastructure matters and Agent Engineering: A New Discipline.

Identifying the Risks of LM Agents with an LM-Emulated Sandbox. ICLR 2024.
Autellix: An Efficient Serving Engine for LLM Agents as General Programs. arXiv 2025.

Multimodal Infra

If you want to learn more about multimodal models, check out the Understanding Multimodal LLMs.

Training

Serving

GPU Kernels

Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency. arXiv 2025.
- FlashAttention-2 JVP kernel for training
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration. ICLR 2025.
- 8-bit attention kernel for inference
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention. arXiv 2025.
- Sparse attention kernel for inference

Engineering

Engineering best practices for building AI systems

Kernel Best Practices

OpenAI Triton Best Practices
- A hands-on tutorial on best practices for writing efficient GPU kernels using Triton.

PyTorch Best Practices

Optimize Training Performance in PyTorch
- Model FLOPs Utilization (MFU), PyTorch Profiler & Nsight Systems for performance monitoring, Triton and Nsight Compute for kernel optimization.

Multimodal Best Practices

Disaggregated Hybrid Parallelism with Ray
- A framework for training vision-language models using disaggregated hybrid parallelism, where each model component adopts its optimal parallelization strategy independently.

Ray Best Practices

Use Ray for Distributed Machine Learning Apps
- Scaling up machine learning applications using Ray.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
Engineering		Engineering
Research/multimodal_infra		Research/multimodal_infra
Vibe		Vibe
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Engineering 101

Table of Contents

Vibe Coding

Research

Agent Infra

Multimodal Infra

Training

Serving

GPU Kernels

Engineering

Kernel Best Practices

PyTorch Best Practices

Multimodal Best Practices

Ray Best Practices

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Engineering 101

Table of Contents

Vibe Coding

Research

Agent Infra

Multimodal Infra

Training

Serving

GPU Kernels

Engineering

Kernel Best Practices

PyTorch Best Practices

Multimodal Best Practices

Ray Best Practices

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages