This repository provides various examples for Colossal-AI. For each feature of
Colossal-AI, you can find a simple example in the feature
folder and a corresponding tutorial in feature section of the documentation. For more complex examples for domain-specific models, you can find them in this repository as well. Some of them are covered in the advanced tutorials
of the documentation.
Colossal-AI | Paper | Documentation | Forum | Blog
pip install -r requirements.txt
This repository contains examples of training models with ColossalAI. These examples fall under three categories:
-
Computer Vision
- ResNet
- SimCLR
- Vision Transformer
- Data Parallel
- Pipeline Parallel
- Hybrid Parallel
- WideNet
- Mixture of experts
-
Natural Language Processing
- BERT
- Sequence Parallel
- GPT-2
- Hybrid Parallel
- GPT-3
- Hybrid Parallel
- BERT
-
Features
- Mixed Precision Training
- Gradient Accumulation
- Gradient Clipping
- Tensor Parallel
- Pipeline Parallel
- ZeRO
The image
and language
folders are for complex model applications. The features
folder is for demonstration of Colossal-AI. The features
folder aims to be simple so that users can execute in minutes. Each example in the features
folder relates to a tutorial in the Official Documentation.
If you wish to make contribution to this repository, please read the Contributing
section below.
Discussion about the Colossal-AI project and examples is always welcomed! We would love to exchange ideas with the community to better help this project grow. If you think there is a need to discuss anything, you may jump to our dicussion forum and create a topic there.
If you encounter any problem while running these examples, you may want to raise an issue in this repository.
This project welcomes constructive ideas and implementations from the community.
If you find that an example is broken (not working) or not user-friendly, you may put up a pull request to this repository and update this example.
If you wish to add an example for a specific application, please follow the steps below.
- create a folder in the
image
,language
orfeatures
folders. Generally we do not accept new examples forfeatures
as one example is often enough. We encourage contribution with hybrid parallel or models of different domains (e.g. GAN, self-supervised, detection, video understadning, text classification, text generation) - Prepare configuration files and
train.py
- Prepare a detailed readme on envirionment setup, dataset preparation, code execution, etc. in your example folder
- Update the table of content (first section above) in this readme file
If your PR is accepted, we may invite you to put up a tutorial or blog in ColossalAI Documentation.