-
Notifications
You must be signed in to change notification settings - Fork 0
Everything Quantization
Botchkarev edited this page Oct 17, 2023
·
6 revisions
FINN-hls4ml Guidelines for Quantization-Aware Training
Medium Blog about GPT-2 Optimization
[Repo transfomers-silicon-research, with lots of links]https://github.com/alimpk/transfomers-silicon-research
PyTorch Quantization API Reference
PyTorch Quantized Inference on GPU
Deploying Int8 QAT Models on GPU with TensorRT
quantization-and-training-of-neural-networks
fasttextzip-compressing-text-classification