From f87fb563f451cd0d869775009667f59ea610e593 Mon Sep 17 00:00:00 2001 From: Driss Guessous <32754868+drisspg@users.noreply.github.com> Date: Wed, 20 Nov 2024 08:28:02 -0800 Subject: [PATCH] Update README.md (#1319) --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 158cfb7562..5385f486a7 100644 --- a/README.md +++ b/README.md @@ -177,7 +177,7 @@ We're also fortunate to be integrated into some of the leading open-source libra 2. Hugging Face diffusers best practices with torch.compile and torchao in a standalone repo [diffusers-torchao](https://github.com/sayakpaul/diffusers-torchao) 3. Mobius HQQ backend leveraged our int4 kernels to get [195 tok/s on a 4090](https://github.com/mobiusml/hqq#faster-inference) 4. [TorchTune](https://github.com/pytorch/torchtune) for our QLoRA and QAT recipes -5. [torchchat](https://github.com/pytorch/torchtune) for post training quantization +5. [torchchat](https://github.com/pytorch/torchchat) for post training quantization 6. [SGLang](https://github.com/sgl-project/sglang/pull/1341) for LLM inference quantization ## Videos