[Usage]: Does DeepSeek-R1 1.58-bit Dynamic Quant work on VLLM? #12573

shimmyshimmer · 2025-01-30T09:21:56Z

Your current environment

Hey guys! Recently in our blogpost we wrote that vLLM supports GGUFs however we've been getting many people saying that the R1 GGUFs don't actually work in VLLM at the moment and they get errors.

I'm guessing it's not supported at the moment? Thank you! :)

Blog: https://unsloth.ai/blog/deepseekr1-dynamic
Model: https://huggingface.co/unsloth/DeepSeek-R1-GGUF

How would you like to use vllm

Run DeepSeek-R1 1.58

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

robertgshaw2-redhat · 2025-01-30T17:36:48Z

Im not sure of the state. Can you try it?

Isotr0py · 2025-01-31T04:18:15Z

I'm afraid not. Because the GGUF support in vllm is dependent on the GGUF interoperability in transformers (we depend on it to extract hf_config from GGUF), and Deepseek and its GGUF interoperability hasn't supported in transformers yet: https://huggingface.co/docs/transformers/v4.48.2/en/gguf#supported-model-architectures

shimmyshimmer · 2025-01-31T04:29:55Z

Im not sure of the state. Can you try it?

Oh yes we tested it a few hours unfortunately it doesnt work

I'm afraid not. Because the GGUF support in vllm is dependent on the GGUF interoperability in transformers (we depend on it to extract hf_config from GGUF), and Deepseek and its GGUF interoperability hasn't supported in transformers yet: https://huggingface.co/docs/transformers/v4.48.2/en/gguf#supported-model-architectures

Alright thanks so much for letting me know. Once it's supported we'll let others know as well :)

dannydabbles · 2025-01-31T10:01:53Z

Steps to reproduce the vLLM issues I'm seeing here via the standard vllm/vllm-openai:latest Docker image.

ValueError: GGUF model with architecture deepseek2 is not supported yet.

shimmyshimmer added the usage How to use vllm label Jan 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Usage]: Does DeepSeek-R1 1.58-bit Dynamic Quant work on VLLM? #12573

[Usage]: Does DeepSeek-R1 1.58-bit Dynamic Quant work on VLLM? #12573

shimmyshimmer commented Jan 30, 2025

robertgshaw2-redhat commented Jan 30, 2025 •

edited

Loading

Isotr0py commented Jan 31, 2025 •

edited

Loading

shimmyshimmer commented Jan 31, 2025

dannydabbles commented Jan 31, 2025 •

edited

Loading

[Usage]: Does DeepSeek-R1 1.58-bit Dynamic Quant work on VLLM? #12573

[Usage]: Does DeepSeek-R1 1.58-bit Dynamic Quant work on VLLM? #12573

Comments

shimmyshimmer commented Jan 30, 2025

Your current environment

How would you like to use vllm

Before submitting a new issue...

robertgshaw2-redhat commented Jan 30, 2025 • edited Loading

Isotr0py commented Jan 31, 2025 • edited Loading

shimmyshimmer commented Jan 31, 2025

dannydabbles commented Jan 31, 2025 • edited Loading

robertgshaw2-redhat commented Jan 30, 2025 •

edited

Loading

Isotr0py commented Jan 31, 2025 •

edited

Loading

dannydabbles commented Jan 31, 2025 •

edited

Loading