[Bug]: [V1] New v1 engine does not support n>1? #12584

m-harmonic · 2025-01-30T18:24:17Z

VLLM version 0.7.0

No response

When using v1 engine, LLM.generate() only returns 1 CompletionOutput even when SamplingParams sets n>1

Is this expected to work or is n>1 not yet supported for v1? If so, are there plans to support it?

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

The text was updated successfully, but these errors were encountered:

robertgshaw2-redhat · 2025-01-30T18:46:13Z

Thanks, we are aware and working on it.

m-harmonic added the bug Something isn't working label Jan 30, 2025

m-harmonic mentioned this issue Jan 30, 2025

[V1] Feedback Thread #12568

Open

Provide feedback