-
-
Notifications
You must be signed in to change notification settings - Fork 4.6k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix] 500 Internal Server Error when tool_choice is incorrect.
frontend
#10567
opened Nov 22, 2024 by
shenoyvvarun
Loading…
[Hardware][Intel-Gaudi] Enable LoRA support for Intel Gaudi (HPU)
#10565
opened Nov 22, 2024 by
SanjuCSudhakaran
Loading…
[V1] EngineCore supports profiling
ready
ONLY add when PR is ready to merge/full CI is needed
#10564
opened Nov 22, 2024 by
Abatom
Loading…
[Model] Added GLM-4 series model support vllm==0.6.4
#10561
opened Nov 22, 2024 by
sixsixcoder
Loading…
[Benchmark] Benchmark structured output with datasets
#10557
opened Nov 22, 2024 by
xuechendi
Loading…
[Docs] Add dedicated tool calling page to docs
documentation
Improvements or additions to documentation
#10554
opened Nov 21, 2024 by
mgoin
Loading…
support bitsandbytes quantization with qwen model
#10549
opened Nov 21, 2024 by
zixuanzhang226
Loading…
[Misc] Enable vLLM to Dynamically Load LoRA from a Remote Server
frontend
#10546
opened Nov 21, 2024 by
angkywilliam
Loading…
Update default max_num_batch_tokens for chunked prefill to 2048
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
#10544
opened Nov 21, 2024 by
mgoin
Loading…
[Bugfix][Hardware][CPU] Fix ONLY add when PR is ready to merge/full CI is needed
multi_modal_kwargs
broadcast for CPU tensor parallel
ready
#10541
opened Nov 21, 2024 by
Isotr0py
Loading…
For ppc64le, disabled tests for now and addressed space issues
ci/build
#10538
opened Nov 21, 2024 by
npanpaliya
Loading…
[Model]: Add support for Aria model
documentation
Improvements or additions to documentation
#10514
opened Nov 21, 2024 by
xffxff
Loading…
[core] overhaul memory profiling and fix backward compatibility
#10511
opened Nov 21, 2024 by
youkaichao
Loading…
[v1] Refactor KVCacheManager for more hash input than token ids
ready
ONLY add when PR is ready to merge/full CI is needed
#10507
opened Nov 21, 2024 by
rickyyx
Loading…
[Model] Add OLMo November 2024 model
documentation
Improvements or additions to documentation
#10503
opened Nov 20, 2024 by
2015aroras
Loading…
[Core] Implement disagg prefill by StatelessProcessGroup
ci/build
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
#10502
opened Nov 20, 2024 by
KuntaiDu
Loading…
[CI/Build] Dockerfile build for ARM64 / GH200
ci/build
documentation
Improvements or additions to documentation
#10499
opened Nov 20, 2024 by
drikster80
Loading…
[Bugfix] GPU memory profiling should be per LLM instance
#10498
opened Nov 20, 2024 by
tjohnson31415
•
Draft
Previous Next
ProTip!
Updated in the last three days: updated:>2024-11-19.