Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Bugfix] Fix Baichuan BNB online quantization
#10572 opened Nov 22, 2024 by CNTRYROA Loading…
[V1] EngineCore supports profiling ready ONLY add when PR is ready to merge/full CI is needed
#10564 opened Nov 22, 2024 by Abatom Loading…
[torch.compile] support all attention backends
#10558 opened Nov 22, 2024 by youkaichao Loading…
[Docs] Add dedicated tool calling page to docs documentation Improvements or additions to documentation
#10554 opened Nov 21, 2024 by mgoin Loading…
Update default max_num_batch_tokens for chunked prefill to 2048 needs-rebase ready ONLY add when PR is ready to merge/full CI is needed
#10544 opened Nov 21, 2024 by mgoin Loading…
[Bugfix][Hardware][CPU] Fix multi_modal_kwargs broadcast for CPU tensor parallel ready ONLY add when PR is ready to merge/full CI is needed
#10541 opened Nov 21, 2024 by Isotr0py Loading…
Add Sageattention backend
#10532 opened Nov 21, 2024 by flozi00 Loading…
[Model]: Add support for Aria model documentation Improvements or additions to documentation
#10514 opened Nov 21, 2024 by xffxff Loading…
[v1] Refactor KVCacheManager for more hash input than token ids ready ONLY add when PR is ready to merge/full CI is needed
#10507 opened Nov 21, 2024 by rickyyx Loading…
[Model] Add OLMo November 2024 model documentation Improvements or additions to documentation
#10503 opened Nov 20, 2024 by 2015aroras Loading…
[Core] Implement disagg prefill by StatelessProcessGroup ci/build needs-rebase ready ONLY add when PR is ready to merge/full CI is needed
#10502 opened Nov 20, 2024 by KuntaiDu Loading…
Support softcap in ROCm Flash Attention
#10500 opened Nov 20, 2024 by hliuca Loading…
[CI/Build] Dockerfile build for ARM64 / GH200 ci/build documentation Improvements or additions to documentation
#10499 opened Nov 20, 2024 by drikster80 Loading…
ProTip! Updated in the last three days: updated:>2024-11-19.