-
Notifications
You must be signed in to change notification settings - Fork 82
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[V1] Add v0 style schedule into v1 engine.
module:core
#512
opened Apr 13, 2025 by
whx-sjtu
Loading…
support AscendW8A8 quantization
module:quantization
#511
opened Apr 12, 2025 by
dingdingchaomian
Loading…
[Build] Update doc
documentation
Improvements or additions to documentation
#509
opened Apr 12, 2025 by
wangxiyuan
Loading…
[Doc] Update FAQ doc
documentation
Improvements or additions to documentation
#504
opened Apr 11, 2025 by
shen-shanshan
Loading…
[SpecDecode] Add spec decode support
module:tests
#500
opened Apr 10, 2025 by
MengqingCao
Loading…
1 task
[4/N][CI/UT] Add Qwen2.5VL-3B and Qwen2.5-7B test
module:tests
#499
opened Apr 10, 2025 by
Potabk
Loading…
[Misc]Use the es command line tool to process benchmark results
#497
opened Apr 10, 2025 by
Potabk
Loading…
[Doc] Add branch info for v0.8.x
documentation
Improvements or additions to documentation
#493
opened Apr 9, 2025 by
Yikun
Loading…
[SpecDecode][MiniCPM] pick certain feature to main
module:core
#484
opened Apr 8, 2025 by
MengqingCao
•
Draft
2 tasks
[CI]Add model basic accuracy test(Qwen2-1.5B-Instruct)
module:tests
#460
opened Apr 2, 2025 by
hfadzxy
Loading…
port deepseekv2 and mtp to main branch
module:core
module:ops
module:quantization
#429
opened Mar 29, 2025 by
ganyi1996ppo
Loading…
[CI]Add model basic correctness test(Qwen2.5_7B)
module:tests
#387
opened Mar 25, 2025 by
Potabk
Loading…
Multi step main
documentation
Improvements or additions to documentation
module:tests
#350
opened Mar 18, 2025 by
new-TonyWang
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-03-13.