Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

for test
#2724 opened Jan 3, 2025 by zhyncs Draft
3 tasks
fix end check
#2723 opened Jan 3, 2025 by jjjjohnson Loading…
3 tasks
test: add test_block_fp8 in CI
#2715 opened Jan 2, 2025 by zhyncs Draft
3 tasks
chore: bump v0.4.1.post4
#2713 opened Jan 2, 2025 by zhyncs Loading…
3 tasks
WIP: Feature/function calling update
#2700 opened Jan 2, 2025 by YAMY1234 Loading…
[Feature] Support regex as a stopping condition
#2699 opened Jan 2, 2025 by MickQian Loading…
3 tasks done
Hierarchical Caching for SGLang enhancement New feature or request
#2693 opened Jan 1, 2025 by xiezhq-hermann Loading…
3 tasks
Support twoshot kernel
#2688 opened Dec 31, 2024 by yizhang2077 Loading…
3 tasks
Support InternVL2 Series
#2629 opened Dec 28, 2024 by amosyou Draft
3 of 7 tasks
Refactor Scheduler to improve code organization
#2593 opened Dec 26, 2024 by libratiger Loading…
3 tasks done
[Docs] add quantization docs dependencies Pull requests that update a dependency file
#2572 opened Dec 25, 2024 by JamesSand Loading…
3 tasks done
Refactor SchedulePolicy to improve code organization
#2571 opened Dec 25, 2024 by libratiger Loading…
3 tasks done
Enable Nvidia's ModelOpt fp8 quantized models high priority quant LLM Quantization
#2535 opened Dec 21, 2024 by Edwardf0t1 Loading…
1 of 3 tasks
[Cache Offload] Remove device sync overhead
#2533 opened Dec 20, 2024 by Edenzzzz Loading…
3 tasks
adapt custom allreduce for tensorrt llm high priority
#2511 opened Dec 18, 2024 by yizhang2077 Loading…
3 tasks
Add InfiniteBench for long context benchmarking high priority
#2421 opened Dec 9, 2024 by iankur Loading…
2 of 3 tasks
ProTip! Adding no:label will show everything without a label.