-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[10/n] decouple quantization impl from vllm dependency - fix import
format
Auto Format Code
run-ci
#13524
opened Nov 18, 2025 by
FlamingoPg
Loading…
1 of 6 tasks
Fixing hooks test by adding more realistic hook module to import
#13523
opened Nov 18, 2025 by
Carlomus
Loading…
fix double Unicode escape issue in streaming tool_calls parameters
#13518
opened Nov 18, 2025 by
lw9527
Loading…
6 tasks
add speculative args checking
run-ci
speculative-decoding
#13517
opened Nov 18, 2025 by
QsingHuan
Loading…
Shortest-extend-length scheduling policy
documentation
Improvements or additions to documentation
#13507
opened Nov 18, 2025 by
totktospit
Loading…
3 of 5 tasks
modularize gsm8k and mmmu test classes
Multi-modal
multi-modal language model
run-ci
#13506
opened Nov 18, 2025 by
netanel-haber
Loading…
2 tasks done
[Docs] Add doc for expert parallelism (EP)
documentation
Improvements or additions to documentation
#13504
opened Nov 18, 2025 by
YinglingWang
Loading…
2 of 5 tasks
update model_test
deepseek
Multi-modal
multi-modal language model
#13502
opened Nov 18, 2025 by
hhhh1252023
Loading…
5 tasks
debug testcase
Multi-modal
multi-modal language model
#13500
opened Nov 18, 2025 by
htj827
Loading…
5 tasks
fix: check HF_HUB_OFFLINE before fetching remote hf_quant_config.json
#13499
opened Nov 18, 2025 by
lwabish
Loading…
1 of 5 tasks
[HiCache] fix unit test with changed new APIs
run-ci
#13498
opened Nov 18, 2025 by
stmatengss
Loading…
5 tasks
[Bug] Fixes accuracy issues caused by incorrect use of rope
#13495
opened Nov 18, 2025 by
Baidu-AIAK
Loading…
5 tasks
[Tiny ci fix] Fix test isolation issues in test_model_hooks.py
run-ci
#13490
opened Nov 18, 2025 by
BBuf
Loading…
5 tasks
fix: malformed KV events for NVIDIA Dynamo
#13488
opened Nov 18, 2025 by
PeaBrane
Loading…
5 tasks done
[Performance] Replace preprocess_video logic from GLM and Qwen-VL multimodal processor with transformer impl for speed up (up to 27% faster) and addressing OOM (up to 50x improvements)
dependencies
Pull requests that update a dependency file
run-ci
#13487
opened Nov 18, 2025 by
byjiang1996
Loading…
5 tasks done
Fix global scaling factor loading hang
quant
LLM Quantization
run-ci
#13484
opened Nov 18, 2025 by
wenscarl
Loading…
5 tasks
purge unnecessary env variable set in deterministic test
run-ci
#13481
opened Nov 18, 2025 by
zminglei
Loading…
5 tasks
[Piecewise CUDA Graph] Support Triton Block W8A8
piecewise-cuda-graph
run-ci
#13480
opened Nov 18, 2025 by
b8zhong
Loading…
[Spec v2] delay
seq_lens sync and apply new memory manangment
run-ci
#13478
opened Nov 18, 2025 by
hnyls2002
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.