-
Notifications
You must be signed in to change notification settings - Fork 13
Pull requests: hw-native-sys/pypto-serving
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(platform): add configuration types, module layer, and host control plane
#53
opened Jul 2, 2026 by
lterrac
Collaborator
Loading…
3 of 4 tasks
Qwen3-14B serving: dynamic KV sizing, vLLM-style admission, chunked p…
#45
opened Jun 26, 2026 by
sunghajung6688
Loading…
Add TurboQuant KV cache compression to serving pipeline
#33
opened Jun 11, 2026 by
sunghajung6688
Loading…
Rewrite non-L3 Qwen3 kernels through L3 worker
#22
opened Jun 3, 2026 by
ndleslx
Collaborator
Loading…
ProTip!
Mix and match filters to narrow down what you’re looking for.