Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add Windows scripts (deepspeed, ds_report).
#5699 opened Jun 27, 2024 by costin-eseanu Loading…
sequence parallel with communication overlap
#5691 opened Jun 21, 2024 by inkcherry Loading…
Add and Remove ZeRO 3 Hooks
#5658 opened Jun 13, 2024 by jomayeri Loading…
Unpin transformers version
#5650 opened Jun 12, 2024 by loadams Loading…
Hybrid Offloading for ZeRO3
#5625 opened Jun 7, 2024 by tohtana Draft
fix: quantization with DeepSpeed HE
#5624 opened Jun 6, 2024 by Atry Loading…
Add support for Phi-3 small to FastGen
#5614 opened Jun 4, 2024 by adk9 Draft
[INF] Enable torch compile for inference
#5612 opened Jun 4, 2024 by oelayan7 Loading…
Upgrade HPU image to v1.16.2.
#5610 opened Jun 4, 2024 by vshekhawat-hlab Loading…
Update profiler.py
#5584 opened May 29, 2024 by gameofdimension Loading…
reduce cpu host overhead when using moe
#5578 opened May 29, 2024 by ranzhejiang Loading…
Reuse KV cache of prefixes
#5572 opened May 27, 2024 by tohtana Draft
Add support for Microsoft Phi-3 model to DeepSpeed-FastGen
#5559 opened May 21, 2024 by adk9 Loading…
Add chatglm2 & chatglm3 autotp
#5540 opened May 16, 2024 by Yejing-Lai Loading…
Fix deadlock in PipeEngine._exec_recv_grads
#5518 opened May 10, 2024 by i4never Loading…
inference: remove unused _validate_args function
#5505 opened May 8, 2024 by nelyahu Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.