deepspeedai / DeepSpeed Public

Notifications You must be signed in to change notification settings
Fork 4.7k
Star 41.6k

Code
Issues 1.2k
Pull requests 116
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: deepspeedai/DeepSpeed

Labels 31 Milestones 0

New pull request New

116 Open 3,499 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[BUG] Fix: Fix gradient norm calculation and dynamic shape blocking in PP+ZeRO1 collective communication

#7847 opened Feb 12, 2026 by Thinksky5124

Loading…

Fix ROCm BF16 conversion intrinsics in inference v2 (#7843)

#7846 opened Feb 12, 2026 by tohtana

Loading…

Fix no-grad grad-fn lookup in ZeRO hook counting on PyTorch 2.3 (#7830)

#7841 opened Feb 10, 2026 by tohtana

Loading…

Replace torch.jit.script with torch.compile (#7835)

#7840 opened Feb 10, 2026 by tohtana

Loading…

Fix bf16 gradient norm divergence with ZeRO stage 0

#7839 opened Feb 9, 2026 by tohtana

Loading…

Throw error when parameter is modified in GatheredParameters

#7832 opened Feb 5, 2026 by tohtana

Loading…

Z1/2 init: flatten params on device

#7828 opened Feb 3, 2026 by ksugama

Loading…

Fix subgroup optimizer metadata inconsistency

#7820 opened Jan 27, 2026 by st-bang97

Loading…

fix: Ensure full gradient reduction for Muon with reduce_scatter

#7808 opened Jan 23, 2026 by nathon-lee

Loading…

Enable shm_comm support for arm

#7800 opened Jan 20, 2026 by phalani-paladugu

Loading…

[Draft] Muon Optimizer Support for ZeRO3

#7798 opened Jan 20, 2026 by PKUWZP

Loading…

Fix bf16 dtype mismatch in ZeRO-3 with zero_quantized_weights

#7792 opened Jan 18, 2026 by juyterman1000

Loading…

Fix Muon optimizer conflict with gradient clipping in ZeRO 1/2

#7776 opened Jan 12, 2026 by fy817

Loading…

Fix: ZenFlow Adam integration for updated PyTorch backward flow (#7759)

#7771 opened Jan 11, 2026 by Antlera

Loading…

Introduce all_reduce_hook to support gradient aggregation across replica groups.

#7764 opened Jan 7, 2026 by zhengchenyu

Loading…

feat: add parameter-level precision control for BF16 training

#7750 opened Dec 30, 2025 by nathon-lee

Loading…

Fix Muon optimizer checkpoint resume with bf16 mode

#7748 opened Dec 28, 2025 by yurekami

Loading…

2 tasks done

Introduce Megatron-style parallel state management

#7726 opened Dec 15, 2025 by eternalNight

Loading…

4 of 5 tasks

let allgather and alltoall execute in parallel when both attention and MOE used TP

#7723 opened Dec 11, 2025 by taozhiwei

Loading…

fix: When there are tensors registered with register buffer in the weight file, the weights are only loaded on device 0 when loading weights across multiple devices.

#7717 opened Dec 8, 2025 by KeeProMise

Loading…

if no expert found in parameter that have expert in name the loop should continue

#7685 opened Nov 11, 2025 by LckyLke • Draft

Configures workflow for offline unit tests

#7512 opened Aug 24, 2025 by porfanid

Loading…

Add world-size getter in Engine

#7479 opened Aug 9, 2025 by WoosungMyung

Loading…

Add EXAONE 4.0 model support for DeepSpeed inference v2 @

#7456 opened Jul 29, 2025 by notkisk • Draft

Create COMMITTERS_RESPONSIBILITY.md

#7300 opened May 21, 2025 by PKUWZP

Loading…

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!