Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Improve consistency of zero_grad
#6554 opened Sep 18, 2024 by tohtana Draft
Enabled Qwen2-MoE Tensor Parallelism (TP) inference
#6551 opened Sep 18, 2024 by gyou2021 Loading…
Fix gradient accumulation for Z2+offload
#6550 opened Sep 18, 2024 by tohtana Loading…
Use msgpack for p2p comm
#6547 opened Sep 17, 2024 by tohtana Loading…
Fix expert grad scaling problem with ZeRO optimizer
#6546 opened Sep 17, 2024 by wyooyw Loading…
[XPU] Support DeepNVMe new code structure
#6532 opened Sep 13, 2024 by Liangliang-Ma Loading…
Set shuffle=True by default in data_sampler
#6531 opened Sep 13, 2024 by ranzhejiang Loading…
Fix device selection using CUDA_VISIBLE_DEVICES
#6530 opened Sep 12, 2024 by tohtana Loading…
add bfloat16 to inference support dtypes
#6528 opened Sep 12, 2024 by nelyahu Loading…
Fix dynamo issue
#6527 opened Sep 12, 2024 by oraluben Loading…
Handle when backend is also in compile_kwargs
#6502 opened Sep 7, 2024 by oraluben Loading…
[Accelerator] Cambricon MLU support
#6472 opened Sep 2, 2024 by Andy666G Loading…
Adding the new feature of FPDT
#6462 opened Aug 29, 2024 by YJHMITWEB Loading…
sequence parallel for uneven heads
#6392 opened Aug 21, 2024 by inkcherry Loading…
Add weights_only=True in torch.load
#6094 opened Aug 17, 2024 by terry-for-github Loading…
[NaN check] Add NaN check to support bfloat16.
#5879 opened Aug 8, 2024 by ys950902 Loading…
Fix circular import in ds_transformer.py
#5804 opened Jul 28, 2024 by sznmelvin Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.