microsoft / DeepSpeed Public

Notifications You must be signed in to change notification settings
Fork 4.1k
Star 34.8k

Code
Issues 989
Pull requests 130
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: microsoft/DeepSpeed

Labels 32 Milestones 0

New pull request New

130 Open 2,813 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Improve consistency of zero_grad

#6554 opened Sep 18, 2024 by tohtana • Draft

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models

#6553 opened Sep 18, 2024 by gyou2021

Loading…

Enabled Qwen2-MoE Tensor Parallelism (TP) inference

#6551 opened Sep 18, 2024 by gyou2021

Loading…

Fix gradient accumulation for Z2+offload

#6550 opened Sep 18, 2024 by tohtana

Loading…

Use msgpack for p2p comm

#6547 opened Sep 17, 2024 by tohtana

Loading…

Fix expert grad scaling problem with ZeRO optimizer

#6546 opened Sep 17, 2024 by wyooyw

Loading…

reduce setting global variables to reduce torch compile graph breaks

#6541 opened Sep 15, 2024 by NirSonnenschein

Loading…

[XPU] Support DeepNVMe new code structure

#6532 opened Sep 13, 2024 by Liangliang-Ma

Loading…

Set shuffle=True by default in data_sampler

#6531 opened Sep 13, 2024 by ranzhejiang

Loading…

Fix device selection using CUDA_VISIBLE_DEVICES

#6530 opened Sep 12, 2024 by tohtana

Loading…

add bfloat16 to inference support dtypes

#6528 opened Sep 12, 2024 by nelyahu

Loading…

Fix dynamo issue

#6527 opened Sep 12, 2024 by oraluben

Loading…

Handle when backend is also in compile_kwargs

#6502 opened Sep 7, 2024 by oraluben

Loading…

add option to disable logger while compiling to avoid graph breaks

#6496 opened Sep 5, 2024 by ShellyNR

Loading…

Change compile for pipeline module torch.compile

#6478 opened Sep 2, 2024 by NirSonnenschein

Loading…

[Accelerator] Cambricon MLU support

#6472 opened Sep 2, 2024 by Andy666G

Loading…

Adding the new feature of FPDT

#6462 opened Aug 29, 2024 by YJHMITWEB

Loading…

sequence parallel for uneven heads

#6392 opened Aug 21, 2024 by inkcherry

Loading…

Unpin tests that previously used a pinned version of transformers

#6387 opened Aug 20, 2024 by loadams

Loading…

Add weights_only=True in torch.load

#6094 opened Aug 17, 2024 by terry-for-github

Loading…

Add APIs to offload states of model, optimizer, and engine

#6011 opened Aug 16, 2024 by tohtana

Loading…

[NaN check] Add NaN check to support bfloat16.

#5879 opened Aug 8, 2024 by ys950902

Loading…

Fix circular import in ds_transformer.py

#5804 opened Jul 28, 2024 by sznmelvin

Loading…

Add DataStates-LLM: Asynchronous Checkpointing Engine Support

#5763 opened Jul 10, 2024 by mauryaavinash95 • Draft

Switch what versions of python are supported

#5676 opened Jun 17, 2024 by loadams • Draft

Previous 1 2 3 4 5 6 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly