Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add H100 workflow
#3754 opened Jun 14, 2023 by loadams Draft
[squash] styoun/triton fp16 transformer (#530)
#3748 opened Jun 14, 2023 by stephen-youn Loading…
Adding assertion for mp_group in HE.
#3740 opened Jun 12, 2023 by jomayeri Loading…
Zero3 Fix allreduce optimization for extra large tensor
#3739 opened Jun 12, 2023 by hablb Loading…
change model_parameter hint according to document
#3726 opened Jun 9, 2023 by techkang Loading…
Clean opt
#3703 opened Jun 7, 2023 by awan-10 Draft
Asymmetric quant algorithm update
#3696 opened Jun 6, 2023 by cmikeh2 Loading…
Add FALCON-40B Inference-Kernel Support
#3656 opened Jun 1, 2023 by RezaYazdaniAminabadi Loading…
1 task
Add FALCON Auto-TP Support
#3640 opened May 30, 2023 by RezaYazdaniAminabadi Loading…
[CPU] Skip CPU support unimplemented error
#3633 opened May 30, 2023 by Yejing-Lai Loading…
enable pipeline checkpoint loading mode
#3629 opened May 30, 2023 by leiwen83 Loading…
Re-enable GPT-J unit tests
#3618 opened May 26, 2023 by mrwyattii Loading…
3 tasks
fix opt-350m shard loading issue in AutoTP
#3600 opened May 24, 2023 by sywangyi Loading…
Support model declaration in zero.Init context
#3592 opened May 22, 2023 by tohtana Loading…
Test cuda 11.7
#3586 opened May 22, 2023 by RezaYazdaniAminabadi Draft
[profiling]add show_straggler argument to log_summary()
#3579 opened May 19, 2023 by delock Loading…
ProTip! Exclude everything labeled bug with -label:bug.