Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Fix for Stable Diffusion
#3218 opened Apr 13, 2023 by mrwyattii Loading…
[update] reference in cifar-10
#3212 opened Apr 13, 2023 by dogukanutuna Loading…
zero3 checkpoint frozen params
#3205 opened Apr 12, 2023 by tjruwase Loading…
Additional changes to support MI200
#3204 opened Apr 12, 2023 by loadams Draft
Make deepspeed.zero.Init() idempotent
#3203 opened Apr 12, 2023 by eisene Loading…
Update automatic-tensor-parallelism.md
#3198 opened Apr 12, 2023 by sywangyi Loading…
improving int4 asymmetric quantization accuracy
#3190 opened Apr 11, 2023 by HeyangQin Loading…
AMD Kernel Compatibility Fixes
#3180 opened Apr 11, 2023 by cmikeh2 Draft
Enable auto TP policy for llama model
#3170 opened Apr 10, 2023 by jianan-gu Loading…
Fix handling of (CUDA,ROCR)_VISIBLE_DEVICES
#3165 opened Apr 8, 2023 by jglaser Loading…
Dev/fp32
#3149 opened Apr 5, 2023 by ShijieZZZZ Draft
CPP PYBIND Module for Tensor Map
#3129 opened Apr 3, 2023 by DannyBruno Loading…
Disable ZeRO loading when load_module_only=True
#3116 opened Mar 30, 2023 by ploshkin Loading…
Token length type fixing
#3110 opened Mar 29, 2023 by tgergo1 Loading…
add bf16 cuda kernel support
#3092 opened Mar 24, 2023 by dc3671 Loading…
fix mpich launcher issue in multi-node
#3078 opened Mar 22, 2023 by sywangyi Loading…
fix pop off grad mistakenly
#3064 opened Mar 21, 2023 by HuangLK Loading…
ProTip! What’s not been updated in a month: updated:<2023-03-13.