Author
Label
Projects
Milestones
Reviews
Assignee
Sort
feat: Support for training with MoE module in PipelineEngine
#1942
opened May 7, 2022 by
shjwudp
Loading…
fix: remove layer_past storing in DeepSpeedTransformerInference
#1930
opened May 1, 2022 by
codertimo
Loading…
remove force-multi and fix None val check in base tuner in autotuning
#1657
opened Dec 22, 2021 by
cli99
Loading…
[zero-3] add support for new params added during fwd pass
#1606
opened Dec 1, 2021 by
jeffra
Loading…
support batch size dimension in 2D sparse attention mask
#1597
opened Nov 29, 2021 by
jglaser
Loading…
Optimizer state loading fix for bitsandbytes 8-bit optimizers.
#1582
opened Nov 22, 2021 by
TimDettmers
Loading…
Refine quantizer for supporting larger hidden-dim and group size
#1544
opened Nov 9, 2021 by
RezaYazdaniAminabadi
Loading…
Add some improvements for pipeline module, engine and assertion into ds engine
#1529
opened Nov 6, 2021 by
hyunwoongko
Loading…
parallelize writing of layer checkpoint files across data parallel instances
#1419
opened Sep 30, 2021 by
adammoody
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.