Pull requests: microsoft/DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix [BUG] 'DeepSpeedGPTInference' object has no attribute 'dtype' for…
#4814
opened Dec 14, 2023 by
jxysoft
Loading…
mv DeepSpeedEngine param_names dict init post _configure_distributed_model
#4803
opened Dec 12, 2023 by
nelyahu
Loading…
Revert "Fix for when prompt contains an odd num of apostrophes (#4660)"
#4797
opened Dec 11, 2023 by
loadams
Loading…
support autoTP with weight only quantization in DS inference path
#4750
opened Nov 29, 2023 by
ftian1
Loading…
SP Comm-optimization: fuse query, key, and value all-2-all for better SP perforamnce
#4735
opened Nov 28, 2023 by
RezaYazdaniAminabadi
Loading…
fix num_kv_heads sharding in uneven autoTP for Falcon-40b
#4712
opened Nov 21, 2023 by
Yejing-Lai
Loading…
fix bias when calling qkv_func in DeepSpeedSelfAttention
#4711
opened Nov 21, 2023 by
zmzhang2000
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.