New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
PipelineModule inflated checkpoints when using FP16 param flattening
#549
opened Nov 22, 2020 by
opherlieber
Commenting out loss=None causes much higher GPU memory usage in bing_bert.
#547
opened Nov 21, 2020 by
szhengac
Why is CPU Checkpointing only available with partitioned activations?
#541
opened Nov 19, 2020 by
sshleifer
PipelineParallelGrid' object has no attribute 'slice_parallel_size'
#530
opened Nov 17, 2020 by
gongwei-130
How to generate text with the Megatron-LM model trained with DeepSpeed
#507
opened Nov 7, 2020 by
msmolyak
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.