data-parallelism
Here are 29 public repositories matching this topic...
Dear Colossal-AI team,
There are a few features in my mind that I thought would be helpful to the project, and I wanted to ask if there is any of them which might be more useful so I could start implementing them.
Loki-Promtail is a tool for monitoring distributed logs with Grafana. Connecting the Distributed Logger to it and extracting labels from the log structure would be a user-friendly sys
-
Updated
Jul 25, 2018 - Python
-
Updated
Mar 27, 2022 - Nim
-
Updated
Mar 14, 2022 - Shell
-
Updated
Oct 3, 2019 - Python
-
Updated
Nov 19, 2018 - Python
-
Updated
Feb 1, 2021 - Rust
-
Updated
Apr 8, 2022 - Python
-
Updated
Apr 12, 2022 - Python
-
Updated
Mar 25, 2022 - Python
-
Updated
Jan 28, 2019 - C
-
Updated
Jul 27, 2021 - Python
-
Updated
Mar 26, 2021
-
Updated
Dec 8, 2018 - C++
-
Updated
Feb 28, 2018 - Lua
-
Updated
Apr 11, 2022 - C++
-
Updated
Aug 25, 2019 - Python
-
Updated
Jan 22, 2022 - C
-
Updated
Apr 9, 2022 - C
-
Updated
Apr 2, 2018 - C#
-
Updated
Jan 20, 2022 - Python
-
Updated
Mar 23, 2019 - Python
-
Updated
Dec 20, 2021 - Python
-
Updated
Jan 30, 2022 - Erlang
-
Updated
Jan 12, 2022 - Python
-
Updated
Oct 28, 2020 - Java
Improve this page
Add a description, image, and links to the data-parallelism topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-parallelism topic, visit your repo's landing page and select "manage topics."
Hi ,
I have tried out both loss.backward() and model_engine.backward(loss) for my code. There are several subtle differences that I have observed , for one retain_graph = True does not work for model_engine.backward(loss) . This is creating a problem since buffers are not being retained every time I run the code for some reason.
Please look into this if you could.