-
Updated
Jul 9, 2021 - Python
distributed-training
Here are 55 public repositories matching this topic...
-
Updated
Jul 8, 2021 - Python
I have the same hardware envs, same network, but I could not get the result as you, almost half as you. Any best practices and experience? thanks very much! for bytePS with 1 instance and 8 GPU, I have similar testing result.
-
Updated
Jul 9, 2021 - Python
-
Updated
Jul 7, 2021 - Python
proposed by @mryab from #126
Virtual batching and LR scheduling are popular techniques with many applications. It would be nice to have an example of how to implement them with hivemind.
As of 0.8.15, ExpertBackend supports two ways of doing so:
- hacky way: by wrapping over optimizer and implementing LR schedule in opt.step
- orthodox way: by subclassing ExpertBackend and implementing ap
-
Updated
Mar 12, 2020 - Python
-
Updated
Jul 6, 2021 - Go
-
Updated
May 13, 2019
-
Updated
Nov 19, 2018 - Python
-
Updated
Aug 7, 2020 - Python
-
Updated
Jun 30, 2021 - Python
torchtext (as of 0.4.0) adopts torch.utils.data.DataLoader, and the older iterator interface is deprecated. Ensure AdaptDL's AdaptiveDataLoader supports this new torchtext interface for data loading, and port the example transformer code to the new interface. Then, adaptdl.data.iterator can be deprecated/removed.
-
Updated
Jun 22, 2020 - Python
-
Updated
Jul 7, 2021 - Jupyter Notebook
-
Updated
May 8, 2021 - C++
-
Updated
Sep 7, 2020 - Python
-
Updated
Jun 11, 2020 - Python
-
Updated
Jun 13, 2020 - Jupyter Notebook
-
Updated
Apr 28, 2021 - Python
-
Updated
Jun 30, 2021 - C++
-
Updated
Jul 8, 2021 - Java
-
Updated
May 27, 2021 - JavaScript
-
Updated
Sep 23, 2020 - Jupyter Notebook
-
Updated
Dec 18, 2018 - Shell
-
Updated
Jul 8, 2019 - HCL
-
Updated
Feb 15, 2021 - Jupyter Notebook
-
Updated
Dec 28, 2020 - Jupyter Notebook
Improve this page
Add a description, image, and links to the distributed-training topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the distributed-training topic, visit your repo's landing page and select "manage topics."
We would like to forward a particular 'key' column which is part of the features to appear alongside the predictions - this is to be able to identify to which set of features a particular prediction belongs to. Here is an example of predictions output using the tensorflow.contrib.estimator.multi_class_head: