-
Updated
Sep 17, 2021 - Python
rocm
Here are 64 public repositories matching this topic...
-
Updated
Sep 11, 2018 - C++
-
Updated
Jul 26, 2021 - C++
Please go to https://github.com/deepmodeling/deepmd-kit/discussions to ask questions. Before asking questions, you can
- search the previous discussions
- check the document, especially training parameters
Please provide necessary information including the version of software and installation way,
Bug summary
There is evidence that sub_group::get_group_id() does not return the same value as threadIdx.x / warpSize (assuming 1D kernel), as expected on CUDA. We should check the implementation of this function. Our implementation of this function performs bit manipulation magic, presumably the optimization went to far...
To Reproduce
Compare sub_group{}.get_group_id() or `sub
-
Updated
Sep 16, 2021 - Python
-
Updated
Feb 5, 2019 - C++
Since arrays may not actually be modified by a given operation, or might only be partially modified (or the user has some other way to ensure correctness).
-
Updated
Sep 17, 2021 - C++
-
Updated
Aug 4, 2021 - Go
-
Updated
Feb 5, 2019 - C++
-
Updated
Sep 9, 2021 - C++
-
Updated
Jun 16, 2020 - C++
-
Updated
Sep 17, 2021 - C++
-
Updated
Jun 26, 2019 - C++
-
Updated
Sep 16, 2021 - C++
-
Updated
Jul 31, 2020 - Shell
-
Updated
Sep 14, 2021
Improve this page
Add a description, image, and links to the rocm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rocm topic, visit your repo's landing page and select "manage topics."
I see comments suggesting adding this to understand how loops are being handled by numba, and in the their own FAQ (https://numba.pydata.org/numba-doc/latest/user/faq.html)
You would then create your njit function and run it, and I believe the idea is that it prints debug information about whether