-
Updated
Jun 19, 2021 - Python
#
rocm
Here are 59 public repositories matching this topic...
Open deep learning compiler stack for cpu, gpu and specialized accelerators
javascript
machine-learning
performance
deep-learning
metal
compiler
gpu
vulkan
opencl
tensor
spirv
rocm
tvm
-
Updated
Sep 11, 2018 - C++
stdgpu: Efficient STL-like Data Structures on the GPU
cpp
gpu
modern-cpp
cpp14
openmp
cuda
stl
data-structures
gpgpu
gpu-acceleration
cpp17
stl-containers
hip
gpu-computing
rocm
cpp20
stl-like
-
Updated
May 3, 2021 - C++
tomdeakin
commented
Jun 8, 2020
Just an FYI whilst I was trawling through the ROCm GitHub page:
https://rocmdocs.amd.com/en/latest/Programming_Guides/Programming-Guides.html#
Agenium Scale vectorization library for CPUs and GPUs
hpc
neon
cuda
avx
simd
avx2
sse2
simd-programming
aarch64
avx512
simd-instructions
simd-library
sse42
rocm
cpp20
sve
neon128
cpp20-library
vectorization-library
-
Updated
Jun 9, 2021 - Python
AMD OpenVX Core -- a sub-module of amdovx-modules:
linux
cmake
cpu
opencl
range
vcxproj
amdgpu
rocm
radeon-open-compute
openvx
radeon-instinct-mi-series
radeon-vega-series
amd-openvx
khronos-openvx
vx-loomsl
-
Updated
Feb 5, 2019 - C++
jpsamaroo
commented
Apr 6, 2021
Since arrays may not actually be modified by a given operation, or might only be partially modified (or the user has some other way to ensure correctness).
MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.
machine-learning
computer-vision
neural-network
opencl
inference
amd-opencl
virtual-reality
rocm
inference-engine
openvx
ryzen
onnx
windows-machine-learning
amd-openvx
khronos-openvx
openvx-neural-network
amd-opencv
nnef
winml
openvx-extensions
-
Updated
Jun 18, 2021 - C++
AMD OpenVX modules: such as, neural network inference, 360 video stitching, etc.
video-stitching
rocm
radeon-open-compute
openvx
onnx
neural-network-inference
radeon-instinct-mi-series
radeon-vega-series
-
Updated
Feb 5, 2019 - C++
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
linear-algebra
mpi
cuda
scalapack
matrix-multiplication
gpu-acceleration
rocm
communication-optimal
pdgemm
-
Updated
Jun 18, 2021 - C++
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
-
Updated
May 4, 2021 - Go
The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA applications. This information can be used by developers to discover bottlenecks in the application and to find ways to optimize the application's performance.
-
Updated
Jun 16, 2020 - C++
Domain specific library for electronic structure calculations
cmake
gpu
mpi
cuda
density-functional-theory
hdf5
sirius
hdfs
spack
fftw
libxc
gsl
rocm
electronic-structure-calculations
pseudopotential
planewave
full-potential
lapw
spglib
piz-daint
-
Updated
Jun 19, 2021 - C++
MIVisionX toolkit is a comprehensive computer vision and machine intelligence libraries, utilities and applications bundled into a single toolkit.
machine-learning
opencl
amd-opencl
artificial-intelligence
artificial-neural-networks
convolutional-neural-networks
object-detection
amd-modules
machine-intelligence
rocm
openvx
tiny-yolo
yolov2
tiny-yolo-network
openvx-nn-extension
amd-gpu
mivisionx
mivision
amd-openvx
-
Updated
Jun 26, 2019 - C++
Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support
-
Updated
May 25, 2021 - C++
ROCm Machine Learning and HPC Stack installer
-
Updated
Jul 31, 2020 - Shell
Install guide of ROCm and Tensorflow on Ubuntu for the RX580
-
Updated
Feb 4, 2021
Radeon Performance Primitives (RPP) library is a comprehensive high performance computer vision library for AMD (CPU and GPU) with HIP and OpenCL back-ends.
computer-vision
hpc
amd
gpu
histogram
contrast
bitwise
hip
rocm
comprehensive
openvx
rpp
mivisionx
radeon-performance-primitives
warp-affine
channel-extract
agumentation
-
Updated
Jun 14, 2021 - C++
Improve this page
Add a description, image, and links to the rocm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rocm topic, visit your repo's landing page and select "manage topics."
In numba/stencils/stencil.py, there are various places (like line 552, "if isinstance(kernel_size[i][0], int):") where we check for "int" in relation to neighborhoods. I ran across a case where I was creating a neighborhood tuple by extracting values from a Numpy array. This causes a problem because those Numpy values will not match in these isinstance int checks. I worked around it by conver