NumPy aware dynamic Python compiler using LLVM
#
rocm
Repositories 26
Open deep learning compiler stack for cpu, gpu and specialized accelerators
compiler
tensor
deep-learning
gpu
opencl
metal
performance
javascript
rocm
tvm
vulkan
spirv
machine-learning
Python
Updated Mar 22, 2019
move to https://github.com/dmlc/tvm/
C++
Updated Sep 11, 2018
AMD OpenVX Core -- a sub-module of amdovx-modules:
openvx
radeon-instinct-mi-series
radeon-vega-series
rocm
radeon-open-compute
amd-openvx
opencl
khronos-openvx
amdgpu
cpu
linux
vx-loomsl
C++
Updated Feb 5, 2019
Dockerfiles for the various software layers defined in the Radeon Open Compute Platform
Shell
Updated Feb 12, 2019
GPU Performance API for AMD GPUs
C++
Updated Mar 18, 2019
AMD OpenVX modules: such as, neural network inference, 360 video stitching, etc.
openvx
neural-network-inference
onnx
video-stitching
radeon-instinct-mi-series
radeon-vega-series
rocm
radeon-open-compute
C++
Updated Feb 5, 2019
Next generation BLAS implementation for ROCm platform
Shell
Updated Mar 21, 2019
Implementation of SYCL 1.2.1 over AMD HIP/NVIDIA CUDA
The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for …
C++
Updated Feb 13, 2019
Next generation FFT implementation for ROCm
C++
Updated Mar 22, 2019
RAND library for HIP programming language
C
Updated Mar 20, 2019
ROCm Parallel Primitives
C++
Updated Mar 21, 2019
A Benchmark Suite for Heterogeneous System Computation
Jupyter Notebook
Updated Jan 17, 2019
MIVisionX toolkit is a comprehensive computer vision and machine intelligence libraries, utilities and applications b…
amd
radeon
computer-vision
openvx
neural-network
opencl
mivisionx
rocm
inference
ai
inference-engine
opencv
onnx
nnef
caffe
winml
windows-machine-learning
openvx-modules
winmltools
vr
C++
Updated Mar 21, 2019
The repo is obsolete. Use at your own risk.
C++
Updated Aug 1, 2018
C++17 N-body Barnes-Hut on heterogeneous hardware architectures
rocm
cpp17
astronomy
astrophyics
astrophysical-simulation
n-body-simulator
n-body
nbody-simulation
nbody
nbody-gravity-simulation
nbody-sim
nbody-problem
simd
avx512
avx
avx2
vectorization
cuda
C++
Updated Mar 14, 2019
MIVisionX toolkit is a comprehensive computer vision and machine intelligence libraries, utilities and applications b…
yolov2
tiny-yolo
tiny-yolo-network
mivisionx
mivision
rocm
opencl
openvx
openvx-nn-extension
amd-opencl
amd-openvx
amd-modules
amd-gpu
artificial-intelligence
artificial-neural-networks
machine-learning
machine-intelligence
convolutional-neural-networks
object-detection
C++
Updated Feb 22, 2019
This project has scripts to set up, build and test installation of Radeon MIVisionX
profile
rocm
mivision-profile
radeon-mivision
platfrom-report
benchmark-reports
amdovx-modules
mivision-setup
deps-folder
openvx
openvx-nn-extension
open-source
opencl
opencv
mivisionx
amd
radeon-mivisionx
radeon
gpu
amd-gpu
Python
Updated Mar 11, 2019
Benchmark for both NVIDIA and AMD GPU
Python
Updated Jan 23, 2019
a Productive Parallel Programming Language
source code for slide deck presented at parallel2017
JavaScript
Updated Jun 25, 2017
Heterogeneous Compute Device Infomation
C++
Updated Jan 11, 2017
HTML
Updated Mar 19, 2019
ITESO GPU programming Hwks and reads
C++
Updated Feb 18, 2018
OpenCL Code
C++
Updated Jan 19, 2018