-
Updated
Apr 14, 2022 - Python
#
tvm
Here are 68 public repositories matching this topic...
Open deep learning compiler stack for cpu, gpu and specialized accelerators
javascript
machine-learning
performance
deep-learning
metal
compiler
gpu
vulkan
opencl
tensor
spirv
rocm
tvm
-
Updated
Sep 11, 2018 - C++
AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。
reinforcement-learning
deep-learning
tensorflow
optimization
pytorch
tengine
halide
tensor
auto
autosearch
tvm
-
Updated
Sep 27, 2021 - C++
zhiqwang
commented
Apr 7, 2022
I used following commad to export the ONNX models, and I use the 5.0 tag of ultralytics/yolov5 to train the model.pt. It raises an AttributeError: conv object has no attribute weight. How can I do for this error?
python tools/export_model.py --checkpoint_path model.pt --size_divisible 32_Originally posted by @Deronjey in zhiqwang/yolov5-rt-stack#159 (comment)
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
pytorch
quantization
hessian
8-bit
model-compression
distillation
tvm
4-bit
mixed-precision
tensorcore
quantized-neural-networks
hardware-aware
efficient-neural-networks
-
Updated
May 8, 2021 - Python
Optimizing Mobile Deep Learning on ARM GPU with TVM
-
Updated
Oct 15, 2018 - C
-
Updated
Oct 24, 2021 - Python
Open, Modular, Deep Learning Accelerator
-
Updated
Jan 27, 2022 - Scala
TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together
-
Updated
May 22, 2018 - Jupyter Notebook
A home for the final text of all TVM RFCs.
-
Updated
Apr 13, 2022
Benchmark scripts for TVM
-
Updated
Mar 15, 2022 - Python
动手学习TVM核心原理教程
-
Updated
Dec 4, 2020 - Python
Large input size REAL-TIME Face Detector on Cpp. It can also support face verification using MobileFaceNet+Arcface with real-time inference. 480P Over 30FPS on CPU
lightweight
cpp
face-recognition
face-detection
mtcnn
tvm
arcface
insightface
retinaface
real-tim
retinaface-detector
-
Updated
Oct 15, 2020 - C++
-
Updated
Sep 2, 2019 - C++
(MERGED) Rust bindings for TVM runtime
-
Updated
Feb 3, 2019 - Rust
This project contains a code generator that produces static C NN inference deployment code targeting tiny micro-controllers (TinyML) as replacement for other µTVM runtimes. This tools generates a runtime, which statically executes the compiled model. This reduces the overhead in terms of code size and execution time compared to having a dynamic on-device runtime.
-
Updated
Sep 22, 2021 - C
ANT framework's model database that provides DNN models for the various range of IoT devices
-
Updated
Dec 21, 2020 - Python
-
Updated
Sep 2, 2020 - Python
DEPRECATED: this repo has been moved into https://github.com/dmlc/tvm/blob/master/rust
-
Updated
Oct 5, 2018 - Rust
-
Updated
Aug 9, 2021 - Dockerfile
Benchmarks for popular classification and object detection models on CPUs and GPUs
machine-learning
computer-vision
deep-learning
mxnet
benchmarks
image-classification
object-detection
tvm
-
Updated
Apr 8, 2019 - Python
Canopy is a machine learning learning compiler stack with the capability of adopting high-end FPGAs. As a part of OpenAIOS project, Canopy is an evolved version of Apache TVM. Canopy is able to support a variety of hardware backends such as PCIE-based cloud FPGAs, CPUs and GPUs.
-
Updated
May 7, 2021 - Python
Open deep learning compiler stack for cpu, gpu and specialized accelerators
-
Updated
Dec 9, 2021 - Python
各类深度模型的优化器,包括tvm、tf-tensorrt、tensorrt等。
-
Updated
Nov 30, 2021 - Python
Improve this page
Add a description, image, and links to the tvm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the tvm topic, visit your repo's landing page and select "manage topics."
Auto-Installer is currently not supported on Windows platforms. TVM and TensorRT in particular would need special care.