pruning

Star

Here are 337 public repositories matching this topic...

datawhalechina / leedl-tutorial

Star

《李宏毅深度学习教程》，PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

machine-learning reinforcement-learning deep-learning cnn transformer gan rnn pruning self-attention leedl-tutorial

Updated Jan 28, 2023
Jupyter Notebook

IntelLabs / distiller

Star

Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller

deep-neural-networks jupyter-notebook pytorch regularization pruning quantization group-lasso distillation onnx truncated-svd network-compression pruning-structures early-exit automl-for-compression

Updated Dec 11, 2022
Jupyter Notebook

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、reg…

Updated Oct 6, 2021
Python

he-y / Awesome-Pruning

Star

A curated list of neural network pruning resources.

awesome-list pruning model-compression model-acceleration

Updated Jan 8, 2023

neuralmagic / sparseml

Star

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Updated Feb 20, 2023
Python

tensorflow / model-optimization

Star

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

machine-learning sparsity compression deep-learning tensorflow optimization keras ml pruning quantization model-compression quantized-training quantized-neural-networks quantized-networks

Updated Feb 18, 2023
Python

quic / aimet

Star

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

open-source machine-learning opensource deep-neural-networks compression deep-learning pruning quantization auto-ml network-quantization network-compression

Updated Feb 19, 2023
Python

PaddlePaddle / PaddleSlim

Star

PaddleSlim is an open-source library for deep model compression and architecture search.

sparsity compression detection transformer segmentation pruning quantization nas bert tensorrt distillation ernie yolov5 yolov6 yolov7

Updated Feb 20, 2023
Python

neuralmagic / deepsparse

Star

Inference runtime offering GPU-class performance on CPUs and APIs to integrate ML into your application

Updated Feb 20, 2023
Python

open-mmlab / mmrazor

Star

OpenMMLab Model Compression Toolbox and Benchmark.

detection pytorch classification segmentation pruning darts nas knowledge-distillation spos autoslim

Updated Feb 20, 2023
Python

intel / neural-compressor

Star

Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.

sparsity deep-learning pruning quantization knowledge-distillation auto-tuning low-precision quantization-aware-training post-training-quantization