quantization

[🔥updating ...] AI 自动量化交易机器人 AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant

machine-learning deep-learning bitcoin blockchain fintech quantitative-finance trademarks quantization funds strategies quantitative-trading pytrade qlib quant-trade trade-bot quant-trader

Updated Nov 14, 2023
Jupyter Notebook

kornelski / pngquant

Star

Lossy PNG compressor — pngquant command based on libimagequant library

c palette quality png png-compression conversion smaller stdin image-optimization quantization pngquant

Updated Oct 29, 2023
C

IntelLabs / distiller

Star

Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller

deep-neural-networks jupyter-notebook pytorch regularization pruning quantization group-lasso distillation onnx truncated-svd network-compression pruning-structures early-exit automl-for-compression

Updated Apr 24, 2023
Jupyter Notebook

IntelLabs / nlp-architect

Star

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

nlp deep-learning tensorflow nlu transformers pytorch deeplearning quantization bert dynet

Updated Nov 7, 2022
Python

huawei-noah / Pretrained-Language-Model

Star

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

pretrained-models quantization knowledge-distillation model-compression large-scale-distributed

Updated May 21, 2023
Python

PanQiWei / AutoGPTQ

Star

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

nlp deep-learning transformers inference pytorch transformer quantization large-language-models llms

Updated Nov 24, 2023
Python

neuralmagic / deepsparse

Star

Sparsity-aware deep learning inference runtime for CPUs

nlp performance computer-vision inference machinelearning pruning object-detection pretrained-models quantization cpus onnx sparsification llm-inference deepsparse

Updated Nov 24, 2023
Python

aaron-xichen / pytorch-playground

Star

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

pytorch quantization pytorch-tutorial pytorch-tutorials

Updated Nov 22, 2022
Python

OpenNMT / CTranslate2

Star

Fast inference engine for Transformer models

Updated Nov 22, 2023
C++

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、reg…

Updated Oct 6, 2021
Python

huggingface / optimum

Star

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

training optimization intel transformers inference pytorch quantization onnx tflite onnxruntime graphcore habana

Updated Nov 24, 2023
Python

quic / aimet

Star

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

open-source machine-learning opensource deep-neural-networks compression deep-learning pruning quantization auto-ml network-quantization network-compression

Updated Nov 21, 2023
Python

intel / neural-compressor

Star

Provide unified APIs for SOTA model compression techniques, such as low precision (INT8/INT4/FP4/NF4) quantization, sparsity, pruning, and knowledge distillation on mainstream AI frameworks such as TensorFlow, PyTorch, and ONNX Runtime.

sparsity pruning quantization knowledge-distillation auto-tuning low-precision quantization-aware-training post-training-quantization large-language-models smoothquant

Updated Nov 25, 2023
Python

PaddlePaddle / PaddleSlim

Star

PaddleSlim is an open-source library for deep model compression and architecture search.

sparsity compression detection transformer segmentation pruning quantization nas bert tensorrt distillation ernie yolov5 yolov6 yolov7

Updated Nov 24, 2023
Python

tensorflow / model-optimization

Star

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

machine-learning sparsity compression deep-learning tensorflow optimization keras ml pruning quantization model-compression quantized-training quantized-neural-networks quantized-networks

Updated Nov 17, 2023
Python

htqin / awesome-model-quantization

Star

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

awesome deep-learning quantization binarization model-compression model-acceleration binary-network binarized-neural-networks lightweight-neural-network model-quantization efficient-deep-learning

Updated Nov 23, 2023

open-mmlab / mmrazor

Star

OpenMMLab Model Compression Toolbox and Benchmark.

detection pytorch classification segmentation pruning darts quantization nas knowledge-distillation spos autoslim

Updated Sep 22, 2023
Python

Improve this page

Add a description, image, and links to the quantization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the quantization topic, visit your repo's landing page and select "manage topics."

Learn more

quantization

Here are 493 public repositories matching this topic...

ymcui / Chinese-LLaMA-Alpaca

hiyouga / LLaMA-Factory

SYSTRAN / faster-whisper

UFund-Me / Qbot

kornelski / pngquant

IntelLabs / distiller

IntelLabs / nlp-architect

huawei-noah / Pretrained-Language-Model

PanQiWei / AutoGPTQ

neuralmagic / deepsparse

aaron-xichen / pytorch-playground

OpenNMT / CTranslate2

666DZY666 / micronet

huggingface / optimum

quic / aimet

intel / neural-compressor

PaddlePaddle / PaddleSlim

tensorflow / model-optimization

htqin / awesome-model-quantization

open-mmlab / mmrazor

Improve this page

Add this topic to your repo