simd-instructions

Star

Here are 52 public repositories matching this topic...

lakshayg / tensorflow-build-archived

Star

TensorFlow binaries supporting AVX, FMA, SSE

machine-learning tensorflow simd-instructions

Updated Feb 4, 2020
Shell

xtensor-stack / xsimd

Star

C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, NEON, AVX512)

cpp neon c-plus-plus-11 avx sse simd vectorization avx512 mathematical-functions simd-instructions simd-intrinsics

Updated Jul 26, 2021
C++

VcDevel / Vc

Star

SIMD Vector Classes for C++

c-plus-plus cpp portable neon cpp14 parallel parallel-computing avx sse cpp11 simd cpp17 avx2 simd-programming vectorization avx512 simd-instructions simd-vector data-parallel

Updated Jul 8, 2021
C++

google / highway

Star

Performance-portable, length-agnostic SIMD with runtime dispatch

neon wasm simd intrinsics avx2 simd-programming avx512 simd-instructions

Updated Jul 26, 2021
C++

lemire / simdcomp

Star

A simple C library for compressing lists of integers using binary packing

c compression simd simd-instructions

Updated Jul 23, 2021
C

lemire / SIMDCompressionAndIntersection

Star

A C++ library to compress and intersect sorted lists of integers using SIMD instructions

compression algorithms simd integer-compression intersection simd-instructions

Updated Dec 11, 2020
C++

lakshayg / tensorflow-build

Star

TensorFlow binaries supporting AVX, FMA, SSE

machine-learning tensorflow simd-instructions

Updated Feb 15, 2021

agenium-scale / nsimd

Star

Agenium Scale vectorization library for CPUs and GPUs

hpc neon cuda avx simd avx2 sse2 simd-programming aarch64 avx512 simd-instructions simd-library sse42 rocm cpp20 sve neon128 cpp20-library vectorization-library

Updated Jul 26, 2021
Python

lemire / MaskedVByte

Star

Fast decoder for VByte-compressed integers

compression simd integer-compression simd-instructions vbyte vbyte-compressed-integers

Updated Jul 22, 2019
C

DragonSpit / HPCsharp

Star

High performance algorithms in C#: SIMD/SSE, multi-core and faster

linq sorting algorithm csharp high-performance fill sum parallel sse generic sort simd high-performance-computing sorting-algorithms simd-instructions radix-sort summation c-shap algorithm-performance

Updated Apr 10, 2021
C#

lemire / dictionary

Star

High-performance dictionary coding

simd integer-compression simd-instructions

Updated Apr 5, 2017
C++

edanor / umesimd

Star

UME::SIMD A library for explicit simd vectorization.

Updated Jan 19, 2018
C++

lemire / SIMDxorshift

Star

Fast random number generators: Vectorized (SIMD) version of xorshift128+

simd prng xorshift simd-instructions

Updated Jul 22, 2020
C

mklarqvist / positional-popcount

Star

Fast C functions for the computing the positional popcount (pospopcnt).

simd avx2 avx512 simd-instructions popcnt popcount sse4 pospopcnt

Updated Jan 23, 2020
C

cloudflare / sliceslice-rs

Star

A fast implementation of single-pattern substring search using SIMD acceleration.

search-in-text simd avx2 simd-programming text-processing simd-instructions substring-search

Updated May 18, 2021
Rust

lemire / FastDifferentialCoding

Star

Fast differential coding functions (using SIMD instructions)

simd prefix-sum integer-compression simd-instructions compressed

Updated Dec 8, 2017
C

mklarqvist / libalgebra

Star

Fast C header-only library for popcnt, pospopcnt, and set algebraic operations

bitset simd avx2 avx512 bitset-library simd-instructions popcnt popcount set-operations pospopcnt positional-popcount

Updated Dec 16, 2019
C

PatwinchIR / ultra-sort

Star

DSL for SIMD Sorting on AVX2 & AVX512

fast sorting parallel intel sort simd sorting-algorithms gtest avx2 simd-programming vectorization avx512 simd-parallelism simd-instructions simd-transpose

Updated Jan 11, 2019
C++

frtru / GemParticles

Star

Particle engine built on OpenGL used to produce various visual effects.

cmake opengl shaders particles gpgpu rendering-engine glm stb glfw3 simd-instructions glew

Updated Feb 21, 2021
C++

badamczewski / SimpleIntrinsics

Star

This project aims to rename all C# intrinsic names to their more compact C/C++ counterparts that the industry uses.

dotnet simd dotnet-core intrinsics simd-instructions

Updated Nov 25, 2020
C#

jermp / mutable_rank_select

Star

A SIMD-based C++ library providing rank/select queries over mutable bitmaps.

bitmap simd simd-instructions segment-tree rank-select mutable-bitmaps

Updated Feb 23, 2021
C++

sadko4u / lsp-dsp-lib

Star

DSP library for signal processing

algorithms dsp assembly x86-64 simd armv7 fft aarch64 simd-instructions simd-library x86-32 architectures dsp-library processing-algorithms lsp-dsp-lib convolution-algorithms fma3

Updated May 16, 2021
C++

Technologicat / cython-sse-example

Star

Simple example for embedding SSE2 assembly in Cython projects

python example assembly x86-64 cython sse python3 simd x86 intrinsics sse2 python2 python27 python34 simd-instructions

Updated May 2, 2017
Python

andrelrt / litesimd

Star

Litesimd is a no overhead, header only, C++ library for SIMD processing, specialized on SIMD comparison and data shuffle.

cpp avx sse simd simd-programming vectorization simd-instructions

Updated May 23, 2019
C++

jermp / psds

Star

Efficient Prefix-Sum data structures in C++.

simd data-structures prefix-sum simd-instructions segment-tree fenwick-tree

Updated Oct 15, 2020
C++

lemire / vectorclass

Star

Random number generator for large applications using vector instructions

performance simd prng simd-instructions

Updated Feb 17, 2016
C++

zamronypj / simd

Star

Simple pascal demo project to show how to use Single Instruction Multiple Data (SIMD) using Intel SSE instruction

lazarus assembly sse freepascal simd simd-programming simd-instructions lazarus-ide

Updated Feb 13, 2017
Pascal

nulidangxueshen / ALBUS

Star

A Method for efficiently processing SpMV using SIMD and load balancing

simd csr spmv simd-instructions load-balancing sparse-matrix-vector-multiplication albus

Updated May 17, 2021
C++

minio / go-cv

Star

Golang wrapper for https://github.com/ermig1979/Simd

deep-neural-networks deep-learning image-processing convolutional-neural-networks opencv-library simd-instructions

Updated Jun 4, 2020
Go

sshekh / fast-lda

Star

Course project in 'How to write Fast Numerical Code' on optimized implementation of latent dirichlet allocation

avx sse topic-modeling avx2 optimized-functions simd-instructions latent-dirichlet-allocation

Updated Jul 22, 2017
C

Improve this page

Add a description, image, and links to the simd-instructions topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the simd-instructions topic, visit your repo's landing page and select "manage topics."

Learn more