fma
Here are 34 public repositories matching this topic...
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
-
Updated
Jul 15, 2019
Music genre classification model using CRNN
-
Updated
Sep 27, 2018 - Python
Recommending Music using a Convolutional Neural Network.
-
Updated
May 4, 2019 - Python
This package contains a macro for converting expressions to use muladd calls and fused-multiply-add (FMA) operations for high-performance in the SciML scientific machine learning ecosystem
-
Updated
Nov 27, 2022 - Julia
IEEE 754 standard floating point unit fpu single double precision verilog vhdl riscv
-
Updated
Oct 17, 2022 - VHDL
FMA Transmutation Circles
-
Updated
Feb 13, 2019 - TypeScript
IEEE 754 standard floating point unit fpu single precision verilog vhdl riscv
-
Updated
Oct 17, 2022 - VHDL
Data pipeline and training pipeline for
-
Updated
Dec 8, 2022 - Jupyter Notebook
A collection of highly optimized, SIMD-accelerated (SSE, AVX, FMA, NEON) functions written in C
-
Updated
Oct 19, 2021 - C
software implementation of Fused-Multiply Add for 64-bit floats
-
Updated
Aug 6, 2020 - Go
X86-64 bilateral instruction tokenizer implemented in C. Supports the following processor extensions: AES, AVX, AVX2, AVX512, FMA, MMX, SSE, SSE2, SSE3, SSE4, x87(FPU), VMX. In order to ease testing, a diassembler which transforms tokens into compilable assembly (for NASM compiler) has been implemented.
-
Updated
Oct 2, 2022 - C
SIMD accelerate xoshiro128, generate 256 bits 'number' at once
-
Updated
Oct 24, 2021 - C++
Improve this page
Add a description, image, and links to the fma topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the fma topic, visit your repo's landing page and select "manage topics."