matrix-multiplication
Here are 328 public repositories matching this topic...
-
Updated
Apr 14, 2020 - C
-
Updated
May 18, 2020 - Clojure
In,
https://github.com/flame/blis/blob/master/docs/Performance.md
I noticed that MKL on EPYC was abysmal, but that's just because it uses SSE2 on AMD by default.
You can set the following (undocumented!) environment variable to tell MKL to use AVX2 CPU instructions:
export MKL_DEBUG_CPU_TYPE=5
-
Updated
Jun 18, 2018 - C
-
Updated
Jun 7, 2020 - C++
-
Updated
Jun 11, 2020 - C
The fp_reduction_latency benchmarks were the very first benchmark, optimization and primitive code tested in Laser.
Unfortunately it is currently very confusing.
It should be reorganized:
-
Updated
May 28, 2020 - C
-
Updated
Aug 6, 2019 - C
-
Updated
Jun 10, 2020 - Go
Hi,
Your glas library is very interesting, and the benchmark for your gemm is impressive. I think users and potential contributors would benefit greatly from a series of blogs or documentation explaining the design approach you have taken for the whole library and for the gemm algorithm.
One of the issues about D is that the advanced methods, idioms, and techniques are not particularly well
-
Updated
Jun 10, 2020 - C++
-
Updated
Jun 10, 2020 - Python
-
Updated
Sep 23, 2016 - Rust
-
Updated
Feb 28, 2019 - C#
Machine learning libraries have found a nice API for tensor contractions, which is based on the Einstein notation. Examples are PyTorch, Tensorflow, and NumPy. I guess, it would be rather strai
-
Updated
Jan 22, 2020 - Python
-
Updated
May 29, 2020 - C++
-
Updated
Nov 1, 2017 - C
-
Updated
Jun 8, 2020 - PHP
-
Updated
Jan 25, 2020 - C
-
Updated
Aug 20, 2016 - Swift
-
Updated
Dec 12, 2017 - TeX
-
Updated
Aug 12, 2017 - Verilog
-
Updated
Nov 18, 2016 - C
-
Updated
May 19, 2020 - C
-
Updated
Feb 3, 2020 - Python
Improve this page
Add a description, image, and links to the matrix-multiplication topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the matrix-multiplication topic, visit your repo's landing page and select "manage topics."
Add quickselect algorithm. Quickselect is a selection algorithm to find the kth smallest element in an unordered list.