xsimd
MandelbrotSSE
xsimd | MandelbrotSSE | |
---|---|---|
3 | 4 | |
2,043 | 83 | |
1.4% | - | |
8.7 | 2.0 | |
2 days ago | 11 months ago | |
C++ | C++ | |
BSD 3-clause "New" or "Revised" License | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
xsimd
-
GDlog: A GPU-Accelerated Deductive Engine
https://github.com/xtensor-stack/xsimd
GH topics > HashMap:
-
SIMD intrinsics and the possibility of a standard library solution
xsimd - 1.6K GH stars
-
SPO600 project part 1
I've decided to switch to something better, and after a few hours of searching, I found this repository: NSIMD https://github.com/agenium-scale/nsimd FastDifferentialCoding https://github.com/lemire/FastDifferentialCoding VS https://github.com/VcDevel/Vc XSIMD https://github.com/xtensor-stack/xsimd
MandelbrotSSE
-
1000x speedup on interactive Mandelbrot zooms: from C, to inline SSE assembly, to OpenMP for multiple cores, to CUDA, to pixel-reuse from previous frames, to inline AVX assembly...
I verified that the limiting factor is memory bandwidth - and that once we switch to a fully CPU-bound mode (with option -p 100) the computation speed scales linearly with more cores.
What are some alternatives?
highway - Performance-portable, length-agnostic SIMD with runtime dispatch
futhark - :boom::computer::boom: A data-parallel functional programming language
Vc - SIMD Vector Classes for C++
GAPFixFractal - A program to generate pictures of Mandelbrot or Julia sets, using arbitrary precision arithmetic on CUDA GPUs.
libsimdpp - Portable header-only C++ low level SIMD library
FractalAsm-Linux - x86 assembly language (with GUI in C) Mandelbrot and Julia fractal program (Linux)
nsimd - Agenium Scale vectorization library for CPUs and GPUs
CTranslate2 - Fast inference engine for Transformer models
FastDifferentialCoding - Fast differential coding functions (using SIMD instructions)
FractalAsm - x86 inline assembly language Mandelbrot and Julia fractal program (Windows)
optuna - A hyperparameter optimization framework
VectorizedKernel - Running GPGPU-like kernels on CPU with auto-vectorization for SSE/AVX/AVX512 SIMD Architectures