Top 23 Simd Open-Source Projects

ncnn

12 19,176 9.4 C++

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Project mention: AMD Funded a Drop-In CUDA Implementation Built on ROCm: It's Open-Source | news.ycombinator.com | 2024-02-12

ncnn uses Vulkan for GPU acceleration, I've seen it used in a few projects to get AMD hardware support.
https://github.com/Tencent/ncnn

simdjson

65 18,362 9.2 C++

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

Project mention: Tips on adding JSON output to your command line utility. (2021) | news.ycombinator.com | 2024-04-20

It's also supported by simdjson [0] (which has a lot of language bindings [1]):
> Multithreaded processing of gigantic Newline-Delimited JSON (ndjson) and related formats at 3.5 GB/s
[0] https://simdjson.org/
[0] https://github.com/simdjson/simdjson?tab=readme-ov-file#bind...

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
QuestDB

311 13,448 9.7 Java

An open source time-series database for fast ingest and SQL queries

Project mention: How to Forecast Air Temperatures with AI + IoT Sensor Data | dev.to | 2024-03-24

If your data lacks uniform time intervals between consecutive entries, QuestDB offers a solution by allowing you to sample your data. After that, MindsDB facilitates creating, training, and deploying your time-series models.

john

77 9,267 9.3 C

John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs

Project mention: Best Hacking Tools for Beginners 2024 | dev.to | 2024-02-01

John The Ripper

GLM

36 8,671 9.0 C++

OpenGL Mathematics (GLM)

Project mention: Release of GLM 1.0.0 | news.ycombinator.com | 2024-01-24

EntityComponentSystemSamples

23 6,821 4.8 C#

Project mention: Added Physics to my enemies - Unity DOTS. Pretty happy with the result! Time to tweak | /r/Unity3D | 2023-05-25

Sure. This is the repo. It contains more than Physics

sonic

23 6,277 8.1 Assembly

A blazingly fast JSON serializing & deserializing library (by bytedance)

Project mention: How to Visualize and Analyze Data in Open Source Communities | dev.to | 2024-04-21

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
highway

66 3,645 9.8 C++

Performance-portable, length-agnostic SIMD with runtime dispatch

Project mention: Llamafile 0.7 Brings AVX-512 Support: 10x Faster Prompt Eval Times for AMD Zen 4 | news.ycombinator.com | 2024-03-31

The bf16 dot instruction replaces 6 instructions: https://github.com/google/highway/blob/master/hwy/ops/x86_12...

hora

9 2,554 0.0 Rust

🚀 efficient approximate nearest neighbor search algorithm collections library written in Rust 🦀 .

Project mention: Building a Vector Database with Rust to Make Use of Vector Embeddings | /r/rust | 2023-06-01

We have been playing around with Hora as a replacement for the Rust-CV implementation as we want PQ as well. I'll check out instanct-distance, looks very interesting!

ispc

4 2,402 9.5 C++

Intel® Implicit SPMD Program Compiler

Project mention: Implementing a GPU's Programming Model on a CPU | news.ycombinator.com | 2023-10-14

This so-called GPU programming model has existed many decades before the appearance of the first GPUs, but at that time the compilers were not so good like the CUDA compilers, so the burden for a programmer was greater.
As another poster has already mentioned, there exists a compiler for CPUs which has been inspired by CUDA and which has been available for many years: ISPC (Implicit SPMD Program Compiler), at https://github.com/ispc/ispc .
NVIDIA has the very annoying habit of using a lot of terms that are different from those that have been previously used in computer science for decades. The worst is that NVIDIA has not invented new words, but they have frequently reused words that have been widely used with other meanings.
SIMT (Single-Instruction Multiple Thread) is not the worst term coined by NVIDIA, but there was no need for yet another acronym. For instance they could have used SPMD (Single Program, Multiple Data Stream), which dates from 1988, two decades before CUDA.
Moreover, SIMT is the same thing that was called "array of processes" by C.A.R. Hoare in August 1978 (in "Communicating Sequential Processes"), or "replicated parallel" by Occam in 1985 or "PARALLEL DO" by "OpenMP Fortran" in 1997-10 or "parallel for" by "OpenMP C and C++" in 1998-10.
The only (but extremely important) innovation brought by CUDA is that the compiler is smart enough so that the programmer does not need to know the structure of the processor, i.e. how many cores it has and how many SIMD lanes has each core. The CUDA compiler distributes automatically the work over the available SIMD lanes and available cores and in most cases the programmer does not care whether two executions of the function that must be executed for each data item are done on two different cores or on two different SIMD lanes of the same core.

ozz-animation

4 2,257 5.6 C++

Open source c++ skeletal animation library and toolset
simde

7 2,167 9.2 C

Implementations of SIMD instruction sets for systems which don't natively support them.

Project mention: The Case of the Missing SIMD Code | news.ycombinator.com | 2023-06-08

I was curious about these libraries a few weeks ago and did some searching. Is there one that's got a clearly dominating set of users or contributors?
I don't know what a good way to compare these might be, other than perhaps activity/contributor count.
[1] https://github.com/simd-everywhere/simde
[2] https://github.com/ermig1979/Simd
[3] https://github.com/google/highway
[4] https://gitlab.com/libeigen/eigen
[5] https://github.com/shibatch/sleef

cglm

12 2,043 9.2 C

📽 Highly Optimized 2D / 3D Graphics Math (glm) for C
xsimd

3 2,036 8.7 C++

C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))

Project mention: GDlog: A GPU-Accelerated Deductive Engine | news.ycombinator.com | 2023-12-03

https://github.com/xtensor-stack/xsimd
GH topics > HashMap:

zig-gamedev

55 1,976 9.7 C

Main monorepo for @zig-gamedev libs and example applications
Simd

1 1,971 9.6 C++

C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM. (by ermig1979)

Project mention: The Case of the Missing SIMD Code | news.ycombinator.com | 2023-06-08

I was curious about these libraries a few weeks ago and did some searching. Is there one that's got a clearly dominating set of users or contributors?
I don't know what a good way to compare these might be, other than perhaps activity/contributor count.
[1] https://github.com/simd-everywhere/simde
[2] https://github.com/ermig1979/Simd
[3] https://github.com/google/highway
[4] https://gitlab.com/libeigen/eigen
[5] https://github.com/shibatch/sleef

StringZilla

14 1,776 9.8 C++

Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores, etc 🦖

Project mention: Measuring energy usage: regular code vs. SIMD code | news.ycombinator.com | 2024-02-19

The 3.5x energy-efficiency gap between serial and SIMD code becomes even larger when
A. you do byte-level processing instead of float words;
B. you use embedded, IoT, and other low-energy devices.
A few years ago I've compared Nvidia Jetson Xavier (long before the Orin release), Intel-based MacBook Pro with Core i9, and AVX-512 capable CPUs on substring search benchmarks.
On Xavier one can quite easily disable/enable cores and reconfigure power usage. At peak I got to 4.2 GB/J which was an 8.3x improvement in inefficiency over LibC in substring search operations. The comparison table is still available in the older README: https://github.com/ashvardanian/StringZilla/tree/v2.0.2?tab=...

XNNPACK

8 1,697 9.9 C

High-efficiency floating-point neural network inference operators for mobile, server, and Web

Project mention: Xnnpack: High-efficiency floating-point neural network inference operators | news.ycombinator.com | 2023-12-25

usearch

20 1,629 9.8 C++

Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

Project mention: USearch SQLite Extensions for Vector and Text Search | news.ycombinator.com | 2024-02-22

kfr

2 1,582 9.2 C++

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
faster

1 1,548 0.0 Rust

SIMD for humans (by AdamNiederer)
42_CheatSheet

3 1,542 0.0 C

A comprehensive guide to 50 years of evolution of strict C programming, a tribute to Dennis Ritchie's language
DirectXMath

13 1,481 6.8 C++

DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Simd related posts

Glibc Buffer Overflow in Iconv
1 project | news.ycombinator.com | 21 Apr 2024
Tips on adding JSON output to your command line utility. (2021)
2 projects | news.ycombinator.com | 20 Apr 2024
Llamafile 0.7 Brings AVX-512 Support: 10x Faster Prompt Eval Times for AMD Zen 4
3 projects | news.ycombinator.com | 31 Mar 2024
1BRC Merykitty's Magic SWAR: 8 Lines of Code Explained in 3k Words
4 projects | news.ycombinator.com | 9 Mar 2024
Training great LLMs from ground zero in the wilderness as a startup
3 projects | news.ycombinator.com | 6 Mar 2024
JPEG XL and the Pareto Front
9 projects | news.ycombinator.com | 1 Mar 2024
Singeli: High-level interface for low-level programming
1 project | news.ycombinator.com | 27 Feb 2024
A note from our sponsor - SaaSHub
www.saashub.com | 26 Apr 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Simd projects? This list will help you:

	Project	Stars
1	ncnn	19,176
2	simdjson	18,362
3	QuestDB	13,448
4	john	9,267
5	GLM	8,671
6	EntityComponentSystemSamples	6,821
7	sonic	6,277
8	highway	3,645
9	hora	2,554
10	ispc	2,402
11	ozz-animation	2,257
12	simde	2,167
13	cglm	2,043
14	xsimd	2,036
15	zig-gamedev	1,976
16	Simd	1,971
17	StringZilla	1,776
18	XNNPACK	1,697
19	usearch	1,629
20	kfr	1,582
21	faster	1,548
22	42_CheatSheet	1,542
23	DirectXMath	1,481