sse2neon vs simde

sse2neon

A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation (by DLTcollab)

Source Code

Suggest alternative

Edit details

simde

Implementations of SIMD instruction sets for systems which don't natively support them. (by simd-everywhere)

simd-intrinsics Sse Neon Arm Avx Simd sse2 sse3 ssse3 Sse41 sse42 Avx2 Avx512 fma gfni mmx altivec powerpc Arm64 Vectorization

Source Code

simd-everywhere.github.io

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

sse2neon		simde
	Project
7	Mentions	7
1,224	Stars	2,171
1.2%	Growth	1.5%
7.3	Activity	9.1
14 days ago	Latest Commit	7 days ago
C++	Language	C
MIT License	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

sse2neon

Posts with mentions or reviews of sse2neon. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-02-11.

sse2neon - A C/C++ header file that converts Intel SSE intrinsics to Aarch64 NEON intrinsic
1 project | /r/CKsTechNews | 26 Dec 2022
A C/C++ header file that converts Intel SSE intrinsics to Aarch64 NEON intrinsic
1 project | news.ycombinator.com | 26 Dec 2022
Porting Architecture Specific C/C++ Intrinsics to Graviton
4 projects | dev.to | 11 Feb 2022

The sse2neon project is a quick way to get C/C++ applications compiling and running on Graviton. The sse2neon header file provides NEON implementations for x64 intrinsics so no source code changes are needed. Each function call (intrinsic) is simply replaced with NEON instructions and will just work on Graviton.
An AWS Community Builder Story
3 projects | dev.to | 11 Jan 2022

To continue our collaboration I contributed some small changes to KasmVNC on GitHub to use sse2neon for a performance critical part of the application which uses SSE intrinsics and needed to be changed to NEON intrinsics.
Deserializing JSON Fast
3 projects | news.ycombinator.com | 1 Jan 2022

I think the talk is very clearly laid out as an incremental journey, and each stepping stone involves contextual decision-making. I don't think Andreas is saying "you must end up with the SSE2 implementation at the end". Using machine-specific intrinsics is another dependency decision very similar to deciding to use a given library. I would have loved the talk and probably still thought of it and posted it, even if it ended before the intrinsics (but I think he does an excellent job at that part too).
And porting SSE2 to Neon is actually pretty easy -- if you use https://github.com/DLTcollab/sse2neon, IME it's very easy to do incrementally (or avoid or postpone indefinitely, depending on your needs).
PortableGL: An MIT licensed implementation of OpenGL 3.x-ish in clean C
4 projects | /r/GraphicsProgramming | 1 Oct 2021

I have a private cross-platform port, I’m waiting on the resolution of his latest GitHub issue to submit my changes. sse2neon (https://github.com/DLTcollab/sse2neon) was a big help - I also wrote a very primitive sse2scalar for raspbian builds where neon is unavailable. Honestly SIMD doesn’t help much, as you’re usually memory bound under SWGL. The biggest perf win is any amount of asynchronous execution - running off the main thread is good enough and could be applied to your library externally through a command buffer without any changes to your code.
Success porting VCV into aarch64 linux! (Usable on Android Devices)
3 projects | /r/vcvrack | 13 Mar 2021

You should go to /include/simd and download sse2neon.h into the folder. Replace appearing in any source files in that directory with "sse2neon.h". You will still encounter errors; remove the lines causing problems, typically containing the phrase ZERO_MODE. ARM processors does not require it.

simde

Posts with mentions or reviews of simde. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-08.

The Case of the Missing SIMD Code
7 projects | news.ycombinator.com | 8 Jun 2023

I was curious about these libraries a few weeks ago and did some searching. Is there one that's got a clearly dominating set of users or contributors?
I don't know what a good way to compare these might be, other than perhaps activity/contributor count.
[1] https://github.com/simd-everywhere/simde
[2] https://github.com/ermig1979/Simd
[3] https://github.com/google/highway
[4] https://gitlab.com/libeigen/eigen
[5] https://github.com/shibatch/sleef
Rise: Accelerate the Development of Open Source Software for RISC-V
5 projects | news.ycombinator.com | 31 May 2023

I note that SIMDe doesn't have RISC-V support yet (but it does support Loongson LoongArch):
https://github.com/simd-everywhere/simde/
There are still a ton of things to do to get the Debian riscv64 port going too:
https://wiki.debian.org/PortsDocs/New
SIMD intrinsics and the possibility of a standard library solution
16 projects | /r/cpp | 8 Jan 2023
Portable SIMD library
3 projects | /r/C_Programming | 15 Nov 2022

SIMDe is everything you're after: https://github.com/simd-everywhere/simde
SIMD Everywhere – SIMD intrinsics on hardware which doesn't support them
1 project | news.ycombinator.com | 5 Sep 2022
Making Your Own Tools
2 projects | news.ycombinator.com | 15 May 2021

> low level code that can run on multiple hardware architectures
I thought SIMD Everywhere was a pretty interesting project for that, lets you write x86 SSE/AVX code and run it on non-x86 architectures:
https://github.com/simd-everywhere/simde
Adobe Photoshop Ships on Macs Apple Silicon/M1 – 50% Faster
3 projects | news.ycombinator.com | 12 Mar 2021

> architecture-specific features such as SSE/AVX which is not portable.
I don’t have hands-on experience, but somewhere on HN I saw this: https://github.com/simd-everywhere/simde If starting a new cross-platform project today, I would try that library first, before doing the usual intrinsics.

What are some alternatives?

When comparing sse2neon and simde you can also consider the following projects:

yenten-arm-miner-yespowerr16 - ARM 64 CPU miner for Yespower variant algorithms

nsimd - Agenium Scale vectorization library for CPUs and GPUs

KasmVNC - Modern VNC Server and client, web based and secure

android-inline-hook - :fire: ShadowHook is an Android inline hook library which supports thumb, arm32 and arm64.

Tow-Boot - An opinionated distribution of U-Boot. — https://matrix.to/#/#Tow-Boot:matrix.org?via=matrix.org

libsimdpp - Portable header-only C++ low level SIMD library

libsamplerate - An audio Sample Rate Conversion library

Sparkle - A software update framework for macOS

cglm - 📽 Highly Optimized 2D / 3D Graphics Math (glm) for C

picoRTOS - Very small, lightning fast, yet portable RTOS with SMP suppport

simdutf - Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension. Part of Node.js and Bun.

sse2neon vs yenten-arm-miner-yespowerr16 simde vs nsimd sse2neon vs KasmVNC simde vs android-inline-hook sse2neon vs Tow-Boot simde vs libsimdpp sse2neon vs libsamplerate simde vs Sparkle sse2neon vs cglm simde vs picoRTOS sse2neon vs android-inline-hook simde vs simdutf

Compare sse2neon vs simde and see what are their differences.

sse2neon

simde

sse2neon

simde

What are some alternatives?