sse2neon
libsamplerate
sse2neon | libsamplerate | |
---|---|---|
7 | 1 | |
1,224 | 555 | |
1.2% | 0.2% | |
7.3 | 2.9 | |
14 days ago | 6 days ago | |
C++ | C | |
MIT License | BSD 2-clause "Simplified" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sse2neon
- sse2neon - A C/C++ header file that converts Intel SSE intrinsics to Aarch64 NEON intrinsic
- A C/C++ header file that converts Intel SSE intrinsics to Aarch64 NEON intrinsic
-
Porting Architecture Specific C/C++ Intrinsics to Graviton
The sse2neon project is a quick way to get C/C++ applications compiling and running on Graviton. The sse2neon header file provides NEON implementations for x64 intrinsics so no source code changes are needed. Each function call (intrinsic) is simply replaced with NEON instructions and will just work on Graviton.
-
An AWS Community Builder Story
To continue our collaboration I contributed some small changes to KasmVNC on GitHub to use sse2neon for a performance critical part of the application which uses SSE intrinsics and needed to be changed to NEON intrinsics.
-
Deserializing JSON Fast
I think the talk is very clearly laid out as an incremental journey, and each stepping stone involves contextual decision-making. I don't think Andreas is saying "you must end up with the SSE2 implementation at the end". Using machine-specific intrinsics is another dependency decision very similar to deciding to use a given library. I would have loved the talk and probably still thought of it and posted it, even if it ended before the intrinsics (but I think he does an excellent job at that part too).
And porting SSE2 to Neon is actually pretty easy -- if you use https://github.com/DLTcollab/sse2neon, IME it's very easy to do incrementally (or avoid or postpone indefinitely, depending on your needs).
-
PortableGL: An MIT licensed implementation of OpenGL 3.x-ish in clean C
I have a private cross-platform port, I’m waiting on the resolution of his latest GitHub issue to submit my changes. sse2neon (https://github.com/DLTcollab/sse2neon) was a big help - I also wrote a very primitive sse2scalar for raspbian builds where neon is unavailable. Honestly SIMD doesn’t help much, as you’re usually memory bound under SWGL. The biggest perf win is any amount of asynchronous execution - running off the main thread is good enough and could be applied to your library externally through a command buffer without any changes to your code.
-
Success porting VCV into aarch64 linux! (Usable on Android Devices)
You should go to /include/simd and download sse2neon.h into the folder. Replace appearing in any source files in that directory with "sse2neon.h". You will still encounter errors; remove the lines causing problems, typically containing the phrase ZERO_MODE. ARM processors does not require it.
libsamplerate
-
Success porting VCV into aarch64 linux! (Usable on Android Devices)
dep/libsamplerate is a problem because Rack is using the 0.1.9 version, which is very old and might not build very well depending on your system. I downloaded the 0.2.1 version here, built it by cmake, and copied the builtlibsamplerate.a into dep/lib. Note that you should remove the corresponding command lines in dep/Makefile if you're using this method. This step should be skipped if you are using armv7 device.
What are some alternatives?
yenten-arm-miner-yespowerr16 - ARM 64 CPU miner for Yespower variant algorithms
muon - GPU based Electron on a diet
KasmVNC - Modern VNC Server and client, web based and secure
libspng - Simple, modern libpng alternative
simde - Implementations of SIMD instruction sets for systems which don't natively support them.
Muon - A Micro (1k lines of code) Unit Test Framework for C/C++
Tow-Boot - An opinionated distribution of U-Boot. — https://matrix.to/#/#Tow-Boot:matrix.org?via=matrix.org
libsndfile - A C library for reading and writing sound files containing sampled audio data.
cglm - 📽 Highly Optimized 2D / 3D Graphics Math (glm) for C
C_Project - Friendly Template for C Projects
android-inline-hook - :fire: ShadowHook is an Android inline hook library which supports thumb, arm32 and arm64.
FFmpeg - Mirror of https://git.ffmpeg.org/ffmpeg.git with CMake support in the feature/cmake-build branch.