The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 18 C++ Cpu Projects
-
I don't know how often it's a problem, but I work for a company doing software video encoding, and we always fill up all the dimm slots on servers to have as much bandwidth as possible, even if we have only really use maybe 1/4 of the RAM.
I'm not sure any of the standard Linux tools can show you memory bandwidth usage easily (maybe perf), I know we use Intel PCM (https://github.com/intel/pcm) and AMDuProfPCM (https://www.amd.com/en/developer/uprof.html)
-
Looks like something that's still to be added to https://github.com/google/cpu_features/blob/main/src/impl_riscv_linux.c
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
StringZilla
Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores, etc 🦖
Project mention: Measuring energy usage: regular code vs. SIMD code | news.ycombinator.com | 2024-02-19The 3.5x energy-efficiency gap between serial and SIMD code becomes even larger when
A. you do byte-level processing instead of float words;
B. you use embedded, IoT, and other low-energy devices.
A few years ago I've compared Nvidia Jetson Xavier (long before the Orin release), Intel-based MacBook Pro with Core i9, and AVX-512 capable CPUs on substring search benchmarks.
On Xavier one can quite easily disable/enable cores and reconfigure power usage. At peak I got to 4.2 GB/J which was an 8.3x improvement in inefficiency over LibC in substring search operations. The comparison table is still available in the older README: https://github.com/ashvardanian/StringZilla/tree/v2.0.2?tab=...
-
-
Project mention: For those interested in learning how to build a Language Identification solution using PyTorch, check out my article. | /r/learnmachinelearning | 2023-04-28
Link to code sample: https://github.com/oneapi-src/oneAPI-samples/tree/master/AI-and-Analytics/End-to-end-Workloads/LanguageIdentification
-
-
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
-
Project mention: Watch out AMD: Intel Arc A580 could be the next great affordable GPU | news.ycombinator.com | 2023-08-06
Intel already has a working GPGPU stack, using oneAPI/SYCL.
They also have arguably pretty good OpenCL support, as well as downstream support for PyTorch and Tensorflow using their custom extensions https://github.com/intel/intel-extension-for-tensorflow and https://github.com/intel/intel-extension-for-pytorch which are actively developed and just recently brought up-to-date with upstream releases.
-
-
-
Project mention: I've open sourced my Flutter plugin to run on-device LLMs on any platform. TestFlight builds available now. | /r/FlutterDev | 2023-12-08
And more stuff I’m often checking back on: - https://github.com/staghado/vit.cpp - https://github.com/serp-ai/bark-with-voice-clone - https://github.com/leejet/stable-diffusion.cpp (generate images) - etc … there’s too much fun stuff out there. Wish I had more free time haha.
-
-
-
-
One of the reasons I was able to do it is a technique of hardware validation. I used an Arduino controlling an 8088 to validate that my CPU core was accurate on a cycle by cycle basis. Without that and the recent decoding of the 8088 microcode by reenigne, this wouldn't have been possible.
-
-
VectorizedKernel
Running GPGPU-like kernels on CPU with auto-vectorization for SSE/AVX/AVX512 SIMD Architectures
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
C++ Cpu related posts
- Measuring energy usage: regular code vs. SIMD code
- 500 Lines or Less – Writing a useful program in fewer than 500 line code – AOSA
- Stringzilla: 10x Faster SIMD-accelerated String Class
- Stringzilla: 10x faster SIMD-accelerated Python `str` class
- Vision Processing Platform-JH7110
- Is x86 really that bad?
- What performance monitors do you use?
-
A note from our sponsor - WorkOS
workos.com | 28 Mar 2024
Index
What are some of the best open-source Cpu projects in C++? This list will help you:
Project | Stars | |
---|---|---|
1 | pcm | 2,494 |
2 | cpu_features | 2,361 |
3 | StringZilla | 1,660 |
4 | thor-os | 1,599 |
5 | oneAPI-samples | 810 |
6 | Astro8-Computer | 660 |
7 | oneMKL | 558 |
8 | hwinfo | 357 |
9 | intel-extension-for-tensorflow | 302 |
10 | etl | 210 |
11 | monolish | 189 |
12 | vit.cpp | 163 |
13 | CPURasterizer | 155 |
14 | c2clat | 104 |
15 | peakperf | 56 |
16 | arduino_8088 | 26 |
17 | 8-bit-computer-emulator | 18 |
18 | VectorizedKernel | 7 |