The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 23 Openmp Open-Source Projects
-
john
John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Arraymancer
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
-
Kratos
Kratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modularity, extensibility and HPC are the main objectives. Kratos has BSD license and is written in C++ with extensive Python interface. (by KratosMultiphysics)
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
weave
A state-of-the-art multithreading runtime: message-passing based, fast, scalable, ultra-low overhead (by mratsim)
-
Kernels
This is a set of simple programs that can be used to explore the features of a parallel platform.
-
mixbench
A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)
-
laser
The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers (by mratsim)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
John The Ripper
If you are talking about non-small matrix multiplication in MKL, is now in opensource as a part of oneDNN. It literally has exactly the same code, as in MKL (you can see this by inspecting constants or doing high-precision benchmarks).
For small matmul there is libxsmm. It may take tremendous efforts make something faster than oneDNN and libxsmm, as jit-based approach of https://github.com/oneapi-src/oneDNN/blob/main/src/gpu/jit/g... is too flexible: if someone finds a better sequence, oneDNN can reuse it without major change of design.
But MKL is not limited to matmul, I understand it...
Project mention: Creando Subtítulos Automáticos para Vídeos con Python, Faster-Whisper, FFmpeg, Streamlit, Pillow | dev.to | 2024-04-29
Nice article. We did a demo for booting to LLM and also as Kernel Module: https://github.com/trholding/llama2.c The whole things was funny and buggy, but since then we have been developing in stealth, even trying to raise VC capital. Our goal is to make computers like a buddy to whom you can talk to and explain things and get work done, kinda like a Jarvis. The way we interact with computers haven't changed for decades, its time to disrupt that to get more productivity. I also believe with this approach one can avoid installing different applications, when the computer (models) emulate activities done through applications. For example, cutting and pasting a dog from a dog photo onto a banner for a dog racing competition would not require you to be a graphics artist nor use tools like photshop / gimp. You could tell the computer and it would use segment anything to cut the dog, use Text and SD for banner text and bg paste the dog, seek your approval, search for the fastest, best and cheapest banner printing service and submit it. 10 years ago this could have been sci-fi, but now it is a possibility. Just need to connect the dots, package and polish it to make it a good product.
It is a small DSL written using macros at https://github.com/mratsim/Arraymancer/blob/master/src/array....
Nim has pretty great meta-programming capabilities and arraymancer employs some cool features like emitting cuda-kernels on the fly using standard templates depending on backend !
Project mention: XFaaS: Hyperscale and Low Cost Serverless Functions at Meta | news.ycombinator.com | 2024-01-31
Project mention: The GIL can now be disabled in Python's main branch | news.ycombinator.com | 2024-03-11
P.S. Im not trying some incredible debate here. Its just that I payed 60$ a month on hosted tabs since 2014, and Im trying to do the same type of advertisement but where should I advertise my server if open.mp is dead?
Project mention: Portable and vendor neutral parallel programming on heterogeneous platforms | news.ycombinator.com | 2024-04-11
It works fine in the latest stable release of Darktable. You just need the definition for the camera in cameras.xml. Support for the A6700 was already added in the development branch but there hasn't been a new stable release yet since. Fortunately, simply downloading and replacing the file (which you can get from the dev branch on GitHub) suffices.
It depends.
You need 2~3 accumulators to saturate instruction-level parallelism with a parallel sum reduction. But the compiler won't do it because it only creates those when the operation is associative, i.e. (a+b)+c = a+(b+c), which is true for integers but not for floats.
There is an escape hatch in -ffast-math.
I have extensive benches on this here: https://github.com/mratsim/laser/blob/master/benchmarks%2Ffp...
Openmp related posts
- Benchmarking 20 programming languages on N-queens and matrix multiplication
- Support for a6700?
- Support for Nikon Z8
- A7R V RAW Support
- Anyone know a more efficient method to use for cracking this zip?
- bkcrack - Crack legacy zip encryption with Biham and Kocher's known plaintext attack.
- How does one crack a ZIP file password when you have access to one of the files from the ZIP file?
-
A note from our sponsor - WorkOS
workos.com | 29 Apr 2024
Index
What are some of the best open-source Openmp projects? This list will help you:
Project | Stars | |
---|---|---|
1 | john | 9,267 |
2 | oneDNN | 3,461 |
3 | CTranslate2 | 2,799 |
4 | llama2.c | 1,382 |
5 | Arraymancer | 1,307 |
6 | stdgpu | 1,085 |
7 | Kratos | 958 |
8 | faasm | 837 |
9 | amgcl | 701 |
10 | weave | 523 |
11 | Kernels | 401 |
12 | open.mp | 398 |
13 | muparser | 391 |
14 | occa | 379 |
15 | ginkgo | 373 |
16 | mixbench | 339 |
17 | rawspeed | 338 |
18 | pyccel | 331 |
19 | alpaka | 324 |
20 | BabelStream | 310 |
21 | primecount | 302 |
22 | mcmap | 300 |
23 | laser | 261 |
Sponsored