SaaSHub helps you find the best software and product alternatives Learn more ā
Top 23 high-performance-computing Open-Source Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
FluidX3D
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs via OpenCL.
-
awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
kokkos
Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction
-
drake
An R-focused pipeline toolkit for reproducibility and high-performance computing (by ropensci)
-
Arraymancer
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
-
AdaptiveCpp
Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
-
envpool
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
-
preCICE
A coupling library for partitioned multi-physics simulations, including, but not restricted to fluid-structure interaction and conjugate heat transfer simulations.
-
prometeo
An experimental Python-to-C transpiler and domain specific language for embedded high-performance computing
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
For parallel programming nowadays, personally I reach for C++ Taskflow when I really care about performance, or a mix of core.async and running multiple load balanced instances when Iām doing more traditional web backend stuff in Clojure.
Project mention: tf-quant-finance: NEW Derivatives and Hedging - star count:3911.0 | /r/algoprojects | 2023-06-10
> So long as Pytorch only practically works with Nvidia GPUs, everything else is little more than a rounding error.
This is changing.
https://github.com/merrymercy/awesome-tensor-compilers
There are more and better projects that can compile an existing PyTorch codebase into a more optimized format for a range of devices. Triton (which is part of PyTorch) TVM and the MLIR based efforts (like torch-MLIR or IREE) are big ones, but there are smaller fish like GGML and Tinygrad, or more narrowly focused projects like Meta's AITemplate (which works on AMD datacenter GPUs).
Hardware is in a strange place now... It feels like everyone but Cerebras and AMD/Intel was squeezed out, but with all the money pouring in, I think this is temporary.
Project mention: Faer-rs: Linear algebra foundation for the Rust programming language | news.ycombinator.com | 2024-04-24BLIS is an interesting new direction in that regard: https://github.com/flame/blis
>The BLAS-like Library Instantiation Software (BLIS) framework is a new infrastructure for rapidly instantiating Basic Linear Algebra Subprograms (BLAS) functionality. Its fundamental innovation is that virtually all computation within level-2 (matrix-vector) and level-3 (matrix-matrix) BLAS operations can be expressed and optimized in terms of very simple kernels.
The only way I can foresee a cryptocoin actually holding value is if spending the coin meant spending processing cycles and RAM doing things like this:
https://en.wikipedia.org/wiki/List_of_volunteer_computing_pr...
But in more general sense, less like https://boinc.berkeley.edu/ and more like AWS...
It's the only way to have value, actually holding computing power in a distributed network.
Project mention: Mirage ā A programming framework for building type-safe, modular systems | news.ycombinator.com | 2023-11-23The rough Rust of this is https://github.com/hermit-os/hermit-rs
Though last I looked it wasn't nearly as mature as MirageOS.
It is a small DSL written using macros at https://github.com/mratsim/Arraymancer/blob/master/src/array....
Nim has pretty great meta-programming capabilities and arraymancer employs some cool features like emitting cuda-kernels on the fly using standard templates depending on backend !
Project mention: Build Your Own Docker with Linux Namespaces, Cgroups, and Chroot | news.ycombinator.com | 2023-06-26
Project mention: What Every Developer Should Know About GPU Computing | news.ycombinator.com | 2023-10-21Sapphire Rapids is a CPU.
AMD's primary focus for a GPU software ecosystem these days seems to be implementing CUDA with s/cuda/hip, so AMD directly supports and encourages running GPU software written in CUDA on AMD GPUs.
The only implementation for sycl on AMD GPUs that I can find is a hobby project that apparently is not allowed to use either the 'hip' or 'sycl' names. https://github.com/AdaptiveCpp/AdaptiveCpp
Project mention: How do I improve my SB3 PPO on an EnvPool environment | /r/reinforcementlearning | 2023-09-21I am looking to improve the overall performance as well as optimize the wall clock time. I slightly modified the code to develop a SB3 wrapper for envpool from here.
Project mention: Vectorious: BLAS-accelerated linear algebra for TypeScript | /r/hypeurls | 2023-06-02
Project mention: The Law of Large Numbers, or Why It Is a Bad Idea to Go to the Casino | news.ycombinator.com | 2023-05-14It was actually invented for this.
Open source radiation transport Monte Carlo code here if you'd like to play around:
https://github.com/openmc-dev/openmc
Project mention: Looking for HLS frameworks to start deploying DL algorithms on FPGAs | /r/FPGA | 2023-06-20
high-performance-computing related posts
- Bitcoin Block 840000
- Benchmarking 20 programming languages on N-queens and matrix multiplication
- Metal-flash-attention: Faster alternative to Metal Performance Shaders
- Gemini is only 1x Chinchilla, so it undertrained for production
- Hermit: a Rust-based, lightweight unikernel
- Have you ever donated your computing power with BOINC? Take 5 minutes to fill out the 2023 BOINC Census!
- Berkeley Open Infrastructure for Network Computing
-
A note from our sponsor - SaaSHub
www.saashub.com | 26 Apr 2024
Index
What are some of the best open-source high-performance-computing projects? This list will help you:
Project | Stars | |
---|---|---|
1 | Taskflow | 9,552 |
2 | metaflow | 7,586 |
3 | tf-quant-finance | 4,265 |
4 | FluidX3D | 3,193 |
5 | alpa | 2,983 |
6 | awesome-tensor-compilers | 2,167 |
7 | blis | 2,091 |
8 | boinc | 1,915 |
9 | kokkos | 1,718 |
10 | hermit-rs | 1,589 |
11 | mfem | 1,537 |
12 | drake | 1,330 |
13 | Arraymancer | 1,304 |
14 | kernel | 1,057 |
15 | neanderthal | 1,043 |
16 | AdaptiveCpp | 1,040 |
17 | envpool | 1,012 |
18 | vectorious | 911 |
19 | targets | 866 |
20 | openmc | 696 |
21 | preCICE | 669 |
22 | prometeo | 610 |
23 | dace | 465 |
Sponsored