SaaSHub helps you find the best software and product alternatives Learn more β
Top 23 GPU Open-Source Projects
-
I tried to do something similar with 'first-class' dimension objects in PyTorch https://github.com/pytorch/pytorch/blob/main/functorch/dim/R... .
-
InfluxDB
InfluxDB β Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
Terminal emulator Alacritty, for instance. The version in the Debian Stable repo is datedβ Alacritty of this version uses a .yml config file, but newer versions have switched to .toml.
-
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Project mention: DeepSpeed-Domino: Communication-Free LLM Training Engine | news.ycombinator.com | 2024-11-26 -
Project mention: Stats β macOS system monitor in your menu bar | news.ycombinator.com | 2025-01-30
-
Project mention: Taichi: Productive, portable, and performant GPU programming in Python | news.ycombinator.com | 2024-08-20
-
Project mention: Integrate Hugging Face Spaces & Gradio with a React application | dev.to | 2024-10-12
We are going to use fast.ai to create a simple classification which predicts whether a given image is a cat or not.
-
FanControl.Releases
This is the release repository for Fan Control, a highly customizable fan controlling software for Windows.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
-
Project mention: Ask HN: Resources for General Purpose GPU development on Apple's M* chips? | news.ycombinator.com | 2024-12-25
People have already mentioned Metal, but if you want cross platform, https://github.com/gfx-rs/wgpu has a vulkan-like API and cross compiles to all the various GPU frameworks. I believe it uses https://github.com/KhronosGroup/MoltenVK to run on Macs. You can also see the metal shader transpilation results for debugging.
-
Have you looked at Neovide? It's very similar to this, a cross-platform Rust based front end for neovim. Neovide is really quite nice it's beautifully rendered.
https://neovide.dev
My primary use case for Neovide (vs neovim running in Wezterm/kitty/Alacritty) is simply so I can Alt/Cmd-TAB to it quickly as I usually run with many terminal windows open and it's somewhat of a chore to locate the one with nvim in it, whereas with Neovide I have a separate app with an icon in the dock/sysmenu to switch to.
-
Deeplearning4j
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learn...
-
scalene
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
This has been a feature of the Scalene Python profiler (https://github.com/plasma-umass/scalene) for some time (at this point, about 1.5 years) - bring your own API key for OpenAI / Azure / Bedrock, also works with Ollama. Optimizing Python code to use NumPy or other similar native libraries can easily yield multiple order of magnitude improvements in real-world settings. We tried it on several of the success stories of Scalene (before the integration with LLMs); see https://github.com/plasma-umass/scalene/issues/58 - and found that it often automatically yielded the same or better optimizations - see https://github.com/plasma-umass/scalene/issues/554. (Full disclosure: I am one of the principal designers of Scalene.)
-
-
-
john
John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
Project mention: π‘οΈ Top 10 Free Penetration Testing Tools Every Security Team Should Use in 2025 | dev.to | 2025-04-28An old-school tool that's still going strong. John the Ripper is a fast and flexible password cracker.
-
The plethora of packages, including DSLs for compute and MLIR.
https://developer.nvidia.com/how-to-cuda-python
https://cupy.dev/
-
g-helper
Lightweight Armoury Crate alternative for Asus laptops and ROG Ally. Control tool for ROG Zephyrus G14, G15, G16, M16, Flow X13, Flow X16, TUF, Strix, Scar and other models
Project mention: G-Helper: Open-Source Armoury Crate Alternative for Asus Laptops and ROG Ally | news.ycombinator.com | 2024-08-25 -
-
server
The Triton Inference Server provides an optimized cloud and edge inferencing solution. (by triton-inference-server)
This is very interesting but many of the motivations listed are far better served with alternate approaches.
For "remote" model training there is NCCL + Deepspeed/FSDP/etc. For remote inferencing there are solutions like Triton Inference Server[0] that can do very high-performance hosting of any model for inference. For LLMs specifically there are nearly countless implementations.
That said, the ability to use this for testing is interesting but I wonder about GPU contention and as others have noted the performance of such a solution will be terrible even with relatively high speed interconnect (100/400gb ethernet, etc).
NCCL has been optimized to support DMA directly between network interfaces and GPUs which is of course considerably faster than solutions like this. Triton can also make use of shared memory, mmap, NCCL, MPI, etc which is one of the many tricks it uses for very performant inference - even across multiple chassis over another network layer.
[0] - https://github.com/triton-inference-server/server
-
-
Project mention: Unleashing GPU Power: Supercharge Your Data Processing with cuDF | dev.to | 2024-06-21
cuDF Documentation
-
ImageAI
A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
GPU discussion
GPU related posts
-
GPU Glossary: GPU documentation for humans
-
GPU Glossary: GPU documentation for humans
-
FlashMoE: DeepSeek-R1 671B and Qwen3MoE 235B with 1~2 Intel B580 GPU in IPEX-LLM
-
Launch HN: ParaQuery (YC X25) β GPU Accelerated Spark/SQL
-
DeepLearning4j Blockchain Integration: Convergence of AI, Blockchain, and Open Source Funding
-
Implementing a Struct of Arrays
-
AMD GPU Programming in Julia
-
A note from our sponsor - SaaSHub
www.saashub.com | 19 May 2025
Index
What are some of the best open-source GPU projects? This list will help you:
# | Project | Stars |
---|---|---|
1 | Pytorch | 89,871 |
2 | alacritty | 58,847 |
3 | DeepSpeed | 38,430 |
4 | stats | 31,364 |
5 | taichi | 27,101 |
6 | fastai | 26,948 |
7 | FanControl.Releases | 16,125 |
8 | WebGL-Fluid-Simulation | 15,420 |
9 | gpu.js | 15,190 |
10 | wgpu | 14,156 |
11 | neovide | 13,994 |
12 | Deeplearning4j | 13,959 |
13 | scalene | 12,653 |
14 | Open3D | 12,300 |
15 | tvm | 12,286 |
16 | john | 11,321 |
17 | cupy | 10,204 |
18 | g-helper | 9,689 |
19 | pycaret | 9,326 |
20 | server | 9,188 |
21 | nvtop | 9,101 |
22 | cudf | 8,917 |
23 | ImageAI | 8,788 |