SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 GPU Open-Source Projects
-
Project mention: Tracing torch.cuda.empty_cache() on an RTX 4090 - Where Do the 53 MB Go? | dev.to | 2026-05-28
pytorch/pytorch#173382 - a user calls torch.cuda.empty_cache() after deleting tensors, but GPU memory stays allocated. The caching allocator's empty_cache() only releases blocks it has marked as free, but the user sees a persistent gap between "allocated" and "reserved" memory. We traced what happens when torch cuda empty cache runs on an RTX 4090 and measured exactly how much GPU memory it reclaims.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Ligatures are a renderer issue, so using alacritty as a lib wouldn't have this issue (it does demonstrate their hardline stance). Another example that would translate is how long it took them to support disambiguation of key combinations: https://github.com/alacritty/alacritty/issues/6378 (2019-2023). Of course, the maintainers are free to do whatever they want with the project - but such things do make alacritty-as-a-lib an exceptionally bad choice for situations where you want things to just work.
-
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Project mention: AWS SageMaker HyperPod: Distributed Training for Foundation Models at Scale | dev.to | 2026-01-16DeepSpeed Optimization Library - An open-source library compatible with HyperPod that offers advanced pipeline and system optimizations for LLM training.
-
Project mention: Ask HN: Is your Mac's menu bar throwing errors? | news.ycombinator.com | 2026-04-19
https://github.com/exelban/stats/issues/2829
The log seems to be thrown from somewhere deep inside MacOS' windowserver. Crack open your console app and see if you also see it anywhere in the logs if you have an app that has a menu bar icon on macos.
-
Project mention: Ask HN: Why has Taichi-Lang's development frequency declined? | news.ycombinator.com | 2026-04-28
-
Project mention: GSoC 2026 Predictions: 30 NEW AI/ML/Security Organizations You Should Start Contributing to NOW! | dev.to | 2026-02-06
fastai: https://github.com/fastai/fastai ⭐ 26k+
-
FanControl.Releases
This is the release repository for Fan Control, a highly customizable fan controlling software for Windows.
-
Project mention: We Built the First Pure Go DXIL Generator — Because Optimizing the Wrong Path Wasn't Enough | dev.to | 2026-04-05
Rust's naga shader compiler has had an open issue for DXIL backend since 2020. Six years later, it's still marked as future work. The Rust team called it "a lot of work and a long way away."
-
-
-
Project mention: Electron-based apps cause system-wide lag on macOS 26 Tahoe | news.ycombinator.com | 2025-09-25
-
Deeplearning4j
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learn...
-
The most popular library for point cloud processing in Python is Open3D.
-
g-helper
Lightweight Armoury Crate alternative for Asus laptops with nearly the same functionality. Works with ROG Zephyrus, Flow, TUF, Strix, Scar, ProArt, Vivobook, Zenbook, Expertbook, ROG Ally, and many more.
Have your friend grab https://github.com/seerge/g-helper which can disable armory crate. It’s also a lot lighter on your system - I was having constant gradual frame drops (games would start find and performance would slowly degrade) until I tried this and used the option to disable the AC processes.
-
-
scalene
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Project mention: Show HN: Similarity = cosine(your_GitHub_stars, Karpathy) Client-side | news.ycombinator.com | 2026-01-06 -
john
John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
Project mention: John the Ripper jumbo – advanced offline password cracker | news.ycombinator.com | 2025-09-09 -
-
server
The Triton Inference Server provides an optimized cloud and edge inferencing solution. (by triton-inference-server)
-
Project mention: Zml-smi: universal monitoring tool for GPUs, TPUs and NPUs | news.ycombinator.com | 2026-04-04
-
skypilot
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).
Project mention: Skypilot: Run, manage, and scale AI workloads on any AI infrastructure | news.ycombinator.com | 2026-01-30 -
Project mention: Dell's version of the DGX Spark fixes pain points | news.ycombinator.com | 2026-01-01
I'm telling your it works now. It's just not called `tcgen05`.
Put this in nsight compute: https://github.com/NVIDIA/cutlass/blob/main/examples/79_blac...
(I said 83, it's 79).
If you want to know what NVIDIA really thinks, watch this repo: https://github.com/nVIDIA/fuser. The Polyhedral Wizards at play. All the big not-quite-Fields players are splashing around there. I'm doing lean4 proofs of a bunch of their stuff. https://v0-straylight-papers-touchups.vercel.app
It works now. It's just not the PTX mnemonic that you want to see.
-
GPU discussion
GPU related posts
-
Show HN: Piqc – An open-source GPU waste scanner for LLM inference clusters
-
Show HN: I embedded 685M public texts in 32 minutes (on 8x A100, Rust, TensorRT)
-
Use your Nvidia GPU's VRAM as swap space on Linux
-
Notes on CUDA Tensor Core GEMM (WMMA)
-
Tracing torch.cuda.empty_cache() on an RTX 4090 - Where Do the 53 MB Go?
-
Show HN: Shaderbang – Shebang for Shaders
-
Erin Brockovich made a map to track data centers around the country
-
A note from our sponsor - SaaSHub
www.saashub.com | 12 Jun 2026
Index
What are some of the best open-source GPU projects? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | Pytorch | 100,659 |
| 2 | alacritty | 64,514 |
| 3 | DeepSpeed | 42,495 |
| 4 | stats | 39,365 |
| 5 | taichi | 28,244 |
| 6 | fastai | 28,021 |
| 7 | FanControl.Releases | 19,747 |
| 8 | wgpu | 17,298 |
| 9 | WebGL-Fluid-Simulation | 16,303 |
| 10 | gpu.js | 15,369 |
| 11 | neovide | 15,057 |
| 12 | Deeplearning4j | 14,235 |
| 13 | Open3D | 13,686 |
| 14 | g-helper | 13,625 |
| 15 | tvm | 13,460 |
| 16 | scalene | 13,451 |
| 17 | john | 13,249 |
| 18 | cupy | 10,995 |
| 19 | server | 10,749 |
| 20 | nvtop | 10,716 |
| 21 | skypilot | 10,066 |
| 22 | cutlass | 9,882 |
| 23 | pycaret | 9,744 |