Top 23 GPU Open-Source Projects

  • Pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

  • Project mention: The Programmer's Brain | | 2024-06-17

    It's not just a problem when you are an amateur. This is sth that every project should provide.

    But there are also many projects which do. Sometimes you need to search a bit for it. Actually I would expect that most big projects have such documentation somewhere in some form.





    And then for some popular projects you will also find some independent overviews:

    - (and many more on



    One problem is of course that those documents can be outdated and don't go into much details. But they still will give you important insights and should be a good starting point.

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • alacritty

    A cross-platform, OpenGL terminal emulator.

  • Project mention: Alacritty – A fast, cross-platform, OpenGL terminal emulator | | 2024-05-21

    Wezterm gives you basic stuff like scrollbars that Alacritty refuses to do:

  • DeepSpeed

    DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

  • Project mention: Can we discuss MLOps, Deployment, Optimizations, and Speed? | /r/LocalLLaMA | 2023-12-06

    DeepSpeed can handle parallelism concerns, and even offload data/model to RAM, or even NVMe (!?) . I'm surprised I don't see this project used more.

  • fastai

    The fastai deep learning library

  • Project mention: Notebooks Are McDonalds of Code | | 2024-06-13

    I'd say the fastai library itself[0] is a pretty good example of how maintainable/scalable practices can come to life in notebook flows. There's something to be said IMO for an active project with 25.8k stars, 238 contributers, 2.7k commits, and 199 open vs 1.5k closed issues.


  • taichi

    Productive, portable, and performant GPU programming in Python.

  • Project mention: CERN Root | | 2024-06-01

    The haughtiness is not for nothing. Since Dec 2023, they made a lame excuse that Pytorch didn't support 3.12:

    Later, even when Pytorch added support for 3.12, nothing changed (so far) in Taichi.

  • stats

    macOS system monitor in your menu bar

  • Project mention: Ask HN: What macOS apps/programs do you use daily and recommend? | | 2024-06-09

    * MacPorts: Everything you need to make Apple Unix equivalent to a Linux box, plus more. Works with the Apple OS, not against it. Doesn't put things in weird places or expect to disable SIP etc. Updates the old versions of CLI stuff that is in the standard MacOS (eg bash, GNU utilities etc).

    * iTerm2: Awesome terminal.

    In terms of MacOS stuff to enhance the out-of-the-box:

    * Bartender to control what shows on the menu bar (note, it has recently changed hands and people are suggesting to use "Ice" instead)

    * Rectangle: To control windows and screens via the keyboard

    * Arq: Better than Time Machine, allows selective backups and supports multiple destinations (OneDrive, NAS, etc)

    * Monitor Control: Controls brightness across multi-screen setups

    * Stats: Puts various meters in your menu bar (CPU, fans, network etc)

  • gpu.js

    GPU Accelerated JavaScript

  • Project mention: Deep Learning in JavaScript | | 2024-03-28

    You might already be familiar, but a GPU.js backend can provide some speedups via good old WebGL -- no need for WebGPU just yet!


  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • WebGL-Fluid-Simulation

    Play with fluids in your browser (works even on mobile)

  • Project mention: That is some extremely impressive water physics, especialy for a place you only visit once. How did they do it? (MAJOR SPOILERS FOR 4.2 WORLD QUEST) | /r/Genshin_Impact | 2023-12-02

    It kinda reminds me of this fluid simulation website. The site is about a computer graphics technique that simulates the motion and appearance of fluids such as water, smoke or fire. You can use your mouse to move around the screen to see the simulation, also can change how the fluid simulation works by adjusting the sliders in the control panel next to it.

  • ivy

    The Unified AI Framework

  • Project mention: Keras 3.0 | | 2023-11-28

    See also which I have not tried but seems along the lines of what you are describing, working with all the major frameworks

  • Deeplearning4j

    Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learning using automatic differentiation.

  • Project mention: Deeplearning4j Suite Overview | | 2024-03-29
  • FanControl.Releases

    This is the release repository for Fan Control, a highly customizable fan controlling software for Windows.

  • Project mention: Better PC Cooling with Python and Grafana | | 2024-03-03

    You don't really need PID, just a decent fan curve with

  • neovide

    No Nonsense Neovim Client in Rust

  • Project mention: Helix-gpui: helix gpui front end | | 2024-05-27

    Have you looked at Neovide? It's very similar to this, a cross-platform Rust based front end for neovim. Neovide is really quite nice it's beautifully rendered.

    My primary use case for Neovide (vs neovim running in Wezterm/kitty/Alacritty) is simply so I can Alt/Cmd-TAB to it quickly as I usually run with many terminal windows open and it's somewhat of a chore to locate the one with nvim in it, whereas with Neovide I have a separate app with an icon in the dock/sysmenu to switch to.

  • tvm

    Open deep learning compiler stack for cpu, gpu and specialized accelerators

  • Project mention: Show HN: I built a free in-browser Llama 3 chatbot powered by WebGPU | | 2024-05-03

    Yes. Web-llm is a wrapper of tvmjs:

    Just wrappers all the way down

  • wgpu

    A cross-platform, safe, pure-Rust graphics API.

  • Project mention: I learned Vulkan and wrote a small game engine with it (in 3 months) | | 2024-06-06

  • scalene

    Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

  • Project mention: Memray – A Memory Profiler for Python | | 2024-02-10

    I collected a list of profilers (also memory profilers, also specifically for Python) here:

    Currently I actually need a Python memory profiler, because I want to figure out whether there is some memory leak in my application (PyTorch based training script), and where exactly (in this case, it's not a problem of GPU memory, but CPU memory).

    I tried Scalene (, which seems to be powerful, but somehow the output it gives me is not useful at all? It doesn't really give me a flamegraph, or a list of the top lines with memory allocations, but instead it gives me a listing of all source code lines, and prints some (very sparse) information on each line. So I need to search through that listing now by hand to find the spots? Maybe I just don't know how to use it properly.

    I tried Memray, but first ran into an issue (, but after using some workaround, it worked now. I get a flamegraph out, but it doesn't really seem accurate? After a while, there don't seem to be any new memory allocations at all anymore, and I don't quite trust that this is correct.

    There is also Austin (, which I also wanted to try (have not yet).

    Somehow this experience so far was very disappointing.

    (Side node, I debugged some very strange memory allocation behavior of Python before, where all local variables were kept around after an exception, even though I made sure there is no reference anymore to the exception object, to the traceback, etc, and I even called frame.clear() for all frames to really clear it. It turns out, frame.f_locals will create another copy of all the local variables, and the exception object and all the locals in the other frame still stay alive until you access frame.f_locals again. At that point, it will sync the f_locals again with the real (fast) locals, and then it can finally free everything. It was quite annoying to find the source of this problem and to find workarounds for it.

  • Open3D

    Open3D: A Modern Library for 3D Data Processing

  • Project mention: Does anyone else agree that the links to the latest development version of Open3D don't work? | /r/cscareerquestions | 2023-07-10

    I was going to file a bug about another issue, but I have to download the development version. This is why I want this solved quickly. None of the links seem to work:

  • john

    John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs

  • Project mention: Best Hacking Tools for Beginners 2024 | | 2024-02-01

    John The Ripper

  • pycaret

    An open-source, low-code machine learning library in Python

  • ImageAI

    A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities

  • Project mention: Photo gallery frontend with encryption and search | /r/selfhosted | 2023-11-27

    Hi. I want to implement an image server similar to Photoprism using ImageAI to tag images based on objects and context. However I don't want to spend to much time working on the frontend, at first I were thinking about using Danbooru and use Flexbooru or the web interface on my phone. But it doesn't have any encryption or password protection (since the purpose of it is to be used as a public image board).

  • cudf

    cuDF - GPU DataFrame Library

  • Project mention: This Week In Python | | 2024-06-14

    cudf – GPU DataFrame Library

  • cupy

    NumPy & SciPy for GPU

  • Project mention: NumPy 2.0.0 | | 2024-06-16


    You may want to check out cupy

  • catboost

    A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

  • Project mention: CatBoost: Open-source gradient boosting library | | 2024-03-05
  • AlphaPose

    Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

GPU discussion

Log in or Post with

GPU related posts

  • This Week In Python

    5 projects | | 14 Jun 2024
  • AMD's MI300X Outperforms Nvidia's H100 for LLM Inference

    1 project | | 13 Jun 2024
  • I learned Vulkan and wrote a small game engine with it (in 3 months)

    17 projects | | 6 Jun 2024
  • cuDF – GPU DataFrame Library

    2 projects | | 2 Jun 2024
  • CuDF – GPU DataFrame Library

    1 project | | 1 Jun 2024
  • Understanding GPT: How To Implement a Simple GPT Model with PyTorch

    2 projects | | 31 May 2024
  • Building a Simple Chatbot using GPT model - part 2

    1 project | | 31 May 2024
  • A note from our sponsor - SaaSHub | 19 Jun 2024
    SaaSHub helps you find the best software and product alternatives Learn more →


What are some of the best open-source GPU projects? This list will help you:

Project Stars
1 Pytorch 79,497
2 alacritty 54,086
3 DeepSpeed 33,399
4 fastai 25,784
5 taichi 24,930
6 stats 22,803
7 gpu.js 15,005
8 WebGL-Fluid-Simulation 14,377
9 ivy 14,031
10 Deeplearning4j 13,483
11 FanControl.Releases 13,024
12 neovide 12,262
13 tvm 11,357
14 wgpu 11,336
15 scalene 11,307
16 Open3D 10,682
17 john 9,546
18 pycaret 8,587
19 ImageAI 8,463
20 cudf 7,923
21 cupy 7,900
22 catboost 7,838
23 AlphaPose 7,792

Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in is all you need to start monitoring your apps. Sign up for our free tier today.