Python reinforcement-learning

Open-source Python projects categorized as reinforcement-learning

Top 23 Python reinforcement-learning Projects

reinforcement-learning
  1. nn

    🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

  2. Nutrient

    Nutrient - The #1 PDF SDK Library. Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrient’s PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free.

    Nutrient logo
  3. Ray

    Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

    Project mention: Ask HN: What Open Source Projects Need Help? | news.ycombinator.com | 2024-11-16

    I'm guessing this comment is some kind of "if you know, you know." Likely starting from https://docs.ray.io/en/latest/cluster/vms/user-guides/launch... and then trawling through one of these I guess https://github.com/ray-project/ray/issues?q=is%3Aissue+prem+...

  4. d2l-en

    Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

  5. reinforcement-learning-an-introduction

    Python Implementation of Reinforcement Learning: An Introduction

  6. stable-baselines3

    PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

  7. wandb

    The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

    Project mention: Bullish on AI infrastructure, bearish on AI developer frameworks | dev.to | 2025-01-31

    Experiment tracking and reproducibility: Tools like Weights & Biases solve the hard problem of managing hundreds of experiments with varying hyperparameters, dataset splits, and evaluation results. This is critical for teams working collaboratively on model improvements.

  8. machine_learning_examples

    A collection of machine learning examples and tutorials.

  9. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  10. Gymnasium

    An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

    Project mention: Gymnasium Release v1.0 | news.ycombinator.com | 2024-10-08
  11. trax

    Trax — Deep Learning with Clear Code and Speed

    Project mention: Maxtext: A simple, performant and scalable Jax LLM | news.ycombinator.com | 2024-04-23

    Is t5x an encoder/decoder architecture?

    Some more general options.

    The Flax ecosystem

    https://github.com/google/flax?tab=readme-ov-file

    or dm-haiku

    https://github.com/google-deepmind/dm-haiku

    were some of the best developed communities in the Jax AI field

    Perhaps the “trax” repo? https://github.com/google/trax

    Some HF examples https://github.com/huggingface/transformers/tree/main/exampl...

    Sadly it seems much of the work is proprietary these days, but one example could be Grok-1, if you customize the details. https://github.com/xai-org/grok-1/blob/main/run.py

  12. pysc2

    StarCraft II Learning Environment

  13. PaLM-rlhf-pytorch

    Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

  14. TensorLayer

    Deep Learning and Reinforcement Learning Library for Scientists and Engineers

  15. cleanrl

    High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

  16. keras-rl

    Deep Reinforcement Learning for Keras.

  17. trlx

    A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

    Project mention: Recapping the AI, Machine Learning and Data Science Meetup — May 2, 2024 | dev.to | 2024-05-02

    Transformer Reinforcement Learning X on GitHub

  18. OpenRLHF

    An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

    Project mention: AIM Weekly 27 May 2024 | dev.to | 2024-05-28
  19. stable-baselines

    A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

  20. dm_control

    Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

  21. ElegantRL

    Massively Parallel Deep Reinforcement Learning. 🔥

  22. pytorch-a2c-ppo-acktr-gail

    PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

  23. polyaxon

    MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle

  24. acme

    A library of reinforcement learning components and agents

  25. football

    Check out the new game server:

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python reinforcement-learning discussion

Log in or Post with

Python reinforcement-learning related posts

  • AgiBot X1, a modular humanoid robot with high dof

    1 project | news.ycombinator.com | 28 Oct 2024
  • Gymnasium Release v1.0

    1 project | news.ycombinator.com | 8 Oct 2024
  • Ask HN: Best way to learn robotics with a 10 year old?

    6 projects | news.ycombinator.com | 25 Jul 2024
  • Deep Reinforcement Learning: Zero to Hero

    3 projects | news.ycombinator.com | 5 May 2024
  • Recapping the AI, Machine Learning and Data Science Meetup — May 2, 2024

    2 projects | dev.to | 2 May 2024
  • Bayesianbandits: A Pythonic microframework for multi-armed bandit problems

    1 project | news.ycombinator.com | 14 Mar 2024
  • Adding Weapons

    1 project | dev.to | 24 Jan 2024
  • A note from our sponsor - Nutrient
    nutrient.io | 16 Feb 2025
    Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrient’s PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free. Learn more →

Index

What are some of the best open-source reinforcement-learning projects in Python? This list will help you:

# Project Stars
1 nn 58,489
2 Ray 35,405
3 d2l-en 24,927
4 reinforcement-learning-an-introduction 13,802
5 stable-baselines3 9,731
6 wandb 9,457
7 machine_learning_examples 8,489
8 Gymnasium 8,160
9 trax 8,159
10 pysc2 8,064
11 PaLM-rlhf-pytorch 7,747
12 TensorLayer 7,338
13 cleanrl 6,285
14 keras-rl 5,529
15 trlx 4,563
16 OpenRLHF 4,534
17 stable-baselines 4,169
18 dm_control 3,914
19 ElegantRL 3,847
20 pytorch-a2c-ppo-acktr-gail 3,612
21 polyaxon 3,604
22 acme 3,582
23 football 3,377

Sponsored
Nutrient - The #1 PDF SDK Library
Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrient’s PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free.
nutrient.io

Did you know that Python is
the 2nd most popular programming language
based on number of references?