Top 23 Python reinforcement-learning Projects

Ray

44 31,718 10.0 Python

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Project mention: Comparison: Dask vs. Ray | news.ycombinator.com | 2024-06-14

Scout Monitoring

www.scoutapm.com featured

Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
d2l-en

6 22,174 8.0 Python

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
reinforcement-learning-an-introduction

2 13,272 2.7 Python

Python Implementation of Reinforcement Learning: An Introduction
wandb

17 8,482 9.9 Python

🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

Project mention: 10 Open Source Tools for Building MLOps Pipelines | dev.to | 2024-06-06

Weights and Biases (W&B) ****is a tool for visualizing and tracking machine learning experiments. It supports major machine learning frameworks such as TensorFlow and PyTorch. Its key features include:

stable-baselines3

46 8,200 8.0 Python

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Project mention: Sim-to-real RL pipeline for open-source wheeled bipeds | /r/robotics | 2023-12-09

The latest release (v3.0.0) of Upkie's software brings a functional sim-to-real reinforcement learning pipeline based on Stable Baselines3, with standard sim-to-real tricks. The pipeline trains on the Gymnasium environments distributed in upkie.envs (setup: pip install upkie) and is implemented in the PPO balancer. Here is a policy running on an Upkie:

machine_learning_examples

3 8,161 7.1 Python

A collection of machine learning examples and tutorials.
trax

7 7,988 4.3 Python

Trax — Deep Learning with Clear Code and Speed

Project mention: Maxtext: A simple, performant and scalable Jax LLM | news.ycombinator.com | 2024-04-23

Is t5x an encoder/decoder architecture?
Some more general options.
The Flax ecosystem
https://github.com/google/flax?tab=readme-ov-file
or dm-haiku
https://github.com/google-deepmind/dm-haiku
were some of the best developed communities in the Jax AI field
Perhaps the “trax” repo? https://github.com/google/trax
Some HF examples https://github.com/huggingface/transformers/tree/main/exampl...
Sadly it seems much of the work is proprietary these days, but one example could be Grok-1, if you customize the details. https://github.com/xai-org/grok-1/blob/main/run.py

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
pysc2

6 7,946 0.0 Python

StarCraft II Learning Environment
PaLM-rlhf-pytorch

25 7,620 4.6 Python

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
TensorLayer

1 7,296 0.0 Python

Deep Learning and Reinforcement Learning Library for Scientists and Engineers
Gymnasium

12 6,054 9.3 Python

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Project mention: NASA JPL Open Source Rover That Runs ROS 2 | news.ycombinator.com | 2023-09-22

"Show HN: Ghidra Plays Mario" (2023) https://news.ycombinator.com/item?id=37475761 :
[RL, MuZero reduxxxx ]
> Farama-Foundation/Gymnasium is a fork of OpenAI/gym and it has support for additional Environments like MuJoCo: https://github.com/Farama-Foundation/Gymnasium#environments
> Farama-Foundatiom/MO-Gymnasiun: "Multi-objective Gymnasium environments for reinforcement learning": https://github.com/Farama-Foundation/MO-Gymnasium

keras-rl

1 5,497 0.0 Python

Deep Reinforcement Learning for Keras.
cleanrl

41 4,725 5.7 Python

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Project mention: [P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials) | /r/reinforcementlearning | 2023-08-24

PettingZoo 1.24.0 is now live! This release includes Python 3.11 support, updated Chess and Hanabi environment versions, and many bugfixes, documentation updates and testing expansions. We are also very excited to announce 3 tutorials using Stable-Baselines3, and a full training script using CleanRL with TensorBoard and WandB.

trlx

6 4,367 7.9 Python

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Project mention: Recapping the AI, Machine Learning and Data Science Meetup — May 2, 2024 | dev.to | 2024-05-02

Transformer Reinforcement Learning X on GitHub

stable-baselines

10 4,068 0.0 Python

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
dm_control

7 3,603 7.7 Python

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
ElegantRL

6 3,520 7.1 Python

Massively Parallel Deep Reinforcement Learning. 🔥
polyaxon

9 3,502 8.6 Python

MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
pytorch-a2c-ppo-acktr-gail

3 3,491 0.0 Python

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
acme

11 3,412 6.1 Python

A library of reinforcement learning components and agents
tensorforce

1 3,286 4.5 Python

Tensorforce: a TensorFlow library for applied reinforcement learning
football

2 3,267 0.0 Python

Check out the new game server:
catalyst

1 3,243 0.0 Python

Accelerated deep learning R&D (by catalyst-team)

Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python reinforcement-learning discussion

Python reinforcement-learning related posts

Deep Reinforcement Learning: Zero to Hero

3 projects | news.ycombinator.com | 5 May 2024
Recapping the AI, Machine Learning and Data Science Meetup — May 2, 2024

2 projects | dev.to | 2 May 2024
Bayesianbandits: A Pythonic microframework for multi-armed bandit problems

1 project | news.ycombinator.com | 14 Mar 2024
Adding Weapons

1 project | dev.to | 24 Jan 2024
Understand how transformers work by demystifying all the math behind them

1 project | news.ycombinator.com | 4 Jan 2024
Show HN: An end-to-end reinforcement learning library for infinite horizon tasks

1 project | news.ycombinator.com | 29 Dec 2023
Show HN: Easily train AlphaZero-like agents on any environment you want

2 projects | news.ycombinator.com | 20 Dec 2023
A note from our sponsor - Scout Monitoring
www.scoutapm.com | 17 Jun 2024

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today. Learn more →

Index

What are some of the best open-source reinforcement-learning projects in Python? This list will help you:

	Project	Stars
1	Ray	31,718
2	d2l-en	22,174
3	reinforcement-learning-an-introduction	13,272
4	wandb	8,482
5	stable-baselines3	8,200
6	machine_learning_examples	8,161
7	trax	7,988
8	pysc2	7,946
9	PaLM-rlhf-pytorch	7,620
10	TensorLayer	7,296
11	Gymnasium	6,054
12	keras-rl	5,497
13	cleanrl	4,725
14	trlx	4,367
15	stable-baselines	4,068
16	dm_control	3,603
17	ElegantRL	3,520
18	polyaxon	3,502
19	pytorch-a2c-ppo-acktr-gail	3,491
20	acme	3,412
21	tensorforce	3,286
22	football	3,267
23	catalyst	3,243