Top 23 Python deep-reinforcement-learning Projects

ML-From-Scratch

3 23,131 0.0 Python

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
trax

6 7,948 4.7 Python

Trax — Deep Learning with Clear Code and Speed

Project mention: Replit's new Code LLM was trained in 1 week | news.ycombinator.com | 2023-05-03

and the implementation https://github.com/google/trax/blob/master/trax/models/resea... if you are interested.
Hope you get to look into this!

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
cleanrl

41 4,414 6.7 Python

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Project mention: [P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials) | /r/reinforcementlearning | 2023-08-24

PettingZoo 1.24.0 is now live! This release includes Python 3.11 support, updated Chess and Hanabi environment versions, and many bugfixes, documentation updates and testing expansions. We are also very excited to announce 3 tutorials using Stable-Baselines3, and a full training script using CleanRL with TensorBoard and WandB.

pytorch-a2c-ppo-acktr-gail

3 3,423 0.0 Python

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
tensorforce

1 3,278 3.0 Python

Tensorforce: a TensorFlow library for applied reinforcement learning
minimalRL

5 2,725 1.6 Python

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
rlcard

5 2,689 6.2 Python

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

Project mention: [P] Looking for RL or rules-based No-Limit Hold 'Em Work | /r/MachineLearning | 2023-06-03

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
muzero-general

14 2,373 0.0 Python

MuZero

Project mention: Open source rules engine for Magic: The Gathering | news.ycombinator.com | 2023-12-14

I went looking for MuZero implementations in order to see how, exactly, they interact with the game space. Based on this one, which had the most stars in the muzero topic, it appears that it needs to be able to discern legal next steps from the current game state https://github.com/werner-duvaud/muzero-general/blob/master/...
So, I guess for the cards Forge has implemented one could MuZero it, but I believe it's a bit chicken and egg with a "free text" game like M:TG -- in order to train one would need to know legal steps for any random game state, but in order to have legal steps one would need to be able to read and interpret English rules and card text

rl-baselines3-zoo

11 1,764 6.3 Python

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Project mention: Can't solve MountainCar-v0 with A2C algorithm (stable-baselines3) | /r/reinforcementlearning | 2023-06-27

I'm trying to solve MountainCar-v0 enviroment from gymnasium with the A2C algorithm and the agent doesn't find a solution. I checked this so I added import stable_baselines3.common.sb2_compat.rmsprop_tf_like as RMSpropTFLike. Also checked the rl-baselines3-zoo for the hyperparameter tuning. So my code is:

habitat-lab

3 1,692 9.1 Python

A modular high-level library to train embodied AI agents across a variety of tasks and environments.
PPO-PyTorch

2 1,441 2.8 Python

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
d3rlpy

2 1,197 9.1 Python

An offline deep reinforcement learning library
DI-star

9 1,159 3.3 Python

An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
softlearning

4 1,150 0.0 Python

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Project mention: Problem with Truncated Quantile Critics (TQC) and n-step learning algorithm. | /r/reinforcementlearning | 2023-12-09

# see https://github.com/rail-berkeley/softlearning/issues/60

FinRL-Meta

64 1,115 7.4 Python

FinRL-Meta: Dynamic datasets and market environments for FinRL.

Project mention: FinRL-Meta: NEW Data - star count:937.0 | /r/algoprojects | 2023-09-25

PyGame-Learning-Environment

1 989 0.0 Python

PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.
deepdrive

1 872 0.0 Python

Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
crypto-rl

10 799 0.0 Python

Deep Reinforcement Learning toolkit: record and replay cryptocurrency limit order book data & train a DDQN agent

Project mention: crypto-rl: Retrieve limit order book level data from coinbase pro and bitfinex -> record in [arctic](https://github.com/man-group/arctic) timeseries database then implemented trend following strategies (market orders) and market making (limit orders) | /r/algoprojects | 2023-12-10

autonomous-learning-library

2 638 7.6 Python

A PyTorch library for building deep reinforcement learning agents.
DeepRL-TensorFlow2

2 573 0.0 Python

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
phillip

2 539 0.0 Python

The SSBM "Phillip" AI.
AgileRL

12 488 9.8 Python

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

Project mention: [P] Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework | /r/MachineLearning | 2023-10-15

DRL-robot-navigation

1 420 2.1 Python

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-12-14.

Python deep-reinforcement-learning related posts

Making Synthesized Sounds More Acoustic
2 projects | news.ycombinator.com | 20 Dec 2023
The Power of Reinforcement Learning: look how this DeepRL Sektor model found a smart, super-cool exploit for Ultimate Mortal Kombat 3 in the video of a submission on DIAMBRA competition platform!
1 project | /r/reinforcementlearning | 9 Dec 2023
[P] Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework
1 project | /r/MachineLearning | 15 Oct 2023
Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework
1 project | /r/reinforcementlearning | 15 Oct 2023
FinRL-Meta: NEW Data - star count:937.0
1 project | /r/algoprojects | 25 Sep 2023
FinRL-Meta: NEW Data - star count:937.0
1 project | /r/algoprojects | 22 Sep 2023
FinRL-Meta: NEW Data - star count:937.0
1 project | /r/algoprojects | 22 Sep 2023
A note from our sponsor - SaaSHub
www.saashub.com | 19 Apr 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source deep-reinforcement-learning projects in Python? This list will help you:

	Project	Stars
1	ML-From-Scratch	23,131
2	trax	7,948
3	cleanrl	4,414
4	pytorch-a2c-ppo-acktr-gail	3,423
5	tensorforce	3,278
6	minimalRL	2,725
7	rlcard	2,689
8	muzero-general	2,373
9	rl-baselines3-zoo	1,764
10	habitat-lab	1,692
11	PPO-PyTorch	1,441
12	d3rlpy	1,197
13	DI-star	1,159
14	softlearning	1,150
15	FinRL-Meta	1,115
16	PyGame-Learning-Environment	989
17	deepdrive	872
18	crypto-rl	799
19	autonomous-learning-library	638
20	DeepRL-TensorFlow2	573
21	phillip	539
22	AgileRL	488
23	DRL-robot-navigation	420