SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python deep-reinforcement-learning Projects
-
ML-From-Scratch
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
-
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
-
rlcard
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
-
habitat-lab
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
-
PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
-
DI-star
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
-
softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
-
PyGame-Learning-Environment
PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.
-
deepdrive
Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
-
crypto-rl
Deep Reinforcement Learning toolkit: record and replay cryptocurrency limit order book data & train a DDQN agent
-
DeepRL-TensorFlow2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
and the implementation https://github.com/google/trax/blob/master/trax/models/resea... if you are interested.
Hope you get to look into this!
Project mention: [P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials) | /r/reinforcementlearning | 2023-08-24PettingZoo 1.24.0 is now live! This release includes Python 3.11 support, updated Chess and Hanabi environment versions, and many bugfixes, documentation updates and testing expansions. We are also very excited to announce 3 tutorials using Stable-Baselines3, and a full training script using CleanRL with TensorBoard and WandB.
Project mention: [P] Looking for RL or rules-based No-Limit Hold 'Em Work | /r/MachineLearning | 2023-06-03
Project mention: Open source rules engine for Magic: The Gathering | news.ycombinator.com | 2023-12-14I went looking for MuZero implementations in order to see how, exactly, they interact with the game space. Based on this one, which had the most stars in the muzero topic, it appears that it needs to be able to discern legal next steps from the current game state https://github.com/werner-duvaud/muzero-general/blob/master/...
So, I guess for the cards Forge has implemented one could MuZero it, but I believe it's a bit chicken and egg with a "free text" game like M:TG -- in order to train one would need to know legal steps for any random game state, but in order to have legal steps one would need to be able to read and interpret English rules and card text
Project mention: Can't solve MountainCar-v0 with A2C algorithm (stable-baselines3) | /r/reinforcementlearning | 2023-06-27I'm trying to solve MountainCar-v0 enviroment from gymnasium with the A2C algorithm and the agent doesn't find a solution. I checked this so I added import stable_baselines3.common.sb2_compat.rmsprop_tf_like as RMSpropTFLike. Also checked the rl-baselines3-zoo for the hyperparameter tuning. So my code is:
Project mention: Problem with Truncated Quantile Critics (TQC) and n-step learning algorithm. | /r/reinforcementlearning | 2023-12-09# see https://github.com/rail-berkeley/softlearning/issues/60
Project mention: crypto-rl: Retrieve limit order book level data from coinbase pro and bitfinex -> record in [arctic](https://github.com/man-group/arctic) timeseries database then implemented trend following strategies (market orders) and market making (limit orders) | /r/algoprojects | 2023-12-10
Project mention: [P] Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework | /r/MachineLearning | 2023-10-15
Python deep-reinforcement-learning related posts
- Making Synthesized Sounds More Acoustic
- The Power of Reinforcement Learning: look how this DeepRL Sektor model found a smart, super-cool exploit for Ultimate Mortal Kombat 3 in the video of a submission on DIAMBRA competition platform!
- [P] Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework
- Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework
- FinRL-Meta: NEW Data - star count:937.0
- FinRL-Meta: NEW Data - star count:937.0
- FinRL-Meta: NEW Data - star count:937.0
-
A note from our sponsor - SaaSHub
www.saashub.com | 19 Apr 2024
Index
What are some of the best open-source deep-reinforcement-learning projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | ML-From-Scratch | 23,131 |
2 | trax | 7,948 |
3 | cleanrl | 4,414 |
4 | pytorch-a2c-ppo-acktr-gail | 3,423 |
5 | tensorforce | 3,278 |
6 | minimalRL | 2,725 |
7 | rlcard | 2,689 |
8 | muzero-general | 2,373 |
9 | rl-baselines3-zoo | 1,764 |
10 | habitat-lab | 1,692 |
11 | PPO-PyTorch | 1,441 |
12 | d3rlpy | 1,197 |
13 | DI-star | 1,159 |
14 | softlearning | 1,150 |
15 | FinRL-Meta | 1,115 |
16 | PyGame-Learning-Environment | 989 |
17 | deepdrive | 872 |
18 | crypto-rl | 799 |
19 | autonomous-learning-library | 638 |
20 | DeepRL-TensorFlow2 | 573 |
21 | phillip | 539 |
22 | AgileRL | 488 |
23 | DRL-robot-navigation | 420 |