SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python deep-reinforcement-learning Projects
-
ML-From-Scratch
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Project mention: Maxtext: A simple, performant and scalable Jax LLM | news.ycombinator.com | 2024-04-23
Is t5x an encoder/decoder architecture?
Some more general options.
The Flax ecosystem
https://github.com/google/flax?tab=readme-ov-file
or dm-haiku
https://github.com/google-deepmind/dm-haiku
were some of the best developed communities in the Jax AI field
Perhaps the “trax” repo? https://github.com/google/trax
Some HF examples https://github.com/huggingface/transformers/tree/main/exampl...
Sadly it seems much of the work is proprietary these days, but one example could be Grok-1, if you customize the details. https://github.com/xai-org/grok-1/blob/main/run.py
-
Project mention: Pwnagotchi: Deep Reinforcement Learning for WiFi Hacking | news.ycombinator.com | 2024-01-27
-
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
-
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
-
-
rlcard
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
-
-
Project mention: Open source rules engine for Magic: The Gathering | news.ycombinator.com | 2023-12-14
I went looking for MuZero implementations in order to see how, exactly, they interact with the game space. Based on this one, which had the most stars in the muzero topic, it appears that it needs to be able to discern legal next steps from the current game state https://github.com/werner-duvaud/muzero-general/blob/master/...
So, I guess for the cards Forge has implemented one could MuZero it, but I believe it's a bit chicken and egg with a "free text" game like M:TG -- in order to train one would need to know legal steps for any random game state, but in order to have legal steps one would need to be able to read and interpret English rules and card text
-
rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
-
habitat-lab
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
-
PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
-
-
-
DI-star
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
-
softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Project mention: Problem with Truncated Quantile Critics (TQC) and n-step learning algorithm. | /r/reinforcementlearning | 2023-12-09# see https://github.com/rail-berkeley/softlearning/issues/60
-
PyGame-Learning-Environment
PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.
-
deepdrive
Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
-
crypto-rl
Deep Reinforcement Learning toolkit: record and replay cryptocurrency limit order book data & train a DDQN agent
Project mention: crypto-rl: Retrieve limit order book level data from coinbase pro and bitfinex -> record in [arctic](https://github.com/man-group/arctic) timeseries database then implemented trend following strategies (market orders) and market making (limit orders) | /r/algoprojects | 2023-12-10 -
-
DeepRL-TensorFlow2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
-
Python deep-reinforcement-learning discussion
Python deep-reinforcement-learning related posts
-
Deep Reinforcement Learning: Zero to Hero
-
Making Synthesized Sounds More Acoustic
-
The Power of Reinforcement Learning: look how this DeepRL Sektor model found a smart, super-cool exploit for Ultimate Mortal Kombat 3 in the video of a submission on DIAMBRA competition platform!
-
[P] Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework
-
Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework
-
FinRL-Meta: NEW Data - star count:937.0
-
FinRL-Meta: NEW Data - star count:937.0
-
A note from our sponsor - SaaSHub
www.saashub.com | 5 Dec 2024
Index
What are some of the best open-source deep-reinforcement-learning projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | ML-From-Scratch | 23,893 |
2 | trax | 8,110 |
3 | pwnagotchi | 7,808 |
4 | cleanrl | 5,773 |
5 | pytorch-a2c-ppo-acktr-gail | 3,612 |
6 | tensorforce | 3,297 |
7 | rlcard | 2,928 |
8 | minimalRL | 2,858 |
9 | muzero-general | 2,520 |
10 | rl-baselines3-zoo | 2,105 |
11 | habitat-lab | 2,014 |
12 | PPO-PyTorch | 1,755 |
13 | d3rlpy | 1,336 |
14 | FinRL-Meta | 1,301 |
15 | DI-star | 1,234 |
16 | softlearning | 1,227 |
17 | PyGame-Learning-Environment | 1,015 |
18 | deepdrive | 893 |
19 | crypto-rl | 851 |
20 | autonomous-learning-library | 646 |
21 | DRL-robot-navigation | 638 |
22 | DeepRL-TensorFlow2 | 603 |
23 | AgileRL | 597 |