SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python reinforcement-learning Projects
-
Ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
Project mention: Is dynamic action masking possible in Rllib? | reddit.com/r/reinforcementlearning | 2023-01-23 -
tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Project mention: [P] Why I quit my lucrative job at Google to start Vectara? (neural search as a service for developers everywhere). | reddit.com/r/MachineLearning | 2022-10-17Found relevant code at https://github.com/tensorflow/tensor2tensor + all code implementations here
-
InfluxDB
Build time-series-based applications quickly and at scale.. InfluxDB is the Time Series Platform where developers build real-time applications for analytics, IoT and cloud-native services. Easy to start, it is available in the cloud or on-premises.
-
reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
Project mention: Reinforcement Learning - looking for some resources | reddit.com/r/reinforcementlearning | 2022-04-09 -
Project mention: [D] What tool do you use for reinforcement learning experimentation? | reddit.com/r/MachineLearning | 2022-08-10
Good evening, guys. I currently use StarCraft 2 as a tool for experimenting with my deep reinforcement learning projects, I have also used OpenAI Gym.
-
Project mention: The founder of Gmail claims that ChatGPT can “kill” Google in two years. | reddit.com/r/Futurology | 2023-01-31
But a couple years later they came out with open source implementations yeah: https://github.com/google/trax/tree/master/trax/models/reformer
-
-
Project mention: Introduction to ‘TensorLayer’: A Python-based Versatile Deep Learning Library Designed for Machine Learning Researchers | reddit.com/r/Python | 2022-02-20
Github: https://github.com/tensorlayer/TensorLayer
-
Sonar
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
Project mention: How can we model an observation space of an env with different features and sizes. | reddit.com/r/reinforcementlearning | 2022-12-20
-
wandb
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
Project mention: Efficient way to tune a network by changing hyperparameters? | reddit.com/r/deeplearning | 2023-01-25Wandb is the best! https://wandb.ai/
-
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Project mention: Is Stable Baselines 3 no longer compatible with PettingZoo? | reddit.com/r/reinforcementlearning | 2023-01-11I was able to get Stable Baselines 3 to work with gymnasium by following the details in this work-in-progress PR: https://github.com/DLR-RM/stable-baselines3/pull/780. I have not used PettingZoo, though.
-
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Project mention: Gather up great coders and make a better Character.Ai | reddit.com/r/CharacterAI | 2023-01-22Well... Not necessarily. Actually, if you want to be extra thrifty, you could even go without an ML expert. Just use an open-source one, like LaMDA or PaLM. After that, use chatGPT to build you a basic front end (which would still be better than CAI lol).
-
Found relevant code at https://github.com/hill-a/stable-baselines + all code implementations here
-
Project mention: [D] Kubernetes for ML - how are y'all doing it? | reddit.com/r/MachineLearning | 2022-04-14
We use Polyaxon and it’s pretty good
-
-
dm_control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Project mention: Installing & Using MuJoCo 2.1.5 with OpenAi Gym | reddit.com/r/reinforcementlearning | 2022-05-18Deepmind Control Suite is a good alternative to Open AI Gym for continuous control tasks. It contains many of the environments present in Gym and also a few extra ones. Deepmind Control Suite also uses Mujoco. I found the installation to be straightforward. Check out https://github.com/deepmind/dm_control
-
-
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Project mention: How does advantage estimation is done when episodes are of variable length in PPO? | reddit.com/r/reinforcementlearning | 2022-04-07As an example look at "compute_returns" function here (and pay attention to how self.masks is used): https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail/blob/master/a2c_ppo_acktr/storage.py
-
Project mention: How much of a MuJoCo simulation or real life robot can you train on a 3090? | reddit.com/r/reinforcementlearning | 2022-12-01
I'm training a few algorithms from Deepmind's acme library on some MuJoCo models and I'm wondering how long this will take to train and what it's going to do to my electric bill. Is a 3090 or two enough to train something to keep its balance, or do a task, or do I need to wait for the 8090 to come out?
-
Project mention: Does “massively parallel simulation” help advance Reinforcement Learning? | news.ycombinator.com | 2022-12-09
-
agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Project mention: cannot import name 'binary_weighted_focal_crossentropy' from 'keras.backend' | reddit.com/r/learnmachinelearning | 2022-09-05im trying to follow this tutorial = https://github.com/tensorflow/agents/blob/master/docs/tutorials/9_c51_tutorial.ipynb
-
Project mention: Does anyone know good python sources hardcoded of RL? | reddit.com/r/reinforcementlearning | 2022-05-15
-
Project mention: Struggling with algorithm generality? Try DI engine; here is the solution | news.ycombinator.com | 2022-04-29
-
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Project mention: Model-based hierarchical reinforcement learning | reddit.com/r/reinforcementlearning | 2023-01-27Shameless self-plug: as far as implementation is concerned, I am working on a (hopefully) easier to understand Dreamer architecture under the CleanRL library, toward also re-implementing Director, Dreamer-v3, and and JAX variant for faster training.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python reinforcement-learning related posts
- Best recurrent RL library?
- An Idea for Integration
- Model-based hierarchical reinforcement learning
- What is the limit on parallel environments?
- [P] Robust Policy Optimization is now in CleanRL 🔥!
- Gather up great coders and make a better Character.Ai
- An orb learns to dodge obstacles, collect other orbs and reach the end platform by itself using PPO RL AI
-
A note from our sponsor - #<SponsorshipServiceOld:0x00007fea5b8a2378>
www.saashub.com | 2 Feb 2023
Index
What are some of the best open-source reinforcement-learning projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | Ray | 23,792 |
2 | tensor2tensor | 12,926 |
3 | reinforcement-learning-an-introduction | 12,035 |
4 | pysc2 | 7,672 |
5 | trax | 7,319 |
6 | machine_learning_examples | 7,210 |
7 | TensorLayer | 7,135 |
8 | keras-rl | 5,368 |
9 | wandb | 5,368 |
10 | stable-baselines3 | 4,833 |
11 | PaLM-rlhf-pytorch | 4,718 |
12 | stable-baselines | 3,713 |
13 | polyaxon | 3,239 |
14 | tensorforce | 3,212 |
15 | dm_control | 3,049 |
16 | football | 3,047 |
17 | pytorch-a2c-ppo-acktr-gail | 3,045 |
18 | acme | 2,999 |
19 | ElegantRL | 2,570 |
20 | agents | 2,424 |
21 | minimalRL | 2,301 |
22 | DI-engine | 2,196 |
23 | cleanrl | 2,016 |