Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 13 Python reinforcement-learning-algorithm Projects
-
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
-
PPO-for-Beginners
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
-
Safe-Policy-Optimization
NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms
-
pytorch-learn-reinforcement-learning
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
The latest release (v3.0.0) of Upkie's software brings a functional sim-to-real reinforcement learning pipeline based on Stable Baselines3, with standard sim-to-real tricks. The pipeline trains on the Gymnasium environments distributed in upkie.envs (setup: pip install upkie) and is implemented in the PPO balancer. Here is a policy running on an Upkie:
I posted a while back asking people what frameworks they were using for RL research. Recently i stumbled upon DI-Engine which looks promising! Actively maintained, with a diverse set of algorithms already implemented.
Project mention: Problem with Truncated Quantile Critics (TQC) and n-step learning algorithm. | /r/reinforcementlearning | 2023-12-09# https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/blob/master/sb3_contrib/tqc/tqc.py :
Python reinforcement-learning-algorithms related posts
- Sim-to-real RL pipeline for open-source wheeled bipeds
- [Question] Why there is so few algorithms implemented in SB3?
- Stable baselines! Where my people at?
- Exporting an A2C model created with stable-baselines3 to PyTorch
- Stable-Baselines3 v1.8 Release
- Distributed implementation tips
- Is stable-baselines3 compatible with gymnasium/gymnasium-robotics?
-
A note from our sponsor - InfluxDB
www.influxdata.com | 24 Apr 2024
Index
What are some of the best open-source reinforcement-learning-algorithm projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | stable-baselines3 | 7,894 |
2 | stable-baselines | 4,000 |
3 | DI-engine | 2,517 |
4 | PPO-PyTorch | 1,453 |
5 | rex-gym | 957 |
6 | autonomous-learning-library | 638 |
7 | PPO-for-Beginners | 637 |
8 | stable-baselines3-contrib | 422 |
9 | Safe-Policy-Optimization | 287 |
10 | pytorch-learn-reinforcement-learning | 139 |
11 | l2rpn-baselines | 74 |
12 | R-NaD | 30 |
13 | soft-actor-critic | 6 |
Sponsored