Python deep-reinforcement-learning

Open-source Python projects categorized as deep-reinforcement-learning

Top 23 Python deep-reinforcement-learning Projects

  • ML-From-Scratch

    Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

  • trax

    Trax — Deep Learning with Clear Code and Speed

  • Project mention: Replit's new Code LLM was trained in 1 week | news.ycombinator.com | 2023-05-03

    and the implementation https://github.com/google/trax/blob/master/trax/models/resea... if you are interested.

    Hope you get to look into this!

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • cleanrl

    High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

  • Project mention: [P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials) | /r/reinforcementlearning | 2023-08-24

    PettingZoo 1.24.0 is now live! This release includes Python 3.11 support, updated Chess and Hanabi environment versions, and many bugfixes, documentation updates and testing expansions. We are also very excited to announce 3 tutorials using Stable-Baselines3, and a full training script using CleanRL with TensorBoard and WandB.

  • pytorch-a2c-ppo-acktr-gail

    PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

  • tensorforce

    Tensorforce: a TensorFlow library for applied reinforcement learning

  • minimalRL

    Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

  • rlcard

    Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

  • Project mention: [P] Looking for RL or rules-based No-Limit Hold 'Em Work | /r/MachineLearning | 2023-06-03
  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • muzero-general

    MuZero

  • Project mention: Open source rules engine for Magic: The Gathering | news.ycombinator.com | 2023-12-14

    I went looking for MuZero implementations in order to see how, exactly, they interact with the game space. Based on this one, which had the most stars in the muzero topic, it appears that it needs to be able to discern legal next steps from the current game state https://github.com/werner-duvaud/muzero-general/blob/master/...

    So, I guess for the cards Forge has implemented one could MuZero it, but I believe it's a bit chicken and egg with a "free text" game like M:TG -- in order to train one would need to know legal steps for any random game state, but in order to have legal steps one would need to be able to read and interpret English rules and card text

  • rl-baselines3-zoo

    A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

  • Project mention: Can't solve MountainCar-v0 with A2C algorithm (stable-baselines3) | /r/reinforcementlearning | 2023-06-27

    I'm trying to solve MountainCar-v0 enviroment from gymnasium with the A2C algorithm and the agent doesn't find a solution. I checked this so I added import stable_baselines3.common.sb2_compat.rmsprop_tf_like as RMSpropTFLike. Also checked the rl-baselines3-zoo for the hyperparameter tuning. So my code is:

  • habitat-lab

    A modular high-level library to train embodied AI agents across a variety of tasks and environments.

  • PPO-PyTorch

    Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

  • d3rlpy

    An offline deep reinforcement learning library

  • DI-star

    An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.

  • softlearning

    Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

  • Project mention: Problem with Truncated Quantile Critics (TQC) and n-step learning algorithm. | /r/reinforcementlearning | 2023-12-09

    # see https://github.com/rail-berkeley/softlearning/issues/60

  • FinRL-Meta

    FinRL­-Meta: Dynamic datasets and market environments for FinRL.

  • Project mention: FinRL-Meta: NEW Data - star count:937.0 | /r/algoprojects | 2023-09-25
  • PyGame-Learning-Environment

    PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.

  • deepdrive

    Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving

  • crypto-rl

    Deep Reinforcement Learning toolkit: record and replay cryptocurrency limit order book data & train a DDQN agent

  • Project mention: crypto-rl: Retrieve limit order book level data from coinbase pro and bitfinex -> record in [arctic](https://github.com/man-group/arctic) timeseries database then implemented trend following strategies (market orders) and market making (limit orders) | /r/algoprojects | 2023-12-10
  • autonomous-learning-library

    A PyTorch library for building deep reinforcement learning agents.

  • DeepRL-TensorFlow2

    🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

  • phillip

    The SSBM "Phillip" AI.

  • AgileRL

    Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

  • Project mention: [P] Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework | /r/MachineLearning | 2023-10-15
  • DRL-robot-navigation

    Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-12-14.

Python deep-reinforcement-learning related posts

Index

What are some of the best open-source deep-reinforcement-learning projects in Python? This list will help you:

Project Stars
1 ML-From-Scratch 23,131
2 trax 7,948
3 cleanrl 4,414
4 pytorch-a2c-ppo-acktr-gail 3,423
5 tensorforce 3,278
6 minimalRL 2,725
7 rlcard 2,689
8 muzero-general 2,373
9 rl-baselines3-zoo 1,764
10 habitat-lab 1,692
11 PPO-PyTorch 1,441
12 d3rlpy 1,197
13 DI-star 1,159
14 softlearning 1,150
15 FinRL-Meta 1,115
16 PyGame-Learning-Environment 989
17 deepdrive 872
18 crypto-rl 799
19 autonomous-learning-library 638
20 DeepRL-TensorFlow2 573
21 phillip 539
22 AgileRL 488
23 DRL-robot-navigation 420
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com