Python deep-reinforcement-learning

Open-source Python projects categorized as deep-reinforcement-learning Edit details

Top 23 Python deep-reinforcement-learning Projects

  • ML-From-Scratch

    Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

    Project mention: Coding K-Means Clustering using Python and NumPy | dev.to | 2022-09-22

    ML From Scratch - An excellent Github repository containing implementations of many machine learning models and algorithms. Easy to understand and highly recommended.

  • trax

    Trax — Deep Learning with Clear Code and Speed

    Project mention: [D] Paper Explained - Sparse is Enough in Scaling Transformers (aka Terraformer) | Video Walkthrough | reddit.com/r/MachineLearning | 2021-12-01

    Code: https://github.com/google/trax/blob/master/trax/examples/Terraformer_from_scratch.ipynb

  • InfluxDB

    Build time-series-based applications quickly and at scale.. InfluxDB is the Time Series Data Platform where developers build real-time applications for analytics, IoT and cloud-native services in less time with less code.

  • tensorforce

    Tensorforce: a TensorFlow library for applied reinforcement learning

  • pytorch-a2c-ppo-acktr-gail

    PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

    Project mention: How does advantage estimation is done when episodes are of variable length in PPO? | reddit.com/r/reinforcementlearning | 2022-04-07

    As an example look at "compute_returns" function here (and pay attention to how self.masks is used): https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail/blob/master/a2c_ppo_acktr/storage.py

  • minimalRL

    Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

    Project mention: Does anyone know good python sources hardcoded of RL? | reddit.com/r/reinforcementlearning | 2022-05-15
  • muzero-general

    MuZero

    Project mention: Ask HN: What interesting problems are you working on? ( 2022 Edition) | news.ycombinator.com | 2022-09-16
  • rlcard

    Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

    Project mention: Self play environments | reddit.com/r/reinforcementlearning | 2021-11-26

    Hi. I’ve decided to do a project to adapt an rl library to support self-play. This is a project so I can teach myself more about building rl systems. I’ve been considering working with the environment system from rlcard https://github.com/datamllab/rlcard/ but wonder if there are other more widely-used self play environment libraries. Thanks.

  • Scout APM

    Truly a developer’s best friend. Scout APM is great for developers who want to find and fix performance issues in their applications. With Scout, we'll take care of the bugs so you can focus on building great things 🚀.

  • cleanrl

    High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

    Project mention: CartPole Swing RL task | reddit.com/r/reinforcementlearning | 2022-08-18
  • deep-q-learning

    Minimal Deep Q Learning (DQN & DDQN) implementations in Keras (by keon)

  • softlearning

    Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

    Project mention: SAC: Enforcing Action Bounds formula derivation | reddit.com/r/reinforcementlearning | 2022-01-26

    Code for https://arxiv.org/abs/1812.05905 found: https://github.com/rail-berkeley/softlearning

  • PyGame-Learning-Environment

    PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.

    Project mention: How do I run a Pygame Image visually? | reddit.com/r/docker | 2022-06-12

    Accessing host graphics with docker is tricky. If you’re running the container on a unix system, you could try this medium article. This Dockerfile also has setup instructions for unix and macos.

  • habitat-lab

    A modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators.

    Project mention: [D] Looking for open source projects to contribute | reddit.com/r/MachineLearning | 2022-01-09

    There are plenty of them out there. I spend a lot of time contributing to open source projects like Habitat-Sim https://github.com/facebookresearch/habitat-sim and Habitat-Lab https://github.com/facebookresearch/habitat-lab which have a ton of open issues and code maintaince stuff that we would welcome contributions of.

  • ai-economist

    Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforcement learning to learn optimal economic policies, as done by the AI Economist (https://www.einstein.ai/the-ai-economist).

    Project mention: Agent-based modeling in applied economics? | reddit.com/r/AskEconomics | 2022-07-06

    3 Area of Reinforcement learning, in particular, has demonstrated impressive breakthroughs recently. There were attempts to apply it to economic policy planning and finance

  • deepdrive

    Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving

  • PPO-PyTorch

    Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

    Project mention: A2C/PPO with continuous action space | reddit.com/r/reinforcementlearning | 2021-12-19

    In some methods, like the one here, the actor network has two heads, one for the mean and one for the variance. In other methods, like the one here, the network only outputs the mean, while the variance is pre-defined and is decaying throughout the training.

  • d3rlpy

    An offline deep reinforcement learning library

    Project mention: Python libraries for solving reinforcement learning problems implemented in OpenAI gym | reddit.com/r/reinforcementlearning | 2022-01-20
  • crypto-rl

    Deep Reinforcement Learning toolkit: record and replay cryptocurrency limit order book data & train a DDQN agent

    Project mention: crypto-rl: Retrieve limit order book level data from coinbase pro and bitfinex -> record in [arctic](https://github.com/man-group/arctic) timeseries database then implemented trend following strategies (market orders) and market making (limit orders) | reddit.com/r/algoprojects | 2022-07-16
  • autonomous-learning-library

    A PyTorch library for building deep reinforcement learning agents.

  • phillip

    The SSBM "Phillip" AI.

    Project mention: [P] Imitation Learning (+RL) in Super Smash Bros Melee for Humanlike Agents | reddit.com/r/MachineLearning | 2022-08-22

    Project Nabla is an AI trained with deep neural networks using behavioral cloning and deep reinforcement learning self-play, similar to AlphaStar. It is enabled by the recent launch of a suite of software tools for the game known as "Slippi" which allow for us to save human replays. We train on a subset of ~100k tournament games. It is similar to the older Phillip project, which did not have the benefit of Slippi when it was created (and doesn't use any human replays).

  • DeepRL-TensorFlow2

    🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

  • deep-RL-trading

    playing idealized trading games with deep reinforcement learning

    Project mention: deep-RL-trading: trading game comparing RNN vs CNN vs MLP based on [paper](https://arxiv.org/abs/1803.03916) Deep Learning And Reinforcement Learning - star count:301.0 | reddit.com/r/algoprojects | 2022-04-09
  • or-gym

    Environments for OR and RL Research

    Project mention: Gym like frameworks for combinatorial optimization on Graphs? | reddit.com/r/reinforcementlearning | 2022-06-15

    How about ORGym: https://github.com/hubbs5/or-gym ?

  • PCGrad

    Code for "Gradient Surgery for Multi-Task Learning"

  • talent.io

    Download talent.io’s Tech Salary Report. Median salaries, most in-demand technologies, state of the remote work... all you need to know your worth on the market by tech recruitment platform talent.io

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-09-22.

Python deep-reinforcement-learning related posts

Index

What are some of the best open-source deep-reinforcement-learning projects in Python? This list will help you:

Project Stars
1 ML-From-Scratch 21,537
2 trax 7,112
3 tensorforce 3,169
4 pytorch-a2c-ppo-acktr-gail 2,929
5 minimalRL 2,253
6 muzero-general 1,898
7 rlcard 1,853
8 cleanrl 1,327
9 deep-q-learning 1,155
10 softlearning 958
11 PyGame-Learning-Environment 912
12 habitat-lab 902
13 ai-economist 891
14 deepdrive 789
15 PPO-PyTorch 786
16 d3rlpy 728
17 crypto-rl 610
18 autonomous-learning-library 570
19 phillip 520
20 DeepRL-TensorFlow2 505
21 deep-RL-trading 311
22 or-gym 229
23 PCGrad 217
Find remote jobs at our new job board 99remotejobs.com. There are 8 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
Static code analysis for 29 languages.
Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.
www.sonarqube.org