Python reinforcement-learning

Open-source Python projects categorized as reinforcement-learning | Edit details

Top 23 Python reinforcement-learning Projects

  • GitHub repo Ray

    An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

    Project mention: Writing your First Distributed Python Application with Ray (without multiprocessing) | | 2021-08-23

    Here is an older discussion on dask vs ray from the creators of both projects:

  • GitHub repo tensor2tensor

    Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

    Project mention: [D] Resources for Understanding The Original Transformer Paper | | 2021-09-08

    Code for found:

  • Nanos

    Run Linux Software Faster and Safer than Linux with Unikernels.

  • GitHub repo pysc2

    StarCraft II Learning Environment

    Project mention: Tips for a beginner | | 2021-07-22

    If you are looking to develop a machine-learning based bot you can go with pysc2:

  • GitHub repo trax

    Trax — Deep Learning with Clear Code and Speed

    Project mention: Why would I want to develop yet another deep learning framework? | | 2021-09-16
  • GitHub repo machine_learning_examples

    A collection of machine learning examples and tutorials.

    Project mention: How to save an attention model for deployment/exposing to an API? | | 2021-08-17

    I've been following a course teaching how to make an attention model for neural machine translation, This is the file inside the repo. I know that I'll have to use certain functions to make the textual input be processed in encodings and tokens, but those functions use certain instances of the model, which I don't know if I should keep or not. If anyone can please take a look and help me out here, it'd be really really appreciated.

  • GitHub repo client

    🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API. (by wandb)

    Project mention: Should I take a second attempt at GRE with so many universities making it optional? | | 2021-07-24

    Strong ML background with almost 3 years of experience in the field. I am frequently publishing articles as an ML author for reputed organizations such as Weights and Biases.

  • GitHub repo stable-baselines

    A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

    Project mention: Nvidia ISAAC gym/RL | | 2021-08-28

    Code for found:

  • Scout APM

    Scout APM: A developer's best friend. Try free for 14-days. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.

  • GitHub repo tensorforce

    Tensorforce: a TensorFlow library for applied reinforcement learning

    Project mention: Advice on doing RL for Settlers of Catan? | | 2021-07-11

    The most promising approach has been using the TensorForce framework ( with a custom environment that represents a simpler game (1v1 against a bot that chooses actions randomly, no trading between players, and fixing discarding to be done automatically and at random).

  • GitHub repo polyaxon

    Machine Learning Platform for Kubernetes (MLOps tools for experimentation and automation)

    Project mention: [D] Productionalizing machine learning pipelines for small teams | | 2021-08-08

    For running experiments, is a really good free open-source package that has lots of nice integrations so you can quickly run experiments in k8s but it might be overkill in some cases.

  • GitHub repo football

    Check out the new game server:

    Project mention: Creating a new football game | | 2021-07-26

    For fun, merging such an idea with Google's open source football research project and its AI could result in a very interesting game!

  • GitHub repo pytorch-a2c-ppo-acktr-gail

    PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

    Project mention: How to pretrain a model on expert data? | | 2021-09-12

    Try using an imitation learning algorithm. Two popular options are MaxEnt IRL and GAIL. This repository has GAIL implementation and this repository has MaxEnt IRL and GAIL implementation. There are other implementations too that you can check out.

  • GitHub repo dm_control

    DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

    Project mention: Any beginner resources for RL in Robotics? | | 2021-04-19

    DeepMind's dm control:

  • GitHub repo acme

    A library of reinforcement learning components and agents

    Project mention: Applied resources in Pytorch? | | 2021-07-04
  • GitHub repo stable-baselines3

    PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

    Project mention: I need suggestions to improve my project | | 2021-09-06

    Hello everyone, I published my python project a month ago, it's a command line interface for training, tuning and reusing reinforcement learning algorithms in tensorflow 2.x. It's similar to stable-baselines, tf-agents, and not so many others. It seems like it's not getting enough attention despite the README, license, and everything else.

  • GitHub repo agents

    TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

    Project mention: Help understanding PPO training performance | | 2021-09-19

    I'm using a simple training loop, based on this.

  • GitHub repo minimalRL

    Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

    Project mention: Rl algorithm implemented | | 2021-07-18
  • GitHub repo muzero-general


    Project mention: MuZero unable to solve non-slippery FrozenLake environment? | | 2021-08-09

    I have used this implementation from MuZero:

  • GitHub repo rlcard

    Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

    Project mention: What sort of algorithm should I use ? Incomplete information, card game. (Flowchart for reference) | | 2021-01-12

    Probably the easiest way for you to get started is to implement your game on an open source RL framework that has working implementations of some basic CFR variations as well as some other self-play algorithms such as NFSP. OpenSpiel and RLCard are two that I am aware of. Depending on the complexity of your game and how strong your agent needs to play, you might be satisfied with the performance you get using by one of these frameworks.

  • GitHub repo gym-minigrid

    Minimalistic gridworld package for OpenAI Gym

    Project mention: How to train an agent in custom mini-grid environment using stable baselines3? | | 2021-07-20

    Hello guys I tried to build a custom environment using maxicymeb repo

  • GitHub repo ElegantRL

    Lightweight and scalable deep reinforcement learning using PyTorch. 🔥

    Project mention: ElegantRL: A Lightweight and Stable Deep Reinforcement Learning Library | | 2021-03-15
  • GitHub repo Advanced-Deep-Learning-with-Keras

    Advanced Deep Learning with Keras, published by Packt

    Project mention: Cannot understand how REINFORCE model is trained | | 2021-03-04

    I have understood the concept of REINFORCE algorithm and what policy gradient is. However, when I see the code published by PacktPublishing, I was stuck with it.

  • GitHub repo deep-q-learning

    Minimal Deep Q Learning (DQN & DDQN) implementations in Keras (by keon)

    Project mention: Deep Q Network knapsack problem | | 2021-05-22

    So go online on GitHub and find a DQN implementation that has options for using a feedforward net as input (instead of conv net as your input isn’t pixel based). Any remotely modular piece of code will take in state space size and action space as parameters to their NN. This is essentially setting input layer to be equal to state space (so 4) and output layer to be action space (201). ( this repo seems helpful i a cursory glance

  • GitHub repo Hypernets

    A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.

    Project mention: [N][R] A Brief Tutorial for Developing AutoML Tools with Hypernets | | 2021-06-28

    Please see here for the Hypernets library.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2021-09-19.


What are some of the best open-source reinforcement-learning projects in Python? This list will help you:

Project Stars
1 Ray 17,498
2 tensor2tensor 11,573
3 pysc2 7,294
4 trax 6,467
5 machine_learning_examples 6,320
6 client 3,340
7 stable-baselines 3,272
8 tensorforce 3,018
9 polyaxon 2,908
10 football 2,781
11 pytorch-a2c-ppo-acktr-gail 2,530
12 dm_control 2,366
13 acme 2,313
14 stable-baselines3 2,101
15 agents 2,052
16 minimalRL 1,984
17 muzero-general 1,440
18 rlcard 1,419
19 gym-minigrid 1,263
20 ElegantRL 1,179
21 Advanced-Deep-Learning-with-Keras 1,072
22 deep-q-learning 1,060
23 Hypernets 1,012
Find remote jobs at our new job board There are 25 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives