Python reinforcement-learning

Open-source Python projects categorized as reinforcement-learning

Top 23 Python reinforcement-learning Projects

  • Ray

    Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.

    Project mention: Is dynamic action masking possible in Rllib? | reddit.com/r/reinforcementlearning | 2023-01-23
  • tensor2tensor

    Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

    Project mention: [P] Why I quit my lucrative job at Google to start Vectara? (neural search as a service for developers everywhere). | reddit.com/r/MachineLearning | 2022-10-17

    Found relevant code at https://github.com/tensorflow/tensor2tensor + all code implementations here

  • InfluxDB

    Build time-series-based applications quickly and at scale.. InfluxDB is the Time Series Platform where developers build real-time applications for analytics, IoT and cloud-native services. Easy to start, it is available in the cloud or on-premises.

  • reinforcement-learning-an-introduction

    Python Implementation of Reinforcement Learning: An Introduction

    Project mention: Reinforcement Learning - looking for some resources | reddit.com/r/reinforcementlearning | 2022-04-09
  • pysc2

    StarCraft II Learning Environment

    Project mention: [D] What tool do you use for reinforcement learning experimentation? | reddit.com/r/MachineLearning | 2022-08-10

    Good evening, guys. I currently use StarCraft 2 as a tool for experimenting with my deep reinforcement learning projects, I have also used OpenAI Gym.

  • trax

    Trax — Deep Learning with Clear Code and Speed

    Project mention: The founder of Gmail claims that ChatGPT can “kill” Google in two years. | reddit.com/r/Futurology | 2023-01-31

    But a couple years later they came out with open source implementations yeah: https://github.com/google/trax/tree/master/trax/models/reformer

  • machine_learning_examples

    A collection of machine learning examples and tutorials.

  • TensorLayer

    Deep Learning and Reinforcement Learning Library for Scientists and Engineers

    Project mention: Introduction to ‘TensorLayer’: A Python-based Versatile Deep Learning Library Designed for Machine Learning Researchers | reddit.com/r/Python | 2022-02-20

    Github: https://github.com/tensorlayer/TensorLayer

  • Sonar

    Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

  • keras-rl

    Deep Reinforcement Learning for Keras.

    Project mention: How can we model an observation space of an env with different features and sizes. | reddit.com/r/reinforcementlearning | 2022-12-20
  • wandb

    🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

    Project mention: Efficient way to tune a network by changing hyperparameters? | reddit.com/r/deeplearning | 2023-01-25

    Wandb is the best! https://wandb.ai/

  • stable-baselines3

    PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

    Project mention: Is Stable Baselines 3 no longer compatible with PettingZoo? | reddit.com/r/reinforcementlearning | 2023-01-11

    I was able to get Stable Baselines 3 to work with gymnasium by following the details in this work-in-progress PR: https://github.com/DLR-RM/stable-baselines3/pull/780. I have not used PettingZoo, though.

  • PaLM-rlhf-pytorch

    Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

    Project mention: Gather up great coders and make a better Character.Ai | reddit.com/r/CharacterAI | 2023-01-22

    Well... Not necessarily. Actually, if you want to be extra thrifty, you could even go without an ML expert. Just use an open-source one, like LaMDA or PaLM. After that, use chatGPT to build you a basic front end (which would still be better than CAI lol).

  • stable-baselines

    A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

    Project mention: GAIL without actions? | reddit.com/r/reinforcementlearning | 2022-09-29

    Found relevant code at https://github.com/hill-a/stable-baselines + all code implementations here

  • polyaxon

    MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle

    Project mention: [D] Kubernetes for ML - how are y'all doing it? | reddit.com/r/MachineLearning | 2022-04-14

    We use Polyaxon and it’s pretty good

  • tensorforce

    Tensorforce: a TensorFlow library for applied reinforcement learning

  • dm_control

    DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

    Project mention: Installing & Using MuJoCo 2.1.5 with OpenAi Gym | reddit.com/r/reinforcementlearning | 2022-05-18

    Deepmind Control Suite is a good alternative to Open AI Gym for continuous control tasks. It contains many of the environments present in Gym and also a few extra ones. Deepmind Control Suite also uses Mujoco. I found the installation to be straightforward. Check out https://github.com/deepmind/dm_control

  • football

    Check out the new game server:

  • pytorch-a2c-ppo-acktr-gail

    PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

    Project mention: How does advantage estimation is done when episodes are of variable length in PPO? | reddit.com/r/reinforcementlearning | 2022-04-07

    As an example look at "compute_returns" function here (and pay attention to how self.masks is used): https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail/blob/master/a2c_ppo_acktr/storage.py

  • acme

    A library of reinforcement learning components and agents

    Project mention: How much of a MuJoCo simulation or real life robot can you train on a 3090? | reddit.com/r/reinforcementlearning | 2022-12-01

    I'm training a few algorithms from Deepmind's acme library on some MuJoCo models and I'm wondering how long this will take to train and what it's going to do to my electric bill. Is a 3090 or two enough to train something to keep its balance, or do a task, or do I need to wait for the 8090 to come out?

  • ElegantRL

    Cloud-native Deep Reinforcement Learning. 🔥

    Project mention: Does “massively parallel simulation” help advance Reinforcement Learning? | news.ycombinator.com | 2022-12-09
  • agents

    TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

    Project mention: cannot import name 'binary_weighted_focal_crossentropy' from 'keras.backend' | reddit.com/r/learnmachinelearning | 2022-09-05

    im trying to follow this tutorial = https://github.com/tensorflow/agents/blob/master/docs/tutorials/9_c51_tutorial.ipynb

  • minimalRL

    Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

    Project mention: Does anyone know good python sources hardcoded of RL? | reddit.com/r/reinforcementlearning | 2022-05-15
  • DI-engine

    OpenDILab Decision AI Engine

    Project mention: Struggling with algorithm generality? Try DI engine; here is the solution | news.ycombinator.com | 2022-04-29
  • cleanrl

    High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

    Project mention: Model-based hierarchical reinforcement learning | reddit.com/r/reinforcementlearning | 2023-01-27

    Shameless self-plug: as far as implementation is concerned, I am working on a (hopefully) easier to understand Dreamer architecture under the CleanRL library, toward also re-implementing Director, Dreamer-v3, and and JAX variant for faster training.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-01-31.

Python reinforcement-learning related posts

Index

What are some of the best open-source reinforcement-learning projects in Python? This list will help you:

Project Stars
1 Ray 23,792
2 tensor2tensor 12,926
3 reinforcement-learning-an-introduction 12,035
4 pysc2 7,672
5 trax 7,319
6 machine_learning_examples 7,210
7 TensorLayer 7,135
8 keras-rl 5,368
9 wandb 5,368
10 stable-baselines3 4,833
11 PaLM-rlhf-pytorch 4,718
12 stable-baselines 3,713
13 polyaxon 3,239
14 tensorforce 3,212
15 dm_control 3,049
16 football 3,047
17 pytorch-a2c-ppo-acktr-gail 3,045
18 acme 2,999
19 ElegantRL 2,570
20 agents 2,424
21 minimalRL 2,301
22 DI-engine 2,196
23 cleanrl 2,016
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com