Python reinforcement-learning

Open-source Python projects categorized as reinforcement-learning

Top 23 Python reinforcement-learning Projects

  • Ray

    Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

    Project mention: Open Source Advent Fun Wraps Up! | dev.to | 2024-01-05

    22. Ray | Github | tutorial

  • d2l-en

    Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

    Project mention: which book to chose for deep learning :lan Goodfellow or francois chollet | /r/learnmachinelearning | 2023-04-07
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • reinforcement-learning-an-introduction

    Python Implementation of Reinforcement Learning: An Introduction

    Project mention: Help request: Are the results of Sutton and Barto's Example 6.6 Cliff walking believable? What's likely the problem if my SARSA implementation can't replicate? | /r/reinforcementlearning | 2023-04-10

    The python code to generate any figure in this textbook is reproduced in a repo, and you can find the file for the figure in question here: https://github.com/ShangtongZhang/reinforcement-learning-an-introduction/blob/master/chapter06/cliff_walking.py

  • machine_learning_examples

    A collection of machine learning examples and tutorials.

    Project mention: Doubt about numpy's eigen calculation | /r/learnmachinelearning | 2023-05-25

    Does that mean that the example I found on the internet is wrong (I think it comes from a DL Course, so I'd imagine it is not wrong)? or does it mean that I am comparing two different things? I guess this has to deal with right and left eigen vectors as u/JanneJM pointed out in her comment?

  • trax

    Trax — Deep Learning with Clear Code and Speed

    Project mention: Replit's new Code LLM was trained in 1 week | news.ycombinator.com | 2023-05-03

    and the implementation https://github.com/google/trax/blob/master/trax/models/resea... if you are interested.

    Hope you get to look into this!

  • pysc2

    StarCraft II Learning Environment

    Project mention: Project For Beginners [StarCraft 2 AI] | /r/learnmachinelearning | 2023-03-05
  • wandb

    🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

    Project mention: A list of SaaS, PaaS and IaaS offerings that have free tiers of interest to devops and infradev | dev.to | 2024-02-05

    Weights & Biases — The developer-first MLOps platform. Build better models faster with experiment tracking, dataset versioning, and model management. Free tier for personal projects only, with 100 GB of storage included.

  • WorkOS

    The modern API for authentication & user identity. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • PaLM-rlhf-pytorch

    Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

    Project mention: How should I get an in-depth mathematical understanding of generative AI? | /r/datascience | 2023-05-18

    ChatGPT isn't open sourced so we don't know what the actual implementation is. I think you can read Open Assistant's source code for application design. If that is too much, try Open Chat Toolkit's source code for developer tools . If you need very bare implementation, you should go for lucidrains/PaLM-rlhf-pytorch.

  • stable-baselines3

    PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

    Project mention: Sim-to-real RL pipeline for open-source wheeled bipeds | /r/robotics | 2023-12-09

    The latest release (v3.0.0) of Upkie's software brings a functional sim-to-real reinforcement learning pipeline based on Stable Baselines3, with standard sim-to-real tricks. The pipeline trains on the Gymnasium environments distributed in upkie.envs (setup: pip install upkie) and is implemented in the PPO balancer. Here is a policy running on an Upkie:

  • TensorLayer

    Deep Learning and Reinforcement Learning Library for Scientists and Engineers

  • keras-rl

    Deep Reinforcement Learning for Keras.

  • Gymnasium

    An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

    Project mention: NASA JPL Open Source Rover That Runs ROS 2 | news.ycombinator.com | 2023-09-22

    "Show HN: Ghidra Plays Mario" (2023) https://news.ycombinator.com/item?id=37475761 :

    [RL, MuZero reduxxxx ]

    > Farama-Foundation/Gymnasium is a fork of OpenAI/gym and it has support for additional Environments like MuJoCo: https://github.com/Farama-Foundation/Gymnasium#environments

    > Farama-Foundatiom/MO-Gymnasiun: "Multi-objective Gymnasium environments for reinforcement learning": https://github.com/Farama-Foundation/MO-Gymnasium

  • trlx

    A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

    Project mention: Why did Stability not copy Midjourney's RLHF process? And what's the future of Stable Diffusion? | /r/StableDiffusion | 2023-04-09

    We drove and released the top RLHF framework TRLX for example from our Carper AI lab used by some of the biggest companies in the world: https://github.com/CarperAI/trlx

  • cleanrl

    High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

    Project mention: [P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials) | /r/reinforcementlearning | 2023-08-24

    PettingZoo 1.24.0 is now live! This release includes Python 3.11 support, updated Chess and Hanabi environment versions, and many bugfixes, documentation updates and testing expansions. We are also very excited to announce 3 tutorials using Stable-Baselines3, and a full training script using CleanRL with TensorBoard and WandB.

  • stable-baselines

    A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

    Project mention: Distributed implementation tips | /r/reinforcementlearning | 2023-03-15

    As underlined by gold-panda, you can give a try with multiprocessing. I once implemented a version based on what is done in stable_baselines v1 (https://github.com/hill-a/stable-baselines/blob/master/stable_baselines/common/vec_env/subproc_vec_env.py)

  • polyaxon

    MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle

    Project mention: Any MLOps platform you use? | /r/selfhosted | 2023-02-25

    If you're not concerned about self-hosting, WandB is one of the more fully featured training monitoring tools (I've used it in the past without any issues but the lack of data and training privacy and lack of self-hosting possibilities makes it a hard no for anything that isn't scholastic). Polyaxon is an alternative but rewriting all your variable logging to conform to their requirements makes it very difficult to switch to it in the middle of a project so you have to commit to it from the get-go.

  • dm_control

    Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

    Project mention: Shimmy 1.0: Gymnasium & PettingZoo bindings for popular external RL environments | /r/farama | 2023-04-25

    This includes single-agent Gymnasium wrappers for DM Control, DM Lab, Behavior Suite, Arcade Learning Environment, OpenAI Gym V21 & V26. Multi-agent PettingZoo wrappers support DM Control Soccer, OpenSpiel and Melting Pot. For more information, read the release notes here:

  • pytorch-a2c-ppo-acktr-gail

    PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

  • acme

    A library of reinforcement learning components and agents

    Project mention: Fast and hackable frameworks for RL research | /r/reinforcementlearning | 2023-03-08

    I'm tired of having my 200m frames of Atari take 5 days to run with dopamine, so I'm looking for another framework to use. I haven't been able to find one that's fast and hackable, preferably distributed or with vectorized environments. Anybody have suggestions? seed-rl seems promising but is archived (and in TF2). sample-factory seems super fast but to the best of my knowledge doesn't work with replay buffers. I've been trying to get acme working but documentation is sparse and many of the features are broken.

  • ElegantRL

    Massively Parallel Deep Reinforcement Learning. 🔥

  • tensorforce

    Tensorforce: a TensorFlow library for applied reinforcement learning

  • football

    Check out the new game server:

  • catalyst

    Accelerated deep learning R&D (by catalyst-team)

    Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09
  • LearnThisRepo.com

    Learn 300+ open source libraries for free using AI. LearnThisRepo lets you learn 300+ open source repos including Postgres, Langchain, VS Code, and more by chatting with them using AI!

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-02-05.

Python reinforcement-learning related posts

Index

What are some of the best open-source reinforcement-learning projects in Python? This list will help you:

Project Stars
1 Ray 30,029
2 d2l-en 20,954
3 reinforcement-learning-an-introduction 13,032
4 machine_learning_examples 7,964
5 trax 7,884
6 pysc2 7,881
7 wandb 7,846
8 PaLM-rlhf-pytorch 7,546
9 stable-baselines3 7,448
10 TensorLayer 7,275
11 keras-rl 5,473
12 Gymnasium 5,037
13 trlx 4,233
14 cleanrl 4,078
15 stable-baselines 3,943
16 polyaxon 3,452
17 dm_control 3,446
18 pytorch-a2c-ppo-acktr-gail 3,349
19 acme 3,325
20 ElegantRL 3,315
21 tensorforce 3,273
22 football 3,226
23 catalyst 3,201
Learn 300+ open source libraries for free using AI.
LearnThisRepo lets you learn 300+ open source repos including Postgres, Langchain, VS Code, and more by chatting with them using AI!
learnthisrepo.com