PettingZoo
stable-baselines3
Our great sponsors
PettingZoo | stable-baselines3 | |
---|---|---|
19 | 46 | |
2,370 | 7,894 | |
4.1% | 5.2% | |
8.8 | 8.2 | |
7 days ago | 7 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
PettingZoo
- CartPole equivalent enviroments in MARL?
-
[P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials)
Release notes: https://github.com/Farama-Foundation/PettingZoo/releases/tag/1.24.0
-
Shimmy 1.0: Gymnasium & PettingZoo bindings for popular external RL environments
To address this issue, we are excited to announce the release of Shimmy as a mature Farama Foundation project. Shimmy is an API compatibility tool for converting external RL environments to the Gymnasium and PettingZoo APIs. This allows users to access a wide range of single and multi-agent environments, all under a single standard API.
-
Petting Zoo Classic Environment
I am currently trying to implement my own version of a Connect Four Environment based on the version available on the PettingZoo Library github (https://github.com/Farama-Foundation/PettingZoo/blob/master/pettingzoo/classic/connect_four/connect_four.py).
-
Ideas for MARL project?
The Farama Foundation is always looking for contributors to PettingZoo, the largest open source MARL library out there (https://github.com/Farama-Foundation/PettingZoo, farama.org). If you might be interested in that you can message them in their discord server
-
Is Stable Baselines 3 no longer compatible with PettingZoo?
I am trying to implement a custom PettingZoo environment, and a shared policy with Stable Baselines 3. I am running into trouble with the action spaces not being compatible, since PettingZoo has started using gymnasium instead of gym. Does anyone know if these libraries no longer work together, and perhaps if there is a work-around?
-
New to reinforcement learning.
I'd say this is a great path but I'd also look at the basic on-policy gradient actor critic methods like A2C and eventually PPO. Someone recommended SAC which also really good. There are tons of environments in the https://github.com/Farama-Foundation/PettingZoo as well if you want to mess with those. You can also check out stable baselines https://github.com/DLR-RM/stable-baselines3 which is pretty popular. If you want to get into the theory more I recommend reading the Sutton and Barto book on reinforcement learning.
-
[Stable Baselines3] How do I train 3 model simultaneously?
Might want to check out petting zoo: https://github.com/Farama-Foundation/PettingZoo
-
Are there any other high-quality pre-built Python training environments other than Open AI's GYM?
Not that I know of, most I've seen are based on gym. I did see PettingZoo which is for multi-agent RL. https://github.com/Farama-Foundation/PettingZoo
-
Training with multiple agents.
Thank you for your comment. This is certainly helpful. I have since read more about this domain and turns out mathematically it is modeled using Markov/Stochastic game representations. I also found PettingZoo which can be used with RayLib for MARL problems.
stable-baselines3
-
Sim-to-real RL pipeline for open-source wheeled bipeds
The latest release (v3.0.0) of Upkie's software brings a functional sim-to-real reinforcement learning pipeline based on Stable Baselines3, with standard sim-to-real tricks. The pipeline trains on the Gymnasium environments distributed in upkie.envs (setup: pip install upkie) and is implemented in the PPO balancer. Here is a policy running on an Upkie:
-
[P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials)
PettingZoo 1.24.0 is now live! This release includes Python 3.11 support, updated Chess and Hanabi environment versions, and many bugfixes, documentation updates and testing expansions. We are also very excited to announce 3 tutorials using Stable-Baselines3, and a full training script using CleanRL with TensorBoard and WandB.
-
[Question] Why there is so few algorithms implemented in SB3?
I am wondering why there is so few algorithms in Stable Baselines 3 (SB3, https://github.com/DLR-RM/stable-baselines3/tree/master)? I was expecting some algorithms like ICM, HIRO, DIAYN, ... Why there is no model-based, skill-chaining, hierarchical-RL, ... algorithms implemented there?
-
Stable baselines! Where my people at?
Discord is more focused, and they have a page for people who wants to contribute https://github.com/DLR-RM/stable-baselines3/blob/master/CONTRIBUTING.md
-
SB3 - NotImplementedError: Box([-1. -1. -8.], [1. 1. 8.], (3,), <class 'numpy.float32'>) observation space is not supported
Therefore, I debugged this error to the ReplayBuffer that was imported from `SB3`. This is the problem function -
- Exporting an A2C model created with stable-baselines3 to PyTorch
-
Shimmy 1.0: Gymnasium & PettingZoo bindings for popular external RL environments
Have you ever wanted to use dm-control with stable-baselines3? Within Reinforcement learning (RL), a number of APIs are used to implement environments, with limited ability to convert between them. This makes training agents across different APIs highly difficult, and has resulted in a fractured ecosystem.
-
Stable-Baselines3 v1.8 Release
Changelog: https://github.com/DLR-RM/stable-baselines3/releases/tag/v1.8.0
-
[P] Reinforcement learning evolutionary hyperparameter optimization - 10x speed up
Great project! One question though, is there any reason why you are not using existing RL models instead of creating your own, such as stable baselines?
- Is stable-baselines3 compatible with gymnasium/gymnasium-robotics?
What are some alternatives?
open_spiel - OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Ray - Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
gym-macm - 2D, physics based gym environment for multi-agent combat & movement tasks
stable-baselines - A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
SuperSuit - A collection of wrappers for Gymnasium and PettingZoo environments (being merged into gymnasium.wrappers and pettingzoo.wrappers
Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration
gym - A toolkit for developing and comparing reinforcement learning algorithms.
cleanrl - High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Mava - 🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
tianshou - An elegant PyTorch deep reinforcement learning library.
tmrl - Reinforcement Learning for real-time applications - host of the TrackMania Roborace League
Super-mario-bros-PPO-pytorch - Proximal Policy Optimization (PPO) algorithm for Super Mario Bros