stable-baselines vs SuperSuit

stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms (by hill-a)

Source Code

stable-baselines.readthedocs.io

Suggest alternative

Edit details

SuperSuit

A collection of wrappers for Gymnasium and PettingZoo environments (being merged into gymnasium.wrappers and pettingzoo.wrappers (by Farama-Foundation)

Suggest topics

Source Code

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

stable-baselines		SuperSuit
	Project
10	Mentions	4
4,000	Stars	430
-	Growth	1.4%
0.0	Activity	8.0
over 1 year ago	Latest Commit	about 1 month ago
Python	Language	Python
MIT License	License	GNU General Public License v3.0 or later

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

stable-baselines

Posts with mentions or reviews of stable-baselines. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-08-28.

Distributed implementation tips
1 project | /r/reinforcementlearning | 15 Mar 2023

As underlined by gold-panda, you can give a try with multiprocessing. I once implemented a version based on what is done in stable_baselines v1 (https://github.com/hill-a/stable-baselines/blob/master/stable_baselines/common/vec_env/subproc_vec_env.py)
GAIL without actions?
1 project | /r/reinforcementlearning | 29 Sep 2022

Found relevant code at https://github.com/hill-a/stable-baselines + all code implementations here
Best framework to use if learning today
1 project | /r/reinforcementlearning | 12 Aug 2022

Depends what you wanna do. Universal answer would be https://stable-baselines.readthedocs.io/
weird mean reward graph
1 project | /r/reinforcementlearning | 10 Mar 2022

As you will see here it is recommended to augment this safety measure with target kl_divergence, that will ensure even smoother learning and enforce early stopping to prevent learning collapses.
Nvidia ISAAC gym/RL
2 projects | /r/reinforcementlearning | 28 Aug 2021

Code for https://arxiv.org/abs/1707.06347 found: https://github.com/hill-a/stable-baselines
Bounds for observation
1 project | /r/reinforcementlearning | 22 Mar 2021
Understanding multi agent learning in OpenAI gym and stable-baselines
4 projects | /r/reinforcementlearning | 17 Mar 2021

I haven't read the code, but stable-baselines doesn't support multi-agent environments (https://github.com/hill-a/stable-baselines/issues/423), so I think they're trying to make learning multi-agent easier with Environment.train().
Using Reinforment Learning to beat the first boss in Dark souls 3 with Proximal Policy Optimization
1 project | /r/learnmachinelearning | 18 Feb 2021
Reinforcement Learning Crash Course (Free)
1 project | /r/reinforcementlearning | 15 Jan 2021

- https://github.com/hill-a/stable-baselines (Tensorflow)
JAX Implementations of Actor-Critic Algorithms
5 projects | /r/reinforcementlearning | 10 Jan 2021

- tf2 speed: https://github.com/hill-a/stable-baselines/issues/576#issuecomment-573331715

SuperSuit

Posts with mentions or reviews of SuperSuit. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-11-16.

What is a wrapper in RL?
2 projects | /r/reinforcementlearning | 16 Nov 2021

"SuperSuit is a library that includes all commonly used wrappers in RL (frame stacking, observation, normalization, etc.) for PettingZoo and Gym environments with a nice API. We developed it in lieu of wrappers built into PettingZoo. https://github.com/Farama-Foundation/SuperSuit "
Simple (few states) two-agent environments?
3 projects | /r/reinforcementlearning | 25 Jun 2021

+1 on PettingZoo, and the wrappers they provide as SuperSuit come in handy as well!. Also check out OpenSpiel
Take a look at SuperSuit- It contains mature versions of all common preprocessing wrappers for gym environments, including ones that accept lambda functions for observations/actions/rewards
1 project | /r/reinforcementlearning | 6 Apr 2021
Understanding multi agent learning in OpenAI gym and stable-baselines
4 projects | /r/reinforcementlearning | 17 Mar 2021

Multi-agent isn’t supported by default in stable baselines, but you can make it work with PettingZoo. This example trains a single policy to control every agent in an environment (Parameter sharing). You could use these SuperSuit wrappers to work with other methods (self-play, independent learning, etc) but you would probably need to write some custom training code. https://github.com/PettingZoo-Team/SuperSuit#parallel-environment-vectorization

What are some alternatives?

When comparing stable-baselines and SuperSuit you can also consider the following projects:

stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Ray - Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

stable-baselines - Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

rl-baselines3-zoo - A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

PettingZoo - An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Super-mario-bros-PPO-pytorch - Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

open_spiel - OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Tic-Tac-Toe-Gym - This is the Tic-Tac-Toe game made with Python using the PyGame library and the Gym library to implement the AI with Reinforcement Learning

kaggle-environments

DI-engine - OpenDILab Decision AI Engine

gym