agents vs stable-baselines3

agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning. (by tensorflow)

Source Code

Suggest alternative

Edit details

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms. (by DLR-RM)

reinforcement-learning reinforcement-learning-algorithms Machine Learning Gym openai baselines Toolbox stable-baselines Python Pytorch Robotics Sde gsde sb3

Source Code

stable-baselines3.readthedocs.io

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

agents		stable-baselines3
	Project
11	Mentions	46
2,731	Stars	7,953
0.4%	Growth	3.1%
8.0	Activity	8.2
about 1 month ago	Latest Commit	7 days ago
Python	Language	Python
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

agents

Posts with mentions or reviews of agents. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-12-03.

cannot import name 'binary_weighted_focal_crossentropy' from 'keras.backend'
1 project | /r/learnmachinelearning | 5 Sep 2022

im trying to follow this tutorial = https://github.com/tensorflow/agents/blob/master/docs/tutorials/9_c51_tutorial.ipynb
Trying to apply the TensorFlow agents from the examples to a custom environment
1 project | /r/tensorflow | 9 Jan 2022

I followed the TensorFlow tutorial for agents and the multi armed bandit tutorial and now I'm trying to make one of the already implemented agents, from the examples, work on my own environment. Basically my environment exists of 5 actions and 5 observations. Applying one action i results in the same state i. One action contains another step of sending that action number to a different program via a socket and the answer from the program is interpreted for the reward. My environment seems to be working, I used the little test script below to test the observe and action functions. I know this is not a full proof but showed its atleast working.
DD-PPO, TD3, SAC: which is the best?
3 projects | /r/reinforcementlearning | 3 Dec 2021

Depending on what task you pick the "best" algo will vary. There are also a bunch of variations and tricks for each of those, some of which have been given new names over time. If you are working on a project I would suggest whichever one has the simplest and most extendable implementation. If you really want to compare all of them you can use libraries that have them all implemented, such as tfagents.
Help understanding PPO training performance
1 project | /r/reinforcementlearning | 19 Sep 2021

I'm using a simple training loop, based on this.
I need suggestions to improve my project
3 projects | /r/github | 6 Sep 2021

Hello everyone, I published my python project a month ago, it's a command line interface for training, tuning and reusing reinforcement learning algorithms in tensorflow 2.x. It's similar to stable-baselines, tf-agents, and not so many others. It seems like it's not getting enough attention despite the README, license, and everything else.
xagents, a new reinforcement learning library in TF2
2 projects | /r/reinforcementlearning | 5 Aug 2021
tf-agents throws ValueError: Layer dense layer expects 1 input(s), but it received 4 input tensors when using custom environment with OpenAI Gym
3 projects | /r/learnmachinelearning | 7 Jul 2021

Well it seems it doesn't flatten anything, just passes OrderedDict as input dense. Not sure but apparently it's keras that makes that a list of tensors. You can dig around places like https://github.com/tensorflow/agents/blob/v0.8.0/tf_agents/networks/network.py https://github.com/tensorflow/agents/blob/v0.8.0/tf_agents/agents/dqn/dqn_agent.py https://github.com/openai/gym/blob/master/gym/spaces/dict.py if you want to be really sure.
[D] Choosing best parameters from an optimization
3 projects | /r/MachineLearning | 5 Jun 2021

2- You could go the reinforcement learning approach by controlling these parameters using an agent. This would mean that the parameters would have to change on the fly, which I am not sure if appropriate. If so, creating a gym environment is not so hard, which would then use something like tf.agents , rlax or any other rl framework of your liking.
"Modern" version of OpenAI's spinning up?
1 project | /r/reinforcementlearning | 4 Jun 2021

Not the same style and looks somehow more complicated but i want to mention tf.agents if you don't know about it already.
Can somebody give me reinforcement learning code example.
1 project | /r/MLQuestions | 22 Jan 2021

stable-baselines3

Posts with mentions or reviews of stable-baselines3. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-09.

Sim-to-real RL pipeline for open-source wheeled bipeds
2 projects | /r/robotics | 9 Dec 2023

The latest release (v3.0.0) of Upkie's software brings a functional sim-to-real reinforcement learning pipeline based on Stable Baselines3, with standard sim-to-real tricks. The pipeline trains on the Gymnasium environments distributed in upkie.envs (setup: pip install upkie) and is implemented in the PPO balancer. Here is a policy running on an Upkie:
[P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials)
4 projects | /r/reinforcementlearning | 24 Aug 2023

PettingZoo 1.24.0 is now live! This release includes Python 3.11 support, updated Chess and Hanabi environment versions, and many bugfixes, documentation updates and testing expansions. We are also very excited to announce 3 tutorials using Stable-Baselines3, and a full training script using CleanRL with TensorBoard and WandB.
[Question] Why there is so few algorithms implemented in SB3?
1 project | /r/reinforcementlearning | 22 Jul 2023

I am wondering why there is so few algorithms in Stable Baselines 3 (SB3, https://github.com/DLR-RM/stable-baselines3/tree/master)? I was expecting some algorithms like ICM, HIRO, DIAYN, ... Why there is no model-based, skill-chaining, hierarchical-RL, ... algorithms implemented there?
Stable baselines! Where my people at?
1 project | /r/reinforcementlearning | 5 Jul 2023

Discord is more focused, and they have a page for people who wants to contribute https://github.com/DLR-RM/stable-baselines3/blob/master/CONTRIBUTING.md
SB3 - NotImplementedError: Box([-1. -1. -8.], [1. 1. 8.], (3,), <class 'numpy.float32'>) observation space is not supported
2 projects | /r/reinforcementlearning | 19 Jun 2023

Therefore, I debugged this error to the ReplayBuffer that was imported from `SB3`. This is the problem function -
Exporting an A2C model created with stable-baselines3 to PyTorch
1 project | /r/reinforcementlearning | 5 Jun 2023
Shimmy 1.0: Gymnasium & PettingZoo bindings for popular external RL environments
10 projects | /r/reinforcementlearning | 25 Apr 2023

Have you ever wanted to use dm-control with stable-baselines3? Within Reinforcement learning (RL), a number of APIs are used to implement environments, with limited ability to convert between them. This makes training agents across different APIs highly difficult, and has resulted in a fractured ecosystem.
Stable-Baselines3 v1.8 Release
2 projects | /r/reinforcementlearning | 12 Apr 2023

Changelog: https://github.com/DLR-RM/stable-baselines3/releases/tag/v1.8.0
[P] Reinforcement learning evolutionary hyperparameter optimization - 10x speed up
3 projects | /r/MachineLearning | 24 Mar 2023

Great project! One question though, is there any reason why you are not using existing RL models instead of creating your own, such as stable baselines?
Is stable-baselines3 compatible with gymnasium/gymnasium-robotics?
1 project | /r/reinforcementlearning | 13 Feb 2023

What are some alternatives?

When comparing agents and stable-baselines3 you can also consider the following projects:

gym - A toolkit for developing and comparing reinforcement learning algorithms.

Ray - Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

tensorforce - Tensorforce: a TensorFlow library for applied reinforcement learning

stable-baselines - A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

VMAgent - Our VMAgent is a platform for exploiting Reinforcement Learning (RL) on Virtual Machine (VM) scheduling tasks.

Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration

Gymnasium - An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

cleanrl - High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

habitat-api - A modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators. [Moved to: https://github.com/facebookresearch/habitat-lab]

tianshou - An elegant PyTorch deep reinforcement learning library.

GPflowOpt - Bayesian Optimization using GPflow

Super-mario-bros-PPO-pytorch - Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

agents vs gym stable-baselines3 vs Ray agents vs tensorforce stable-baselines3 vs stable-baselines agents vs VMAgent stable-baselines3 vs Pytorch agents vs Gymnasium stable-baselines3 vs cleanrl agents vs habitat-api stable-baselines3 vs tianshou agents vs GPflowOpt stable-baselines3 vs Super-mario-bros-PPO-pytorch

Compare agents vs stable-baselines3 and see what are their differences.

agents

stable-baselines3

agents

stable-baselines3

What are some alternatives?