stable-baselines3 VS tianshou

Compare stable-baselines3 vs tianshou and see what are their differences.

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
stable-baselines3 tianshou
46 8
7,850 7,356
4.0% 2.5%
8.2 9.5
8 days ago 4 days ago
Python Python
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

stable-baselines3

Posts with mentions or reviews of stable-baselines3. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-09.

tianshou

Posts with mentions or reviews of tianshou. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-02.
  • Multi-Agent Stable Baselines
    2 projects | /r/reinforcementlearning | 2 Feb 2023
    https://github.com/thu-ml/tianshou Imho there isn't a library that has it all, RLlib is quite good too, but I think that Tianshou is more similar to Pytorch and that helps to change the internals more intuitively and know what you are doing.
  • Question about the old policy and new policy in TRPO code
    4 projects | /r/reinforcementlearning | 6 Jul 2022
    Good point...I'll check in more detail when I get a chance later today! I would suggest looking at a more recent implementation like https://github.com/DLR-RM/stable-baselines3 or https://github.com/thu-ml/tianshou if you're trying to build. https://spinningup.openai.com/en/latest/algorithms/trpo.html is particularly good for understanding
  • Tensorflow vs PyTorch for A3C
    4 projects | /r/reinforcementlearning | 17 Nov 2021
    Do you absolutely need A3C? A2C has become more widely used (see, e.g., the comment in https://github.com/ikostrikov/pytorch-a3c, and the fact that both https://github.com/thu-ml/tianshou and https://github.com/facebookresearch/salina have A2C implementations, but no A3C at first glance).
  • Best PyTorch RL library for doing research
    9 projects | /r/reinforcementlearning | 30 Apr 2021
    I tried tianshou and thought it was well-designed for modularity, but it was early in development when I tried and missing some basic features

What are some alternatives?

When comparing stable-baselines3 and tianshou you can also consider the following projects:

Ray - Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

stable-baselines - A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration

cleanrl - High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Super-mario-bros-PPO-pytorch - Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

ElegantRL - Massively Parallel Deep Reinforcement Learning. đŸ”„

SuperSuit - A collection of wrappers for Gymnasium and PettingZoo environments (being merged into gymnasium.wrappers and pettingzoo.wrappers

Tic-Tac-Toe-Gym - This is the Tic-Tac-Toe game made with Python using the PyGame library and the Gym library to implement the AI with Reinforcement Learning

rl-baselines3-zoo - A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

RL-Adventure - Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

agents - TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.