Ray vs stable-baselines

Ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. (by ray-project)

Source Code

ray.io

Docs

Suggest alternative

Edit details

stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms (by hill-a)

reinforcement-learning-algorithms reinforcement-learning Machine Learning Gym openai baselines Toolbox Python Data Science

Source Code

stable-baselines.readthedocs.io

Suggest alternative

Edit details

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

Ray		stable-baselines
	Project
42	Mentions	10
30,988	Stars	4,000
2.8%	Growth	-
10.0	Activity	0.0
about 15 hours ago	Latest Commit	over 1 year ago
Python	Language	Python
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Ray

Posts with mentions or reviews of Ray. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-05.

Open Source Advent Fun Wraps Up!
10 projects | dev.to | 5 Jan 2024

22. Ray | Github | tutorial
Fine-Tuning Llama-2: A Comprehensive Case Study for Tailoring Custom Models
1 project | news.ycombinator.com | 11 Aug 2023

Training times for GSM8k are mentioned here: https://github.com/ray-project/ray/tree/master/doc/source/te...
Ray – an open source project for scaling AI workloads
1 project | news.ycombinator.com | 11 Aug 2023
Methods to keep agents inside grid world.
1 project | /r/reinforcementlearning | 30 Jun 2023

Here's a reference from RLlib that points to docs and an example, and here's one from one of my projects that includes all my own implementations
TransformerXL + PPO Baseline + MemoryGym
10 projects | /r/reinforcementlearning | 15 Feb 2023

RLlib
Is dynamic action masking possible in Rllib?
1 project | /r/reinforcementlearning | 23 Jan 2023
AWS re:Invent 2022 Recap | Data & Analytics services
1 project | dev.to | 3 Jan 2023

⦿ AWS Glue Data Quality - Automatic data quality rule recommendations based on your data AWS Glue for Ray - Data integration with Ray (ray.io), a popular new open-source compute framework that helps you scale Python workloads
Think about it for a second
1 project | /r/mathmemes | 19 Oct 2022

https://ray.io (just dropping the link)
Elixir Livebook now as a desktop app
12 projects | news.ycombinator.com | 2 Aug 2022

I've wondered whether it's easier to add data analyst stuff to Elixir that Python seems to have, or add features to Python that Erlang (and by extension Elixir) provides out of the box.
By what I can see, if you want multiprocessing on Python in an easier way (let's say running async), you have to use something like ray core[0], then if you want multiple machines you need redis(?). Elixir/Erlang supports this out of the box.
Explorer[1] is an interesting approach, where it uses Rust via Rustler (Elixir library to call Rust code) and uses Polars as its dataframe library. I think Rustler needs to be reworked for this usecase, as it can be slow to return data. I made initial improvements which drastically improves encoding (https://github.com/elixir-nx/explorer/pull/282 and https://github.com/elixir-nx/explorer/pull/286, tldr 20+ seconds down to 3).
[0] https://github.com/ray-project/ray
Learn various techniques to reduce data processing time by using multiprocessing, joblib, and tqdm concurrent
1 project | /r/Python | 13 Jul 2022

Adding these for anyone who had a similar question about Ray vs dask 1, 2, 3

stable-baselines

Posts with mentions or reviews of stable-baselines. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-08-28.

Distributed implementation tips
1 project | /r/reinforcementlearning | 15 Mar 2023

As underlined by gold-panda, you can give a try with multiprocessing. I once implemented a version based on what is done in stable_baselines v1 (https://github.com/hill-a/stable-baselines/blob/master/stable_baselines/common/vec_env/subproc_vec_env.py)
GAIL without actions?
1 project | /r/reinforcementlearning | 29 Sep 2022

Found relevant code at https://github.com/hill-a/stable-baselines + all code implementations here
Best framework to use if learning today
1 project | /r/reinforcementlearning | 12 Aug 2022

Depends what you wanna do. Universal answer would be https://stable-baselines.readthedocs.io/
weird mean reward graph
1 project | /r/reinforcementlearning | 10 Mar 2022

As you will see here it is recommended to augment this safety measure with target kl_divergence, that will ensure even smoother learning and enforce early stopping to prevent learning collapses.
Nvidia ISAAC gym/RL
2 projects | /r/reinforcementlearning | 28 Aug 2021

Code for https://arxiv.org/abs/1707.06347 found: https://github.com/hill-a/stable-baselines
Bounds for observation
1 project | /r/reinforcementlearning | 22 Mar 2021
Understanding multi agent learning in OpenAI gym and stable-baselines
4 projects | /r/reinforcementlearning | 17 Mar 2021

I haven't read the code, but stable-baselines doesn't support multi-agent environments (https://github.com/hill-a/stable-baselines/issues/423), so I think they're trying to make learning multi-agent easier with Environment.train().
Using Reinforment Learning to beat the first boss in Dark souls 3 with Proximal Policy Optimization
1 project | /r/learnmachinelearning | 18 Feb 2021
Reinforcement Learning Crash Course (Free)
1 project | /r/reinforcementlearning | 15 Jan 2021

- https://github.com/hill-a/stable-baselines (Tensorflow)
JAX Implementations of Actor-Critic Algorithms
5 projects | /r/reinforcementlearning | 10 Jan 2021

- tf2 speed: https://github.com/hill-a/stable-baselines/issues/576#issuecomment-573331715

What are some alternatives?

When comparing Ray and stable-baselines you can also consider the following projects:

optuna - A hyperparameter optimization framework

stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

rl-baselines3-zoo - A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Faust - Python Stream Processing

Super-mario-bros-PPO-pytorch - Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

gevent - Coroutine-based concurrency library for Python

Tic-Tac-Toe-Gym - This is the Tic-Tac-Toe game made with Python using the PyGame library and the Gym library to implement the AI with Reinforcement Learning

SCOOP (Scalable COncurrent Operations in Python) - SCOOP (Scalable COncurrent Operations in Python)

DI-engine - OpenDILab Decision AI Engine

Thespian Actor Library - Python Actor concurrency library

gym

Ray vs optuna stable-baselines vs stable-baselines3 Ray vs stable-baselines3 stable-baselines vs rl-baselines3-zoo Ray vs Faust stable-baselines vs Super-mario-bros-PPO-pytorch Ray vs gevent stable-baselines vs Tic-Tac-Toe-Gym Ray vs SCOOP (Scalable COncurrent Operations in Python) stable-baselines vs DI-engine Ray vs Thespian Actor Library stable-baselines vs gym

Compare Ray vs stable-baselines and see what are their differences.

Ray

stable-baselines

Ray

stable-baselines

What are some alternatives?