q-learning-algorithms vs Ray

q-learning-algorithms

This repository will aim to provide implementations of q-learning algorithms (DQN, Double-DQN, ...) using Pytorch. (by thomashirtz)

Source Code

Suggest alternative

Edit details

Ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. (by ray-project)

Source Code

ray.io

Docs

Suggest alternative

Edit details

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

q-learning-algorithms		Ray
	Project
1	Mentions	42
4	Stars	30,879
-	Growth	2.8%
0.0	Activity	10.0
almost 3 years ago	Latest Commit	7 days ago
Python	Language	Python
-	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

q-learning-algorithms

Posts with mentions or reviews of q-learning-algorithms. We have used some of these posts to build our list of alternatives and similar projects.

actor-critic algorithms
1 project | /r/reinforcementlearning | 11 Apr 2021

I learn quite some things about reinforcement learning in the last months, and I feel like I understand much better deep-Q learning algorithms (if you want, you can check my [repo](https://github.com/thomashirtz/q-learning-algorithms). I would like to change a little bit my focus towards actor-critics algorithms now. The only thing is, I feel like in comparison to Q-learning algorithms, the explanations of the papers are not as precise as for Q-learning, and explanations on the internet diverge really greatly (e.g. the original paper does not give the A2C but only the A3C for one learner).

Ray

Posts with mentions or reviews of Ray. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-05.

Open Source Advent Fun Wraps Up!
10 projects | dev.to | 5 Jan 2024

22. Ray | Github | tutorial
Fine-Tuning Llama-2: A Comprehensive Case Study for Tailoring Custom Models
1 project | news.ycombinator.com | 11 Aug 2023

Training times for GSM8k are mentioned here: https://github.com/ray-project/ray/tree/master/doc/source/te...
Ray – an open source project for scaling AI workloads
1 project | news.ycombinator.com | 11 Aug 2023
Methods to keep agents inside grid world.
1 project | /r/reinforcementlearning | 30 Jun 2023

Here's a reference from RLlib that points to docs and an example, and here's one from one of my projects that includes all my own implementations
TransformerXL + PPO Baseline + MemoryGym
10 projects | /r/reinforcementlearning | 15 Feb 2023

RLlib
Is dynamic action masking possible in Rllib?
1 project | /r/reinforcementlearning | 23 Jan 2023
AWS re:Invent 2022 Recap | Data & Analytics services
1 project | dev.to | 3 Jan 2023

⦿ AWS Glue Data Quality - Automatic data quality rule recommendations based on your data AWS Glue for Ray - Data integration with Ray (ray.io), a popular new open-source compute framework that helps you scale Python workloads
Think about it for a second
1 project | /r/mathmemes | 19 Oct 2022

https://ray.io (just dropping the link)
Elixir Livebook now as a desktop app
12 projects | news.ycombinator.com | 2 Aug 2022

I've wondered whether it's easier to add data analyst stuff to Elixir that Python seems to have, or add features to Python that Erlang (and by extension Elixir) provides out of the box.
By what I can see, if you want multiprocessing on Python in an easier way (let's say running async), you have to use something like ray core[0], then if you want multiple machines you need redis(?). Elixir/Erlang supports this out of the box.
Explorer[1] is an interesting approach, where it uses Rust via Rustler (Elixir library to call Rust code) and uses Polars as its dataframe library. I think Rustler needs to be reworked for this usecase, as it can be slow to return data. I made initial improvements which drastically improves encoding (https://github.com/elixir-nx/explorer/pull/282 and https://github.com/elixir-nx/explorer/pull/286, tldr 20+ seconds down to 3).
[0] https://github.com/ray-project/ray
Learn various techniques to reduce data processing time by using multiprocessing, joblib, and tqdm concurrent
1 project | /r/Python | 13 Jul 2022

Adding these for anyone who had a similar question about Ray vs dask 1, 2, 3

What are some alternatives?

When comparing q-learning-algorithms and Ray you can also consider the following projects:

bomberland - Bomberland: a multi-agent AI competition based on Bomberman. This repository contains both starter / hello world kits + the engine source code

optuna - A hyperparameter optimization framework

chess - Program for playing chess in the console against AI or human opponents

stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

AgileRL - Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

Faust - Python Stream Processing

fragile - Framework for building algorithms based on FractalAI

gevent - Coroutine-based concurrency library for Python

stable-baselines - A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

SCOOP (Scalable COncurrent Operations in Python) - SCOOP (Scalable COncurrent Operations in Python)

Thespian Actor Library - Python Actor concurrency library

Dask - Parallel computing with task scheduling

q-learning-algorithms vs bomberland Ray vs optuna q-learning-algorithms vs chess Ray vs stable-baselines3 q-learning-algorithms vs AgileRL Ray vs Faust q-learning-algorithms vs fragile Ray vs gevent Ray vs stable-baselines Ray vs SCOOP (Scalable COncurrent Operations in Python) Ray vs Thespian Actor Library Ray vs Dask

Compare q-learning-algorithms vs Ray and see what are their differences.

q-learning-algorithms

Ray

q-learning-algorithms

Ray

What are some alternatives?