q-learning-algorithms vs cleanrl

q-learning-algorithms

This repository will aim to provide implementations of q-learning algorithms (DQN, Double-DQN, ...) using Pytorch. (by thomashirtz)

Source Code

Suggest alternative

Edit details

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG) (by vwxyzjn)

Wandb reinforcement-learning Pytorch Python Gym Machine Learning deep-reinforcement-learning Deep Learning atari ale A2c proximal-policy-optimization Ppo advantage-actor-critic actor-critic phasic-policy-gradient

Source Code

docs.cleanrl.dev

Suggest alternative

Edit details

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

q-learning-algorithms		cleanrl
	Project
1	Mentions	41
4	Stars	4,353
-	Growth	-
0.0	Activity	6.7
almost 3 years ago	Latest Commit	about 1 month ago
Python	Language	Python
-	License	GNU General Public License v3.0 or later

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

q-learning-algorithms

Posts with mentions or reviews of q-learning-algorithms. We have used some of these posts to build our list of alternatives and similar projects.

We haven't tracked posts mentioning q-learning-algorithms yet.
Tracking mentions began in Dec 2020.

cleanrl

Posts with mentions or reviews of cleanrl. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-08-24.

[P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials)
4 projects | /r/reinforcementlearning | 24 Aug 2023

PettingZoo 1.24.0 is now live! This release includes Python 3.11 support, updated Chess and Hanabi environment versions, and many bugfixes, documentation updates and testing expansions. We are also very excited to announce 3 tutorials using Stable-Baselines3, and a full training script using CleanRL with TensorBoard and WandB.
[P] 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!
2 projects | /r/MachineLearning | 7 Jul 2023
SB3 - NotImplementedError: Box([-1. -1. -8.], [1. 1. 8.], (3,), <class 'numpy.float32'>) observation space is not supported
2 projects | /r/reinforcementlearning | 19 Jun 2023

I am trying to run cleanrl on the `Pendulum-v1` environment. I did that by going here and changing the default `env-id` to ` parser.add_argument("--env-id", type=str, default="Pendulum-v1",
[P] Robust Policy Optimization is now in CleanRL 🔥!
2 projects | /r/MachineLearning | 23 Jan 2023

💾code: https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/rpo_continuous_action.py

2 projects | /r/MachineLearning | 23 Jan 2023

Happy to share that CleanRL now has a new algorithm called Robust Policy Optimization — 5 lines of code change to PPO to get better performance in 57 out of 61 continuous action envs 🚀 (e.g., dm_control)
What's the best "Non-Black Box" framework for SOTA algorithms?
3 projects | /r/reinforcementlearning | 17 Jan 2023

CleanRL is the gold standard for "approachable implementations" of the most popular RL algorithms, imo. Can't really beat single-file implementations in <= 200 lines of Python.
I could use some basic help
2 projects | /r/reinforcementlearning | 22 Nov 2022

If you're interested in the theoretical foundations of RL, OpenAI's Spinning Up is an amazing resource that goes a bit easier on the math. For the practical side of things, I can't recommend Costa's CleanRL repo more. It has single file (~200ish lines of Python) implementations of most relevant RL algorithms, so it makes it really easy to grasp.
[P] 🔥 CleanRL has reached v1.0.0; Reworked documentation, JAX support, and more!
2 projects | /r/MachineLearning | 14 Nov 2022

GitHub Release: https://github.com/vwxyzjn/cleanrl/releases/tag/v1.0.0
RL review
2 projects | /r/reinforcementlearning | 24 Oct 2022

You can also reference the source code for some of the popular implementations from open source RL libraries like stablebaselines3, RLlib, CleanRL, or Dopamine. These can help you if you’re trying to compare your implementation to a “standard”.
What are sota hyperparameter optimization methods?
3 projects | /r/reinforcementlearning | 19 Oct 2022

As far as I know CleanRL implements TPE. Also, I'm wondering if F-Race is considerable.

What are some alternatives?

When comparing q-learning-algorithms and cleanrl you can also consider the following projects:

stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

tianshou - An elegant PyTorch deep reinforcement learning library.

d3rlpy - An offline deep reinforcement learning library

reinforcement-learning-discord-wiki - The RL discord wiki

mbrl-lib - Library for Model Based RL

machin - Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

bomberland - Bomberland: a multi-agent AI competition based on Bomberman. This repository contains both starter / hello world kits + the engine source code

sample-factory - High throughput synchronous and asynchronous reinforcement learning

Deep-Reinforcement-Learning-Algorithms-with-PyTorch - PyTorch implementations of deep reinforcement learning algorithms and environments

wandb - 🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

spinningup - An educational resource to help anyone learn deep reinforcement learning.

deep_rl_zoo - A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.

cleanrl vs stable-baselines3 cleanrl vs tianshou cleanrl vs d3rlpy cleanrl vs reinforcement-learning-discord-wiki cleanrl vs mbrl-lib cleanrl vs machin q-learning-algorithms vs bomberland cleanrl vs sample-factory cleanrl vs Deep-Reinforcement-Learning-Algorithms-with-PyTorch cleanrl vs wandb cleanrl vs spinningup cleanrl vs deep_rl_zoo

Compare q-learning-algorithms vs cleanrl and see what are their differences.

q-learning-algorithms

cleanrl

q-learning-algorithms

cleanrl

What are some alternatives?