rl_games vs alpha-zero-general

rl_games

RL implementations (by Denys88)

alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more (by suragnair)

Tensorflow Pytorch Keras gobang gomoku alpha-zero alphago-zero Alphago reinforcement-learning self-play mcts monte-carlo-tree-search othello tf Deep Learning Alphazero neural-network

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

rl_games		alpha-zero-general
	Project
2	Mentions	4
744	Stars	3,683
-	Growth	-
5.3	Activity	4.7
9 days ago	Latest Commit	8 days ago
Jupyter Notebook	Language	Jupyter Notebook
MIT License	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

rl_games

Posts with mentions or reviews of rl_games. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-17.

Exporting a RL Policy From Isaac Gym for Dofbot
3 projects | /r/MLQuestions | 17 Feb 2023

Source Example: https://github.com/Denys88/rl_games/blob/master/notebooks/train_and_export_onnx_example_continuous.ipynb
V-MPO - what do you think
2 projects | /r/reinforcementlearning | 20 Jun 2022

I tried to reproduce it in my library you can take a look at implementation (https://github.com/Denys88/rl_games/pull/177) you can find even a few configs - moonlander and cartpole works good..

alpha-zero-general

Posts with mentions or reviews of alpha-zero-general. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-26.

Competitive reinforcement learning for turn-based games
2 projects | /r/reinforcementlearning | 26 May 2023

This is a good intro to alphazero and montecarlo treesearch , Followed by This repo.
Looking for deeper understanding of AlphaZero algorithm
4 projects | /r/baduk | 1 Mar 2021
Any interest in a strong Santorini (no powers) AI?
2 projects | /r/boardgames | 10 Feb 2021

I'm not planning on sharing code at the moment as I'm still working on improving it. The main part of the code is simply from https://github.com/suragnair/alpha-zero-general plus my implementation of game logic (about 100 lines). So for you to use the AI you really need the weights for the neural network. I plan on releasing a better version than the current version in say two months or so.

What are some alternatives?

When comparing rl_games and alpha-zero-general you can also consider the following projects:

OmniIsaacGymEnvs-DofbotReacher - Dofbot Reacher Reinforcement Learning Sim2Real Environment for Omniverse Isaac Gym/Sim

muzero-general - MuZero

Practical_RL - A course in reinforcement learning in the wild

minigo - An open-source implementation of the AlphaGoZero algorithm

Excolligere - Forked Repo of J3soon's OmniIsaacGym-DofbotReacher

tensorflow-onnx - Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX

nn - 🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

a3c_trading - Trading with recurrent actor-critic reinforcement learning

seed_rl - SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

reversatile - Reversatile: Reversi for Android

Popular-RL-Algorithms - PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

rl_games vs OmniIsaacGymEnvs-DofbotReacher alpha-zero-general vs muzero-general rl_games vs Practical_RL alpha-zero-general vs minigo rl_games vs Excolligere alpha-zero-general vs tensorflow-onnx rl_games vs nn alpha-zero-general vs a3c_trading rl_games vs seed_rl alpha-zero-general vs Practical_RL alpha-zero-general vs reversatile alpha-zero-general vs Popular-RL-Algorithms

Compare rl_games vs alpha-zero-general and see what are their differences.

rl_games

alpha-zero-general

rl_games

alpha-zero-general

What are some alternatives?