muzero-general VS alpha-zero-general

Compare muzero-general vs alpha-zero-general and see what are their differences.

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
muzero-general alpha-zero-general
14 4
2,373 3,667
- -
0.0 3.1
4 months ago about 2 months ago
Python Jupyter Notebook
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

muzero-general

Posts with mentions or reviews of muzero-general. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-14.

alpha-zero-general

Posts with mentions or reviews of alpha-zero-general. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-26.

What are some alternatives?

When comparing muzero-general and alpha-zero-general you can also consider the following projects:

deep-RL-trading - playing idealized trading games with deep reinforcement learning

minigo - An open-source implementation of the AlphaGoZero algorithm

Super-mario-bros-PPO-pytorch - Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

tensorflow-onnx - Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX

open_spiel - OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

a3c_trading - Trading with recurrent actor-critic reinforcement learning

stable-baselines3-contrib - Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

Practical_RL - A course in reinforcement learning in the wild

pytorch-ddpg - Deep deterministic policy gradient (DDPG) in PyTorch 🚀

reversatile - Reversatile: Reversi for Android

seed_rl - SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Popular-RL-Algorithms - PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..