alpha-zero-general VS muzero-general

Compare alpha-zero-general vs muzero-general and see what are their differences.

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
WorkOS - The modern identity platform for B2B SaaS
The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
workos.com
featured
alpha-zero-general muzero-general
4 14
3,667 2,379
- -
3.1 0.0
2 months ago 4 months ago
Jupyter Notebook Python
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

alpha-zero-general

Posts with mentions or reviews of alpha-zero-general. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-26.

muzero-general

Posts with mentions or reviews of muzero-general. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-14.

What are some alternatives?

When comparing alpha-zero-general and muzero-general you can also consider the following projects:

minigo - An open-source implementation of the AlphaGoZero algorithm

deep-RL-trading - playing idealized trading games with deep reinforcement learning

tensorflow-onnx - Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX

Super-mario-bros-PPO-pytorch - Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

a3c_trading - Trading with recurrent actor-critic reinforcement learning

open_spiel - OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Practical_RL - A course in reinforcement learning in the wild

stable-baselines3-contrib - Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

reversatile - Reversatile: Reversi for Android

pytorch-ddpg - Deep deterministic policy gradient (DDPG) in PyTorch 🚀

Popular-RL-Algorithms - PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

seed_rl - SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.