muzero-general VS es_pytorch

Compare muzero-general vs es_pytorch and see what are their differences.

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
muzero-general es_pytorch
14 1
2,373 23
- -
0.0 0.0
4 months ago about 2 years ago
Python Python
MIT License -
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

muzero-general

Posts with mentions or reviews of muzero-general. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-14.

es_pytorch

Posts with mentions or reviews of es_pytorch. We have used some of these posts to build our list of alternatives and similar projects.
  • What is the greatest achievement of Genetic Algorithms[D]?
    1 project | /r/MachineLearning | 29 Dec 2020
    ES, specifically OpenAI's ES (and to an extent CMA-ES). This has been shown to be very competitive with modern state of the art RL algorithms. A huge benefit of it is that it's incredibly easy to implement (I'm gonna shamelessly plug my implementation if you want to see the inner workings)

What are some alternatives?

When comparing muzero-general and es_pytorch you can also consider the following projects:

deep-RL-trading - playing idealized trading games with deep reinforcement learning

pureples - Pure Python Library for ES-HyperNEAT. Contains implementations of HyperNEAT and ES-HyperNEAT.

Super-mario-bros-PPO-pytorch - Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

neat-python - Python implementation of the NEAT neuroevolution algorithm

alpha-zero-general - A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

policy-adaptation-during-deployment - Training code and evaluation benchmarks for the "Self-Supervised Policy Adaptation during Deployment" paper.

open_spiel - OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

stable-baselines3-contrib - Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

pytorch-ddpg - Deep deterministic policy gradient (DDPG) in PyTorch 🚀

seed_rl - SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Pytorch-UNet - PyTorch implementation of the U-Net for image semantic segmentation with high quality images