es_pytorch VS muzero-general

Compare es_pytorch vs muzero-general and see what are their differences.

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
es_pytorch muzero-general
1 14
23 2,379
- -
0.0 0.0
over 2 years ago 4 months ago
Python Python
- MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

es_pytorch

Posts with mentions or reviews of es_pytorch. We have used some of these posts to build our list of alternatives and similar projects.
  • What is the greatest achievement of Genetic Algorithms[D]?
    1 project | /r/MachineLearning | 29 Dec 2020
    ES, specifically OpenAI's ES (and to an extent CMA-ES). This has been shown to be very competitive with modern state of the art RL algorithms. A huge benefit of it is that it's incredibly easy to implement (I'm gonna shamelessly plug my implementation if you want to see the inner workings)

muzero-general

Posts with mentions or reviews of muzero-general. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-14.

What are some alternatives?

When comparing es_pytorch and muzero-general you can also consider the following projects:

pureples - Pure Python Library for ES-HyperNEAT. Contains implementations of HyperNEAT and ES-HyperNEAT.

deep-RL-trading - playing idealized trading games with deep reinforcement learning

neat-python - Python implementation of the NEAT neuroevolution algorithm

Super-mario-bros-PPO-pytorch - Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

policy-adaptation-during-deployment - Training code and evaluation benchmarks for the "Self-Supervised Policy Adaptation during Deployment" paper.

alpha-zero-general - A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

open_spiel - OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

stable-baselines3-contrib - Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

pytorch-ddpg - Deep deterministic policy gradient (DDPG) in PyTorch 🚀

seed_rl - SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Pytorch-UNet - PyTorch implementation of the U-Net for image semantic segmentation with high quality images