Muzero-unplugged
Stochastic-muzero
Our great sponsors
Muzero-unplugged | Stochastic-muzero | |
---|---|---|
3 | 1 | |
20 | 44 | |
- | - | |
10.0 | 4.5 | |
about 1 year ago | 6 months ago | |
Python | Python | |
GNU General Public License v3.0 only | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Muzero-unplugged
-
Show HN: Ghidra Plays Mario
https://github.com/DHDev0/Muzero-unplugged
Gym is now gymnasium and it has support for additional Environments like Mujoco:
- Implementation of MuZero, MuZero Unplugged and Stochastic MuZero
Stochastic-muzero
What are some alternatives?
Muzero - Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.
LightZero - [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
pytorch-A3C - Simple A3C implementation with pytorch + multiprocessing
nn-morse - Decode morse using a neural network
muzero-general - MuZero
ghidra-tlcs900h - Ghidra processor module for Toshiba TLCS-900/H
retro - Retro Games in Gym
neural-network-scratch - build a neural network to show as a demonstration on inner workings of a neural network
ghidra-plays-mario - Playing NES ROMs with Ghidra's PCode Emulator
DeepCubeA - Code for DeepCubeA, a Deep Reinforcement Learning algorithm that can learn to solve the Rubik's cube.
cleanrl - High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)