|7 months ago||2 months ago|
|MIT License||MIT License|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Multi-task learning: How's that done?
2 projects | reddit.com/r/learnmachinelearning | 2 Dec 2021
I use the first approach in a chess bot of mine, here's a link to the article: https://towardsdatascience.com/building-a-chess-engine-part2-db4784e843d5. The code for this can be found here: https://github.com/bellerb/chess/blob/main/ai_ben/train.py.
Building a Chess AI
1 project | reddit.com/r/Python | 5 Nov 2021
How to "fit" the output of the Critic to the dimension of the reward?
1 project | reddit.com/r/reinforcementlearning | 8 Feb 2022
You may want to use the trick described in https://arxiv.org/pdf/1805.11593.pdf as a Transformed Bellman Operator. Its efficiency is proved in MuZero original paper https://arxiv.org/pdf/1911.08265.pdf Appendix F. The implementation of that method you can find here: https://github.com/werner-duvaud/muzero-general Usage: muzero/models.py:649 (def support_to_scalar)
MuZero unable to solve non-slippery FrozenLake environment?
2 projects | reddit.com/r/reinforcementlearning | 9 Aug 2021
I have used this implementation from MuZero: https://github.com/werner-duvaud/muzero-general
RL for chess
2 projects | reddit.com/r/reinforcementlearning | 5 Jun 2021
+1 to taking a look at OpenSpiel. It has AlphaZero in C++ and Python, and there is even a PR open that allows running UCI (e.g. Stockfish) bot. You can also load chess via the OpenSpiel wrapper in muzero-general: https://github.com/werner-duvaud/muzero-general
The future of MuZero, and where to go for news
1 project | reddit.com/r/MuZero | 9 Mar 2021
When I looked up some community implementations, like that of Werner Duvaud on GitHub and Discord, hoping to make my own contributions to this effect, I soon found that I was hopelessly out of my depth as an amateur programmer, even with the help of some other sources like this walkthrough series. However, from what I could tell, most of the people working on this sort of thing seemed to be tackling relatively simple games. At first I thought this might be largely due to limitations in hobby time or computing power available to these users, but then I also noticed that, unless I have misunderstood something, it seems like the games are required to be rebuilt entirely in the engine of (this implementation of) MuZero, which would also obviously be a limit on the complexity of games chosen.
Is MuZero currently the best RL algo that we have now?
1 project | reddit.com/r/reinforcementlearning | 6 Mar 2021
"muzero-general", PyTorch/Ray code for Gym/Atari/board-games (reasonable results + checkpoints for small tasks)
4 projects | reddit.com/r/reinforcementlearning | 3 Feb 2021
Windows support (Experimental / Workaround: Use the notebook in Google Colab)4 projects | reddit.com/r/reinforcementlearning | 3 Feb 2021
Muzero code implementation
1 project | reddit.com/r/reinforcementlearning | 24 Jan 2021
There are several if you google "muzero github", e.g. https://github.com/werner-duvaud/muzero-general
DeepMind's MuZero conquers and learns the rules as it does
2 projects | reddit.com/r/singularity | 27 Dec 2020
As you can see here for the Atari games; https://github.com/werner-duvaud/muzero-general/blob/master/games/atari.py2 projects | reddit.com/r/singularity | 27 Dec 2020
What are some alternatives?
alpha-zero-general - A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
deep-RL-trading - playing idealized trading games with deep reinforcement learning
Super-mario-bros-PPO-pytorch - Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
open_spiel - OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
seed_rl - SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
Pytorch-UNet - PyTorch implementation of the U-Net for image semantic segmentation with high quality images
muzero-general - MuZero
pytorch-ddpg - Deep deterministic policy gradient (DDPG) in PyTorch 🚀
Gym-Booking-Discord-Bot - Book Me Bot™ | www.bookmebot.com | A Facebook + Discord Chatbot that guarantees you a workout booking time at your local Fit4Less, LA Fitness, or CrunchFitness gym!
trax - Trax — Deep Learning with Clear Code and Speed
es_pytorch - High performance implementation of Deep neuroevolution in pytorch using mpi4py. Intended for use on HPC clusters