alpha-zero-general
minigo
Our great sponsors
alpha-zero-general | minigo | |
---|---|---|
4 | 1 | |
3,667 | 3,234 | |
- | - | |
3.1 | 3.0 | |
2 months ago | about 3 years ago | |
Jupyter Notebook | C++ | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
alpha-zero-general
-
Competitive reinforcement learning for turn-based games
This is a good intro to alphazero and montecarlo treesearch , Followed by This repo.
- Looking for deeper understanding of AlphaZero algorithm
-
Any interest in a strong Santorini (no powers) AI?
I'm not planning on sharing code at the moment as I'm still working on improving it. The main part of the code is simply from https://github.com/suragnair/alpha-zero-general plus my implementation of game logic (about 100 lines). So for you to use the AI you really need the weights for the neural network. I plan on releasing a better version than the current version in say two months or so.
minigo
-
Looking for deeper understanding of AlphaZero algorithm
If you want to see some real code, and are comfortable with Python, here's a minimal open source implementation of the ideas: https://github.com/tensorflow/minigo
What are some alternatives?
muzero-general - MuZero
deeplearning-notes - Notes for Deep Learning Specialization Courses led by Andrew Ng.
tensorflow-onnx - Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
a3c_trading - Trading with recurrent actor-critic reinforcement learning
Practical_RL - A course in reinforcement learning in the wild
reversatile - Reversatile: Reversi for Android
Popular-RL-Algorithms - PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
SelfplayLab - Implementation of the alphago zero algorithm with some small games for experimenting with reinforcement learning
Hands-On-Meta-Learning-With-Python - Learning to Learn using One-Shot Learning, MAML, Reptile, Meta-SGD and more with Tensorflow
rl-trading - Using Reinforcement Learning agents as Algorithmic Traders
Siren-fastai2 - Unofficial implementation of 'Implicit Neural Representations with Periodic Activation Functions'
rl_games - RL implementations