SpectralEmbeddings
alpha-zero-general
SpectralEmbeddings | alpha-zero-general | |
---|---|---|
1 | 4 | |
62 | 3,674 | |
- | - | |
2.6 | 3.1 | |
over 2 years ago | 2 months ago | |
HTML | Jupyter Notebook | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
SpectralEmbeddings
alpha-zero-general
-
Competitive reinforcement learning for turn-based games
This is a good intro to alphazero and montecarlo treesearch , Followed by This repo.
- Looking for deeper understanding of AlphaZero algorithm
-
Any interest in a strong Santorini (no powers) AI?
I'm not planning on sharing code at the moment as I'm still working on improving it. The main part of the code is simply from https://github.com/suragnair/alpha-zero-general plus my implementation of game logic (about 100 lines). So for you to use the AI you really need the weights for the neural network. I plan on releasing a better version than the current version in say two months or so.
What are some alternatives?
gpt-mini - Yet another minimalistic Tensorflow (re-)re-implementation of Karpathy's Pytorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer).
muzero-general - MuZero
minigo - An open-source implementation of the AlphaGoZero algorithm
tensorflow-onnx - Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
a3c_trading - Trading with recurrent actor-critic reinforcement learning
Practical_RL - A course in reinforcement learning in the wild
reversatile - Reversatile: Reversi for Android
Popular-RL-Algorithms - PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
SelfplayLab - Implementation of the alphago zero algorithm with some small games for experimenting with reinforcement learning
Hands-On-Meta-Learning-With-Python - Learning to Learn using One-Shot Learning, MAML, Reptile, Meta-SGD and more with Tensorflow
rl-trading - Using Reinforcement Learning agents as Algorithmic Traders
Siren-fastai2 - Unofficial implementation of 'Implicit Neural Representations with Periodic Activation Functions'