Popular-RL-Algorithms
alpha-zero-general
Popular-RL-Algorithms | alpha-zero-general | |
---|---|---|
1 | 4 | |
983 | 3,667 | |
- | - | |
4.7 | 3.1 | |
5 months ago | 2 months ago | |
Jupyter Notebook | Jupyter Notebook | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Popular-RL-Algorithms
-
What does LSTM do (rather than FC Layers) to SAC and TD3 and when to use them?
Here is the example: https://github.com/quantumiracle/Popular-RL-Algorithms
alpha-zero-general
-
Competitive reinforcement learning for turn-based games
This is a good intro to alphazero and montecarlo treesearch , Followed by This repo.
- Looking for deeper understanding of AlphaZero algorithm
-
Any interest in a strong Santorini (no powers) AI?
I'm not planning on sharing code at the moment as I'm still working on improving it. The main part of the code is simply from https://github.com/suragnair/alpha-zero-general plus my implementation of game logic (about 100 lines). So for you to use the AI you really need the weights for the neural network. I plan on releasing a better version than the current version in say two months or so.
What are some alternatives?
amazon-sagemaker-examples - Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠Amazon SageMaker.
muzero-general - MuZero
Deep-Reinforcement-Learning-Algorithms - 32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.
minigo - An open-source implementation of the AlphaGoZero algorithm
jaxrl - JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
tensorflow-onnx - Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
a3c_trading - Trading with recurrent actor-critic reinforcement learning
Practical_RL - A course in reinforcement learning in the wild
reversatile - Reversatile: Reversi for Android
SelfplayLab - Implementation of the alphago zero algorithm with some small games for experimenting with reinforcement learning
Hands-On-Meta-Learning-With-Python - Learning to Learn using One-Shot Learning, MAML, Reptile, Meta-SGD and more with Tensorflow
rl-trading - Using Reinforcement Learning agents as Algorithmic Traders