rl_games
alpha-zero-general
rl_games | alpha-zero-general | |
---|---|---|
2 | 4 | |
744 | 3,683 | |
- | - | |
5.3 | 4.7 | |
9 days ago | 8 days ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
rl_games
-
Exporting a RL Policy From Isaac Gym for Dofbot
Source Example: https://github.com/Denys88/rl_games/blob/master/notebooks/train_and_export_onnx_example_continuous.ipynb
-
V-MPO - what do you think
I tried to reproduce it in my library you can take a look at implementation (https://github.com/Denys88/rl_games/pull/177) you can find even a few configs - moonlander and cartpole works good..
alpha-zero-general
-
Competitive reinforcement learning for turn-based games
This is a good intro to alphazero and montecarlo treesearch , Followed by This repo.
- Looking for deeper understanding of AlphaZero algorithm
-
Any interest in a strong Santorini (no powers) AI?
I'm not planning on sharing code at the moment as I'm still working on improving it. The main part of the code is simply from https://github.com/suragnair/alpha-zero-general plus my implementation of game logic (about 100 lines). So for you to use the AI you really need the weights for the neural network. I plan on releasing a better version than the current version in say two months or so.
What are some alternatives?
OmniIsaacGymEnvs-DofbotReacher - Dofbot Reacher Reinforcement Learning Sim2Real Environment for Omniverse Isaac Gym/Sim
muzero-general - MuZero
Practical_RL - A course in reinforcement learning in the wild
minigo - An open-source implementation of the AlphaGoZero algorithm
Excolligere - Forked Repo of J3soon's OmniIsaacGym-DofbotReacher
tensorflow-onnx - Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
nn - 🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
a3c_trading - Trading with recurrent actor-critic reinforcement learning
seed_rl - SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
reversatile - Reversatile: Reversi for Android
Popular-RL-Algorithms - PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..