alpha-zero-general
rl-trading
alpha-zero-general | rl-trading | |
---|---|---|
4 | 1 | |
3,687 | 4 | |
- | - | |
4.7 | 0.0 | |
5 days ago | over 3 years ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
alpha-zero-general
-
Competitive reinforcement learning for turn-based games
This is a good intro to alphazero and montecarlo treesearch , Followed by This repo.
- Looking for deeper understanding of AlphaZero algorithm
-
Any interest in a strong Santorini (no powers) AI?
I'm not planning on sharing code at the moment as I'm still working on improving it. The main part of the code is simply from https://github.com/suragnair/alpha-zero-general plus my implementation of game logic (about 100 lines). So for you to use the AI you really need the weights for the neural network. I plan on releasing a better version than the current version in say two months or so.
rl-trading
-
Recruiters representing Citadel has been aggressively attempting to recruit me as a software developer since mid November, offering to pay $100-150k more than the median for early/mid career developers
In the summer, I did something somewhat similar to what OP suggests above. Here's a link to the corresponding Github repository.
What are some alternatives?
muzero-general - MuZero
Lean - Lean Algorithmic Trading Engine by QuantConnect (Python, C#)
minigo - An open-source implementation of the AlphaGoZero algorithm
alphalens - Performance analysis of predictive (alpha) stock factors
tensorflow-onnx - Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
nn - 🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
a3c_trading - Trading with recurrent actor-critic reinforcement learning
Practical_RL - A course in reinforcement learning in the wild
Deep-Learning-Computer-Vision - My assignment solutions for Stanford’s CS231n (CNNs for Visual Recognition) and Michigan’s EECS 498-007/598-005 (Deep Learning for Computer Vision), version 2020.
reversatile - Reversatile: Reversi for Android
Popular-RL-Algorithms - PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..