minimalRL
Pytorch-PCGrad
minimalRL | Pytorch-PCGrad | |
---|---|---|
5 | 1 | |
2,725 | 265 | |
- | - | |
1.6 | 1.8 | |
about 1 year ago | almost 3 years ago | |
Python | Python | |
MIT License | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
minimalRL
- Does anyone know good python sources hardcoded of RL?
-
Question about pseudocodes
Did you try minimalRL?
- Rl algorithm implemented
-
RL agent for simple games?
This github is great.
-
PPO+LSTM Implementation
Maybe this implementation helps: https://github.com/seungeunrho/minimalRL/blob/master/ppo-lstm.py
Pytorch-PCGrad
What are some alternatives?
ElegantRL - Massively Parallel Deep Reinforcement Learning. 🔥
pytorch-grad-norm - Pytorch implementation of the GradNorm. GradNorm addresses the problem of balancing multiple losses for multi-task learning by learning adjustable weight coefficients.
DeepRL-TensorFlow2 - 🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
PPO-PyTorch - Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
rlpyt - Reinforcement Learning in PyTorch
cleanrl - High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
pomdp-baselines - Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
muzero-general - MuZero
deep-RL-trading - playing idealized trading games with deep reinforcement learning
pytorch-a2c-ppo-acktr-gail - PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
ultimate-volleyball - 3D RL Volleyball environment built on Unity ML-Agents
pytorch-learn-reinforcement-learning - A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.