pytorch-learn-reinforcement-learning
PPO-for-Beginners
Our great sponsors
pytorch-learn-reinforcement-learning | PPO-for-Beginners | |
---|---|---|
3 | 1 | |
139 | 639 | |
- | - | |
0.0 | 4.2 | |
almost 3 years ago | 4 months ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pytorch-learn-reinforcement-learning
PPO-for-Beginners
-
Why does this PPO implementation calculate the Advantage only once per rollout?
I am looking at this PPO implementation, which follows the pseudocode given in Spinning Up. This implementation has been really easy to follow and I understand almost everything. However, I am lost in line 103, where the author computes the normalized advantage before the rollout -
What are some alternatives?
Super-mario-bros-PPO-pytorch - Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Tetris-deep-Q-learning-pytorch - Deep Q-learning for playing tetris game
PPO-PyTorch - Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
tianshou - An elegant PyTorch deep reinforcement learning library.
R-NaD - Experimentation with Regularized Nash Dynamics on a GPU accelerated game
6DRepNet - Official Pytorch implementation of 6DRepNet: 6D Rotation representation for unconstrained head pose estimation.
cleanrl - High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Amortized-SVGD-GAN - Learning to draw samples: with application to amortized maximum likelihood estimator for generative adversarial learning
stable-baselines3-contrib - Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
nes-torch - Minimal PyTorch Library for Natural Evolution Strategies
pytorch-GAT - My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entropy histograms. I've supported both Cora (transductive) and PPI (inductive) examples!