Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today. Learn more →
Top 9 policy-gradient Open-Source Projects
-
-
Scout Monitoring
Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
-
PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
-
HandyRL
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
-
pytorch-learn-reinforcement-learning
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
-
episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Project mention: Is it better to not use the Target Update Frequency in Double DQN or depends on the application? | /r/reinforcementlearning | 2023-07-05The tianshou implementation I found at https://github.com/thu-ml/tianshou/blob/master/tianshou/policy/modelfree/dqn.py is DQN by default.
Project mention: Question about Transformer model input in RL | /r/reinforcementlearning | 2023-06-17Check out this implementation https://github.com/MarcoMeter/episodic-transformer-memory-ppo
policy-gradient discussion
policy-gradient related posts
-
Is it better to not use the Target Update Frequency in Double DQN or depends on the application?
-
他們能回來嗎
-
Multi-Agent Stable Baselines
-
Question about the old policy and new policy in TRPO code
-
Tensorflow vs PyTorch for A3C
-
"Tianshou: a Highly Modularized Deep Reinforcement Learning Library", Weng et al 2021 (Python PyTorch MuJuCo; PPO, DQN, A2C, DDPG, SAC, TD3, REINFORCE, NPG, TRPO, ACKTR)
-
"Tianshou: a Highly Modularized Deep Reinforcement Learning Library", Weng et al 2021 (Python PyTorch MuJuCo; PPO, DQN, A2C, DDPG, SAC, TD3, REINFORCE, NPG, TRPO, ACKTR)
-
A note from our sponsor - Scout Monitoring
www.scoutapm.com | 12 Jun 2024
Index
What are some of the best open-source policy-gradient projects? This list will help you:
Project | Stars | |
---|---|---|
1 | tianshou | 7,528 |
2 | PPO-PyTorch | 1,527 |
3 | HandyRL | 282 |
4 | pytorch-learn-reinforcement-learning | 143 |
5 | episodic-transformer-memory-ppo | 120 |
6 | recurrent-ppo-truncated-bptt | 108 |
7 | nes-torch | 17 |
8 | pbo | 14 |
9 | snakeAI | 10 |