Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 7 Python policy-gradient Projects
-
PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
HandyRL
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
-
pytorch-learn-reinforcement-learning
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
-
episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Project mention: Is it better to not use the Target Update Frequency in Double DQN or depends on the application? | /r/reinforcementlearning | 2023-07-05The tianshou implementation I found at https://github.com/thu-ml/tianshou/blob/master/tianshou/policy/modelfree/dqn.py is DQN by default.
Project mention: Question about Transformer model input in RL | /r/reinforcementlearning | 2023-06-17Check out this implementation https://github.com/MarcoMeter/episodic-transformer-memory-ppo
Python policy-gradient related posts
- Is it better to not use the Target Update Frequency in Double DQN or depends on the application?
- 他們能回來嗎
- Multi-Agent Stable Baselines
- Question about the old policy and new policy in TRPO code
- Tensorflow vs PyTorch for A3C
- "Tianshou: a Highly Modularized Deep Reinforcement Learning Library", Weng et al 2021 (Python PyTorch MuJuCo; PPO, DQN, A2C, DDPG, SAC, TD3, REINFORCE, NPG, TRPO, ACKTR)
- "Tianshou: a Highly Modularized Deep Reinforcement Learning Library", Weng et al 2021 (Python PyTorch MuJuCo; PPO, DQN, A2C, DDPG, SAC, TD3, REINFORCE, NPG, TRPO, ACKTR)
-
A note from our sponsor - InfluxDB
www.influxdata.com | 29 Apr 2024
Index
What are some of the best open-source policy-gradient projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | tianshou | 7,406 |
2 | PPO-PyTorch | 1,453 |
3 | HandyRL | 282 |
4 | pytorch-learn-reinforcement-learning | 139 |
5 | episodic-transformer-memory-ppo | 108 |
6 | nes-torch | 17 |
7 | pbo | 13 |
Sponsored