-
ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
POPGym is based on RLlib and has two linear transformers and five or six RNN variants, including LSTM. I've found that transformers tend to perform pretty poorly in RL when compared to RNNs.
I am still working on it, but I used the ppo implementation of https://github.com/vwxyzjn/ppo-implementation-details and modifiy it. Fir transformer, i just implement with pytorch.
I provide baseline implementations on TransformerXL + PPO and LSTM/GRU + PPO. These are designed to be slim and easy-to-follow so that you can advance those implementations to the features and toolset that you need.
I provide baseline implementations on TransformerXL + PPO and LSTM/GRU + PPO. These are designed to be slim and easy-to-follow so that you can advance those implementations to the features and toolset that you need.