Our great sponsors
-
stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
Simplified version of rollout collection (adapted from ppo_mask.py line 282):
-
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Algorithm to compute returns and advantages (I use MaskableDictRolloutBuffer which inherits the shown implementation from RolloutBuffer (line 371)):
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Code for https://arxiv.org/abs/1506.02438 found: https://github.com/170928/-Review-High-Dimensional-Continuous-Control-Using-Generalized-Advantage_Estimation