PPO rollout buffer for turn-based two-player game with varying turn lengths

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • stable-baselines3-contrib

    Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

    Simplified version of rollout collection (adapted from ppo_mask.py line 282):

  • stable-baselines3

    PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

    Algorithm to compute returns and advantages (I use MaskableDictRolloutBuffer which inherits the shown implementation from RolloutBuffer (line 371)):

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • Code for https://arxiv.org/abs/1506.02438 found: https://github.com/170928/-Review-High-Dimensional-Continuous-Control-Using-Generalized-Advantage_Estimation

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts