TransformerXL + PPO Baseline + MemoryGym

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • episodic-transformer-memory-ppo

    Clean baseline implementation of PPO using an episodic TransformerXL memory

  • We finally completed a lightweight implementation of a memory-based agent using PPO and TransformerXL (and Gated TransformerXL).

  • brain-agent

    Brain Agent for Large-Scale and Multi-Task Agent Learning

  • Brain Agent

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • DI-engine

    OpenDILab Decision AI Engine

  • DI Engine

  • Ray

    Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

  • RLlib

  • endless-memory-gym

    Challenging Memory-based Deep Reinforcement Learning Agents

  • Code: https://github.com/MarcoMeter/drl-memory-gym

  • adaptive-transformers-in-rl

    Adaptive Attention Span for Reinforcement Learning

  • Found relevant code at https://github.com/jerrodparker20/adaptive-transformers-in-rl + all code implementations here

  • popgym

    Partially Observable Process Gym

  • Have you seen this other ICLR paper, POPGym? Paper: https://openreview.net/forum?id=chDrutUTs0K Code: https://github.com/smorad/popgym

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Gymnasium

    An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

  • Thanks! It really depends on the task that you want to implement. But in general, sticking to the standard gymnasium API is important. If you want to implement a 2D environment then PyGame is promising. If it's more like a game, check out Unity ML-Agents or Godot RL Agents. Anything simpler can also be just pure python code. You also need to carefully design your observation space, action space and reward function. My advice is to explore design choices of related environments.

  • ml-agents

    The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

  • Thanks! It really depends on the task that you want to implement. But in general, sticking to the standard gymnasium API is important. If you want to implement a 2D environment then PyGame is promising. If it's more like a game, check out Unity ML-Agents or Godot RL Agents. Anything simpler can also be just pure python code. You also need to carefully design your observation space, action space and reward function. My advice is to explore design choices of related environments.

  • godot_rl_agents

    An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents

  • Thanks! It really depends on the task that you want to implement. But in general, sticking to the standard gymnasium API is important. If you want to implement a 2D environment then PyGame is promising. If it's more like a game, check out Unity ML-Agents or Godot RL Agents. Anything simpler can also be just pure python code. You also need to carefully design your observation space, action space and reward function. My advice is to explore design choices of related environments.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts