DI-engine vs episodic-transformer-memory-ppo

episodic-transformer-memory-ppo

Clean baseline implementation of PPO using an episodic TransformerXL memory (by MarcoMeter)

Pytorch deep-reinforcement-learning episodic-memory Ppo Transformer proximal-policy-optimization on-policy policy-gradient pomdp actor-critic transformer-xl gtrxl gated-transformer-xl trxl memory-gym

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

DI-engine		episodic-transformer-memory-ppo
	Project
3	Mentions	5
2,553	Stars	109
5.7%	Growth	-
8.7	Activity	2.5
10 days ago	Latest Commit	about 1 month ago
Python	Language	Python
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

DI-engine

Posts with mentions or reviews of DI-engine. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-15.

Anyone have experience with DI-Engine?
1 project | /r/reinforcementlearning | 4 May 2023

I posted a while back asking people what frameworks they were using for RL research. Recently i stumbled upon DI-Engine which looks promising! Actively maintained, with a diverse set of algorithms already implemented.
TransformerXL + PPO Baseline + MemoryGym
10 projects | /r/reinforcementlearning | 15 Feb 2023

DI Engine
Struggling with algorithm generality? Try DI engine; here is the solution
1 project | news.ycombinator.com | 29 Apr 2022

episodic-transformer-memory-ppo

Posts with mentions or reviews of episodic-transformer-memory-ppo. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-17.

Question about Transformer model input in RL
2 projects | /r/reinforcementlearning | 17 Jun 2023

Check out this implementation https://github.com/MarcoMeter/episodic-transformer-memory-ppo
Using transformers in RL?
1 project | /r/reinforcementlearning | 9 Apr 2023

Maybe this easy-to-follow baseline implementation of PPO + TransformerXL is an inspiration for you.
What RL library supports custom LSTM and Transformer neural networks to use with algorithms such as PPO?
4 projects | /r/reinforcementlearning | 25 Mar 2023

I provide baseline implementations on TransformerXL + PPO and LSTM/GRU + PPO. These are designed to be slim and easy-to-follow so that you can advance those implementations to the features and toolset that you need.
Trained a Transformer Decoder architecture with PPO, best way to maximize the entropy?
1 project | /r/reinforcementlearning | 22 Mar 2023

You can also checkout my baseline implementation of PPO + TrXL.
TransformerXL + PPO Baseline + MemoryGym
10 projects | /r/reinforcementlearning | 15 Feb 2023

We finally completed a lightweight implementation of a memory-based agent using PPO and TransformerXL (and Gated TransformerXL).

What are some alternatives?

When comparing DI-engine and episodic-transformer-memory-ppo you can also consider the following projects:

stable-baselines - A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

godot_rl_agents - An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents

pytorch-a2c-ppo-acktr-gail - PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Gymnasium - An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

tianshou - An elegant PyTorch deep reinforcement learning library.

popgym - Partially Observable Process Gym

seed_rl - SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

recurrent-ppo-truncated-bptt - Baseline implementation of recurrent PPO using truncated BPTT

stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

brain-agent - Brain Agent for Large-Scale and Multi-Task Agent Learning

on-policy - This is the official implementation of Multi-Agent PPO (MAPPO).

rl8 - A high throughput, end-to-end RL library for infinite horizon tasks.

DI-engine vs stable-baselines episodic-transformer-memory-ppo vs godot_rl_agents DI-engine vs pytorch-a2c-ppo-acktr-gail episodic-transformer-memory-ppo vs Gymnasium DI-engine vs tianshou episodic-transformer-memory-ppo vs popgym DI-engine vs seed_rl episodic-transformer-memory-ppo vs recurrent-ppo-truncated-bptt DI-engine vs stable-baselines3 episodic-transformer-memory-ppo vs brain-agent DI-engine vs on-policy episodic-transformer-memory-ppo vs rl8

Compare DI-engine vs episodic-transformer-memory-ppo and see what are their differences.

DI-engine

episodic-transformer-memory-ppo

DI-engine

episodic-transformer-memory-ppo

What are some alternatives?