endless-memory-gym
episodic-transformer-memory-ppo
endless-memory-gym | episodic-transformer-memory-ppo | |
---|---|---|
1 | 5 | |
67 | 109 | |
- | - | |
6.7 | 2.5 | |
21 days ago | about 1 month ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
endless-memory-gym
-
TransformerXL + PPO Baseline + MemoryGym
Code: https://github.com/MarcoMeter/drl-memory-gym
episodic-transformer-memory-ppo
-
Question about Transformer model input in RL
Check out this implementation https://github.com/MarcoMeter/episodic-transformer-memory-ppo
-
Using transformers in RL?
Maybe this easy-to-follow baseline implementation of PPO + TransformerXL is an inspiration for you.
-
What RL library supports custom LSTM and Transformer neural networks to use with algorithms such as PPO?
I provide baseline implementations on TransformerXL + PPO and LSTM/GRU + PPO. These are designed to be slim and easy-to-follow so that you can advance those implementations to the features and toolset that you need.
-
Trained a Transformer Decoder architecture with PPO, best way to maximize the entropy?
You can also checkout my baseline implementation of PPO + TrXL.
-
TransformerXL + PPO Baseline + MemoryGym
We finally completed a lightweight implementation of a memory-based agent using PPO and TransformerXL (and Gated TransformerXL).
What are some alternatives?
quantum-arch-search - Cirq/PyTorch implementation of Quantum Architecture Search via Deep Reinforcement Learning by (Kuo et al., 2021)
godot_rl_agents - An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents
ConvLSTM-PyTorch - ConvLSTM/ConvGRU (Encoder-Decoder) with PyTorch on Moving-MNIST
Gymnasium - An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
DI-engine - OpenDILab Decision AI Engine
popgym - Partially Observable Process Gym
recurrent-ppo-truncated-bptt - Baseline implementation of recurrent PPO using truncated BPTT
brain-agent - Brain Agent for Large-Scale and Multi-Task Agent Learning
rl8 - A high throughput, end-to-end RL library for infinite horizon tasks.
ml-agents - The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
ppo-implementation-details - The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization