deep_control
episodic-transformer-memory-ppo
deep_control | episodic-transformer-memory-ppo | |
---|---|---|
1 | 5 | |
87 | 109 | |
- | - | |
1.8 | 2.5 | |
over 2 years ago | about 1 month ago | |
Python | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
deep_control
-
Help on what could be wrong on my TD3?
So I am training with my own simulator from Unity connected to Open AI gym using TD3 adopted from this https://github.com/jakegrigsby/deep_control/blob/master/deep_control/td3.py
episodic-transformer-memory-ppo
-
Question about Transformer model input in RL
Check out this implementation https://github.com/MarcoMeter/episodic-transformer-memory-ppo
-
Using transformers in RL?
Maybe this easy-to-follow baseline implementation of PPO + TransformerXL is an inspiration for you.
-
What RL library supports custom LSTM and Transformer neural networks to use with algorithms such as PPO?
I provide baseline implementations on TransformerXL + PPO and LSTM/GRU + PPO. These are designed to be slim and easy-to-follow so that you can advance those implementations to the features and toolset that you need.
-
Trained a Transformer Decoder architecture with PPO, best way to maximize the entropy?
You can also checkout my baseline implementation of PPO + TrXL.
-
TransformerXL + PPO Baseline + MemoryGym
We finally completed a lightweight implementation of a memory-based agent using PPO and TransformerXL (and Gated TransformerXL).
What are some alternatives?
autonomous-learning-library - A PyTorch library for building deep reinforcement learning agents.
godot_rl_agents - An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents
Gymnasium - An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
popgym - Partially Observable Process Gym
recurrent-ppo-truncated-bptt - Baseline implementation of recurrent PPO using truncated BPTT
brain-agent - Brain Agent for Large-Scale and Multi-Task Agent Learning
rl8 - A high throughput, end-to-end RL library for infinite horizon tasks.
DI-engine - OpenDILab Decision AI Engine
ml-agents - The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
ppo-implementation-details - The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
Ray - Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
endless-memory-gym - Challenging Memory-based Deep Reinforcement Learning Agents