Recurrent-ppo-truncated-bptt Alternatives

Similar projects and alternatives to recurrent-ppo-truncated-bptt

ml-agents

60 16,358 8.0 C# recurrent-ppo-truncated-bptt VS ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
ppo-implementation-details

18 557 0.0 Python recurrent-ppo-truncated-bptt VS ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
neroRL

3 26 0.0 Jupyter Notebook recurrent-ppo-truncated-bptt VS neroRL

Deep Reinforcement Learning Framework done with PyTorch
pomdp-baselines

5 275 4.3 Python recurrent-ppo-truncated-bptt VS pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
snakeAI

1 10 5.5 C++ recurrent-ppo-truncated-bptt VS snakeAI

testing MLP, DQN, PPO, SAC, policy-gradient by snake
PPO-PyTorch

2 1,472 2.8 Python recurrent-ppo-truncated-bptt VS PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
episodic-transformer-memory-ppo

5 109 2.5 Python recurrent-ppo-truncated-bptt VS episodic-transformer-memory-ppo

Clean baseline implementation of PPO using an episodic TransformerXL memory
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
popgym

4 143 6.1 Python recurrent-ppo-truncated-bptt VS popgym

Partially Observable Process Gym
pytorch-a2c-ppo-acktr-gail

3 3,423 0.0 Python recurrent-ppo-truncated-bptt VS pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
cleanrl

41 4,459 6.3 Python recurrent-ppo-truncated-bptt VS cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better recurrent-ppo-truncated-bptt alternative or higher similarity.

Suggest an alternative to recurrent-ppo-truncated-bptt

recurrent-ppo-truncated-bptt reviews and mentions

Posts with mentions or reviews of recurrent-ppo-truncated-bptt. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-03-25.

What RL library supports custom LSTM and Transformer neural networks to use with algorithms such as PPO?
4 projects | /r/reinforcementlearning | 25 Mar 2023

I provide baseline implementations on TransformerXL + PPO and LSTM/GRU + PPO. These are designed to be slim and easy-to-follow so that you can advance those implementations to the features and toolset that you need.
How does a recurrent generator work in PPO?
1 project | /r/reinforcementlearning | 10 Aug 2022
LSTM encoder in the policy?
1 project | /r/reinforcementlearning | 23 Mar 2022
what is the best approach to POMDP environment?
1 project | /r/reinforcementlearning | 13 Jan 2022

Second, when training a limited view agent in a tabular environment, I expected the rppo agent to perform better than cnn-based ppo. But it didn't. I used this repository that was already implemented and saw slow learning based on this.
LSTM with SAC not learning well on tasks like Mountain Car and Lunar Lander?
1 project | /r/reinforcementlearning | 27 Oct 2021
Recurrent PPO using truncated BPTT
3 projects | /r/reinforcementlearning | 29 Jun 2021
A note from our sponsor - InfluxDB
www.influxdata.com | 2 May 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Stats

Basic recurrent-ppo-truncated-bptt repo stats

Mentions

Stars

106

Activity

3.2

Last Commit

4 days ago

MarcoMeter/recurrent-ppo-truncated-bptt is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of recurrent-ppo-truncated-bptt is Jupyter Notebook.

Popular Comparisons