Agent trains great with PPO but terrible with SAC --> Advice for Hyperparameters

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

rl-baselines-zoo

2 1,106 0.0 Python

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

Take a look at these tuned sets of hyperparameters for various problems in PPO and SAC. The batch sizes are WAY smaller regardless of the problem. Your initial learning rate may also be too high.

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Can't solve MountainCar-v0 with A2C algorithm (stable-baselines3)
1 project | /r/reinforcementlearning | 27 Jun 2023
Stable-Baselines3 v2.0: Gymnasium Support
2 projects | /r/reinforcementlearning | 26 Jun 2023
Understanding Action Masking in RLlib
1 project | /r/reinforcementlearning | 12 Mar 2023
Tips and Tricks for RL from Experimental Data using Stable Baselines3 Zoo
1 project | /r/reinforcementlearning | 2 Jul 2022
Simple continuous environment with spaceship but yet challenging for RL algorithms (like SAC, TD3)
3 projects | /r/reinforcementlearning | 28 Jun 2022

Agent trains great with PPO but terrible with SAC --> Advice for Hyperparameters

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning
rl zoo reinforcement-learning stable-baselines openai-gym
Post date: 28 Aug 2022

rl-baselines-zoo

InfluxDB

Related posts

Agent trains great with PPO but terrible with SAC --&gt; Advice for Hyperparameters

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning rl zoo reinforcement-learning stable-baselines openai-gym Post date: 28 Aug 2022

rl-baselines-zoo

InfluxDB

Related posts

Agent trains great with PPO but terrible with SAC --> Advice for Hyperparameters

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning
rl zoo reinforcement-learning stable-baselines openai-gym
Post date: 28 Aug 2022