Our great sponsors
-
rl-baselines-zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Take a look at these tuned sets of hyperparameters for various problems in PPO and SAC. The batch sizes are WAY smaller regardless of the problem. Your initial learning rate may also be too high.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- Can't solve MountainCar-v0 with A2C algorithm (stable-baselines3)
- Stable-Baselines3 v2.0: Gymnasium Support
- Understanding Action Masking in RLlib
- Tips and Tricks for RL from Experimental Data using Stable Baselines3 Zoo
- Simple continuous environment with spaceship but yet challenging for RL algorithms (like SAC, TD3)