Our great sponsors
-
rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
In case you want to take a look the envs are published here https://github.com/MIMUW-RL/space-gym
Try hyperparameter search. It's implemented here: https://github.com/DLR-RM/rl-baselines3-zoo for stable-baselines3. Hyperparameters make a huge difference in RL, much more than in supervised learning.
Check out Alf examples
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- Problem with Truncated Quantile Critics (TQC) and n-step learning algorithm.
- Can't solve MountainCar-v0 with A2C algorithm (stable-baselines3)
- Stable-Baselines3 v2.0: Gymnasium Support
- Understanding Action Masking in RLlib
- Agent trains great with PPO but terrible with SAC --> Advice for Hyperparameters