rl-trained-agents
rl-baselines-zoo
rl-trained-agents | rl-baselines-zoo | |
---|---|---|
2 | 2 | |
93 | 1,106 | |
- | - | |
2.1 | 0.0 | |
about 1 year ago | over 1 year ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
rl-trained-agents
- Where can I get pre trained machine learning models?
-
Easily load and upload Stable-baselines3 models from the Hugging Face Hub 🤗
Uploading RL-trained-agents models into the 🤗 Hub: a big collection of pre-trained reinforcement learning agents using stable-baselines3.
rl-baselines-zoo
-
Agent trains great with PPO but terrible with SAC --> Advice for Hyperparameters
Take a look at these tuned sets of hyperparameters for various problems in PPO and SAC. The batch sizes are WAY smaller regardless of the problem. Your initial learning rate may also be too high.
-
How do I convert zoo / gym trained models to TensorFlow Lite or PyTorch TorchScript?
https://github.com/araffin/rl-baselines-zoo (TensorFlow based, using https://github.com/hill-a/stable-baselines)
What are some alternatives?
rl-baselines3-zoo - A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Minigrid - Simple and easily configurable grid world environments for reinforcement learning
learning-to-drive-in-5-minutes - Implementation of reinforcement learning approach to make a car learn to drive smoothly in minutes
seed_rl - SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
pybullet-gym - Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.
pytorch-blender - :sweat_drops: Seamless, distributed, real-time integration of Blender into PyTorch data pipelines
gym - A toolkit for developing and comparing reinforcement learning algorithms.
stable-baselines3-contrib - Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
gym-battleship - Battleship environment for reinforcement learning tasks