Our great sponsors
-
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
I even have the PyTorch implementation faster in some cases (I created a branch with pytorch optimization that gives a 5% speed improvement https://github.com/DLR-RM/stable-baselines3/tree/exp/torch-optim ).
-
Ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Folks like me using RLLib have observed this behavior: https://github.com/ray-project/ray/issues/12494
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
- tf2 speed: https://github.com/hill-a/stable-baselines/issues/576#issuecomment-573331715
-
rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
for pytorch, use the rl zoo (https://github.com/DLR-RM/rl-baselines3-zoo) and sb3 ;) https://github.com/DLR-RM/stable-baselines3
Related posts
- [P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials)
- [Question] Why there is so few algorithms implemented in SB3?
- Stable baselines! Where my people at?
- SB3 - NotImplementedError: Box([-1. -1. -8.], [1. 1. 8.], (3,), <class 'numpy.float32'>) observation space is not supported
- Exporting an A2C model created with stable-baselines3 to PyTorch