JAX Implementations of Actor-Critic Algorithms

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

stable-baselines3

46 7,850 8.2 Python

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

I even have the PyTorch implementation faster in some cases (I created a branch with pytorch optimization that gives a 5% speed improvement https://github.com/DLR-RM/stable-baselines3/tree/exp/torch-optim ).
Ray

42 30,879 10.0 Python

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Folks like me using RLLib have observed this behavior: https://github.com/ray-project/ray/issues/12494
WorkOS

workos.com
sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
stable-baselines

10 4,000 0.0 Python

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

- tf2 speed: https://github.com/hill-a/stable-baselines/issues/576#issuecomment-573331715
rl-baselines3-zoo

11 1,764 6.3 Python

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

for pytorch, use the rl zoo (https://github.com/DLR-RM/rl-baselines3-zoo) and sb3 ;) https://github.com/DLR-RM/stable-baselines3

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project