sbx
jaxrl
sbx | jaxrl | |
---|---|---|
5 | 2 | |
265 | 576 | |
- | - | |
6.6 | 0.0 | |
24 days ago | over 1 year ago | |
Python | Jupyter Notebook | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sbx
-
Stable-Baselines3 v2.0: Gymnasium Support
Stable-Baselines Jax (SBX): https://github.com/araffin/sbx
-
JAX in Reinforcement Learning
If you want to learn from examples, you can take a look at clean rl or stable baselines jax (sbx): https://github.com/araffin/sbx
-
How can I speed up SAC?
You mean wallclock time or sample efficiency? For the former, you can take a look at Jax implementation like: https://github.com/araffin/sbx (SB3 + Jax)
-
Stable-Baselines3 v1.8 Release
The Hindsight Experience Replay (HER) buffer is compatible with all off-policy reinforcement learning algorithms. (and also compatible with the Jax version of SB3: https://github.com/araffin/sbx/pull/11).
-
JAX or PyTorch?
I haven't used JAX yet but quite excited about it - Little plug for SBX https://github.com/araffin/sbx which seems quite clean!
jaxrl
-
JAX in Reinforcement Learning
Have you looked at this repo or this repo ?
-
CleanRL now has a DDPG + JAX implementation roughly 2.5-4x faster than DDPG + PyTorch
https://github.com/ikostrikov/jaxrl would be another great reference implementation. Probably you want to also checkout the docs for jax, flax, and optax.
What are some alternatives?
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
jaxrl_m - Skeleton for scalable and flexible Jax RL implementations
indaba-pracs-2022 - Notebooks for the Practicals at the Deep Learning Indaba 2022.
fselect - Find files with SQL-like queries
cleanrl - High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Study-Time-Tally - Track your study hours.
Popular-RL-Algorithms - PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Terminal-Video-Player - A program that can display video in the terminal using ascii characters
uvadlc_notebooks - Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023
RL-X - A framework for Reinforcement Learning research.
rl-baselines3-zoo - A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.