HJxB
jaxrl
HJxB | jaxrl | |
---|---|---|
3 | 2 | |
13 | 576 | |
- | - | |
0.0 | 0.0 | |
over 2 years ago | over 1 year ago | |
Python | Jupyter Notebook | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
HJxB
jaxrl
-
JAX in Reinforcement Learning
Have you looked at this repo or this repo ?
-
CleanRL now has a DDPG + JAX implementation roughly 2.5-4x faster than DDPG + PyTorch
https://github.com/ikostrikov/jaxrl would be another great reference implementation. Probably you want to also checkout the docs for jax, flax, and optax.
What are some alternatives?
jax-resnet - Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).
jaxrl_m - Skeleton for scalable and flexible Jax RL implementations
long-range-arena - Long Range Arena for Benchmarking Efficient Transformers
sbx - SBX: Stable Baselines Jax (SB3 + Jax)
adaptive-policy-iteration - JAX implementation of Adaptive Approximate Policy Iteration (Hao et al., 2021)
indaba-pracs-2022 - Notebooks for the Practicals at the Deep Learning Indaba 2022.
dymos - Open Source Optimization of Dynamic Multidisciplinary Systems
cleanrl - High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Popular-RL-Algorithms - PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
uvadlc_notebooks - Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023