AgileRL
tnt
AgileRL | tnt | |
---|---|---|
12 | 1 | |
501 | 1,630 | |
4.2% | 1.2% | |
9.8 | 9.6 | |
5 days ago | 6 days ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
AgileRL
- [P] Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework
- Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework
-
[P] Significant improvements for multi-agent reinforcement learning!
Please check it out! https://github.com/AgileRL/AgileRL
- 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!
- [P] 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!
-
(1/2) May 2023
Deep Reinforcement Learning library focused on improving development by introducing RLOps - MLOps for reinforcement learning (https://github.com/AgileRL/AgileRL)
-
[P] 10x faster reinforcement learning HPO - now for RLHF!
https://github.com/AgileRL/AgileRL/blob/main/CONTRIBUTING.md Has a link to our discord too
- 10x faster reinforcement learning HPO - now with CNNs!
- [P] 10x faster reinforcement learning HPO - now with CNNs!
-
[P] Reinforcement learning evolutionary hyperparameter optimization - 10x speed up
GitHub: https://github.com/AgileRL/AgileRL
tnt
-
What skills are necessary to understand/be able to make meaningful contributions to PyTorch?
Shameless plug, my team works on torcheval and torchtnt. Neither of them are core pytorch, but if you're looking to help build out tooling for metric evaluation or training frameworks, both libraries are pretty new with very low hanging fruit.
What are some alternatives?
chat-ui - Open source codebase powering the HuggingChat app
torcheval - A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to facilitate metric computation in distributed training and tools for PyTorch model evaluations.
RLeXplore - RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random network distillation (RND) and rewarding impact-driven exploration (RIDE).
Ray - Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
loopquest - A Production Tool for Embodied AI
cleanrl - High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
de-torch - Minimal PyTorch Library for Differential Evolution
Muzero - Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.
q-learning-algorithms - This repository will aim to provide implementations of q-learning algorithms (DQN, Double-DQN, ...) using Pytorch.
Open-Llama - The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
easyopt - zero-code hyperparameters optimization framework
hlb-gpt - Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to larger models with one parameter change (feature currently in alpha).