AgileRL
RLeXplore
AgileRL | RLeXplore | |
---|---|---|
12 | 1 | |
501 | 317 | |
4.2% | 0.3% | |
9.8 | 6.8 | |
5 days ago | 6 days ago | |
Python | Jupyter Notebook | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
AgileRL
- [P] Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework
- Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework
-
[P] Significant improvements for multi-agent reinforcement learning!
Please check it out! https://github.com/AgileRL/AgileRL
- 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!
- [P] 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!
-
(1/2) May 2023
Deep Reinforcement Learning library focused on improving development by introducing RLOps - MLOps for reinforcement learning (https://github.com/AgileRL/AgileRL)
-
[P] 10x faster reinforcement learning HPO - now for RLHF!
https://github.com/AgileRL/AgileRL/blob/main/CONTRIBUTING.md Has a link to our discord too
- 10x faster reinforcement learning HPO - now with CNNs!
- [P] 10x faster reinforcement learning HPO - now with CNNs!
-
[P] Reinforcement learning evolutionary hyperparameter optimization - 10x speed up
GitHub: https://github.com/AgileRL/AgileRL
RLeXplore
-
Evolutionary hyperparameter optimization for RL - 10x speed up
Definitely up for giving it a go - I recently put together an SB3 callback that integrates with the Rlexplore baselines https://github.com/yuanmingqi/rl-exploration-baselines/issues/3 so have a bit of experience with this sort of thing.
What are some alternatives?
chat-ui - Open source codebase powering the HuggingChat app
rl-baselines3-zoo - A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
loopquest - A Production Tool for Embodied AI
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
de-torch - Minimal PyTorch Library for Differential Evolution
rex-gym - OpenAI Gym environments for an open-source quadruped robot (SpotMicro)
Muzero - Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.
stable-baselines - Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
q-learning-algorithms - This repository will aim to provide implementations of q-learning algorithms (DQN, Double-DQN, ...) using Pytorch.
stable-baselines3-contrib - Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
Open-Llama - The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
stable-baselines - A fork of OpenAI Baselines, implementations of reinforcement learning algorithms