Muzero
AgileRL
Muzero | AgileRL | |
---|---|---|
1 | 12 | |
16 | 494 | |
- | 2.8% | |
10.0 | 9.8 | |
over 1 year ago | 8 days ago | |
Python | Python | |
GNU General Public License v3.0 only | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Muzero
AgileRL
- [P] Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework
- Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework
-
[P] Significant improvements for multi-agent reinforcement learning!
Please check it out! https://github.com/AgileRL/AgileRL
- 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!
- [P] 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!
-
(1/2) May 2023
Deep Reinforcement Learning library focused on improving development by introducing RLOps - MLOps for reinforcement learning (https://github.com/AgileRL/AgileRL)
-
[P] 10x faster reinforcement learning HPO - now for RLHF!
https://github.com/AgileRL/AgileRL/blob/main/CONTRIBUTING.md Has a link to our discord too
- 10x faster reinforcement learning HPO - now with CNNs!
- [P] 10x faster reinforcement learning HPO - now with CNNs!
-
[P] Reinforcement learning evolutionary hyperparameter optimization - 10x speed up
GitHub: https://github.com/AgileRL/AgileRL
What are some alternatives?
neural-network-scratch - build a neural network to show as a demonstration on inner workings of a neural network
chat-ui - Open source codebase powering the HuggingChat app
LightZero - [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
RLeXplore - RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random network distillation (RND) and rewarding impact-driven exploration (RIDE).
Muzero-unplugged - Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
loopquest - A Production Tool for Embodied AI
Note - Easily implement parallel training and distributed training. Machine learning library. Note.neuralnetwork.tf package include Llama2, Llama3, CLIP, ViT, ConvNeXt, SwiftFormer, etc, these models built with Note are compatible with TensorFlow and can be trained with TensorFlow.
de-torch - Minimal PyTorch Library for Differential Evolution
Stochastic-muzero - Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
q-learning-algorithms - This repository will aim to provide implementations of q-learning algorithms (DQN, Double-DQN, ...) using Pytorch.
nn-morse - Decode morse using a neural network
Open-Llama - The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.