pomdp-baselines
machin
pomdp-baselines | machin | |
---|---|---|
5 | 2 | |
275 | 381 | |
- | - | |
4.3 | 1.8 | |
7 months ago | over 2 years ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pomdp-baselines
- Best recurrent RL library?
-
In Latest Machine Learning Research, A Group at CMU Release a Simple and Efficient Implementation of Recurrent Model-Free Reinforcement Learning (RL) for Future Work to Use as a Baseline for POMDP Algorithms
Continue reading| Check out the paper, github link, project and reference article.
-
[R] Recurrent Model-Free RL is a Strong Baseline for Many POMDPs
Code for https://arxiv.org/abs/2110.05038 found: https://github.com/twni2016/pomdp-baselines
machin
-
Best PyTorch RL library for doing research
Machin is really nice, it is very easy to use and to try different things, although itβs developed by one person and maybe not appropriately tested yet.
-
Is there a consensus about RL frameworks?
I found this repo very helpful to get started: https://github.com/iffiX/machin
What are some alternatives?
tianshou - An elegant PyTorch deep reinforcement learning library.
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
ElegantRL - Massively Parallel Deep Reinforcement Learning. π₯
cleanrl - High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
pytorch-a2c-ppo-acktr-gail - PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Apache Impala - Apache Impala
DeepRL-TensorFlow2 - π Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
RL-Adventure - Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
minimalRL - Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
recurrent-ppo-truncated-bptt - Baseline implementation of recurrent PPO using truncated BPTT