d3rlpy
Minari
d3rlpy | Minari | |
---|---|---|
2 | 1 | |
1,215 | 222 | |
- | 7.2% | |
9.1 | 8.2 | |
10 days ago | 10 days ago | |
Python | Python | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
d3rlpy
- Python libraries for solving reinforcement learning problems implemented in OpenAI gym
-
Conservative Q Learning TD error not converging
Hi, I am using the discrete conservative Q learning implementation in the d3rlpy library (https://github.com/takuseno/d3rlpy) to train a policy offline to optimize mechanical ventilation treatment by using the MIMIC-III dataset (https://physionet.org/content/mimiciii-demo/1.4/).
Minari
-
Announcing Minari (Gym for offline RL, by the Farama Foundation) is going into public beta
You can also read the full release notes here: https://github.com/Farama-Foundation/Minari/releases/tag/v0.3.0
What are some alternatives?
cleanrl - High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
exorl - ExORL: Exploratory Data for Offline Reinforcement Learning
gymprecice - A framework to design and develop reinforcement learning environments for single- and multi-physics active flow control.
Coursera_Reinforcement_Learning - Coursera Reinforcement Learning Specialization by University of Alberta & Alberta Machine Intelligence Institute
PettingZoo - An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
pytorch-a2c-ppo-acktr-gail - PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
MO-Gymnasium - Multi-objective Gymnasium environments for reinforcement learning
rlai - This is a Python implementation of concepts and algorithms described in "Reinforcement Learning: An Introduction" (Sutton and Barto, 2018, 2nd edition).