tianshou
PantheonRL
tianshou | PantheonRL | |
---|---|---|
8 | 2 | |
7,459 | 117 | |
2.0% | 2.6% | |
9.5 | 5.0 | |
4 days ago | 6 months ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tianshou
-
Is it better to not use the Target Update Frequency in Double DQN or depends on the application?
The tianshou implementation I found at https://github.com/thu-ml/tianshou/blob/master/tianshou/policy/modelfree/dqn.py is DQN by default.
- 他們能回來嗎
-
Multi-Agent Stable Baselines
https://github.com/thu-ml/tianshou Imho there isn't a library that has it all, RLlib is quite good too, but I think that Tianshou is more similar to Pytorch and that helps to change the internals more intuitively and know what you are doing.
-
Question about the old policy and new policy in TRPO code
Good point...I'll check in more detail when I get a chance later today! I would suggest looking at a more recent implementation like https://github.com/DLR-RM/stable-baselines3 or https://github.com/thu-ml/tianshou if you're trying to build. https://spinningup.openai.com/en/latest/algorithms/trpo.html is particularly good for understanding
-
Tensorflow vs PyTorch for A3C
Do you absolutely need A3C? A2C has become more widely used (see, e.g., the comment in https://github.com/ikostrikov/pytorch-a3c, and the fact that both https://github.com/thu-ml/tianshou and https://github.com/facebookresearch/salina have A2C implementations, but no A3C at first glance).
-
"Tianshou: a Highly Modularized Deep Reinforcement Learning Library", Weng et al 2021 (Python PyTorch MuJuCo; PPO, DQN, A2C, DDPG, SAC, TD3, REINFORCE, NPG, TRPO, ACKTR)
Code for https://arxiv.org/abs/2107.14171 found: https://github.com/thu-ml/tianshou/
Get the code for Tianshou here (GitHub).
-
Best PyTorch RL library for doing research
I tried tianshou and thought it was well-designed for modularity, but it was early in development when I tried and missing some basic features
PantheonRL
-
Multi-Agent Stable Baselines
Maybe this is what you're looking for : https://github.com/Stanford-ILIAD/PantheonRL
- Policy for each of multi-agents in RL
What are some alternatives?
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
R-NaD - Experimentation with Regularized Nash Dynamics on a GPU accelerated game
cleanrl - High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Simple-MADRL-Chess - MADRL project solving chess environment using PPO with two different methods: 2 agents/networks and a single agent/network.
ElegantRL - Massively Parallel Deep Reinforcement Learning. 🔥
warp-drive - Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)
pytorch-learn-reinforcement-learning - A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
seed_rl - SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
pytorch-a3c - PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Deep-Reinforcement-Learning-Algorithms-with-PyTorch - PyTorch implementations of deep reinforcement learning algorithms and environments
pomdp-baselines - Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
DI-engine - OpenDILab Decision AI Engine