Emergent-Multiagent-Strategies
pymarl2
Emergent-Multiagent-Strategies | pymarl2 | |
---|---|---|
1 | 1 | |
38 | 557 | |
- | - | |
0.0 | 5.0 | |
over 1 year ago | 4 months ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Emergent-Multiagent-Strategies
-
PPO with Transformer or Attention Mechanism
Not stable_baselines but I have an implementation of Attention + PPO in a multi-agent setting: https://github.com/Ankur-Deka/Emergent-Multiagent-Strategies
pymarl2
-
MARL top conference papers are ridiculous
https://github.com/hijkzzz/pymarl2 (RIIT)
What are some alternatives?
IC3Net - Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks
nlp-recipes - Natural Language Processing Best Practices & Examples
Competitive-Programming
auto-sklearn - Automated Machine Learning with scikit-learn
DI-engine - OpenDILab Decision AI Engine
ai-economist - Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforcement learning to learn optimal economic policies, as done by the AI Economist (https://www.einstein.ai/the-ai-economist).
pumpkin - The MAS Demonic Surveillance Platform. 🎃 [Moved to: https://github.com/scandale-project/pumpkin]
Mava - 🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
fast-reid - SOTA Re-identification Methods and Toolbox
SimpleView - Official Code for ICML 2021 paper "Revisiting Point Cloud Shape Classification with a Simple and Effective Baseline"