maze
lumos
Our great sponsors
maze | lumos | |
---|---|---|
4 | 4 | |
257 | 406 | |
1.2% | 53.4% | |
0.0 | 8.9 | |
23 days ago | about 1 month ago | |
Python | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
maze
-
[P] Maze: A Framework for Applied Reinforcement Learning
Check out Maze on GitHub - we'd love feedback from anybody with an interest and/or experience in reinforcement learning!
-
Maze: A Framework for Applied Reinforcement Learning
Check out Maze on GitHub and its documentation here.
-
Is there a consensus about RL frameworks?
For industrial and logistics problems this one looks promising: https://github.com/enlite-ai/maze saw their presentation 2 weeks ago at an international AI conference and was surprised that its already in use and available on github.
- MazeRL - Applied Reinforcement Learning with Python
lumos
-
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
Guess you are looking for this - https://github.com/allenai/lumos/blob/main/README.md
- [R] Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
What are some alternatives?
dcss-ai-wrapper - An API for Dungeon Crawl Stone Soup for Artificial Intelligence research.
Awesome-Prompt-Engineering - This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
machin - Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...
emukit - A Python-based toolbox of various methods in decision making, uncertainty quantification and statistical emulation: multi-fidelity, experimental design, Bayesian optimisation, Bayesian quadrature, etc.
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
SwiftSage - SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks
nle - The NetHack Learning Environment
ToolEmu - A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use
dm_env - A Python interface for reinforcement learning environments
Ray - Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
RL-Adventure - Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
multirotor - Multicopter UAV simulation for control/RL experiments.