ToolEmu
lumos

ToolEmu | lumos | |
---|---|---|
3 | 4 | |
127 | 460 | |
1.6% | 0.9% | |
5.5 | 8.9 | |
11 months ago | 11 months ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ToolEmu
-
[R] Identifying the Risks of LM Agents with an LM-Emulated Sandbox - University of Toronto 2023 - Benchmark consisting of 36 high-stakes tools and 144 test cases!
Website: https://toolemu.com/
- ToolEmu: Identifying the Risks of LM Agents with an LM-Emulated Sandbox
-
Identifying the Risks of LM Agents with an LM-Emulated Sandbox - University of Toronto 2023 - Benchmark consisting of 36 high-stakes tools and 144 test cases!
Github: https://github.com/ryoungj/toolemu
lumos
-
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
Guess you are looking for this - https://github.com/allenai/lumos/blob/main/README.md
- [R] Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
What are some alternatives?
LOGICGUIDE - Plug in and Play implementation of "Certified Reasoning with Language Models" that elevates model reasoning by 40%
SwiftSage - SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks
ethics - Aligning AI With Shared Human Values (ICLR 2021)
emukit - A Python-based toolbox of various methods in decision making, uncertainty quantification and statistical emulation: multi-fidelity, experimental design, Bayesian optimisation, Bayesian quadrature, etc.
graph-of-thoughts - Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
maze - Maze Applied Reinforcement Learning Framework
npi - Action library for AI Agent
Awesome-Prompt-Engineering - This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
prompttools - Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
wheatley - Next-generation scheduling problem solver based on GNNs and Reinforcement Learning
