ToolEmu
npi

ToolEmu | npi | |
---|---|---|
3 | 4 | |
127 | 208 | |
1.6% | 2.4% | |
5.5 | 9.7 | |
11 months ago | 7 days ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ToolEmu
-
[R] Identifying the Risks of LM Agents with an LM-Emulated Sandbox - University of Toronto 2023 - Benchmark consisting of 36 high-stakes tools and 144 test cases!
Website: https://toolemu.com/
- ToolEmu: Identifying the Risks of LM Agents with an LM-Emulated Sandbox
-
Identifying the Risks of LM Agents with an LM-Emulated Sandbox - University of Toronto 2023 - Benchmark consisting of 36 high-stakes tools and 144 test cases!
Github: https://github.com/ryoungj/toolemu
npi
What are some alternatives?
LOGICGUIDE - Plug in and Play implementation of "Certified Reasoning with Language Models" that elevates model reasoning by 40%
Tiger - We do NOT and WILL not have any Crypto Projects, they are a complete SCAM | Neuralink for your AI Agents - LangChain - Autogen - CrewAI
lumos - Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
xllm - 🦖 X—LLM: Cutting Edge & Easy LLM Finetuning
ethics - Aligning AI With Shared Human Values (ICLR 2021)
open-assistant-api - The Open Assistant API is a ready-to-use, open-source, self-hosted agent/gpts orchestration creation framework, supporting customized extensions for LLM, RAG, function call, and tools capabilities. It also supports seamless integration with the openai/langchain sdk.
graph-of-thoughts - Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
awesome-gpt-prompt-engineering - A curated list of awesome resources, tools, and other shiny things for LLM prompt engineering.
prompttools - Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
GPTCache - Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
