chain-of-thought-hub
gorilla
chain-of-thought-hub | gorilla | |
---|---|---|
10 | 51 | |
2,371 | 10,026 | |
- | - | |
6.9 | 8.9 | |
10 days ago | 6 days ago | |
Jupyter Notebook | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
chain-of-thought-hub
- Chain-Of-Thought Hub: Measuring LLMs' Reasoning Performance
-
All Model Leaderboards (that I know)
Chain-of-Thought Hub https://github.com/FranxYao/chain-of-thought-hub - these are mostly gathered although Yao Fu, the author is working on specific CoT runs
- It looks likely that the MMLU score on Hugginface's LLM leaderboard is wrong after all.
-
(2/2) May 2023
Chain-of-Thought Hub: Measuring LLMs' Reasoning Performance (https://github.com/FranxYao/chain-of-thought-hub)
-
Ask HN: Is it just me or GPT-4's quality has significantly deteriorated lately?
https://github.com/FranxYao/chain-of-thought-hub
- [N] Chain-of-Thought Hub: Measuring LLMs' Reasoning Performance
- Chain-of-Thought Hub: Measuring LLMs' Reasoning Performance
gorilla
-
Launch HN: Nango (YC W23) – Open-Source Unified API
Do you leverage https://gorilla.cs.berkeley.edu/ at all? If not, perhaps consider if it would solve some pain for you.
- Autonomous LLM agents with human-out-of-loop
- Show HN: I made a script to scrape your Facebook group
-
Pushing ChatGPT's Structured Data Support to Its Limits
* Gorilla [https://github.com/ShishirPatil/gorilla]
Could be interesting to try some of these exercises with these models.
-
Guidance for selecting a function-calling library?
gorilla
- Gorilla: An API Store for LLMs
-
Show HN: OpenAPI DevTools – Chrome ext. that generates an API spec as you browse
Nice this made me go back and check up on the Gorilla LLM project [1] to see whats they are doing with API and if they have applied their fine tuning to any of the newer foundation models but looks like things have slowed down since they launched (?) or maybe development is happening elsewhere on some invisible discord channel but I hope the intersection of API calling and LLM as a logic processing function keep getting focus it's an important direction for interop across the web.
[1] https://github.com/ShishirPatil/gorilla
-
RestGPT
"Gorilla: Large Language Model Connected with Massive APIs" (2023) https://gorilla.cs.berkeley.edu/ :
> Gorilla enables LLMs to use tools by invoking APIs. Given a natural language query, Gorilla comes up with the semantically- and syntactically- correct API to invoke. With Gorilla, we are the first to demonstrate how to use LLMs to invoke 1,600+ (and growing) API calls accurately while reducing hallucination. We also release APIBench, the largest collection of APIs, curated and easy to be trained on! Join us, as we try to expand the largest API store and teach LLMs how to write them!
eval/:
- Calling APIs with Natural Language
-
Shishir Patil: Teaching AI to Use APIs with Gorilla LLM – Humans of AI Podcast
Humans of AI Podcast #7
An amazing conversation with Shishir Patil the creator of the Gorilla LLM, a large language model specifically trained to use APIs!
Shishir is currently a 5th year PhD student at the University of California, Berkeley whose work broadly covers ML-Systems, LLMs, Edge-ML, and Sky computing.
Definitely give the episode a listen to hear Shishir's story.
And to read more about #GorillaLLM, check out the project page!
https://gorilla.cs.berkeley.edu
What are some alternatives?
DB-GPT - AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
llm-leaderboard - A joint community effort to create one central leaderboard for LLMs.
Voyager - An Open-Ended Embodied Agent with Large Language Models
tree-of-thoughts - Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
gorilla-cli - LLMs for your CLI
airoboros - Customizable implementation of the self-instruct paper.
Gin - Gin is a HTTP web framework written in Go (Golang). It features a Martini-like API with much better performance -- up to 40 times faster. If you need smashing performance, get yourself some Gin.
llm-humaneval-benchmarks
GirlfriendGPT - Girlfriend GPT is a Python project to build your own AI girlfriend using ChatGPT4.0
SuperAGI - <⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.