airoboros
gorilla
airoboros | gorilla | |
---|---|---|
8 | 51 | |
948 | 10,026 | |
- | - | |
8.7 | 8.9 | |
about 2 months ago | 6 days ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
airoboros
- TinyLlama project aims to pretrain a 1.1B Llama model on 3T tokens
- Airoboros: Customizable implementation of the self-instruct paper
-
airoboros (tool) overhaul
Just wanted to drop a note that I overhauled the airoboros tool not the models to have most of the prompts I've been using to build the datasets, plus a couple extras.
-
(2/2) May 2023
airoboros: using large language models to fine-tune large language models (https://github.com/jondurbin/airoboros)
-
Airoboros [7B/13B]
This is a fine-tuned LlaMa model, using completely synthetic training data created by https://github.com/jondurbin/airoboros
-
airobors-13b - 98% eval vs gpt-3.5-turbo
I used airoboros, a python tool I wrote, to generate the synthetic instruction response pairs, and included a jailbreak prompt to attempt to bypass OpenAI censorship. This is the only dataset used to fine-tune the model.
-
[P] airoboros 7b - instruction tuned on 100k synthetic instruction/responses
This is a 7b parameter, fine-tuned on 100k synthetic instruction/response pairs generated by gpt-3.5-turbo using my version of self-instruct airoboros
-
[P] airoboros: a rewrite of self-instruct/alpaca synthetic prompt generation
GitHub Repo
gorilla
-
Launch HN: Nango (YC W23) – Open-Source Unified API
Do you leverage https://gorilla.cs.berkeley.edu/ at all? If not, perhaps consider if it would solve some pain for you.
- Autonomous LLM agents with human-out-of-loop
- Show HN: I made a script to scrape your Facebook group
-
Pushing ChatGPT's Structured Data Support to Its Limits
* Gorilla [https://github.com/ShishirPatil/gorilla]
Could be interesting to try some of these exercises with these models.
-
Guidance for selecting a function-calling library?
gorilla
- Gorilla: An API Store for LLMs
-
Show HN: OpenAPI DevTools – Chrome ext. that generates an API spec as you browse
Nice this made me go back and check up on the Gorilla LLM project [1] to see whats they are doing with API and if they have applied their fine tuning to any of the newer foundation models but looks like things have slowed down since they launched (?) or maybe development is happening elsewhere on some invisible discord channel but I hope the intersection of API calling and LLM as a logic processing function keep getting focus it's an important direction for interop across the web.
[1] https://github.com/ShishirPatil/gorilla
-
RestGPT
"Gorilla: Large Language Model Connected with Massive APIs" (2023) https://gorilla.cs.berkeley.edu/ :
> Gorilla enables LLMs to use tools by invoking APIs. Given a natural language query, Gorilla comes up with the semantically- and syntactically- correct API to invoke. With Gorilla, we are the first to demonstrate how to use LLMs to invoke 1,600+ (and growing) API calls accurately while reducing hallucination. We also release APIBench, the largest collection of APIs, curated and easy to be trained on! Join us, as we try to expand the largest API store and teach LLMs how to write them!
eval/:
- Calling APIs with Natural Language
-
Shishir Patil: Teaching AI to Use APIs with Gorilla LLM – Humans of AI Podcast
Humans of AI Podcast #7
An amazing conversation with Shishir Patil the creator of the Gorilla LLM, a large language model specifically trained to use APIs!
Shishir is currently a 5th year PhD student at the University of California, Berkeley whose work broadly covers ML-Systems, LLMs, Edge-ML, and Sky computing.
Definitely give the episode a listen to hear Shishir's story.
And to read more about #GorillaLLM, check out the project page!
https://gorilla.cs.berkeley.edu
What are some alternatives?
WizardLM - Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder and WizardMath
DB-GPT - AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
TinyLlama - The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Voyager - An Open-Ended Embodied Agent with Large Language Models
WizardVicunaLM - LLM that combines the principles of wizardLM and vicunaLM
gorilla-cli - LLMs for your CLI
datablations - Scaling Data-Constrained Language Models
Gin - Gin is a HTTP web framework written in Go (Golang). It features a Martini-like API with much better performance -- up to 40 times faster. If you need smashing performance, get yourself some Gin.
chain-of-thought-hub - Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
GirlfriendGPT - Girlfriend GPT is a Python project to build your own AI girlfriend using ChatGPT4.0
tree-of-thoughts - Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
SuperAGI - <⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.