chain-of-thought-hub vs airoboros

chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting (by FranxYao)

Suggest topics

Source Code

Suggest alternative

Edit details

airoboros

Customizable implementation of the self-instruct paper. (by jondurbin)

Suggest topics

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

chain-of-thought-hub		airoboros
	Project
10	Mentions	8
2,371	Stars	948
-	Growth	-
6.9	Activity	8.7
10 days ago	Latest Commit	about 2 months ago
Jupyter Notebook	Language	Python
MIT License	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

chain-of-thought-hub

Posts with mentions or reviews of chain-of-thought-hub. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-08.

Chain-Of-Thought Hub: Measuring LLMs' Reasoning Performance
1 project | /r/AIPrompt_requests | 24 Jun 2023
All Model Leaderboards (that I know)
4 projects | /r/LocalLLaMA | 8 Jun 2023

Chain-of-Thought Hub https://github.com/FranxYao/chain-of-thought-hub - these are mostly gathered although Yao Fu, the author is working on specific CoT runs
It looks likely that the MMLU score on Hugginface's LLM leaderboard is wrong after all.
1 project | /r/LocalLLaMA | 8 Jun 2023
(2/2) May 2023
14 projects | /r/dailyainews | 2 Jun 2023

Chain-of-Thought Hub: Measuring LLMs' Reasoning Performance (https://github.com/FranxYao/chain-of-thought-hub)
Ask HN: Is it just me or GPT-4's quality has significantly deteriorated lately?
4 projects | news.ycombinator.com | 31 May 2023

https://github.com/FranxYao/chain-of-thought-hub
[N] Chain-of-Thought Hub: Measuring LLMs' Reasoning Performance
1 project | /r/MachineLearning | 30 May 2023
Chain-of-Thought Hub: Measuring LLMs' Reasoning Performance
1 project | /r/agi | 30 May 2023

1 project | /r/hypeurls | 30 May 2023

3 projects | news.ycombinator.com | 30 May 2023

airoboros

Posts with mentions or reviews of airoboros. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-04.

TinyLlama project aims to pretrain a 1.1B Llama model on 3T tokens
4 projects | news.ycombinator.com | 4 Sep 2023
Airoboros: Customizable implementation of the self-instruct paper
1 project | news.ycombinator.com | 24 Aug 2023
airoboros (tool) overhaul
1 project | /r/LocalLLaMA | 20 Jul 2023

Just wanted to drop a note that I overhauled the airoboros tool not the models to have most of the prompts I've been using to build the datasets, plus a couple extras.
(2/2) May 2023
14 projects | /r/dailyainews | 2 Jun 2023

airoboros: using large language models to fine-tune large language models (https://github.com/jondurbin/airoboros)
Airoboros [7B/13B]
1 project | /r/LocalLLM | 24 May 2023

This is a fine-tuned LlaMa model, using completely synthetic training data created by https://github.com/jondurbin/airoboros
airobors-13b - 98% eval vs gpt-3.5-turbo
1 project | /r/LocalLLaMA | 21 May 2023

I used airoboros, a python tool I wrote, to generate the synthetic instruction response pairs, and included a jailbreak prompt to attempt to bypass OpenAI censorship. This is the only dataset used to fine-tune the model.
[P] airoboros 7b - instruction tuned on 100k synthetic instruction/responses
2 projects | /r/MachineLearning | 12 May 2023

This is a 7b parameter, fine-tuned on 100k synthetic instruction/response pairs generated by gpt-3.5-turbo using my version of self-instruct airoboros
[P] airoboros: a rewrite of self-instruct/alpaca synthetic prompt generation
1 project | /r/MachineLearning | 3 May 2023

GitHub Repo

What are some alternatives?

When comparing chain-of-thought-hub and airoboros you can also consider the following projects:

DB-GPT - AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

WizardLM - Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder and WizardMath

llm-leaderboard - A joint community effort to create one central leaderboard for LLMs.

TinyLlama - The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

tree-of-thoughts - Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%

WizardVicunaLM - LLM that combines the principles of wizardLM and vicunaLM

llm-humaneval-benchmarks

datablations - Scaling Data-Constrained Language Models

GirlfriendGPT - Girlfriend GPT is a Python project to build your own AI girlfriend using ChatGPT4.0

gorilla - Gorilla: An API store for LLMs

gptqlora - GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ