airoboros vs hydra-moe

airoboros

Customizable implementation of the self-instruct paper. (by jondurbin)

Suggest topics

Source Code

Suggest alternative

Edit details

hydra-moe

By SkunkworksAI

Suggest topics

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

airoboros		hydra-moe
	Project
8	Mentions	2
948	Stars	395
-	Growth	1.5%
8.7	Activity	8.4
about 2 months ago	Latest Commit	6 months ago
Python	Language	Python
Apache License 2.0	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

airoboros

Posts with mentions or reviews of airoboros. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-04.

TinyLlama project aims to pretrain a 1.1B Llama model on 3T tokens
4 projects | news.ycombinator.com | 4 Sep 2023
Airoboros: Customizable implementation of the self-instruct paper
1 project | news.ycombinator.com | 24 Aug 2023
airoboros (tool) overhaul
1 project | /r/LocalLLaMA | 20 Jul 2023

Just wanted to drop a note that I overhauled the airoboros tool not the models to have most of the prompts I've been using to build the datasets, plus a couple extras.
(2/2) May 2023
14 projects | /r/dailyainews | 2 Jun 2023

airoboros: using large language models to fine-tune large language models (https://github.com/jondurbin/airoboros)
Airoboros [7B/13B]
1 project | /r/LocalLLM | 24 May 2023

This is a fine-tuned LlaMa model, using completely synthetic training data created by https://github.com/jondurbin/airoboros
airobors-13b - 98% eval vs gpt-3.5-turbo
1 project | /r/LocalLLaMA | 21 May 2023

I used airoboros, a python tool I wrote, to generate the synthetic instruction response pairs, and included a jailbreak prompt to attempt to bypass OpenAI censorship. This is the only dataset used to fine-tune the model.
[P] airoboros 7b - instruction tuned on 100k synthetic instruction/responses
2 projects | /r/MachineLearning | 12 May 2023

This is a 7b parameter, fine-tuned on 100k synthetic instruction/response pairs generated by gpt-3.5-turbo using my version of self-instruct airoboros
[P] airoboros: a rewrite of self-instruct/alpaca synthetic prompt generation
1 project | /r/MachineLearning | 3 May 2023

GitHub Repo

hydra-moe

Posts with mentions or reviews of hydra-moe. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-04.

Hydra – Model of Experts
1 project | news.ycombinator.com | 20 Nov 2023
TinyLlama project aims to pretrain a 1.1B Llama model on 3T tokens
4 projects | news.ycombinator.com | 4 Sep 2023

Thanks. Yes, I've seen airoboros, it aims to use a mixture of fine-tunes of the base model if I recall correctly. Not a truly pre-trained MOE, but could be useful.
Hydra, is this it? https://github.com/SkunkworksAI/hydra-moe

What are some alternatives?

When comparing airoboros and hydra-moe you can also consider the following projects:

WizardLM - Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder and WizardMath

TinyLlama - The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

llama.cpp - LLM inference in C/C++

WizardVicunaLM - LLM that combines the principles of wizardLM and vicunaLM

datablations - Scaling Data-Constrained Language Models

chain-of-thought-hub - Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

gorilla - Gorilla: An API store for LLMs

tree-of-thoughts - Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%

DB-GPT - AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

GirlfriendGPT - Girlfriend GPT is a Python project to build your own AI girlfriend using ChatGPT4.0

prompt-engineering - Tips and tricks for working with Large Language Models like OpenAI's GPT-4.