Top 12 Python instruction-tuning Projects

LLaMA-Factory

2 20,248 9.9 Python

Unify Efficient Fine-Tuning of 100+ LLMs

Project mention: Show HN: GPU Prices on eBay | news.ycombinator.com | 2024-02-23

Depends what model you want to train, and how well you want your computer to keep working while you're doing it.
If you're interested in large language models there's a table of vram requirements for fine-tuning at [1] which says you could do the most basic type of fine-tuning on a 7B parameter model with 8GB VRAM.
You'll find that training takes quite a long time, and as a lot of the GPU power is going on training, your computer's responsiveness will suffer - even basic things like scrolling in your web browser or changing tabs uses the GPU, after all.
Spend a bit more and you'll probably have a better time.
[1] https://github.com/hiyouga/LLaMA-Factory?tab=readme-ov-file#...

LLMSurvey

3 8,716 7.9 Python

The official GitHub page for the survey paper "A Survey of Large Language Models".

Project mention: Ask HN: Textbook Regarding LLMs | news.ycombinator.com | 2024-03-23

Here’s another one - it’s older but has some interesting charts and graphs.
https://arxiv.org/abs/2303.18223

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
self-instruct

3 3,666 2.3 Python

Aligning pretrained language models with instruction data generated by themselves.
Otter

4 3,447 9.1 Python

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Project mention: OpenAI vs Google, Detect ChatGPT Content with 99% accuracy, Navigating AI compute costs | /r/ChatGPT | 2023-06-15

👀 Video-LLaMA - Empower large language models with video and audio understanding capability. (link) 🦦 Otter - Multi-modal model with improved instruction-following and in-context learning ability. 🔗 Linkly.AI - AI-powered lead analytics and management platform that helps you track, analyze, and streamline your leads in one place. 🎬 Jet Cut Ready - AI plugin for Adobe Premiere Pro that automatically removes silent parts in videos. (link) 💬 HeyGen's ChatGPT Plugin - Convert text into high-quality videos using AI text and video generation.

NExT-GPT

1 2,860 9.3 Python

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Project mention: Show HN: NExT-GPT – First LLM working with multimodal input and output | news.ycombinator.com | 2023-09-21

Video-LLaVA

5 2,368 9.1 Python

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Project mention: FLaNK Stack Weekly for 27 November 2023 | dev.to | 2023-11-27

mPLUG-Owl

2 1,945 7.6 Python

mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model

Project mention: Unleash the Power of Video-LLaMA: Revolutionizing Language Models with Video and Audio Understanding! | dev.to | 2023-06-12

We extend our deepest gratitude to the extraordinary projects that have influenced and contributed to the development of Video-LLaMA. We're indebted to MiniGPT-4, FastChat, BLIP-2, EVA-CLIP, ImageBind, LLaMA, VideoChat, LLaVA, WebVid, and mPLUG-Owl for their invaluable contributions. Special thanks to Midjourney for creating the stunning Video-LLaMA logo, encapsulating the essence of our groundbreaking project.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
InternVideo

3 909 8.0 Python

Video Foundation Models & Data for Multimodal Understanding
DataDreamer

5 632 8.1 Python

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Project mention: FLaNK AI - 01 April 2024 | dev.to | 2024-04-01

HugNLP

3 370 7.6 Python

CIKM2023 Best Demo Paper Award. HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊 (by HugAILab)

Project mention: HugNLP: A Unified and Comprehensive Open-Source Library for NLP | news.ycombinator.com | 2023-05-03

CodeCapybara

1 156 5.9 Python

Open-source Self-Instruction Tuning Code LLM

Project mention: CodeCapybara | news.ycombinator.com | 2023-04-30

tasksource

3 122 7.8 Python

Datasets collection and standardization preprocessings for NLP extreme multitask learning

Project mention: [D] What are notable advances in NLU? | /r/MachineLearning | 2023-05-19

Technically, BERT (bert-base) is not sota anymore. deberta+MTT-DNN (multi-task learning on many datasets) https://ibm.github.io/model-recycling/ is arguably sota.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python instruction-tuning related posts

Google Bard AI Now Has the Ability to Understand YouTube Videos
2 projects | news.ycombinator.com | 24 Nov 2023
Video-LLaVA
1 project | /r/hypeurls | 23 Nov 2023
Video-LLaVA
3 projects | news.ycombinator.com | 21 Nov 2023
Share your favorite materials: intersection of LLMs and business applications
1 project | news.ycombinator.com | 29 Aug 2023
Recommended open LLMs with image input modality?
3 projects | /r/LocalLLaMA | 8 Jul 2023
HugNLP: A Unified and Comprehensive Open-Source Library for NLP
2 projects | news.ycombinator.com | 3 May 2023
[R] CodeCapybara: Another open source model for code generation based on instruction tuning, outperformed Llama and CodeAlpaca
2 projects | /r/MachineLearning | 24 Apr 2023
A note from our sponsor - SaaSHub
www.saashub.com | 29 Apr 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source instruction-tuning projects in Python? This list will help you:

	Project	Stars
1	LLaMA-Factory	20,248
2	LLMSurvey	8,716
3	self-instruct	3,666
4	Otter	3,447
5	NExT-GPT	2,860
6	Video-LLaVA	2,368
7	mPLUG-Owl	1,945
8	InternVideo	909
9	DataDreamer	632
10	HugNLP	370
11	CodeCapybara	156
12	tasksource	122