Python instruction-tuning

Open-source Python projects categorized as instruction-tuning

Top 15 Python instruction-tuning Projects

instruction-tuning
  • LLaMA-Factory

    Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

    Project mention: Llama-Factory: A WebUI for Efficient Fine-Tuning of 100 LLMs | news.ycombinator.com | 2024-07-17
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • LLaVA

    [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

    Project mention: Show HN: LLM Aided OCR (Correcting Tesseract OCR Errors with LLMs) | news.ycombinator.com | 2024-08-09

    This package seems to use llama_cpp for local inference [1] so you can probably use anything supported by that [2]. However, I think it's just passing OCR output for correction - the language model doesn't actually see the original image.

    That said, there are some large language models you can run locally which accept image input. Phi-3-Vision [3], LLaVA [4], MiniCPM-V [5], etc.

    [1] - https://github.com/Dicklesworthstone/llm_aided_ocr/blob/main...

    [2] - https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#de...

    [3] - https://huggingface.co/microsoft/Phi-3-vision-128k-instruct

    [4] - https://github.com/haotian-liu/LLaVA

    [5] - https://github.com/OpenBMB/MiniCPM-V

  • LLMSurvey

    The official GitHub page for the survey paper "A Survey of Large Language Models".

    Project mention: Ask HN: Textbook Regarding LLMs | news.ycombinator.com | 2024-03-23

    Here’s another one - it’s older but has some interesting charts and graphs.

    https://arxiv.org/abs/2303.18223

  • self-instruct

    Aligning pretrained language models with instruction data generated by themselves.

  • Otter

    🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

  • NExT-GPT

    Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

  • Video-LLaVA

    【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

    Project mention: FLaNK Stack Weekly for 27 November 2023 | dev.to | 2023-11-27
  • mPLUG-Owl

    mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

  • cambrian

    Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

    Project mention: Cambrian-1 an Open, Vision-Centric Exploration of Multimodal LLMs | news.ycombinator.com | 2024-06-25

    Code: [cambrian-mllm/*cambrian*](https://github.com/cambrian-mllm/cambrian)

  • InternVideo

    [ECCV2024] Video Foundation Models & Data for Multimodal Understanding

  • DataDreamer

    DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

    Project mention: FLaNK AI - 01 April 2024 | dev.to | 2024-04-01
  • DoRA

    [ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation (by NVlabs)

    Project mention: FLaNK-AIM Weekly 06 May 2024 | dev.to | 2024-05-06
  • HugNLP

    CIKM2023 Best Demo Paper Award. HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊 (by HugAILab)

  • CodeCapybara

    Open-source Self-Instruction Tuning Code LLM

  • tasksource

    Datasets collection and preprocessings framework for NLP extreme multitask learning

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python instruction-tuning discussion

Log in or Post with

Python instruction-tuning related posts

  • Google Bard AI Now Has the Ability to Understand YouTube Videos

    2 projects | news.ycombinator.com | 24 Nov 2023
  • Video-LLaVA

    1 project | /r/hypeurls | 23 Nov 2023
  • Video-LLaVA

    3 projects | news.ycombinator.com | 21 Nov 2023
  • Share your favorite materials: intersection of LLMs and business applications

    1 project | news.ycombinator.com | 29 Aug 2023
  • Recommended open LLMs with image input modality?

    3 projects | /r/LocalLLaMA | 8 Jul 2023
  • HugNLP: A Unified and Comprehensive Open-Source Library for NLP

    2 projects | news.ycombinator.com | 3 May 2023
  • [R] CodeCapybara: Another open source model for code generation based on instruction tuning, outperformed Llama and CodeAlpaca

    2 projects | /r/MachineLearning | 24 Apr 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 12 Oct 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source instruction-tuning projects in Python? This list will help you:

Project Stars
1 LLaMA-Factory 31,926
2 LLaVA 19,655
3 LLMSurvey 10,139
4 self-instruct 4,080
5 Otter 3,560
6 NExT-GPT 3,241
7 Video-LLaVA 2,888
8 mPLUG-Owl 2,271
9 cambrian 1,713
10 InternVideo 1,338
11 DataDreamer 813
12 DoRA 591
13 HugNLP 374
14 CodeCapybara 159
15 tasksource 144

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com