[P] Finetuning a commercially viable open source LLM (Flan-UL2) using Alpaca, Dolly15K and LoRA

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

alpaca-lora

107 18,167 3.6 Jupyter Notebook

Instruct-tune LLaMA on consumer hardware
ue5-llama-lora

16 450 2.9 Python

A proof-of-concept project that showcases the potential for using small, locally trainable LLMs to create next-generation documentation tools.
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
dolly

41 10,783 7.2 Python

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Hi /u/meowkittykitty510, I'm embarking on a similar project, solely for teaching myself about finetuning LLMs. I initially chose Dolly (Pythia model) with modifications to the Dolly 2.0 trainer plus additional training data scraped from the web about The Expanse just as a fun way to test.

AlpacaDataCleaned

14 1,394 7.6 Python

Alpaca dataset from Stanford, cleaned and curated

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Show HN: An API for detecting NSFW images
1 project | news.ycombinator.com | 29 Apr 2024
FLaNK AI Weekly for 29 April 2024
44 projects | dev.to | 29 Apr 2024
Functional Semantics in Imperative Clothing (Richard Feldman)
1 project | news.ycombinator.com | 29 Apr 2024
CloudGoat
1 project | news.ycombinator.com | 29 Apr 2024
Memary is a cutting-edge long-term memory system based on a knowledge graph
2 projects | news.ycombinator.com | 29 Apr 2024

[P] Finetuning a commercially viable open source LLM (Flan-UL2) using Alpaca, Dolly15K and LoRA

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning Post date: 20 Apr 2023

alpaca-lora

ue5-llama-lora

WorkOS

dolly

AlpacaDataCleaned

Related posts