How can I train my custom dataset on top of Vicuna?

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

alpaca-lora

107 18,238 3.6 Jupyter Notebook

Instruct-tune LLaMA on consumer hardware
LLaMA-LoRA-Tuner

6 425 7.9 Python

UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT-like Chat UI to demonstrate your language models.
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
stanford_alpaca

108 28,893 2.0 Python

Code and documentation to train Stanford's Alpaca models, and generate the data.

If you have access to the data-center grade GPU, the quickest way to start would be to pick one of the efforts of fine-tuning, for example, stanford alpaca (https://github.com/tatsu-lab/stanford_alpaca/ ) or indeed, vicuna (https://github.com/lm-sys/FastChat ) and use your own data. The main issue for home users is that their VRAM is vastly insufficient for standard model tuning (original weights, updated weights, a copy of adam himself, and a copy for the AI overlord…)

FastChat

83 34,514 9.6 Python

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

If you have access to the data-center grade GPU, the quickest way to start would be to pick one of the efforts of fine-tuning, for example, stanford alpaca (https://github.com/tatsu-lab/stanford_alpaca/ ) or indeed, vicuna (https://github.com/lm-sys/FastChat ) and use your own data. The main issue for home users is that their VRAM is vastly insufficient for standard model tuning (original weights, updated weights, a copy of adam himself, and a copy for the AI overlord…)

simple-llm-finetuner

12 1,977 10.0 Jupyter Notebook

Discontinued Simple UI for LLM Model Finetuning
text-generation-webui

876 36,827 9.9 Python

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

Sure! This is the link to Oobabooga https://github.com/oobabooga/text-generation-webui

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

[P] Uptraining a pretrained model using company data?

4 projects | /r/MachineLearning | 25 May 2023
(HELP) Token Issue on Generation

1 project | /r/LocalLLaMA | 19 May 2023
Help with Random Characters and Words on Output

1 project | /r/LocalLLaMA | 18 May 2023
Fine-tuning LLaMA for research without Meta license

1 project | /r/LocalLLaMA | 15 May 2023
Why run LLMs locally?

4 projects | /r/LocalLLaMA | 8 May 2023

How can I train my custom dataset on top of Vicuna?

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
alpaca alpaca-lora llama Lora Machine Learning
Post date: 19 Apr 2023

alpaca-lora

LLaMA-LoRA-Tuner

InfluxDB

stanford_alpaca

FastChat

simple-llm-finetuner

text-generation-webui

Related posts

[P] Uptraining a pretrained model using company data?

(HELP) Token Issue on Generation

Help with Random Characters and Words on Output

Fine-tuning LLaMA for research without Meta license

Why run LLMs locally?

How can I train my custom dataset on top of Vicuna?

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA alpaca alpaca-lora llama Lora Machine Learning Post date: 19 Apr 2023

alpaca-lora

LLaMA-LoRA-Tuner

InfluxDB

stanford_alpaca

FastChat

simple-llm-finetuner

text-generation-webui

Related posts

[P] Uptraining a pretrained model using company data?

(HELP) Token Issue on Generation

Help with Random Characters and Words on Output

Fine-tuning LLaMA for research without Meta license

Why run LLMs locally?

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
alpaca alpaca-lora llama Lora Machine Learning
Post date: 19 Apr 2023