GPTQ-Merged vs axolotl

GPTQ-Merged

trying to make sense of it (by Ph0rk0z)

Suggest topics

Source Code

Suggest alternative

Edit details

axolotl

Go ahead and axolotl questions (by OpenAccess-AI-Collective)

Suggest topics

Source Code

openaccess-ai-collective.github.io

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

GPTQ-Merged		axolotl
	Project
2	Mentions	29
2	Stars	6,105
-	Growth	13.7%
8.1	Activity	9.8
8 months ago	Latest Commit	5 days ago
Python	Language	Python
-	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

GPTQ-Merged

Posts with mentions or reviews of GPTQ-Merged. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-28.

Slow inference on R720 w/P40 (or not)?
2 projects | /r/LocalLLaMA | 28 Jun 2023

Also autograd from here: https://github.com/Ph0rk0z/text-generation-webui-testing/ and it's matching GPTQ: https://github.com/Ph0rk0z/GPTQ-Merged/tree/dual-model
Finetuning on multiple GPUs
4 projects | /r/LocalLLaMA | 12 Jun 2023

Probably need to add universal support to the native functions because it uses llama only. If you edit the load_llama functions in autograd py to use generic stuff like this: https://github.com/Ph0rk0z/GPTQ-Merged/blob/dual-model/src/alpaca_lora_4bit/autograd_4bit.py it has a good chance of working. Might need to also add trust_remote_code.

axolotl

Posts with mentions or reviews of axolotl. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-04.

Ask HN: Most efficient way to fine-tune an LLM in 2024?
6 projects | news.ycombinator.com | 4 Apr 2024

The approach I see used is axolotl with QLoRA using cloud GPUs which can be quite cheap.
https://github.com/OpenAccess-AI-Collective/axolotl
FLaNK AI - 01 April 2024
31 projects | dev.to | 1 Apr 2024
LoRA from Scratch implementation for LLM finetuning
3 projects | news.ycombinator.com | 22 Jan 2024

https://github.com/OpenAccess-AI-Collective/axolotl
Optimized Triton Kernels for full fine tunes
1 project | news.ycombinator.com | 3 Jan 2024
Axolotl
1 project | news.ycombinator.com | 2 Jan 2024

1 project | news.ycombinator.com | 14 Dec 2023
Let’s Collaborate to Build a High-Quality, Open-Source Dataset for LLMs!
1 project | /r/LocalLLaMA | 6 Dec 2023

One option is to look at what Axolotl uses. They have a list of different dataset formats that they support. They're mostly in JSON with specific field names, so you could start putting a dataset together with a text editor or a JSON editor.
Axolotl: Streamline fine-tuning of AI models
1 project | news.ycombinator.com | 5 Dec 2023
Dataset Creation Tools?
2 projects | /r/LocalLLaMA | 15 Oct 2023

You can save that overall set into a json file and load it up as training data in whatever you're using. I'm using axolotl for it at the moment. Though a GUI based option is probably best for the first couple of tries until you get a feel for the options.
Progress on Reproducing Phi-1/1.5
1 project | /r/LocalLLaMA | 28 Sep 2023

Looking forward to the results! If it turns out the dataset is reproducible, then it might be a good candidate for ReLora training on axolotl!

What are some alternatives?

When comparing GPTQ-Merged and axolotl you can also consider the following projects:

text-generation-webui-testing - A fork of textgen that still supports V1 GPTQ, 4-bit lora and other GPTQ models besides llama.

signal-cli - signal-cli provides an unofficial commandline, JSON-RPC and dbus interface for the Signal messenger.

gpt-llm-trainer

LoRA - Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

mlc-llm - Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

LMFlow - An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

koboldcpp - A simple one-file way to run various GGML and GGUF models with KoboldAI's UI

OpenPipe - Turn expensive prompts into cheap fine-tuned models

xTuring - Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

libsignal - Home to the Signal Protocol as well as other cryptographic primitives which make Signal possible.

org.signal.Signal

Signal-Desktop - A private messenger for Windows, macOS, and Linux.