Any news on training LoRAs in 4-bit mode?

This page summarizes the projects mentioned and recommended in the original post on /r/Oobabooga

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • text-generation-webui

    A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

  • Posted https://github.com/oobabooga/text-generation-webui/issues/813 earlier today. Still hasn't been merged with main repo but got the following fork suggestions:

  • LLaMA-8bit-LoRA

    Repository for Chat LLaMA - training a LoRA for the LLaMA (1 or 2) models on HuggingFace with 8-bit or 4-bit quantization. Research only.

  • https://github.com/serp-ai/LLaMA-8bit-LoRA/blob/main/docs/merging_the_weights.md < merge models

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • alpaca_lora_4bit

  • https://github.com/johnsmith0031/alpaca_lora_4bit < train loras

  • text-generation-webui-testing

    A fork of textgen that still supports V1 GPTQ, 4-bit lora and other GPTQ models besides llama.

  • https://github.com/Ph0rk0z/text-generation-webui-testing < 4bit lora use from the UI on old GPTQ

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • NPi – An Open Source project for enhancing AI Agents in taking action

    3 projects | news.ycombinator.com | 2 May 2024
  • Recapping the AI, Machine Learning and Data Science Meetup — May 2, 2024

    2 projects | dev.to | 2 May 2024
  • THOR: Tracklet-Less Heliocentric Orbit Recovery

    1 project | news.ycombinator.com | 2 May 2024
  • Show HN: Panza: A personal email assistant, trained and running on-device

    1 project | news.ycombinator.com | 2 May 2024
  • Show HN: SpRAG – Open-source RAG implementation for challenging real-world tasks

    3 projects | news.ycombinator.com | 2 May 2024