Sparsebit
LLaMA-8bit-LoRA
Sparsebit | LLaMA-8bit-LoRA | |
---|---|---|
1 | 3 | |
320 | 146 | |
1.3% | 1.4% | |
5.9 | 5.1 | |
4 months ago | 9 months ago | |
Python | Python | |
Apache License 2.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Sparsebit
LLaMA-8bit-LoRA
-
Any news on training LoRAs in 4-bit mode?
https://github.com/serp-ai/LLaMA-8bit-LoRA/blob/main/docs/merging_the_weights.md < merge models
- [R] 🤖🌟 Unlock the Power of Personal AI: Introducing ChatLLaMA, Your Custom Personal Assistant! 🚀💬
What are some alternatives?
sparsegpt-for-LLaMA - Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.
alpaca-lora - Instruct-tune LLaMA on consumer hardware
tabmat - Efficient matrix representations for working with tabular data
text-generation-webui-testing - A fork of textgen that still supports V1 GPTQ, 4-bit lora and other GPTQ models besides llama.
FQ-ViT - [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
trl - Train transformer language models with reinforcement learning.
alpaca_lora_4bit
aimet - AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.