gptqlora
TheVault
Our great sponsors
gptqlora | TheVault | |
---|---|---|
2 | 4 | |
94 | 78 | |
- | - | |
7.6 | 7.9 | |
11 months ago | 3 months ago | |
Python | Jupyter Notebook | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
gptqlora
-
(2/2) May 2023
GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ (https://github.com/qwopqwop200/gptqlora/tree/main)
-
GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ
The difference from QLoRA is that GPTQ is used instead of NF4 (Normal Float4) + DQ (Double Quantization) for model quantization. The advantage is that you can expect better performance because it provides better quantization than conventional bitsandbytes. The downside is that it is a one-shot quantization methodology, so it is more inconvenient than bitsandbytes, and unlike bitsandbytes, it is not universal. I'm still experimenting, but it seems to work. At least, I hope it can be more options for people using LoRA. https://github.com/qwopqwop200/gptqlora/tree/main
TheVault
-
(2/2) May 2023
A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation (https://github.com/FSoft-AI4Code/TheVault)
-
List of code generation datasets (open source)
TheVault
-
[P] Fine-tuning LLaMA on TheVault by AI4Code
I essentially want to fine-tune LLaMA on a dataset that's geared towards code generation. After a bit of research I found TheVault which seems good enough for the job (let me know if there are better datasets tho).
-
[R] Introducing The Vault: A new multilingual dataset for advancing code understanding and generation.
Github page: https://github.com/FSoft-AI4Code/TheVault
What are some alternatives?
tree-of-thoughts - Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
DB-GPT - AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
GirlfriendGPT - Girlfriend GPT is a Python project to build your own AI girlfriend using ChatGPT4.0
chathub - All-in-one chatbot client
chain-of-thought-hub - Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
code_contests
guidance - A guidance language for controlling large language models. [Moved to: https://github.com/guidance-ai/guidance]
waymo-open-dataset - Waymo Open Dataset
gorilla - Gorilla: An API store for LLMs
whylogs - An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈