simple-llm-finetuner
paper-qa
simple-llm-finetuner | paper-qa | |
---|---|---|
12 | 10 | |
1,977 | 3,664 | |
- | - | |
10.0 | 8.7 | |
5 months ago | 12 days ago | |
Jupyter Notebook | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
simple-llm-finetuner
-
Ask HN: Resource to learn how to train and use ML Models
Just the appropriate reddit groups and follow folks on twitter, plus use a search engine.
1. Learn to run a model, checkout llama.cpp Tons of free models on huggingface.com
2. Learn to finetune a model - https://github.com/lxe/simple-llm-finetuner
3. Learn to train one. PyTorch, TensorFlow, HuggingFace libraries, etc.
Good luck.
- How can I train my custom dataset on top of Vicuna?
-
[D] The best way to train an LLM on company data
So as far as set up goes, you just need to: “”” Git clone https://github.com/lxe/simple-llama-finetuner Cd simple-llama-finetuner Pip install -r requirements.txt Python app.py ## if you’re on a remote machine (Paperspace is my go to) then you may need to edit the last line of this script to set ‘share=True’ in the launch args “””
-
Show HN: Document Q&A with GPT: web, .pdf, .docx, etc.
oobabooga's textgen webui has a tab for fine tuning now. You only need a single consumer GPU to fine tune up to 33B parameter models at a rate of about 200 epochs per hour, per GPU.
There are also one-click finetuning projects which run on free Google Colab GPUs like https://github.com/lxe/simple-llama-finetuner
It's easy and not complex at all.
-
How do I fine tune 4 bit or 8 bit models?
for a single 4090, easiest way to get started and simple to use: https://github.com/lxe/simple-llama-finetuner
- Are there publicly available datasets other than Alpaca that we can use to fine-tune LLaMA?
- Show HN: Finetune LLaMA-7B on commodity GPUs using your own text
- [Project] Finetune LLaMA-7B on commodity GPUs (and Colab) using your own text
paper-qa
-
Oracle of Zotero: LLM QA of Your Research Library
Why does this post link to a renamed fork of Paper-QA (https://github.com/whitead/paper-qa) which has made zero changes and is 19 commits behind the original?
-
[P] A Large Language Model for Healthcare | NHS-LLM and OpenGPT
To be honest, I'm not too sure about this part, and think that it is probably not the best approach to have the model itself generate references. I prefer the approach used in e.g. paperqa, but wanted to explore both options.
-
Looking for a paper summarizer
I’ve come across Paper QA (github page) and as a graduate student I loved the idea that when I do literature review and find tons of papers I can just ask the AI to find the info I’m looking for in the paper. However, this service requires OpenAI API key, which I’ve acquired but turns out it’s a paid service. Free key doesn’t get me anything. Is there a service/software like this that is free? Or something that I can host on my PC instead of using people’s servers so it’s cheaper/free?
-
ChatPDF – Chat with Any PDF
I tried it [1] a lot, but I must say it confuses me most of the time and I need to read the original text to check if it makes sense. Lots of times it doesn't.
[1] https://github.com/whitead/paper-qa
- Alternatives to Pinecone? (Vector databases) [D]
-
DIY natural language processing - How to start, techniques guidance
Have a look at this: https://github.com/whitead/paper-qa
-
Show HN: Document Q&A with GPT: web, .pdf, .docx, etc.
1: We are finding out. Someone else mentioned: https://github.com/whitead/paper-qa We're hoping to keep our service be accessible and easy to use, and add features. Such as from your other questions...
2: We are thinking of the website integration. Do you think OpenAI may release this too? Questions received by email is a new idea that sounds interesting!
3: Thanks for the suggestion – we will look into it.
- GitHub - whitead/paper-qa: LLM Chain for answering questions from documents with citations
- Paper QA: LLM Chain for answering questions from documents with citations
What are some alternatives?
alpaca-lora - Instruct-tune LLaMA on consumer hardware
vault-ai - OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, epub, etc) using a simple React frontend.
peft - 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
gpt4-pdf-chatbot-langchain - GPT4 & LangChain Chatbot for large PDF docs
Made-With-ML - Learn how to design, develop, deploy and iterate on production-grade ML applications.
langchain - ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain]
minimal-llama
google-research - Google Research
OpenChatKit
OpenGPT - A framework for creating grounded instruction based datasets and training conversational domain expert Large Language Models (LLMs).
text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
The-Oracle-of-Zotero - LLM Chain querying a scientific Zotero library, with citations