OpenChatKit
simple-llm-finetuner
OpenChatKit | simple-llm-finetuner | |
---|---|---|
23 | 12 | |
9,001 | 1,977 | |
0.1% | - | |
7.1 | 10.0 | |
29 days ago | 5 months ago | |
Python | Jupyter Notebook | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
OpenChatKit
- OpenChatKit - OSS Framework for building chatbots
-
How should I get an in-depth mathematical understanding of generative AI?
ChatGPT isn't open sourced so we don't know what the actual implementation is. I think you can read Open Assistant's source code for application design. If that is too much, try Open Chat Toolkit's source code for developer tools . If you need very bare implementation, you should go for lucidrains/PaLM-rlhf-pytorch.
- OpenChatKit
- OpenChatKit: Open-source kit for setting up a local, libre, LLM chatbot
-
I created a locally-run ai assistant for UE5’s documentation
For a locally run open source option, I'd recommend taking a look at OpenChatKit. It's built on top of a couple different open source LLMs that have been fine-tuned for use as chatbots. I've only messed around with the online demo a little bit, but from what I've read it is supposed to run on a laptop and be almost as good as ChatGPT 3.5.
-
[D] Are there any MIT licenced (or similar) open-sourced instruction-tuned LLMs available?
OpenChatKit https://github.com/togethercomputer/OpenChatKit
-
[D] Is there currently anything comparable to the OpenAI API?
Togethercomputer released openchatkit a few weeks ago. Not tested it but looks promising https://github.com/togethercomputer/OpenChatKit
simple-llm-finetuner
-
Ask HN: Resource to learn how to train and use ML Models
Just the appropriate reddit groups and follow folks on twitter, plus use a search engine.
1. Learn to run a model, checkout llama.cpp Tons of free models on huggingface.com
2. Learn to finetune a model - https://github.com/lxe/simple-llm-finetuner
3. Learn to train one. PyTorch, TensorFlow, HuggingFace libraries, etc.
Good luck.
- How can I train my custom dataset on top of Vicuna?
-
[D] The best way to train an LLM on company data
So as far as set up goes, you just need to: “”” Git clone https://github.com/lxe/simple-llama-finetuner Cd simple-llama-finetuner Pip install -r requirements.txt Python app.py ## if you’re on a remote machine (Paperspace is my go to) then you may need to edit the last line of this script to set ‘share=True’ in the launch args “””
-
Show HN: Document Q&A with GPT: web, .pdf, .docx, etc.
oobabooga's textgen webui has a tab for fine tuning now. You only need a single consumer GPU to fine tune up to 33B parameter models at a rate of about 200 epochs per hour, per GPU.
There are also one-click finetuning projects which run on free Google Colab GPUs like https://github.com/lxe/simple-llama-finetuner
It's easy and not complex at all.
-
How do I fine tune 4 bit or 8 bit models?
for a single 4090, easiest way to get started and simple to use: https://github.com/lxe/simple-llama-finetuner
- Are there publicly available datasets other than Alpaca that we can use to fine-tune LLaMA?
- Show HN: Finetune LLaMA-7B on commodity GPUs using your own text
- [Project] Finetune LLaMA-7B on commodity GPUs (and Colab) using your own text
What are some alternatives?
alpaca.cpp - Locally run an Instruction-Tuned Chat-Style LLM
alpaca-lora - Instruct-tune LLaMA on consumer hardware
roomGPT - Upload a photo of your room to generate your dream room with AI.
paper-qa - LLM Chain for answering questions from documents with citations
Open-Assistant - OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
peft - 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
wik - wik is use to get information about anything on the shell using Wikipedia.
Made-With-ML - Learn how to design, develop, deploy and iterate on production-grade ML applications.
minChatGPT - A minimum example of aligning language models with RLHF similar to ChatGPT
minimal-llama
simpleAI - An easy way to host your own AI API and expose alternative models, while being compatible with "open" AI clients.
text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.