FLaNK-Halifax
gpt-llm-trainer | FLaNK-Halifax | |
---|---|---|
4 | 14 | |
3,825 | 1 | |
- | - | |
5.4 | 7.3 | |
about 2 months ago | 6 months ago | |
Jupyter Notebook | TypeScript | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
gpt-llm-trainer
- FLaNK Stack Weekly 06 Nov 2023
-
Show HN: Fine-tune your own Llama 2 to replace GPT-3.5/4
Very nice, thanks!
Check out what Matt Shumer put together as well: https://github.com/mshumer/gpt-llm-trainer.
I have used his trainer for auto distillation of GPT-4 into GPT3.5 fine tunes, but plan to do the same for Llama as well.
Cheers!
-
[D] Anyone tried gpt-llm-trainer?
Hey guys, so I stumbled upon this Linkedin post, this guy was showing a jupyter notebook on google colab and was explaining step by step how to train your own model to accomplish very specific tasks, and I believe the base model he was using Llama 2 7B Fine tuning version. This is the github link: https://github.com/mshumer/gpt-llm-trainer
- GPT-LLM-Trainer
FLaNK-Halifax
- FLaNK Stack Weekly for 27 November 2023
- FLaNK Stack Weekly for 20 Nov 2023
- FLaNK Stack Weekly for 13 November 2023
- FLaNK Stack Weekly 06 Nov 2023
- FLaNK Stack Weekly for 30 Oct 2023
- FLaNK Stack Weekly 23 Oct 2023
- FLaNK Stack Weekly 16 October 2023
- FLaNK Stack Weekly 09 Oct 2023
- FLaNK Stack Weekly 2 October 2023
- FLaNK Stack for 25 September 2023
What are some alternatives?
axolotl - Go ahead and axolotl questions
rivet - The open-source visual AI programming environment and TypeScript library
OpenPipe - Turn expensive prompts into cheap fine-tuned models
SeaGOAT - local-first semantic code search engine
Llama-2-Onnx
flink-cdc - Flink CDC is a streaming data integration tool
trieve - All-in-one infrastructure for building search, recommendations, and RAG. Trieve combines search language models with tools for tuning ranking and relevance.
vimGPT - Browse the web with GPT-4V and Vimium
open_model_zoo - Pre-trained Deep Learning models and demos (high quality and extremely fast)
CML_AMP_Intelligent-QA-Chatbot-with-NiFi-Pinecone-and-Llama2 - The prototype deploys an Application in CML using a Llama2 model from Hugging Face to answer questions augmented with knowledge extracted from the website. This prototype introduces Pinecone as a database for storing vectors for semantic search.
vllm - A high-throughput and memory-efficient inference and serving engine for LLMs
co-tracker - CoTracker is a model for tracking any point (pixel) on a video.