gpt-llm-trainer vs distil-whisper

gpt-llm-trainer

By mshumer

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate. (by huggingface)

Audio speech-recognition Whisper

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

gpt-llm-trainer		distil-whisper
	Project
4	Mentions	9
3,825	Stars	3,225
-	Growth	7.0%
5.4	Activity	8.9
about 2 months ago	Latest Commit	15 days ago
Jupyter Notebook	Language	Python
MIT License	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

gpt-llm-trainer

Posts with mentions or reviews of gpt-llm-trainer. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-11-06.

FLaNK Stack Weekly 06 Nov 2023
21 projects | dev.to | 6 Nov 2023
Show HN: Fine-tune your own Llama 2 to replace GPT-3.5/4
8 projects | news.ycombinator.com | 12 Sep 2023

Very nice, thanks!
Check out what Matt Shumer put together as well: https://github.com/mshumer/gpt-llm-trainer.
I have used his trainer for auto distillation of GPT-4 into GPT3.5 fine tunes, but plan to do the same for Llama as well.
Cheers!
[D] Anyone tried gpt-llm-trainer?
1 project | /r/MachineLearning | 26 Aug 2023

Hey guys, so I stumbled upon this Linkedin post, this guy was showing a jupyter notebook on google colab and was explaining step by step how to train your own model to accomplish very specific tasks, and I believe the base model he was using Llama 2 7B Fine tuning version. This is the github link: https://github.com/mshumer/gpt-llm-trainer
GPT-LLM-Trainer
1 project | news.ycombinator.com | 9 Aug 2023

distil-whisper

Posts with mentions or reviews of distil-whisper. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-05.

FLaNK Stack 05 Feb 2024
49 projects | dev.to | 5 Feb 2024
Distil-Whisper: a distilled variant of Whisper that is 6x faster
1 project | /r/AudioAI | 17 Nov 2023

Training code will be released in the Distil-Whisper repository this week, enabling anyone in the community to distill a Whisper model in their choice of language!
FLaNK Stack Weekly 06 Nov 2023
21 projects | dev.to | 6 Nov 2023
AI — weekly megathread!
3 projects | /r/artificial | 5 Nov 2023

Hugging Face released Distil-Whisper, a distilled version of Whisper that is 6 times faster, 49% smaller, and performs within 1% word error rate (WER) on out-of-distribution evaluation sets [Details].
Distil-Whisper: distilled version of Whisper that is 6 times faster, 49% smaller
1 project | /r/hackernews | 3 Nov 2023

14 projects | news.ycombinator.com | 31 Oct 2023
Distil-Whisper is up to 6x faster than Whisper while performing within 1% Word-Error-Rate on out-of-distribution eval sets
1 project | /r/speechtech | 2 Nov 2023
Distilling Whisper on 20,000 hours of open-sourced audio data
1 project | /r/AudioAI | 2 Nov 2023

- GitHub page: https://github.com/huggingface/distil-whisper/tree/main
Talk-Llama
8 projects | news.ycombinator.com | 2 Nov 2023

Is https://github.com/huggingface/distil-whisper on its way to whisper.cpp?

What are some alternatives?

When comparing gpt-llm-trainer and distil-whisper you can also consider the following projects:

axolotl - Go ahead and axolotl questions

WhisperInput - Offline voice input panel & keyboard with punctuation for Android.

OpenPipe - Turn expensive prompts into cheap fine-tuned models

pyvideotrans - Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

Llama-2-Onnx

vid2cleantxt - Python API & command-line tool to easily transcribe speech-based video files into clean text

trieve - All-in-one infrastructure for building search, recommendations, and RAG. Trieve combines search language models with tools for tuning ranking and relevance.

faster-whisper - Faster Whisper transcription with CTranslate2

open_model_zoo - Pre-trained Deep Learning models and demos (high quality and extremely fast)

json-masker - High-performance JSON masker library in Java with no runtime dependencies

vllm - A high-throughput and memory-efficient inference and serving engine for LLMs

willow - Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative

gpt-llm-trainer vs axolotl distil-whisper vs WhisperInput gpt-llm-trainer vs OpenPipe distil-whisper vs pyvideotrans gpt-llm-trainer vs Llama-2-Onnx distil-whisper vs vid2cleantxt gpt-llm-trainer vs trieve distil-whisper vs faster-whisper gpt-llm-trainer vs open_model_zoo distil-whisper vs json-masker gpt-llm-trainer vs vllm distil-whisper vs willow

Compare gpt-llm-trainer vs distil-whisper and see what are their differences.

gpt-llm-trainer

distil-whisper

gpt-llm-trainer

distil-whisper

What are some alternatives?