distil-whisper
WhisperInput
distil-whisper | WhisperInput | |
---|---|---|
9 | 2 | |
3,225 | 75 | |
7.0% | - | |
8.9 | 1.7 | |
15 days ago | 7 months ago | |
Python | Java | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
distil-whisper
- FLaNK Stack 05 Feb 2024
-
Distil-Whisper: a distilled variant of Whisper that is 6x faster
Training code will be released in the Distil-Whisper repository this week, enabling anyone in the community to distill a Whisper model in their choice of language!
- FLaNK Stack Weekly 06 Nov 2023
-
AI — weekly megathread!
Hugging Face released Distil-Whisper, a distilled version of Whisper that is 6 times faster, 49% smaller, and performs within 1% word error rate (WER) on out-of-distribution evaluation sets [Details].
- Distil-Whisper: distilled version of Whisper that is 6 times faster, 49% smaller
- Distil-Whisper is up to 6x faster than Whisper while performing within 1% Word-Error-Rate on out-of-distribution eval sets
-
Distilling Whisper on 20,000 hours of open-sourced audio data
- GitHub page: https://github.com/huggingface/distil-whisper/tree/main
-
Talk-Llama
Is https://github.com/huggingface/distil-whisper on its way to whisper.cpp?
WhisperInput
-
Distil-Whisper: distilled version of Whisper that is 6 times faster, 49% smaller
Fortunately yes, recently i've been playing with this github.com/rpdrewes/whisper-websocket-server which uses K6nele as frontend on android if you really care about performance.
Tho if you're looking for a standalone app then you can give this a go : https://github.com/alex-vt/WhisperInput and run it right on your phone :]
For now they both run regular openai whisper thus tiny.en but as you can see there's tons of impromvement potential with faster-whisper and now distill-whisper :D
-
What voice inputs can you use other than googles?
Maybe this. https://github.com/alex-vt/WhisperInput
What are some alternatives?
pyvideotrans - Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
openWakeWord - An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.
vid2cleantxt - Python API & command-line tool to easily transcribe speech-based video files into clean text
K6nele-service - Kõnele service is an Android app that offers a speech-to-text service to other apps, in particular to Kõnele. It implements SpeechRecognizer, backed by an open source speech recognition server software https://github.com/alumae/kaldi-gstreamer-server.
faster-whisper - Faster Whisper transcription with CTranslate2
whisper-turbo - Cross-Platform, GPU Accelerated Whisper 🏎️
json-masker - High-performance JSON masker library in Java with no runtime dependencies
streaming-llm - [ICLR 2024] Efficient Streaming Language Models with Attention Sinks
willow - Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
web-whisper - OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.
project-2501 - Project 2501 is an open-source AI assistant, written in C++.