openWakeWord
distil-whisper
openWakeWord | distil-whisper | |
---|---|---|
5 | 9 | |
457 | 3,199 | |
- | 6.2% | |
8.4 | 8.9 | |
about 1 month ago | 8 days ago | |
Jupyter Notebook | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
openWakeWord
-
OpenAI releases Whisper v3, new generation open source ASR model
https://github.com/dscripka/openWakeWord
Balancing wake reliability vs false wake activation is a tricky balance. OWW is decent but could certainly be better.
It's used with Home Assistant now so I expect the training data and implementation overall to get significantly better fairly soon.
-
Distil-Whisper: distilled version of Whisper that is 6 times faster, 49% smaller
There's also OpenWakeWord[0]. The models are readily available in tflite and ONNX formats and are impressively "light" in terms of compute requirements and performance.
It should be possible.
[0] - https://github.com/dscripka/openWakeWord
-
Real-Time Noise Suppression for PipeWire writen in Rust
hey, quick question. do you mind if I use your stft function in the speech preprocessing library I've been working on? we've been trying to add support for doing mel spectrograms to build a runner for openwakeword, but progress is pretty slow because I've been soloing something I really don't have the right background for(I've never directly studied or worked with signal processing)
-
I'm new to Rust but want to contribute
potentially build another runner for open wakeword
-
I want to contribute in a big project
here's what's on the pipeline next: - finish mel-spectrogram implementation - publish initial version on crates - move python caching rust side - finish implementing in the precise rust port - potentially build another runner for (open wakeword)[https://github.com/dscripka/openWakeWord] - build an android app that supports user-defined wakewords and has some popular defaults to load. ps not a voice assistant, just the thing that activates the voice assistant.
distil-whisper
- FLaNK Stack 05 Feb 2024
-
Distil-Whisper: a distilled variant of Whisper that is 6x faster
Training code will be released in the Distil-Whisper repository this week, enabling anyone in the community to distill a Whisper model in their choice of language!
- FLaNK Stack Weekly 06 Nov 2023
-
AI — weekly megathread!
Hugging Face released Distil-Whisper, a distilled version of Whisper that is 6 times faster, 49% smaller, and performs within 1% word error rate (WER) on out-of-distribution evaluation sets [Details].
- Distil-Whisper: distilled version of Whisper that is 6 times faster, 49% smaller
- Distil-Whisper is up to 6x faster than Whisper while performing within 1% Word-Error-Rate on out-of-distribution eval sets
-
Distilling Whisper on 20,000 hours of open-sourced audio data
- GitHub page: https://github.com/huggingface/distil-whisper/tree/main
-
Talk-Llama
Is https://github.com/huggingface/distil-whisper on its way to whisper.cpp?
What are some alternatives?
WhisperInput - Offline voice input panel & keyboard with punctuation for Android.
mfcc-rust
pyvideotrans - Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
project-2501 - Project 2501 is an open-source AI assistant, written in C++.
vid2cleantxt - Python API & command-line tool to easily transcribe speech-based video files into clean text
Clippy - A bunch of lints to catch common mistakes and improve your Rust code. Book: https://doc.rust-lang.org/clippy/
faster-whisper - Faster Whisper transcription with CTranslate2
whisper-dictation - Dictation app based on the OpenAI speed to text models
json-masker - High-performance JSON masker library in Java with no runtime dependencies
TX-2-simulator - Simulator for the pioneering TX-2 computer
willow - Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative