web-whisper
vid2cleantxt
web-whisper | vid2cleantxt | |
---|---|---|
2 | 1 | |
163 | 156 | |
- | - | |
10.0 | 0.0 | |
9 months ago | over 1 year ago | |
Jupyter Notebook | ||
- | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
web-whisper
-
Whisper.cpp example running fully in the browser
If someone wants to self host you can also try this decent web interface: https://codeberg.org/pluja/web-whisper
I'm not the creator, just a fan.
- Web-UI for Whisper, an awesome audio transcription AI. Easy to self-host.
vid2cleantxt
What are some alternatives?
whisper.cpp - Port of OpenAI's Whisper model in C/C++
SpecVQGAN - Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
LiveWhisper - A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
PipeWire-Guide - PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.
subvert - Generate subtitles, summaries, and chapters from videos in seconds
distil-whisper - Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
SwiftWhisper - 🎤 The easiest way to transcribe audio in Swift
WOLOF-ASR-Wav2Vec2 - Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
timit - The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
web-speech-synthesis-and-recognition - Speech to Text and Text to Speech on a web browser