whisper-standalone-win
whisper-ctranslate2
whisper-standalone-win | whisper-ctranslate2 | |
---|---|---|
3 | 3 | |
801 | 755 | |
- | 5.8% | |
8.8 | 8.3 | |
18 days ago | 5 days ago | |
Python | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
whisper-standalone-win
-
Question : is this a movie only tracker?
On the other hand, if you need subtitles for a movie that doesn't have some. There are some automated solutions like Whisper that can do a very decent job in most cases : https://github.com/Purfview/whisper-standalone-win
-
OpenAI’s whisper module might change the game of the speech-to-text (STT) industry
Try this: https://github.com/Purfview/whisper-standalone-win
-
Attempting to install whisper - Got an error on the last step. Can anyone help out?
I've no idea what you do wrong but you can try standalone executable from there: https://github.com/Purfview/whisper-standalone-win
whisper-ctranslate2
-
Firefox slow to load YouTube? Just another front in Google's war on ad blockers
Much better, actually. Try the large-v3 model, it's great. I use it via whisper-ctranslate2 which is a faster implementation.
https://github.com/Softcatala/whisper-ctranslate2
-
StyleTTS2 – open-source Eleven Labs quality Text To Speech
There's several faster ones out there. I've been using https://github.com/Softcatala/whisper-ctranslate2 which includes a nice --live_transcribe flag. It's not as good as running it on a complete file but it's been helpful to get the gist of foreign language live streams.
- Transcribing your Interview Data
What are some alternatives?
pyannote-whisper
llama.cpp - LLM inference in C/C++
whisper - Robust Speech Recognition via Large-Scale Weak Supervision
whisper-openai-gradio-implementation - Whisper is an automatic speech recognition (ASR) system Gradio Web UI Implementation
AI-Waifu-Vtuber - AI Vtuber for Streaming on Youtube/Twitch
whisper-playground - Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
whisper-subtitles-webui - A gradio interface for making transcribed and translated subtitles for videos
monotonic_align - Monotonic Alignment Search
LiveWhisper - A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
StyleTTS2 - StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models