Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems. Learn more β
whisperX Alternatives
Similar projects and alternatives to whisperX
-
VideoLAN Client (VLC)
VLC media player - All pull requests are ignored, please use MRs on https://code.videolan.org/videolan/vlc
-
Judoscale
Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
-
-
-
-
-
-
-
InfluxDB
InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
-
-
-
open-ai
OpenAI PHP SDK : Most downloaded, forked, contributed, huge community supported, and used PHP (Laravel , Symfony, Yii, Cake PHP or any PHP framework) SDK for OpenAI GPT-3 and DALL-E. It also supports chatGPT-like streaming. (ChatGPT AI is supported)
-
-
-
-
-
-
transcribe-anything
Multi-backend whisper app. Blazing fast. Mac-arm optimized. Easy install. Input a local file or url and this service will transcribe it using Whisper AI. Completely private and Free π€―π€―π€―
-
-
-
syncabook
ππ§ A tool for creating ebooks with synchronized text and audio (EPUB3 with Media Overlays)
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
whisperX discussion
whisperX reviews and mentions
-
Ask HN: Is Whisper Still Relevant?
Yes it's still relevant but I prefer WhisperX for some tasks: https://github.com/m-bain/whisperX
-
Show HN: Mikey β No bot meeting notetaker for Windows
https://github.com/m-bain/whisperX looks promising - I'm hacking away on an always-on transcriber for my notes for later search&recall. It has support for diarization (the speaker detection you're looking for).
I'm currently hacking away on a mix of https://github.com/speaches-ai/speaches + https://github.com/ufal/whisper_streaming though - mostly because my laptop doesn't have a decent GPU, I stream the audio to a home server instead.
But overall it's pretty simple to do after you wrangle the Python dependencies - all you need is a sink for the text files (for example, create a new file for every Teams meeting, but that's another story...)
-
VLC tops 6B downloads, previews AI-generated subtitles
You don't need to wait, you can use: https://github.com/m-bain/whisperX right now for STT with timestamps and diarization.
-
Transcriber AI β Free, end-to-end machine based transcription with speaker id
I use whisper and pyannote (https://github.com/m-bain/whisperX), but it is a pain to run locally - I run it on a 4080. This seems to be actually trying to identify the speakers. Not sure what they are doing for that.
-
Supercharge Your AI Skills: 5 Open Source Repositories You Can't Afford to Miss
3. WhisperX
-
Show HN: Offline audiobook from any format with one CLI command
> And do you know a good speech to text model?
OpenAI's whisper, code+model are available, and multiple projects have built on it. You could try this wrapper: https://github.com/m-bain/whisperX -- or for short utterances on a smart-phone https://github.com/futo-org/whisper-acft
- WhisperX: Precise ASR with Word-Level Timestamps and Diarization
- WhisperX: Precise ASR with Word-Level Timestamps and Speaker Diarization
- Text-to-Speech with Speaker Diarization
-
Easy video transcription and subtitling with Whisper, FFmpeg, and Python
It uses this, which does support diarization: https://github.com/m-bain/whisperX
-
A note from our sponsor - InfluxDB
influxdata.com | 19 Apr 2025
Stats
m-bain/whisperX is an open source project licensed under BSD 2-clause "Simplified" License which is an OSI approved license.
The primary programming language of whisperX is Python.