Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work. Learn more →
Faster-whisper Alternatives
Similar projects and alternatives to faster-whisper
-
-
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
-
Mergify
Updating dependencies is time-consuming.. Solutions like Dependabot or Renovate update but don't merge dependencies. You need to do it manually while it could be fully automated! Add a Merge Queue to your workflow and stop caring about PR management & merging. Try Mergify for free.
-
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
stable-ts
ASR with reliable word-level timestamps using OpenAI's Whisper
-
whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
-
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
-
-
Sonar
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
-
-
-
tinydiarize
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
-
-
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
-
whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
-
-
Photonix
A modern, web-based photo management server. Run it on your home server and it will let you find the right photo from your collection on any device. Smart filtering is made possible by object recognition, face recognition, location awareness, color analysis and other ML algorithms.
-
-
-
ermine-ai
https://ermine.ai -- 100% client-side live audio transcription, powered by transformers.js
-
willow-inference-server
Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS
-
InfluxDB
Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.
faster-whisper reviews and mentions
-
Whisper Turbo: transcribe 20x faster than realtime using Rust and WebGPU
Neat to see a new implementation, although I'll note that for those looking for a drop-in replacement for the whisper library, I believe that both faster-whisper https://github.com/guillaumekln/faster-whisper and https://github.com/m-bain/whisperX are easier (PyTorch-based, doesn't require a web browser), and a lot faster (WhisperX is up to 70X realtime).
-
Whisper.api: An open source, self-hosted speech-to-text with fast transcription
One caveat here is that whisper.cpp does not offer any CUDA support at all, acceleration is only available for Apple Silicon.
If you have Nvidia hardware the ctranslate2 based faster-whisper is very very fast: https://github.com/guillaumekln/faster-whisper
- Faster Whisper Transcription with CTranslate2
-
[Discussion] Video Translation Task
you could look into https://github.com/guillaumekln/faster-whisper especially the VAD section (Voice Activity Detector) using https://github.com/snakers4/silero-vad
-
[D] What is the most efficient version of OpenAI Whisper?
Faster Whisper: https://github.com/guillaumekln/faster-whisper. 4x faster than original, also for short form audio samples. But no extra gains for long form on top of this
-
Does openai whisper works on termux ?
Since then I figured out live transcription and also how to get faster-whisper running. I still need to write things down in detail at some point, though.
-
Show HN: Ermine.ai – Record and transcribe speech, 100% client-side (WASM)
You can speed that up 16x with "faster whisper" https://github.com/guillaumekln/faster-whisper
-
Universal Speech Model
Faster Whisper is 8x faster than real time on CPU and even faster on GPU. https://github.com/guillaumekln/faster-whisper
Vocode uses Whisper for real-time zero latency voicechat with chatGPT. Give their demo line a call to see how well it works: +1-650-729-9536
-
Do we have good, gpu accelerated, text-to-speech, speech-to-text, image/video-to-text face/object recognition that is open source and self-hosted ?
Or https://github.com/guillaumekln/faster-whisper
-
A note from our sponsor - Sonar
www.sonarsource.com | 27 Sep 2023
Stats
guillaumekln/faster-whisper is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of faster-whisper is Python.