whisper-diarization
whisperer
whisper-diarization | whisperer | |
---|---|---|
5 | 1 | |
2,101 | 55 | |
- | - | |
6.8 | 4.8 | |
6 days ago | 9 months ago | |
Jupyter Notebook | Python | |
BSD 2-clause "Simplified" License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
whisper-diarization
-
MacWhisper: Transcribe audio files on your Mac
https://github.com/MahmoudAshraf97/whisper-diarization
This project has been alright for transcribing audio with speaker diarization. A big finicky. The OpenAI model is better than other paid products(Descript, Riverside) so Iām looking forward to trying MacWhisper.
-
Faster Whisper Transcription with CTranslate2
The project page mentions whisper-diarization (speaker recognition) as a user of faster-whisper. I've been in the market for that, definitely going to try it out.
https://github.com/MahmoudAshraf97/whisper-diarization
- GitHub - MahmoudAshraf97/whisper-diarization: Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
-
AI or technique for distinguishing between speakers for podcast?
Here
- Services for transcription
whisperer
-
MacWhisper: Transcribe audio files on your Mac
I have a Python script on my mac that detects when I press-and-hold the right option key, and records audio while it's pressed. On release, it transcribes it with whispercpp and pastes it. Makes it very easy to record quick voice notes. Here it is: https://github.com/corlinp/whisperer
I was working on a native version in the form of a taskbar app with customizable prompt and all. However I quickly realized that the behaviors I want the app to do require a bunch of accessibility permissions that would block it from the app store and require more setup steps.
Would anybody still find that useful?
What are some alternatives?
faster-whisper - Faster Whisper transcription with CTranslate2
audapolis - an editor for spoken-word audio with automatic transcription
speechbrain - A PyTorch-based Speech Toolkit
LLMStack - No-code platform to build LLM Agents, workflows and applications with your data
whisper-youtube - š Youtube Videos Transcription with OpenAI's Whisper
buzz - Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
SpeechRecognition - Speech recognition module for Python, supporting several engines and APIs, online and offline.
whisper - Robust Speech Recognition via Large-Scale Weak Supervision
tinydiarize - Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens