audapolis
whisperer
audapolis | whisperer | |
---|---|---|
8 | 1 | |
641 | 55 | |
2.2% | - | |
6.7 | 4.8 | |
7 months ago | 9 months ago | |
TypeScript | Python | |
GNU Affero General Public License v3.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
audapolis
- Audapolis: An editor for spoken-word audio with automatic transcription
-
MacWhisper: Transcribe audio files on your Mac
Here's a multi-platform open source app that does the same thing but uses vosk instead of whisper.
https://github.com/bugbakery/audapolis
- Will Kden ever have Ai
-
Self-hosted audio transcription?
Audapolis is also an interesting option: https://github.com/audapolis/audapolis
- [Looking for] Ai audio denoise & transcript
- Audapolis – Edit audio and video by selecting text
whisperer
-
MacWhisper: Transcribe audio files on your Mac
I have a Python script on my mac that detects when I press-and-hold the right option key, and records audio while it's pressed. On release, it transcribes it with whispercpp and pastes it. Makes it very easy to record quick voice notes. Here it is: https://github.com/corlinp/whisperer
I was working on a native version in the form of a taskbar app with customizable prompt and all. However I quickly realized that the behaviors I want the app to do require a bunch of accessibility permissions that would block it from the app store and require more setup steps.
Would anybody still find that useful?
What are some alternatives?
vosk-server - WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
LLMStack - No-code platform to build LLM Agents, workflows and applications with your data
whisper-diarization - Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
buzz - Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
SpeechRecognition - Speech recognition module for Python, supporting several engines and APIs, online and offline.
whisper - Robust Speech Recognition via Large-Scale Weak Supervision
oTranscribe - A free & open tool for transcribing audio interviews