-
> iirc whisper-diarization uses whisperx under the hood.
It seems like it does:
https://github.com/MahmoudAshraf97/whisper-diarization/blob/...
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
-
subgen
Autogenerate subtitles using OpenAI Whisper Model via Jellyfin, Plex, Emby, Tautulli, or Bazarr
https://github.com/McCloudS/subgen worked very well for me. I had a TV series where somehow the last few seasons timestamps did not match up with subtitle files I could find online. I used subgen and it worked surprisingly well.
-
-
-
* Allowing users to provide their own models
https://github.com/Stack-Studio-Digital-Collective/Auditif
-
SubtitleEdit is the most complete, it runs offline in your desktop (portable version available also), and has many online tutorials.
Make sure they are recent tutorials because they will probably mention how to use the automated generation tools/plugins that wasn't available years ago.
https://github.com/SubtitleEdit/subtitleedit
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
Related posts
-
Ask HN: What API or software are people using for transcription?
-
MacWhisper: Transcribe audio files on your Mac
-
Whisper Playground - launch speech2text web apps using OpenAI's Whisper
-
Whispercpp – Local, Fast, and Private Audio Transcription for Ruby
-
Build Your Own Siri. Locally. On-Device. No Cloud