-
I've been using this:
https://github.com/bugbakery/transcribee
It's noticeably work-in-progress but it does the job and has a nice UI to edit transcriptions and speakers etc.
It's running on the CPU for me, would be nice to have something that can make use of a 4GB Nvidia GPU, which faster-whisper is actually able to [1]
https://github.com/SYSTRAN/faster-whisper?tab=readme-ov-file...
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
-
I've been using this:
https://github.com/bugbakery/transcribee
It's noticeably work-in-progress but it does the job and has a nice UI to edit transcriptions and speakers etc.
It's running on the CPU for me, would be nice to have something that can make use of a 4GB Nvidia GPU, which faster-whisper is actually able to [1]
https://github.com/SYSTRAN/faster-whisper?tab=readme-ov-file...
-
Re: Diarization, I had decent results with testing this on Colab a while ago:
https://github.com/MahmoudAshraf97/whisper-diarization
I remember having the usual python package hell when NeMo was updated somewhere, but it seems to be decently well maintained so give it a go.
Related posts
-
Ask HN: What API or software are people using for transcription?
-
Amazon Is Discontinuing the "Do Not Send Voice Recordings" Feature on Echo
-
Ask HN: Is Whisper Still Relevant?
-
Transcriber AI – Free, end-to-end machine based transcription with speaker id
-
WhisperX: Precise ASR with Word-Level Timestamps and Diarization