Self-hosted offline transcription and diarization service with LLM summary

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

transcribee

2 198 8.9 TypeScript

open source audio and video transcription software

I've been using this:
https://github.com/bugbakery/transcribee
It's noticeably work-in-progress but it does the job and has a nice UI to edit transcriptions and speakers etc.
It's running on the CPU for me, would be nice to have something that can make use of a 4GB Nvidia GPU, which faster-whisper is actually able to [1]
https://github.com/SYSTRAN/faster-whisper?tab=readme-ov-file...

Scout Monitoring

www.scoutapm.com featured

Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
transcriptionstream

2 599 8.2 Python

turnkey self-hosted offline transcription and diarization service with llm summary
faster-whisper

24 9,691 8.3 Python

Faster Whisper transcription with CTranslate2

I've been using this:
https://github.com/bugbakery/transcribee
It's noticeably work-in-progress but it does the job and has a nice UI to edit transcriptions and speakers etc.
It's running on the CPU for me, would be nice to have something that can make use of a 4GB Nvidia GPU, which faster-whisper is actually able to [1]
https://github.com/SYSTRAN/faster-whisper?tab=readme-ov-file...

whisper-diarization

8 2,282 7.1 Jupyter Notebook

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Re: Diarization, I had decent results with testing this on Colab a while ago:
https://github.com/MahmoudAshraf97/whisper-diarization
I remember having the usual python package hell when NeMo was updated somewhere, but it seems to be decently well maintained so give it a go.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Ask HN: Using Voice Cloning and TTS to Fine-Tune Diarizer

1 project | news.ycombinator.com | 11 Jun 2024
Text-to-Speech with Speaker Diarization

1 project | news.ycombinator.com | 2 Jun 2024
SignWave: Program to transcribe text, audio files into a sign language animation

2 projects | news.ycombinator.com | 27 May 2024
Easy video transcription and subtitling with Whisper, FFmpeg, and Python

1 project | news.ycombinator.com | 6 Apr 2024
SOTA ASR Tooling: Long-Form Transcription

1 project | news.ycombinator.com | 31 Mar 2024

Self-hosted offline transcription and diarization service with LLM summary

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
hardware-buttons scrape-images linkedin-bot
Post date: 26 May 2024

transcribee

Scout Monitoring

transcriptionstream

faster-whisper

whisper-diarization

Related posts

Ask HN: Using Voice Cloning and TTS to Fine-Tune Diarizer

Text-to-Speech with Speaker Diarization

SignWave: Program to transcribe text, audio files into a sign language animation

Easy video transcription and subtitling with Whisper, FFmpeg, and Python

SOTA ASR Tooling: Long-Form Transcription

Self-hosted offline transcription and diarization service with LLM summary

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com hardware-buttons scrape-images linkedin-bot Post date: 26 May 2024

transcribee

Scout Monitoring

transcriptionstream

faster-whisper

whisper-diarization

Related posts

Ask HN: Using Voice Cloning and TTS to Fine-Tune Diarizer

Text-to-Speech with Speaker Diarization

SignWave: Program to transcribe text, audio files into a sign language animation

Easy video transcription and subtitling with Whisper, FFmpeg, and Python

SOTA ASR Tooling: Long-Form Transcription

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
hardware-buttons scrape-images linkedin-bot
Post date: 26 May 2024