Self-hosted offline transcription and diarization service with LLM summary

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io
featured
  1. transcribee

    open source audio and video transcription software

    I've been using this:

    https://github.com/bugbakery/transcribee

    It's noticeably work-in-progress but it does the job and has a nice UI to edit transcriptions and speakers etc.

    It's running on the CPU for me, would be nice to have something that can make use of a 4GB Nvidia GPU, which faster-whisper is actually able to [1]

    https://github.com/SYSTRAN/faster-whisper?tab=readme-ov-file...

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. transcriptionstream

    turnkey self-hosted offline transcription and diarization service with llm summary

  4. faster-whisper

    Faster Whisper transcription with CTranslate2

    I've been using this:

    https://github.com/bugbakery/transcribee

    It's noticeably work-in-progress but it does the job and has a nice UI to edit transcriptions and speakers etc.

    It's running on the CPU for me, would be nice to have something that can make use of a 4GB Nvidia GPU, which faster-whisper is actually able to [1]

    https://github.com/SYSTRAN/faster-whisper?tab=readme-ov-file...

  5. whisper-diarization

    Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

    Re: Diarization, I had decent results with testing this on Colab a while ago:

    https://github.com/MahmoudAshraf97/whisper-diarization

    I remember having the usual python package hell when NeMo was updated somewhere, but it seems to be decently well maintained so give it a go.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Ask HN: What API or software are people using for transcription?

    10 projects | news.ycombinator.com | 9 Jun 2025
  • Amazon Is Discontinuing the "Do Not Send Voice Recordings" Feature on Echo

    3 projects | news.ycombinator.com | 16 Mar 2025
  • Ask HN: Is Whisper Still Relevant?

    2 projects | news.ycombinator.com | 12 Feb 2025
  • Transcriber AI – Free, end-to-end machine based transcription with speaker id

    1 project | news.ycombinator.com | 16 Dec 2024
  • WhisperX: Precise ASR with Word-Level Timestamps and Diarization

    1 project | news.ycombinator.com | 5 Sep 2024

Did you know that Python is
the 2nd most popular programming language
based on number of references?