OTranscribe: A free and open tool for transcribing audio interviews

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io
featured
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
  1. whisper-diarization

    Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

    > iirc whisper-diarization uses whisperx under the hood.

    It seems like it does:

    https://github.com/MahmoudAshraf97/whisper-diarization/blob/...

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. oTranscribe

    A free & open tool for transcribing audio interviews

  4. subgen

    Autogenerate subtitles using OpenAI Whisper Model via Jellyfin, Plex, Emby, Tautulli, or Bazarr

    https://github.com/McCloudS/subgen worked very well for me. I had a TV series where somehow the last few seasons timestamps did not match up with subtitle files I could find online. I used subgen and it worked surprisingly well.

  5. whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

  6. whisper.cpp

    Port of OpenAI's Whisper model in C/C++

  7. Auditif

    Auditif

    * Allowing users to provide their own models

    https://github.com/Stack-Studio-Digital-Collective/Auditif

  8. subtitleedit

    the subtitle editor :)

    SubtitleEdit is the most complete, it runs offline in your desktop (portable version available also), and has many online tutorials.

    Make sure they are recent tutorials because they will probably mention how to use the automated generation tools/plugins that wasn't available years ago.

    https://github.com/SubtitleEdit/subtitleedit

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Ask HN: What API or software are people using for transcription?

    10 projects | news.ycombinator.com | 9 Jun 2025
  • MacWhisper: Transcribe audio files on your Mac

    8 projects | news.ycombinator.com | 23 Aug 2023
  • Whisper Playground - launch speech2text web apps using OpenAI's Whisper

    3 projects | /r/LanguageTechnology | 2 Nov 2022
  • Whispercpp – Local, Fast, and Private Audio Transcription for Ruby

    1 project | news.ycombinator.com | 7 Jun 2025
  • Build Your Own Siri. Locally. On-Device. No Cloud

    1 project | news.ycombinator.com | 13 May 2025

Did you know that Python is
the 2nd most popular programming language
based on number of references?