Show HN: Transcribe YouTube Videos

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB high-performance time series database
Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
influxdata.com
featured
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
  1. yt-transcribe

    Transcribe a youtube video into an easily readable HTML file

    Presumably that used whisper's bundled tiny model, which is no better than youtube CC. A beef I have with whisper-cpp is that they totally outsource model management.

    With mlx_whisper, I just have to tell it to use a model and it will download it if it's not already present: https://github.com/llimllib/yt-transcribe/blob/244841f83d833...

    so if I add whisper.cpp as a dependency, I also have to add huggingface-cli or something similar

  2. InfluxDB

    InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.

    InfluxDB logo
  3. youtube-dl

    Unofficial daily builds for youtube-dl. DO NOT OPEN PULL REQUESTS HERE (by ytdl-patched)

  4. Scoop

    A command-line installer for Windows.

  5. whisper.cpp

    Port of OpenAI's Whisper model in C/C++

    Not as convenient, but you could also have the user manually install the model, like whisper does.

    Just forward the error message output by whisper, or even make a more user-friendly error message with instructions on how/where to download the models.

    Whisper does provide a simple bash script to download models: https://github.com/ggerganov/whisper.cpp/blob/master/models/...

    (As a Windows user, I can run bash scripts via Git Bash for Windows[1])

    [1]: https://git-scm.com/download/win

  6. otrgen

    Generate OTR (oTranscribe) files from (YouTube) SBV/TTML files.

    Well, thanks to you I found out whisper generates decent audio transcriptions using a local LLM (relatively) easily, even on my 6+ year-old laptop.

    (I used to upload videos to YouTube just to get the auto captions.)

    I did some investigation, and it would not be difficult to convert the whisper LRC subtitle output into the format my fork of oTranscribe expects.

    I already made a simple tool to convert YouTube TTML/SBV subtitle output: https://github.com/Leftium/otrgen

  7. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • It's simple,but I feel statified that I automated the process

    2 projects | /r/PowerShell | 17 Jun 2023
  • Microsoft to force Windows 11 24H2 on Home and Pro users

    1 project | news.ycombinator.com | 22 Jan 2025
  • How to Set Up Goose on Windows 🦢

    1 project | dev.to | 16 Nov 2024
  • Git Bash Is My Preferred Windows Shell

    1 project | news.ycombinator.com | 10 Sep 2024
  • Scoop. A command line installer for windows

    1 project | news.ycombinator.com | 22 Apr 2024