Creating Whisper Video Captions: OpenAI VTT

This page summarizes the projects mentioned and recommended in the original post on dev.to

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

  • You do not need to buy an expensive GPU just to run your transcription! Services like Paperspace offer (free and paid) GPU compute. This lets you run the Whisper Python model for example in web-based Jupyter notebooks. That is exactly how I first used Whisper. However, we will see, if your audio is not too long (or alternatively, you are patient) you can run C++ code locally on your CPU. If you want to see how, read on!

  • whisper.cpp

    Port of OpenAI's Whisper model in C/C++

  • whisper.cpp is a lightweight C++ implementation, by Georgi Gerganov, of the original Whisper Python model. It is optimized to run on Apple Silicon processors, but also runs on Intel processors. The app is CPU intensive, so not ideal for running on your 15-year-old laptop, already on its last legs! To follow this guide, you need to be comfortable running code in the Terminal. It is not essential that you know C++ but some previous experience compiling C++ code would help make setting things up a little easier.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • FFmpeg

    Mirror of https://git.ffmpeg.org/ffmpeg.git

  • Before you can use whisper.cpp, you need to clone the repo and compile the C++ code into a binary. We use CMake to help build the binary. CMake is cross-platform tooling useful when working with C++. It generates a make file, setting compiler paths for any third-party libraries. On macOS, you can install CMake with Homebrew. We will also need to have FFmpeg installed locally, so let’s feed two birds with one scone!

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • AI Coding Companions: My Experiences in 2023

    2 projects | dev.to | 26 Sep 2023
  • Show HN: CompressX, my FFmpeg wrapper for macOS, made $9k in the last 4 months

    3 projects | news.ycombinator.com | 10 Apr 2024
  • FFmpeg 7.0 Released

    11 projects | news.ycombinator.com | 4 Apr 2024
  • Microsoft offered FFmpeg one-time payment instead of support contract

    1 project | news.ycombinator.com | 2 Apr 2024
  • Looking for a good file converter for upload testing

    1 project | /r/softwaretesting | 7 Dec 2023