[P] Transcribe any podcast episode in just 1 minute with optimized OpenAI/whisper

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews

    The code is here: github.com/modal-labs/modal-examples/tree/main/misc/whisper_pod_transcriber

  • yt-whisper

    Using OpenAI's Whisper to automatically generate YouTube subtitles

  • With minimal changes to https://github.com/m1guelpf/yt-whisper i got a setup to transcribe subs from YouTube videos or local files bit it might take an hour or so running the large model on my CPU.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • openai-whisper-cpu

    Improving transcription performance of OpenAI Whisper for CPU based deployment

  • There is a very simple method built-in to PyTorch which can give you over 3x speed improvement for the large model, which you could also combine with the method proposed in this post. https://github.com/MiscellaneousStuff/openai-whisper-cpu

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts