Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
The code is here: github.com/modal-labs/modal-examples/tree/main/misc/whisper_pod_transcriber
With minimal changes to https://github.com/m1guelpf/yt-whisper i got a setup to transcribe subs from YouTube videos or local files bit it might take an hour or so running the large model on my CPU.
There is a very simple method built-in to PyTorch which can give you over 3x speed improvement for the large model, which you could also combine with the method proposed in this post. https://github.com/MiscellaneousStuff/openai-whisper-cpu
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- Built this app to generate subtitles, summaries, and chapters for videos, all self-hostable with a single Docker image
- YouTubeTranscript.com
- Show HN: Summarize YouTube videos under 10 seconds
- Show HN: I created automatic subtitling app to boost short videos
- Voxos.ai – An Open-Source Desktop Voice Assistant