jetson-voice
whisper-auto-transcribe
jetson-voice | whisper-auto-transcribe | |
---|---|---|
1 | 8 | |
168 | 195 | |
- | - | |
0.0 | 6.1 | |
3 months ago | about 1 year ago | |
Python | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
jetson-voice
-
Dusty-nv containers on Orin Nano
I've had some trouble running them on a regular Jetson nano. Specifically https://github.com/dusty-nv/jetson-voice. Has anyone tried them with Orin Nano?
whisper-auto-transcribe
- Using Whisper to transcribe the entire Forensic Files series
- Dougal Dixon's "After Man", 1990 Japanese Documentary with English Subtitles
-
Whisper Auto Transcribe - An all-in-one solution for automatic transcription
Also, don't forget to check out this project on GitHub
-
Whisper Auto Transcribe - An all-in-one solution for transcription
git clone --branch v3-alpha https://github.com/tomchang25/whisper-auto-transcribe.git whisper-auto-transcribe-v3
-
Freeze!
I'm still hoping u/blakeo_x decides to sub the second season. Maybe someone could even implement those whisper.ai subs to help give a loose framework. Since there's not a lot of dialogue in Freeze, it would be ideal circumstances to use them.
-
The Making of Godzilla (1984), Japanese Language with English Subtitles
https://github.com/tomchang25/whisper-auto-transcribe https://github.com/c0decracker/video-splitter
-
The Great Escape - Episode 1
I've used Whisper Auto Transcribe, based on OpenAI-whisper, to generate subtitles for the first episode.
-
I create an auto transcribe tool inspired by OpenAI latest AI project
Detail and Demo: https://github.com/tomchang25/whisper-auto-transcribe/
What are some alternatives?
jetson-voice - ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT
subgen - Autogenerate subtitles using OpenAI Whisper Model via Jellyfin, Plex, Emby, Tautulli, or Bazarr
silero-vad - Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Transcriptify - One script that uses OpenAI to transcribe audio into text.
subsync - Subtitle Speech Synchronizer
FunASR - A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。
whisper - Robust Speech Recognition via Large-Scale Weak Supervision
video-splitter - Simple Python script to split video into equal length chunks or chunks of equal size, duration, etc.
subsai - 🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️
whisper-timestamped - Multilingual Automatic Speech Recognition with word-level timestamps and confidence