soundless
AutoSub
soundless | AutoSub | |
---|---|---|
1 | 2 | |
0 | 556 | |
- | - | |
0.0 | 4.1 | |
almost 2 years ago | 4 months ago | |
Python | Python | |
GNU General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
soundless
-
soundless - Cross-platform audio files optimizer (FFmpeg + SoX)
🌐 [GitHub Page]
AutoSub
-
Does there exist a free as in freedom solution to apply speech-to-text recognition to a video to get subtitles?
There's an interesting project here https://github.com/abhirooptalasila/AutoSub which uses FFmpeg & Deepspeech. YMMV around the accuracy.
-
Need to create subtitles
Check autosub
What are some alternatives?
music-metadata-browser - Browser version of music-metadata parser Supporting a wide range of audio and tag formats.
vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
sync-audio-tracks - Audio tracks synchronization command-line tool for video editors that don't support it
ffsubsync - Automagically synchronize subtitles with video.
DeepSpeech-Italian-Model - Tooling for producing Italian model (public release available) for DeepSpeech and text corpus
netflix-to-srt - Rip, extract and convert subtitles to .srt closed captions from .xml/dfxp/ttml and .vtt/WebVTT (e.g. Netflix, YouTube)
ffmpeg - 基于FFmpeg的python视频处理包-因疫情影响,工作比较繁忙,心情也没在视频上面再研究,该项目已经搁置,源码很简单,大家可以自己研究一下自己扩展
cleanvid - cleanvid is a little script to mute profanity in video files
cheetah - On-device streaming speech-to-text engine powered by deep learning
amazon-transcribe-output-word-document - An Amazon Transcribe demo to produce a Microsoft Word document containing the turn-by-turn transcription of the audio. This will include additional metadata depending upon the options selected, such as caller sentiment, category identification and issue detection
sox-noise - Noise generator GUI powered by SoX
motion-tracking-video-crop - Crop motion tracked video with added smoothing movement