sox-noise
AutoSub
sox-noise | AutoSub | |
---|---|---|
1 | 2 | |
19 | 557 | |
- | - | |
0.0 | 4.1 | |
over 1 year ago | 5 months ago | |
Python | Python | |
The Unlicense | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sox-noise
AutoSub
-
Does there exist a free as in freedom solution to apply speech-to-text recognition to a video to get subtitles?
There's an interesting project here https://github.com/abhirooptalasila/AutoSub which uses FFmpeg & Deepspeech. YMMV around the accuracy.
-
Need to create subtitles
Check autosub
What are some alternatives?
inaSpeechSegmenter - CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
nPerlinNoise - A robust open source implementation of Perlin Noise Algorithm for N-Dimensions in Python
ffsubsync - Automagically synchronize subtitles with video.
DeepSpeech-Italian-Model - Tooling for producing Italian model (public release available) for DeepSpeech and text corpus
netflix-to-srt - Rip, extract and convert subtitles to .srt closed captions from .xml/dfxp/ttml and .vtt/WebVTT (e.g. Netflix, YouTube)
ffmpeg - 基于FFmpeg的python视频处理包-因疫情影响,工作比较繁忙,心情也没在视频上面再研究,该项目已经搁置,源码很简单,大家可以自己研究一下自己扩展
cleanvid - cleanvid is a little script to mute profanity in video files
cheetah - On-device streaming speech-to-text engine powered by deep learning
amazon-transcribe-output-word-document - An Amazon Transcribe demo to produce a Microsoft Word document containing the turn-by-turn transcription of the audio. This will include additional metadata depending upon the options selected, such as caller sentiment, category identification and issue detection
motion-tracking-video-crop - Crop motion tracked video with added smoothing movement
CCAligner - 🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.