inaSpeechSegmenter
subaligner
Our great sponsors
inaSpeechSegmenter | subaligner | |
---|---|---|
3 | 3 | |
695 | 415 | |
2.9% | - | |
6.4 | 6.5 | |
about 1 month ago | 3 months ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
inaSpeechSegmenter
-
Listen to HD radio with a $30 RTL SDR dongle
I have a little hobby project where I record an FM radio music station using a SDR and then remove all the non-music portions for offline listening. I like the music selections the DJs pick, but I prefer not to listen to the DJ commentary and the advertisements.
I evaluated three methods of recording: analog capture from a standalone FM receiver, using this nrsc5 library to record the "HD" radio stream, and using an AirSpy SDR with this library: https://github.com/jj1bdx/airspy-fmradion
Recording the "HD" (what a misnomer) radio was nice in that there was no hiss or multipath effects, but in comparison to the other methods the digital compression artifacts became impossible to un-hear. It seems to top out at about 96 kbps
The airspy-fmradion library has some nice stuff in it to address multipath, resulting in the best audio quality of the three methods I tested.
I use https://github.com/ina-foss/inaSpeechSegmenter to identify which segments of the recordings are speech vs. music.
-
(Unpopular?) opinion: KEXP’s John Richards is annoying AF
One key library that makes it possible is this one: https://github.com/ina-foss/inaSpeechSegmenter
-
ytmdl Web - A webapp that lets you download music by getting the audio from YouTube and metadata from various sources like Itunes, Last.FM, Gaana and others. v2 released with lots of fixes.
After looking for a few options, I came across inaSpeechSegmenter. It is a speech segmenter and if you pass it an audio file, it returns the time segments of noises and music.
subaligner
- Show HN: Subaligner – A Multilingual Subtitle Generator
- Show HN: Containerized subtitle synchronizer and translator
-
Show HN: A retrainable subtitle synchronizer you can now build your own
Now you can customize and train a new synchronizer using your own subtitles and audiovisual content: https://github.com/baxtree/subaligner
What are some alternatives?
pyannote-audio - Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
aeneas - aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
madmom - Python audio and music signal processing library
alass - "Automatic Language-Agnostic Subtitle Synchronization"
GuitarTuner - Guitar tuner program made with Python, Tkinter and PyAudio.
JavascriptSubtitlesOctopus - Displays subtitles in .ass format from JavaScript. Supports most SSA/ASS features, easily integrates with HTML5 videos.
ytmdl - A simple app to get songs from YouTube in mp3 format with artist name, album name etc from sources like iTunes, Spotify, LastFM, Deezer, Gaana etc.
netflix-to-srt - Rip, extract and convert subtitles to .srt closed captions from .xml/dfxp/ttml and .vtt/WebVTT (e.g. Netflix, YouTube)
ffsubsync - Automagically synchronize subtitles with video.
sub37 - A set of libraries to parse, serve and show subtitles in video medias
sox-noise - Noise generator GUI powered by SoX
video-subtitle-extractor - 视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.