SaaSHub helps you find the best software and product alternatives Learn more →
Python speech-activity-detection Projects
-
inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
I have a little hobby project where I record an FM radio music station using a SDR and then remove all the non-music portions for offline listening. I like the music selections the DJs pick, but I prefer not to listen to the DJ commentary and the advertisements.
I evaluated three methods of recording: analog capture from a standalone FM receiver, using this nrsc5 library to record the "HD" radio stream, and using an AirSpy SDR with this library: https://github.com/jj1bdx/airspy-fmradion
Recording the "HD" (what a misnomer) radio was nice in that there was no hiss or multipath effects, but in comparison to the other methods the digital compression artifacts became impossible to un-hear. It seems to top out at about 96 kbps
The airspy-fmradion library has some nice stuff in it to address multipath, resulting in the best audio quality of the three methods I tested.
I use https://github.com/ina-foss/inaSpeechSegmenter to identify which segments of the recordings are speech vs. music.
Python speech-activity-detection related posts
- I wanted to use OpenAI's Whisper speech-to-text on my Mac without installing stuff in the Terminal so I made MacWhisper, a free Mac app to transcribe audio and video files for easy transcription and subtitle generation. Would love to hear some feedback on it!
- I won several speaker diarization challenges with pyannote.audio
- Can Whisper differentiate between different voices?
- Post-Game Analysis: Destiny & Alex VS Andrew & Zen Shapiro
- A quick and dirty tool for automatically analyzing speaking time in online debates (Effortpost)
- Maybe next time I'll count how many words each one said
- [D] What is the best package for combined speech recognition and diarization on long conversation audio files?
-
A note from our sponsor - SaaSHub
www.saashub.com | 27 Apr 2024
Index
Project | Stars | |
---|---|---|
1 | inaSpeechSegmenter | 695 |
Sponsored