speech-activity-detection

Open-source projects categorized as speech-activity-detection

speech-activity-detection Open-Source Projects

speech-activity-detection
  • pyannote-audio

    Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

  • Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

    pyannote/pyannote-audio

  • inaSpeechSegmenter

    CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

  • Project mention: Listen to HD radio with a $30 RTL SDR dongle | news.ycombinator.com | 2023-11-05

    I have a little hobby project where I record an FM radio music station using a SDR and then remove all the non-music portions for offline listening. I like the music selections the DJs pick, but I prefer not to listen to the DJ commentary and the advertisements.

    I evaluated three methods of recording: analog capture from a standalone FM receiver, using this nrsc5 library to record the "HD" radio stream, and using an AirSpy SDR with this library: https://github.com/jj1bdx/airspy-fmradion

    Recording the "HD" (what a misnomer) radio was nice in that there was no hiss or multipath effects, but in comparison to the other methods the digital compression artifacts became impossible to un-hear. It seems to top out at about 96 kbps

    The airspy-fmradion library has some nice stuff in it to address multipath, resulting in the best audio quality of the three methods I tested.

    I use https://github.com/ina-foss/inaSpeechSegmenter to identify which segments of the recordings are speech vs. music.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

speech-activity-detection related posts

  • AI Transcribing tool for video with two voices?

    1 project | /r/ChatGPT | 22 Jun 2023
  • I wanted to use OpenAI's Whisper speech-to-text on my Mac without installing stuff in the Terminal so I made MacWhisper, a free Mac app to transcribe audio and video files for easy transcription and subtitle generation. Would love to hear some feedback on it!

    2 projects | /r/apple | 1 Feb 2023
  • I won several speaker diarization challenges with pyannote.audio

    1 project | news.ycombinator.com | 2 Dec 2022
  • Can Whisper differentiate between different voices?

    1 project | /r/OpenAI | 16 Nov 2022
  • Post-Game Analysis: Destiny & Alex VS Andrew & Zen Shapiro

    1 project | /r/Destiny | 3 Sep 2022
  • A quick and dirty tool for automatically analyzing speaking time in online debates (Effortpost)

    1 project | /r/Destiny | 1 Sep 2022
  • Maybe next time I'll count how many words each one said

    2 projects | /r/Destiny | 1 Sep 2022
  • A note from our sponsor - SaaSHub
    www.saashub.com | 4 Jun 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

Project Stars
1 pyannote-audio 5,266
2 inaSpeechSegmenter 702

Sponsored
Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com