pyannote-audio VS inaSpeechSegmenter

Compare pyannote-audio vs inaSpeechSegmenter and see what are their differences.

inaSpeechSegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender. (by ina-foss)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
pyannote-audio inaSpeechSegmenter
15 3
5,077 695
3.4% 1.3%
8.6 6.4
3 days ago about 2 months ago
Jupyter Notebook Python
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

pyannote-audio

Posts with mentions or reviews of pyannote-audio. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-10-02.

inaSpeechSegmenter

Posts with mentions or reviews of inaSpeechSegmenter. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-11-05.
  • Listen to HD radio with a $30 RTL SDR dongle
    15 projects | news.ycombinator.com | 5 Nov 2023
    I have a little hobby project where I record an FM radio music station using a SDR and then remove all the non-music portions for offline listening. I like the music selections the DJs pick, but I prefer not to listen to the DJ commentary and the advertisements.

    I evaluated three methods of recording: analog capture from a standalone FM receiver, using this nrsc5 library to record the "HD" radio stream, and using an AirSpy SDR with this library: https://github.com/jj1bdx/airspy-fmradion

    Recording the "HD" (what a misnomer) radio was nice in that there was no hiss or multipath effects, but in comparison to the other methods the digital compression artifacts became impossible to un-hear. It seems to top out at about 96 kbps

    The airspy-fmradion library has some nice stuff in it to address multipath, resulting in the best audio quality of the three methods I tested.

    I use https://github.com/ina-foss/inaSpeechSegmenter to identify which segments of the recordings are speech vs. music.

  • (Unpopular?) opinion: KEXP’s John Richards is annoying AF
    1 project | /r/Seattle | 9 Aug 2022
    One key library that makes it possible is this one: https://github.com/ina-foss/inaSpeechSegmenter
  • ytmdl Web - A webapp that lets you download music by getting the audio from YouTube and metadata from various sources like Itunes, Last.FM, Gaana and others. v2 released with lots of fixes.
    5 projects | /r/Piracy | 26 Feb 2021
    After looking for a few options, I came across inaSpeechSegmenter. It is a speech segmenter and if you pass it an audio file, it returns the time segments of noises and music.

What are some alternatives?

When comparing pyannote-audio and inaSpeechSegmenter you can also consider the following projects:

NeMo - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

madmom - Python audio and music signal processing library

speechbrain - A PyTorch-based Speech Toolkit

GuitarTuner - Guitar tuner program made with Python, Tkinter and PyAudio.

Resemblyzer - A python package to analyze and compare voices with deep learning

ytmdl - A simple app to get songs from YouTube in mp3 format with artist name, album name etc from sources like iTunes, Spotify, LastFM, Deezer, Gaana etc.

Kaldi Speech Recognition Toolkit - kaldi-asr/kaldi is the official location of the Kaldi project.

ffsubsync - Automagically synchronize subtitles with video.

uis-rnn - This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

subaligner - Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/

segmentation_models.pytorch - Segmentation models with pretrained backbones. PyTorch.

sox-noise - Noise generator GUI powered by SoX