5 Best Open Source Libraries and APIs for Speaker Diarization

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

pyannote-audio

15 5,077 8.6 Jupyter Notebook

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Similar to Kaldi ASR, PyAnnote is another open source Speaker Diarization toolkit, written in Python and built based on the PyTorch Machine Learning framework.

Kaldi Speech Recognition Toolkit

22 13,735 6.7 Shell

kaldi-asr/kaldi is the official location of the Kaldi project.

Kaldi ASR is a well-known open source Speech Recognition platform. To use its Speaker Diarization library, you’ll need to either download their PLDA backend or pre-trained X-Vectors, or train your own models.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Unsupervised (Semi-Supervised) ASR/STT training recipes

2 projects | /r/deeplearning | 3 Nov 2023
[D] ASR/Automatic Speech Recognition toolkit that provides precise word-level timing data? (eg, where in the audio stream a word starts and ends?)

2 projects | /r/MachineLearning | 23 Aug 2021
Show HN: Sonauto – a more controllable AI music creator

1 project | news.ycombinator.com | 10 Apr 2024
Amazon plans to charge for Alexa in June–unless internal conflict delays revamp

1 project | news.ycombinator.com | 20 Jan 2024
Steve's Explanation of the Viterbi Algorithm

1 project | news.ycombinator.com | 16 Oct 2023

5 Best Open Source Libraries and APIs for Speaker Diarization

This page summarizes the projects mentioned and recommended in the original post on dev.to
speaker-verification Audio Pytorch Kaldi speech-processing
Post date: 10 Feb 2022

pyannote-audio

Kaldi Speech Recognition Toolkit

InfluxDB

Related posts

Unsupervised (Semi-Supervised) ASR/STT training recipes

[D] ASR/Automatic Speech Recognition toolkit that provides precise word-level timing data? (eg, where in the audio stream a word starts and ends?)

Show HN: Sonauto – a more controllable AI music creator

Amazon plans to charge for Alexa in June–unless internal conflict delays revamp

Steve's Explanation of the Viterbi Algorithm

5 Best Open Source Libraries and APIs for Speaker Diarization

This page summarizes the projects mentioned and recommended in the original post on dev.to speaker-verification Audio Pytorch Kaldi speech-processing Post date: 10 Feb 2022

pyannote-audio

Kaldi Speech Recognition Toolkit

InfluxDB

Related posts

Unsupervised (Semi-Supervised) ASR/STT training recipes

[D] ASR/Automatic Speech Recognition toolkit that provides precise word-level timing data? (eg, where in the audio stream a word starts and ends?)

Show HN: Sonauto – a more controllable AI music creator

Amazon plans to charge for Alexa in June–unless internal conflict delays revamp

Steve's Explanation of the Viterbi Algorithm

This page summarizes the projects mentioned and recommended in the original post on dev.to
speaker-verification Audio Pytorch Kaldi speech-processing
Post date: 10 Feb 2022