InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 8 Python speaker-recognition Projects
-
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
NVIDIA NeMo To perform speaker diarization using NVIDIA NeMo , follow these steps:
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
Simple Diarizer Simple Diarizer is a speaker diarization library that utilizes pretrained models from SpeechBrain . To get started with simple_diarizer, follow these steps:
-
uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
-
-
ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
-
Angular-Penalty-Softmax-Losses-Pytorch
Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)
-
Falcon Speaker Diarization Falcon Speaker Diarization is an on-device speaker diarization engine powered by deep learning. To get started with Falcon Speaker Diarization, follow these steps:
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Next Steps See the GitHub Python Demo for a more complete example, including how to handle Enrollment feedback, save Speaker Profiles to disk and use files as the audio input. You can also view the Python API docs for details on the package.
Python speaker-recognition discussion
Python speaker-recognition related posts
-
Speaker Diarization in Python
-
SpeechBrain 1.0: A free and open-source AI toolkit for all things speech
-
[D] Training ASR model using SpeechBrain
-
Whisper.cpp
-
Specific Voice recognition
-
[D] Is there a way to distinguish different human voices from 1 audio file ?
-
[D] Speech Enhancement SOTA
-
A note from our sponsor - InfluxDB
www.influxdata.com | 16 May 2025
Index
What are some of the best open-source speaker-recognition projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | NeMo | 14,217 |
2 | speechbrain | 9,808 |
3 | uis-rnn | 1,573 |
4 | SincNet | 1,171 |
5 | ECAPA-TDNN | 666 |
6 | Angular-Penalty-Softmax-Losses-Pytorch | 486 |
7 | falcon | 45 |
8 | eagle | 35 |