ECAPA-TDNN
SincNet
ECAPA-TDNN | SincNet | |
---|---|---|
1 | 3 | |
529 | 1,097 | |
- | - | |
1.0 | 0.0 | |
26 days ago | about 3 years ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ECAPA-TDNN
-
Using Edge Biometrics For Better AI Security System Development
The previous model with Jasper architecture was not able to verify the recordings of the same person taken from different microphones. So we solved this problem by using ECAPA-TDNN architecture, which was trained on VoxCeleb2 dataset from the SpeechBrain framework which did a better job at verifying employees.
SincNet
- Does this SincNet (neural architecture) contain a discriminator?
-
TypeError: layer_norm(): argument 'input' (position 1) must be Tensor, not SincNet.
the sincnet class is taken from here: https://github.com/mravanelli/SincNet/blob/master/dnn_models.py
-
[R][P] Announcing audax, a audio ML/DL framework in Jax
Code for https://arxiv.org/abs/1808.00158 found: https://github.com/mravanelli/SincNet
What are some alternatives?
uis-rnn - This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
pyannote-audio - Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
retinaface - RetinaFace: Deep Face Detection Library for Python
speechbrain - A PyTorch-based Speech Toolkit
NeMo - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
cnnimageretrieval-pytorch - CNN Image Retrieval in PyTorch: Training and evaluating CNNs for Image Retrieval in PyTorch
Image-Forgery-Detection-CNN - Image forgery detection using convolutional neural networks. Group 10's final project for TU Delft's course CS4180 Deep Learning 2019.
UniSpeech - UniSpeech - Large Scale Self-Supervised Learning for Speech
ruptures - ruptures: change point detection in Python
UHV-OTS-Speech - A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
stereo-image-generation - This repository contains code to generate stereo (Side by side) image from a single image.