n2d VS uis-rnn

Compare n2d vs uis-rnn and see what are their differences.

n2d

A deep clustering algorithm. Code to reproduce results for our paper N2D: (Not Too) Deep Clustering via Clustering the Local Manifold of an Autoencoded Embedding. (by rymc)

uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization. (by google)
Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
n2d uis-rnn
1 3
122 1,529
- 0.3%
0.0 3.5
6 months ago 8 months ago
Python Python
GNU General Public License v3.0 only Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

n2d

Posts with mentions or reviews of n2d. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-09-12.
  • Time-Series image clustering. Advice needed!
    2 projects | /r/gis | 12 Sep 2021
    So far I found some approaches that looks promising, for example n2d or k-means with DTW distance, and there are some more (e.g. T-DPSOM), but I want to start from these.

uis-rnn

Posts with mentions or reviews of uis-rnn. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-10-03.
  • [D] Is there a way to distinguish different human voices from 1 audio file ?
    2 projects | /r/MachineLearning | 3 Oct 2022
    Looks like you can get an put of the box here: https://github.com/google/uis-rnn
  • Putting my degree to use. (Exclude Specials and Guests)
    1 project | /r/TrashTaste | 5 Jun 2021
    Discussion: - When I started this, I thought I would use something like the VoxSort Diarization and it would be easy. But these apps are terrible, especially in recognizing Joey apart from Garnt. Connor has a distinct voice so it was recognizable but still bad. But I didn't think Joey's and Garnt's voices were so similar. - Tested the thing and it's accuracy is almost 99%. - You can still improve this by cutting the episode into smaller chunk but 1 second is the maximum for my computer, any smaller than that i will run out of RAM. I can work to get around this but hey I'm lazy. - The library to implement yourself from google.
  • Finally, my degree can be useful
    1 project | /r/TrashTaste | 5 Jun 2021
    I used this algorithm from Google to determine "who spoke when".

What are some alternatives?

When comparing n2d and uis-rnn you can also consider the following projects:

minisom - :red_circle: MiniSom is a minimalistic implementation of the Self Organizing Maps

pyannote-audio - Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pyDenStream - Implementation of the DenStream algorithm in Python.

lightning-bolts - Toolbox of models, callbacks, and datasets for AI/ML researchers.

orange - 🍊 :bar_chart: :bulb: Orange: Interactive data analysis

hover - :speedboat: Label data at scale. Fun and precision included.

ECAPA-TDNN - Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Clover - An Efficient DNA Clustering algorithm based on Tree Structure.