uis-rnn
hover
Our great sponsors
uis-rnn | hover | |
---|---|---|
3 | 1 | |
1,529 | 313 | |
0.3% | - | |
3.5 | 3.0 | |
8 months ago | 10 days ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
uis-rnn
-
[D] Is there a way to distinguish different human voices from 1 audio file ?
Looks like you can get an put of the box here: https://github.com/google/uis-rnn
-
Putting my degree to use. (Exclude Specials and Guests)
Discussion: - When I started this, I thought I would use something like the VoxSort Diarization and it would be easy. But these apps are terrible, especially in recognizing Joey apart from Garnt. Connor has a distinct voice so it was recognizable but still bad. But I didn't think Joey's and Garnt's voices were so similar. - Tested the thing and it's accuracy is almost 99%. - You can still improve this by cutting the episode into smaller chunk but 1 second is the maximum for my computer, any smaller than that i will run out of RAM. I can work to get around this but hey I'm lazy. - The library to implement yourself from google.
-
Finally, my degree can be useful
I used this algorithm from Google to determine "who spoke when".
hover
What are some alternatives?
pyannote-audio - Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
diffgram - The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
pyDenStream - Implementation of the DenStream algorithm in Python.
lightning-bolts - Toolbox of models, callbacks, and datasets for AI/ML researchers.
orange - 🍊 :bar_chart: :bulb: Orange: Interactive data analysis
ECAPA-TDNN - Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Clover - An Efficient DNA Clustering algorithm based on Tree Structure.