uis-rnn
pyDenStream
Our great sponsors
uis-rnn | pyDenStream | |
---|---|---|
3 | 1 | |
1,529 | 9 | |
0.3% | - | |
3.5 | 5.2 | |
8 months ago | 1 day ago | |
Python | Python | |
Apache License 2.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
uis-rnn
-
[D] Is there a way to distinguish different human voices from 1 audio file ?
Looks like you can get an put of the box here: https://github.com/google/uis-rnn
-
Putting my degree to use. (Exclude Specials and Guests)
Discussion: - When I started this, I thought I would use something like the VoxSort Diarization and it would be easy. But these apps are terrible, especially in recognizing Joey apart from Garnt. Connor has a distinct voice so it was recognizable but still bad. But I didn't think Joey's and Garnt's voices were so similar. - Tested the thing and it's accuracy is almost 99%. - You can still improve this by cutting the episode into smaller chunk but 1 second is the maximum for my computer, any smaller than that i will run out of RAM. I can work to get around this but hey I'm lazy. - The library to implement yourself from google.
-
Finally, my degree can be useful
I used this algorithm from Google to determine "who spoke when".
pyDenStream
-
[P] Implementation of DenStream
The implementation can be found here: https://github.com/MrParosk/pyDenStream
What are some alternatives?
pyannote-audio - Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
stringlifier - Stringlifier is on Opensource ML Library for detecting random strings in raw text. It can be used in sanitising logs, detecting accidentally exposed credentials and as a pre-processing step in unsupervised ML-based analysis of application text data.
lightning-bolts - Toolbox of models, callbacks, and datasets for AI/ML researchers.
dedupe - :id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
orange - 🍊 :bar_chart: :bulb: Orange: Interactive data analysis
impfuzzy - Fuzzy Hash calculated from import API of PE files
hover - :speedboat: Label data at scale. Fun and precision included.
ECAPA-TDNN - Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
hazelcast-python-client - Hazelcast Python Client
Clover - An Efficient DNA Clustering algorithm based on Tree Structure.