pyDenStream
uis-rnn
Our great sponsors
pyDenStream | uis-rnn | |
---|---|---|
1 | 3 | |
9 | 1,528 | |
- | 0.2% | |
5.2 | 3.5 | |
about 2 months ago | 8 months ago | |
Python | Python | |
- | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pyDenStream
-
[P] Implementation of DenStream
The implementation can be found here: https://github.com/MrParosk/pyDenStream
uis-rnn
-
[D] Is there a way to distinguish different human voices from 1 audio file ?
Looks like you can get an put of the box here: https://github.com/google/uis-rnn
-
Putting my degree to use. (Exclude Specials and Guests)
Discussion: - When I started this, I thought I would use something like the VoxSort Diarization and it would be easy. But these apps are terrible, especially in recognizing Joey apart from Garnt. Connor has a distinct voice so it was recognizable but still bad. But I didn't think Joey's and Garnt's voices were so similar. - Tested the thing and it's accuracy is almost 99%. - You can still improve this by cutting the episode into smaller chunk but 1 second is the maximum for my computer, any smaller than that i will run out of RAM. I can work to get around this but hey I'm lazy. - The library to implement yourself from google.
-
Finally, my degree can be useful
I used this algorithm from Google to determine "who spoke when".
What are some alternatives?
stringlifier - Stringlifier is on Opensource ML Library for detecting random strings in raw text. It can be used in sanitising logs, detecting accidentally exposed credentials and as a pre-processing step in unsupervised ML-based analysis of application text data.
pyannote-audio - Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
dedupe - :id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
lightning-bolts - Toolbox of models, callbacks, and datasets for AI/ML researchers.
impfuzzy - Fuzzy Hash calculated from import API of PE files
orange - 🍊 :bar_chart: :bulb: Orange: Interactive data analysis
hover - :speedboat: Label data at scale. Fun and precision included.
hazelcast-python-client - Hazelcast Python Client
ECAPA-TDNN - Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Clover - An Efficient DNA Clustering algorithm based on Tree Structure.