Our great sponsors
-
uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Discussion: - When I started this, I thought I would use something like the VoxSort Diarization and it would be easy. But these apps are terrible, especially in recognizing Joey apart from Garnt. Connor has a distinct voice so it was recognizable but still bad. But I didn't think Joey's and Garnt's voices were so similar. - Tested the thing and it's accuracy is almost 99%. - You can still improve this by cutting the episode into smaller chunk but 1 second is the maximum for my computer, any smaller than that i will run out of RAM. I can work to get around this but hey I'm lazy. - The library to implement yourself from google.