-
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
If any CS fellows want to try this out, it seems https://github.com/pyannote/pyannote-audio is the right tool for the job
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
-
AI Transcribing tool for video with two voices?
-
I wanted to use OpenAI's Whisper speech-to-text on my Mac without installing stuff in the Terminal so I made MacWhisper, a free Mac app to transcribe audio and video files for easy transcription and subtitle generation. Would love to hear some feedback on it!
-
I won several speaker diarization challenges with pyannote.audio
-
Can Whisper differentiate between different voices?
-
Post-Game Analysis: Destiny & Alex VS Andrew & Zen Shapiro