Can Whisper differentiate between different voices?

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

pyannote-audio

15 5,027 8.6 Jupyter Notebook

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Whisper can’t, but pyannote-audio can. I’ve seen a couple of prototypes out there which link the two together.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

AI Transcribing tool for video with two voices?

1 project | /r/ChatGPT | 22 Jun 2023
I wanted to use OpenAI's Whisper speech-to-text on my Mac without installing stuff in the Terminal so I made MacWhisper, a free Mac app to transcribe audio and video files for easy transcription and subtitle generation. Would love to hear some feedback on it!

2 projects | /r/apple | 1 Feb 2023
I won several speaker diarization challenges with pyannote.audio

1 project | news.ycombinator.com | 2 Dec 2022
Post-Game Analysis: Destiny & Alex VS Andrew & Zen Shapiro

1 project | /r/Destiny | 3 Sep 2022
A quick and dirty tool for automatically analyzing speaking time in online debates (Effortpost)

1 project | /r/Destiny | 1 Sep 2022

Can Whisper differentiate between different voices?

This page summarizes the projects mentioned and recommended in the original post on /r/OpenAI
Pytorch speech-processing speaker-diarization speech-activity-detection speaker-change-detection
Post date: 16 Nov 2022

pyannote-audio

InfluxDB

Related posts

AI Transcribing tool for video with two voices?

I wanted to use OpenAI's Whisper speech-to-text on my Mac without installing stuff in the Terminal so I made MacWhisper, a free Mac app to transcribe audio and video files for easy transcription and subtitle generation. Would love to hear some feedback on it!

I won several speaker diarization challenges with pyannote.audio

Post-Game Analysis: Destiny & Alex VS Andrew & Zen Shapiro

A quick and dirty tool for automatically analyzing speaking time in online debates (Effortpost)

Can Whisper differentiate between different voices?

This page summarizes the projects mentioned and recommended in the original post on /r/OpenAI Pytorch speech-processing speaker-diarization speech-activity-detection speaker-change-detection Post date: 16 Nov 2022

pyannote-audio

InfluxDB

Related posts

AI Transcribing tool for video with two voices?

I wanted to use OpenAI's Whisper speech-to-text on my Mac without installing stuff in the Terminal so I made MacWhisper, a free Mac app to transcribe audio and video files for easy transcription and subtitle generation. Would love to hear some feedback on it!

I won several speaker diarization challenges with pyannote.audio

Post-Game Analysis: Destiny &amp; Alex VS Andrew &amp; Zen Shapiro

A quick and dirty tool for automatically analyzing speaking time in online debates (Effortpost)

This page summarizes the projects mentioned and recommended in the original post on /r/OpenAI
Pytorch speech-processing speaker-diarization speech-activity-detection speaker-change-detection
Post date: 16 Nov 2022

Post-Game Analysis: Destiny & Alex VS Andrew & Zen Shapiro