WhisperSpeech Alternatives

Similar projects and alternatives to WhisperSpeech

llama.cpp

769 56,891 10.0 C++ WhisperSpeech VS llama.cpp

LLM inference in C/C++
whisper

343 60,303 6.4 Python WhisperSpeech VS whisper

Robust Speech Recognition via Large-Scale Weak Supervision
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Retrieval-based-Voice-Conversion-WebUI

56 19,077 9.6 Python WhisperSpeech VS Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!
piper

33 3,902 8.9 C++ WhisperSpeech VS piper

A fast, local neural text to speech system (by rhasspy)
espnet

15 7,872 10.0 Python WhisperSpeech VS espnet

End-to-End Speech Processing Toolkit
OpenVoice

14 17,263 8.8 Python WhisperSpeech VS OpenVoice

Instant voice cloning by MyShell.
WhisperFusion

3 1,379 8.7 Python WhisperSpeech VS WhisperFusion

WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
whisper-ctranslate2

3 743 8.5 Python WhisperSpeech VS whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.
vocode-python

9 2,287 9.1 Python WhisperSpeech VS vocode-python

🤖 Build voice-based LLM agents. Modular + open source.
StyleTTS2

7 4,036 8.7 Python WhisperSpeech VS StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
tts-generation-webui

5 1,260 8.6 Python WhisperSpeech VS tts-generation-webui

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)
monotonic_align

1 63 10.0 Cython WhisperSpeech VS monotonic_align

Monotonic Alignment Search
EmotiVoice

5 6,303 8.9 Python WhisperSpeech VS EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
VoiceCraft

3 6,587 5.4 Jupyter Notebook WhisperSpeech VS VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild
WhisperLive

4 1,180 9.4 Python WhisperSpeech VS WhisperLive

A nearly-live implementation of OpenAI's Whisper.
Retrieval-based-Voice-Convers

4 - - WhisperSpeech VS Retrieval-based-Voice-Convers
emotivoice-cli

1 5 6.2 JavaScript WhisperSpeech VS emotivoice-cli

CLI wrapper around Emotivoice TTS Synthesis
moondream

3 3,607 9.0 Jupyter Notebook WhisperSpeech VS moondream

tiny vision language model
llm-companion

2 23 6.7 JavaScript WhisperSpeech VS llm-companion

Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs
captioner

1 4 10.0 TypeScript WhisperSpeech VS captioner

Generate subtitles of videos in the browser (by Rodeoclash)
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better WhisperSpeech alternative or higher similarity.

Suggest an alternative to WhisperSpeech

WhisperSpeech reviews and mentions

Posts with mentions or reviews of WhisperSpeech. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-29.

OpenVoice: Versatile Instant Voice Cloning
7 projects | news.ycombinator.com | 29 Mar 2024

I haven't tried openvoice, but I did try whisperspeech and it will do the same thing. You can optionally pass in a file with a reference voice, and the tts uses it.
https://github.com/collabora/whisperspeech
I found it to be kind of creepy hearing it in my own voice. I also tried a friend of mine who had a french canadian accent and strangely the output didn't have his accent.
Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot
7 projects | news.ycombinator.com | 29 Jan 2024

- WhisperSpeech for the text-to-speech - https://github.com/collabora/WhisperSpeech
and an LLM (phi-2, Mistral, etc.) in between
WhisperFusion: Ultra-low latency conversations with an AI chatbot
2 projects | news.ycombinator.com | 25 Jan 2024

Hi, I used the [WhisperSpeech](https://github.com/collabora/WhisperSpeech) model for the TTS part after I did some serious torch.compile optimizations to bring the latency down. The Whisper speech recognition and the LLM were optimized through TensorRT-LLM by Marcus and Vineet.
It's not perfect but I am still extremely proud of how it came out. :)
WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper
9 projects | news.ycombinator.com | 17 Jan 2024
StyleTTS2 – open-source Eleven Labs quality Text To Speech
10 projects | news.ycombinator.com | 19 Nov 2023

I think you’re talking about just using Whisper to annotate audio for a TTS pipeline but someone from Collabora actually created a TTS model directly from Whisper embeddings https://github.com/collabora/WhisperSpeech
A note from our sponsor - InfluxDB
www.influxdata.com | 28 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →