SaaSHub helps you find the best software and product alternatives Learn more β
TTS Alternatives
Similar projects and alternatives to TTS
-
-
InfluxDB
InfluxDB β Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
-
-
-
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
-
common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) (by mozilla)
-
silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
-
willow
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
-
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
-
espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
-
-
-
-
-
STT
πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
-
voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
TTS discussion
TTS reviews and mentions
-
Real-time Voice Chat at ~500ms Latency
That is probably the reason you can't find that much.
*https://coqui.ai/
-
Show HN: Voice-Pro β AI Voice Cloning Magic: Transform Any Voice in 15 Seconds
It's really easy for a technical person to do as well.
I use Coqui TTS[0] as part of my home automation, I wrote a small python script that lets me upload a voice clip for it to clone after I got the idea from HeyWillow[1], and a small shim that lets me send the output to a Home Assistant media player instead of using their standard output device. I run the TTS container on a VM with a Tesla P4 (~Β£100 to buy) and get about 1x-2x (roughly the same time it'd take to say it, to process) using the large model.
Just for a giggle, I uploaded a few 3s-5s second clip of myself speaking and cloned my voice, then executed a command to our living room media player to call my wife into the room; from another room, she was 100% convinced it was myself speaking words I'd never spoken.
I tried playing with a variety of sentences for a few hours and overall, it sounded almost exactly like me, to me, with the exception of some "attitude" and "intonation" I know I wouldn't use in my speech. I didn't notice much of an improvement using much longer clips; the short ones were "good enough".
Tangentially, it really bugs me that most phone providers in the UK insist you record a "personal greeting" now before they'll let you check your voice mail box, I just record silence, because the last thing I want/need is a voicemail greeting in my voice confirming to some randomer I didn't want calling me, who I am and that my number is active, even more so knowing how I can
[0] https://github.com/coqui-ai/TTS
-
Show HN: Offline audiobook from any format with one CLI command
For anyone who is interested, CoquiTTS (formerly, MozillaTTS) was great, but the project isn't maintained anymore (athough there's been some confusion about whether or not it's active. See: https://github.com/coqui-ai/TTS/issues/4022).
Looks like there's an effort to keep an actively maintained fork here, though: https://github.com/idiap/coqui-ai-TTS
-
Ask HN: What is the state of OSS voice cloning?
I am super impressed by the quality of voice cloning offered by Eleven Labs and Play.ai. I feel like I see impressive OSS demos on social frequently, but last weekend I took a few popular ones for a spin and quality wasn't even close to the proprietary models.
https://github.com/coqui-ai/tts
- AIM Weekly 17 June 2024
-
Coqui.ai TTS: A Deep Learning Toolkit for Text-to-Speech
The license is the MPL, which allows commercial use?
https://github.com/coqui-ai/TTS/blob/dev/LICENSE.txt
-
Show HN: Pi-C.A.R.D, a Raspberry Pi Voice Assistant
When I did a similar thing (but with less LLM) I liked https://github.com/coqui-ai/TTS but back then I needed to cut out the conversion step from tensor to a list of numbers to make it work really nicely.
-
Ask HN: Open-source, local Text-to-Speech (TTS) generators
I just noticed that https://coqui.ai/ is "Shutting down".
I'm building a web app (React / Django) which takes a list of affirmations & goals (in Markdown files), puts them into a database (SQlite), and uses voice synthesis to create voice audio files of the phrases. These are combined with a relaxed backing track (ffmpeg), made into playlists of 10-20 phrases (randomly sampled, or according to a theme: "mind" "body" "soul") and then play automatically in the morning & evening (cron). This allows you to persistently hear & vocalize your own goals & good vibes over time.
I had been planning to use Coqui TTS as the local text-to-speech engine, but with this cancellation, I'd love to hear from the community what is a great open-source, local text-to-speech engine?
Generally, I learn both the highest quality commercially available technology (example: ElevenLabs), and also the best open-source equivalent. Would love to hear suggestions & perspectives on this. What voice synth tools are you investing your time into learning & building with?
-
OpenAI deems its voice cloning tool too risky for general release
lol this marketing technique is getting very old. https://github.com/coqui-ai/TTS is already amazing and open source.
-
What things are happening in ML that we can't hear oer the din of LLMs?
Not sure how relevant this is but note that Coqui TTS (the realistic TTS) has already shut down
https://coqui.ai
-
A note from our sponsor - SaaSHub
www.saashub.com | 23 May 2025
Stats
coqui-ai/TTS is an open source project licensed under Mozilla Public License 2.0 which is an OSI approved license.
The primary programming language of TTS is Python.