TTS

πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production (by coqui-ai)

TTS Alternatives

Similar projects and alternatives to TTS

  1. llama.cpp

    900 TTS VS llama.cpp

    LLM inference in C/C++

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. whisper

    367 TTS VS whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

  4. whisper.cpp

    Port of OpenAI's Whisper model in C/C++

  5. tortoise-tts

    148 TTS VS tortoise-tts

    A multi-voice TTS system trained with an emphasis on quality

  6. Real-Time-Voice-Cloning

    Clone a voice in 5 seconds to generate arbitrary speech in real-time

  7. DeepSpeech

    69 TTS VS DeepSpeech

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

  8. common-voice

    67 TTS VS common-voice

    Common Voice is part of Mozilla's initiative to help teach machines how real people speak.

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. piper

    57 TTS VS piper

    A fast, local neural text to speech system

  11. TTS

    62 TTS VS TTS

    :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) (by mozilla)

  12. silero-models

    34 TTS VS silero-models

    Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

  13. willow

    40 TTS VS willow

    Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative

  14. NeMo

    30 TTS VS NeMo

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

  15. espeak-ng

    eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

  16. mimic3

    24 TTS VS mimic3

    A fast local neural text to speech engine for Mycroft

  17. larynx

    18 TTS VS larynx

    Discontinued End to end text to speech system using gruut and onnx

  18. audio-webui

    15 TTS VS audio-webui

    A webui for different audio related Neural Networks

  19. opentts

    11 TTS VS opentts

    Open Text to Speech Server

  20. STT

    11 TTS VS STT

    🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

  21. voice-pro

    11 TTS VS voice-pro

    Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

  22. gTTS

    3 TTS VS gTTS

    Python library and CLI tool to interface with Google Translate's text-to-speech API

  23. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better TTS alternative or higher similarity.

TTS discussion

Log in or Post with

TTS reviews and mentions

Posts with mentions or reviews of TTS. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-05-05.
  • Real-time Voice Chat at ~500ms Latency
    11 projects | news.ycombinator.com | 5 May 2025
    That is probably the reason you can't find that much.

    *https://coqui.ai/

  • Show HN: Voice-Pro – AI Voice Cloning Magic: Transform Any Voice in 15 Seconds
    10 projects | news.ycombinator.com | 27 Nov 2024
    It's really easy for a technical person to do as well.

    I use Coqui TTS[0] as part of my home automation, I wrote a small python script that lets me upload a voice clip for it to clone after I got the idea from HeyWillow[1], and a small shim that lets me send the output to a Home Assistant media player instead of using their standard output device. I run the TTS container on a VM with a Tesla P4 (~Β£100 to buy) and get about 1x-2x (roughly the same time it'd take to say it, to process) using the large model.

    Just for a giggle, I uploaded a few 3s-5s second clip of myself speaking and cloned my voice, then executed a command to our living room media player to call my wife into the room; from another room, she was 100% convinced it was myself speaking words I'd never spoken.

    I tried playing with a variety of sentences for a few hours and overall, it sounded almost exactly like me, to me, with the exception of some "attitude" and "intonation" I know I wouldn't use in my speech. I didn't notice much of an improvement using much longer clips; the short ones were "good enough".

    Tangentially, it really bugs me that most phone providers in the UK insist you record a "personal greeting" now before they'll let you check your voice mail box, I just record silence, because the last thing I want/need is a voicemail greeting in my voice confirming to some randomer I didn't want calling me, who I am and that my number is active, even more so knowing how I can

    [0] https://github.com/coqui-ai/TTS

  • Show HN: Offline audiobook from any format with one CLI command
    7 projects | news.ycombinator.com | 6 Oct 2024
    For anyone who is interested, CoquiTTS (formerly, MozillaTTS) was great, but the project isn't maintained anymore (athough there's been some confusion about whether or not it's active. See: https://github.com/coqui-ai/TTS/issues/4022).

    Looks like there's an effort to keep an actively maintained fork here, though: https://github.com/idiap/coqui-ai-TTS

  • Ask HN: What is the state of OSS voice cloning?
    6 projects | news.ycombinator.com | 30 Sep 2024
    I am super impressed by the quality of voice cloning offered by Eleven Labs and Play.ai. I feel like I see impressive OSS demos on social frequently, but last weekend I took a few popular ones for a spin and quality wasn't even close to the proprietary models.

    https://github.com/coqui-ai/tts

  • AIM Weekly 17 June 2024
    26 projects | dev.to | 17 Jun 2024
  • Coqui.ai TTS: A Deep Learning Toolkit for Text-to-Speech
    6 projects | news.ycombinator.com | 11 Jun 2024
    The license is the MPL, which allows commercial use?

    https://github.com/coqui-ai/TTS/blob/dev/LICENSE.txt

  • Show HN: Pi-C.A.R.D, a Raspberry Pi Voice Assistant
    3 projects | news.ycombinator.com | 13 May 2024
    When I did a similar thing (but with less LLM) I liked https://github.com/coqui-ai/TTS but back then I needed to cut out the conversion step from tensor to a list of numbers to make it work really nicely.
  • Ask HN: Open-source, local Text-to-Speech (TTS) generators
    2 projects | news.ycombinator.com | 7 May 2024
    I just noticed that https://coqui.ai/ is "Shutting down".

    I'm building a web app (React / Django) which takes a list of affirmations & goals (in Markdown files), puts them into a database (SQlite), and uses voice synthesis to create voice audio files of the phrases. These are combined with a relaxed backing track (ffmpeg), made into playlists of 10-20 phrases (randomly sampled, or according to a theme: "mind" "body" "soul") and then play automatically in the morning & evening (cron). This allows you to persistently hear & vocalize your own goals & good vibes over time.

    I had been planning to use Coqui TTS as the local text-to-speech engine, but with this cancellation, I'd love to hear from the community what is a great open-source, local text-to-speech engine?

    Generally, I learn both the highest quality commercially available technology (example: ElevenLabs), and also the best open-source equivalent. Would love to hear suggestions & perspectives on this. What voice synth tools are you investing your time into learning & building with?

  • OpenAI deems its voice cloning tool too risky for general release
    1 project | news.ycombinator.com | 31 Mar 2024
    lol this marketing technique is getting very old. https://github.com/coqui-ai/TTS is already amazing and open source.
  • What things are happening in ML that we can't hear oer the din of LLMs?
    3 projects | news.ycombinator.com | 28 Mar 2024
    Not sure how relevant this is but note that Coqui TTS (the realistic TTS) has already shut down

    https://coqui.ai

  • A note from our sponsor - SaaSHub
    www.saashub.com | 23 May 2025
    SaaSHub helps you find the best software and product alternatives Learn more β†’

Stats

Basic TTS repo stats
241
40,169
8.1
9 months ago

coqui-ai/TTS is an open source project licensed under Mozilla Public License 2.0 which is an OSI approved license.

The primary programming language of TTS is Python.


Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?