TTS vs TensorFlowTTS

TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) (by mozilla)

Source Code

Suggest alternative

Edit details

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages) (by TensorSpeech)

Source Code

tensorspeech.github.io

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

TTS		TensorFlowTTS
	Project
62	Mentions	6
8,806	Stars	3,702
2.2%	Growth	1.4%
0.0	Activity	0.0
6 months ago	Latest Commit	5 months ago
Jupyter Notebook	Language	Python
Mozilla Public License 2.0	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

TTS

Posts with mentions or reviews of TTS. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-04.

Any recommendation for human like voice AI model for conversation AI?
2 projects | news.ycombinator.com | 4 Jan 2024

Fast or good, choose one
Mozilla's TTS is a python package installable with pip and uses cpu or gpu resources to render a choice of voices, they mostly sound natural and this is the good. https://github.com/mozilla/TTS
Mycroft's mimic3 is the default voice renderer for the Mycroft project that runs on pi hardware and sounds ok-ish, that is the fast. https://github.com/MycroftAI/mimic3
There are many others but these are the two I use according to if it needs to run on limited hardware or if the cycles fall freely from the sky.
Coqui.ai Is Shutting Down
4 projects | news.ycombinator.com | 3 Jan 2024

Coqui-ai was a commercial continuation of Mozilla TTS and STT (https://github.com/mozilla/TTS).
At the time (2018-ish), it was really impressive for on-device voice synthesis (with a quality approaching the Google and Azure cloud-based voice synthesis options) and open source, so a lot of people in the FOSS community were hoping it could be used for a privacy-respecting home assistant, Linux speech synthesis that doesn't suck, etc.
After Mozilla abandoned the project, Coqui continued development and had some really impressive one-shot voice cloning, but pivoted to marketing speech synthesis for game developers. They were probably having trouble monetizing it, and it doesn't surprise me that they shut down.
An equivalent project that's still in active development and doing really well is Piper TTS (https://github.com/rhasspy/piper).
What self hosted app do you wish existed?
17 projects | /r/selfhosted | 18 Jun 2023

An RSS reader that integrates TTS (or TTS)
Audio Converter! How to write one in c/c++?
1 project | /r/AskProgramming | 14 May 2023

My solution would be to use a speech synthesis library, maybe eSpeak or Festival, just for ease of use; I think they each provide a library that you could use from C or C++ easily. This one from Mozilla is a more modern system with better-quality output, but it looks like it's set up to run through Python, and I haven't looked at it closely enough to see how much work it would be to get it working for you.
Web Speech API is (still) broken on Linux circa 2023
8 projects | /r/javascript | 15 Apr 2023

There is a lot of TTS and SST development going on (https://github.com/mozilla/TTS; https://github.com/mozilla/DeepSpeech; https://github.com/common-voice/common-voice). That is the only way they work: Contributions from the wild.
[P] Balacoon: free-to-use text-to-speech
3 projects | /r/MachineLearning | 13 Apr 2023

unfortunately not yet. I need to expand the library of languages and voices. looking around, it seems only Coqui had some traction re Brazilian Portuguese: https://github.com/mozilla/TTS/issues/160. If you foresee wide adoption of the tech for this locale, hit me up with DM
Text to speech free
1 project | /r/software | 9 Apr 2023

I haven't used it, but there's also mozilla/TTS.
Does anyone know how to set up Mozilla TTS to work with firefox's reader view?
1 project | /r/firefox | 31 Mar 2023

Mozilla TTS
Conteúdo removido do rb que fiz sobre a destruição do Rio Doce 853KM de rio pela Vale e BHP Billings
1 project | /r/brasilivre | 25 Mar 2023
[D] Looking for someone to do a small coding job
2 projects | /r/MachineLearning | 25 Feb 2023

Instead, just use Firefox's open-source TTS model: https://github.com/mozilla/TTS

TensorFlowTTS

Posts with mentions or reviews of TensorFlowTTS. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-05-17.

Ask HN: On-Device Text to Speech
1 project | news.ycombinator.com | 31 Aug 2023

Hey HN, has anyone found a viable solution for doing this locally and offline on iOS? I'd like to offer a privacy-friendly text to speech feature to my App, and Apple's speech synthesis sounds awful compared to some newer models and TTS engines. The only thing I've found is an older TensorflowTTS example here: https://github.com/TensorSpeech/TensorFlowTTS/tree/master/examples/ios
Any pointers or tips appreciated.
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality
14 projects | news.ycombinator.com | 17 May 2022

I had a lot of success using [FastSpeech2 + MB MelGAN via TensorFlowTTS](https://github.com/TensorSpeech/TensorFlowTTS). There are demos for [iOS](https://github.com/TensorSpeech/TensorFlowTTS/tree/master/ex...) and [Android](https://github.com/TensorSpeech/TensorFlowTTS/tree/master/ex...) which will allow you to run pretty convincing, modern TTS models with only a few hundred milliseconds of processing latency.
TTS mobile help
2 projects | /r/tensorflow | 30 Mar 2022

I need an example of how I would go about it. I've combed through examples but it's just not clicking for me.
A Working TTS feature has been found (No Google Services Required)
1 project | /r/fossdroid | 11 Oct 2021

https://github.com/TensorSpeech/TensorFlowTTS was the project. It was pretty much a direct compile and run. I went through and added the required features to enable it as TTS service for Android. I also moved the Tensorflow portion into a separate thread from the TTS service directly, since Android restricts it's TTS service to a single thread, and the Tensorflow service uses five threads to run at a good speed. It's a much much heavier solution than a C/C++ compiled library, but it works out of the box and I will worry about optimizations later
Free library for text-to-speech
1 project | /r/speechrecognition | 26 Jan 2021

You need to try, it implements most advanced algorithms and not as ad-hoc as nvidia https://github.com/TensorSpeech/TensorFlowTTS
Reviving the 1973 Unix text to voice translator
1 project | news.ycombinator.com | 3 Jan 2021

For open source offline TTS with more or less recent algorithms you can check
https://github.com/TensorSpeech/TensorFlowTTS

What are some alternatives?

When comparing TTS and TensorFlowTTS you can also consider the following projects:

Real-Time-Voice-Cloning - Clone a voice in 5 seconds to generate arbitrary speech in real-time

tortoise-tts - A multi-voice TTS system trained with an emphasis on quality

STT - 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

TTS - 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

DeepSpeech - DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

flowtron - Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

NeMo - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

hifi-gan - HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

FairMOT - [IJCV-2021] FairMOT: On the Fairness of Detection and Re-Identification in Multi-Object Tracking

PaddleSpeech - Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Lip2Speech - A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.

TTS vs Real-Time-Voice-Cloning TensorFlowTTS vs tortoise-tts TTS vs STT TensorFlowTTS vs TTS TTS vs DeepSpeech TensorFlowTTS vs flowtron TTS vs NeMo TensorFlowTTS vs hifi-gan TTS vs TTS TensorFlowTTS vs FairMOT TTS vs PaddleSpeech TensorFlowTTS vs Lip2Speech

Compare TTS vs TensorFlowTTS and see what are their differences.

TTS

TensorFlowTTS

TTS

TensorFlowTTS

What are some alternatives?