SaaSHub helps you find the best software and product alternatives Learn more →
Similar projects and alternatives to TTS
A multi-voice TTS system trained with an emphasis on quality
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) (by mozilla)
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
End to end text to speech system using gruut and onnx
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
Docker images for Coqui AI
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
A fast local neural text to speech engine for Mycroft
Open Text to Speech Server
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Robust Speech Recognition via Large-Scale Weak Supervision
NeMo: a toolkit for conversational AI
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Docker image for Mozilla TTS server
A high-quality, varied ~30hr voice dataset suitable for training a TTS model
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
TTS reviews and mentions
What solarpunk projects are you working on this week?
3 projects | reddit.com/r/solarpunk | 20 Mar 2023
As for TTS I was working with coqui-tts (https://github.com/coqui-ai/TTS) and also you can do inference well without GPU, even on something like RPi.
Completely free, unlimited ElevenLabs alternative?
2 projects | reddit.com/r/artificial | 11 Mar 2023
As others already mentioned, there's Tortoise-tts. I used coqui.ai a bunch the other day but hit the limit on free usage for my account. I suppose you could just make a new account and retrain all your voices whenever you hit the limit, but that's just gonna make them shut down the free usage trial (if they notice how many people are doing that).
Convert EPUB files to speech using Azure's Text-to-Speech with Python
3 projects | reddit.com/r/Python | 25 Feb 2023
Sounds cool, i've been interested in this for a while now. Does it work with DRM ebooks? Also you might be interested in https://github.com/coqui-ai/TTS , it's a offline tts generator. and here's a odd little tts that works offline and is super fast, https://github.com/rendchevi/nix-tts. Sometimes your passion projects are your own, most rabbit holes only fit one rabbit. well not this time.
Text to speech
4 projects | reddit.com/r/selfhosted | 21 Feb 2023
Try https://github.com/coqui-ai/TTS. Runs nicely via docker or on Linux. You Just have to find the voices that work. Ford english, it's one of the Susan voices.
Is there any competitor for elevenlabs as of right now?
2 projects | reddit.com/r/ElevenLabs | 19 Feb 2023
So, has someone made an open-source version of ElevenLabs yet?
2 projects | reddit.com/r/singularity | 19 Feb 2023
I'm also just googling around for this and found this one: https://github.com/CorentinJ/Real-Time-Voice-Cloning - have a look at the youtube video in there. https://github.com/coqui-ai/TTS is also worth looking into maybe.
Open source TTS for bajs
2 projects | reddit.com/r/forsen | 18 Feb 2023
Read this first! Installing dependencies (tts to be exact) will probably fail, since some of the modules are do not have wheels compiled for m1 chips, so do pip install --no-dependencies tts, then manually install them one by one that you need. If you get torchaudio errors like torchaudio/lib/libtorchaudio.so, 0x0006): Symbol not found: __ZN2at8internal15invoke_parallelExxxRKNSt3__18functionIFvxxEEE, uninstall pytorch and install it with torchaudio conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 -c pytorch. If you see ModuleNotFoundError: [!] Config for vits cannot be found, run python and try importing tts core import TTS.tts.configs.vits_config. I will probably fail with mecab error, you can try building it from source to install mecab, but it didn't work for me, so since this lib is only used for japanese tts, I decided to comment out the imports in site-packages/TTS/tts/utils/text/japanese/phonemizer.py, same for jieba in /site-packages/TTS/tts/utils/text/chinese_mandarin/phonemizer.py just because it is too heavy. Last error: No espeak backend found. Install espeak-ng or espeak to your system, soulution for macos: brew install espeak.
[D] What are the best ways to make and run a fast custom TTS?
2 projects | reddit.com/r/MachineLearning | 14 Feb 2023
YourTTS is available in Coqui TTS. It's fast and rather easy to use, but at the cost of quality. It does English, French and Portuguese in the same model.
AI MoistCritical roasts the fuck out of Athene
3 projects | reddit.com/r/Destiny | 10 Feb 2023
Voice clone/training -> TTS: Coqui
David Guetta uses ChatGPT and uberduck.ai to create fake Eminem rap for DJ set
3 projects | news.ycombinator.com | 10 Feb 2023
I will also say -- UberDuck and AI TTS in general, when compared to the SURGE of development and tools that's happened on the image/video side of AI, is TERRIBLE.
UberDuck's community specifically seems geared towards kids making memes -- I suspect they just ended up there and didn't design it that way, but wading through the terrible user created models to find ones that work was tiresome.
I tried to get https://coqui.ai/ setup to do similar things, but have not been successful.
Surely this will all explode in the next 18 mo max
A note from our sponsor - #<SponsorshipServiceOld:0x00007f1610c7d8a0>
www.saashub.com | 24 Mar 2023
coqui-ai/TTS is an open source project licensed under Mozilla Public License 2.0 which is an OSI approved license.