The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Tacotron2 Alternatives
Similar projects and alternatives to tacotron2
-
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
-
Voice-Cloning-App
Discontinued A Python/Pytorch app for easily synthesising human voices
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
-
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
-
-
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) (by mozilla)
-
RHVoice
a free and open source speech synthesizer for Russian and other languages
-
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
-
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
-
Pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
-
-
tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
-
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
-
dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
-
flowtron
Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
-
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
-
radtts
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.
-
DiffSinger
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech) (by keonlee9420)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
tacotron2 reviews and mentions
- [D] What is the best open source text to speech model?
-
What voice-changing apps are available right now?
We have the TorToiSe repo, the SV2TTS repo, and from here you have the other models like Tacotron 2, FastSpeech 2, and such. A there is a lot that goes into training a baseline for these models on the LJSpeech and LibriTTS datasets. Fine tuning is left up to the user.
-
XQC Falls for dono thinkng its Adept
I had tried tacotron2 + waveglow and it's quite easy to get very good results. The hardest part is collecting clean data.
-
Recommendation: This sub should have a Wiki with resources to help noobs get started
I've been trying to find out what the most popular tools are for vocal synthesis. I've stumbled upon https://github.com/NVIDIA/tacotron2 and https://github.com/Kyubyong/dc_tts but those git repos haven't been updated since June 2020 and April 2018 respectively. Does anyone know what the most common tools are that folks use to do vocal synthesis?
- Speech Synthesis on Linux
-
New Voice Cloning App
Unfortunately, the training process uses https://github.com/NVIDIA/tacotron2 which is a complex deep learning model and requires training on an NVIDIA GPU. This app is targeted at people interested in creating voices, but if you just want to try some existing voices out https://15.ai/ and https://vo.codes/ are worth checking out
-
A note from our sponsor - WorkOS
workos.com | 29 Mar 2024
Stats
NVIDIA/tacotron2 is an open source project licensed under BSD 3-clause "New" or "Revised" License which is an OSI approved license.
The primary programming language of tacotron2 is Jupyter Notebook.