assem-vc
YourTTS
assem-vc | YourTTS | |
---|---|---|
1 | 3 | |
260 | 832 | |
0.4% | - | |
0.0 | 1.6 | |
about 2 years ago | about 1 year ago | |
Jupyter Notebook | Jupyter Notebook | |
BSD 3-clause "New" or "Revised" License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
assem-vc
-
[D] Voice modification ML: state of the art and resources
Paperswithcode has a good overview of papers/repos in this area. A pretty good recent project is assem-vc, they have a pretrained model and even a colab to try it out.
YourTTS
-
[D] What are the best ways to make and run a fast custom TTS?
YourTTS is available in Coqui TTS. It's fast and rather easy to use, but at the cost of quality. It does English, French and Portuguese in the same model.
-
How can i use better text to speech services in linux?
For "natural" output you need a trained model for your language and a software for WaveNNN. YourTTS and coqui.ai are the two best approach for realtime TTS
- Use deep fake tech to say stuff with your favorite characters
What are some alternatives?
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
StarGANv2-VC - StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
voice_conversion
autovc - AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
espeak-ng - eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
tt-vae-gan - Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.
jukebox - Code for the paper "Jukebox: A Generative Model for Music"
TTS - πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production