soft-vc
YourTTS
soft-vc | YourTTS | |
---|---|---|
2 | 3 | |
376 | 832 | |
- | - | |
2.2 | 1.6 | |
2 months ago | about 1 year ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
soft-vc
-
Where can I find more info on the various model files
Also been looking into CycleGANs to do "voice conversions." I found that the term "voice conversion" is a research-friendly way to say you are doing voice clones. I found this one really good and I've been quite impressed : https://github.com/bshall/soft-vc
-
Divine conference of the 'sisters' :P
but Lydiasarunrat (my project-partner) had different ideas, and knew of the existence of https://github.com/bshall/soft-vc which he also slightly adjusted to get better results. The title of the work that were basing the female voice-pack conversion on: "{A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion}". It's a beautiful title.. very.. Sekiro lore-friendly π€£ - use this for Pro Shinobi whisper voice-line conversion! π
YourTTS
-
[D] What are the best ways to make and run a fast custom TTS?
YourTTS is available in Coqui TTS. It's fast and rather easy to use, but at the cost of quality. It does English, French and Portuguese in the same model.
-
How can i use better text to speech services in linux?
For "natural" output you need a trained model for your language and a software for WaveNNN. YourTTS and coqui.ai are the two best approach for realtime TTS
- Use deep fake tech to say stuff with your favorite characters
What are some alternatives?
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
StarGANv2-VC - StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
SfMLearner - An unsupervised learning framework for depth and ego-motion estimation from monocular videos
voice_conversion
simclr - SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners
autovc - AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
espeak-ng - eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
tt-vae-gan - Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.
jukebox - Code for the paper "Jukebox: A Generative Model for Music"
TTS - πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production