vits
vall-e
vits | vall-e | |
---|---|---|
6 | 3 | |
6,324 | 2,875 | |
- | - | |
0.0 | 0.0 | |
5 months ago | about 1 year ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
vits
-
[D] TTS systems to download & run offline
And the voice encapsulation system VITS https://github.com/jaywalnut310/vits
- [D] What is the best open source text to speech model?
- githubで公開されている音声自動生成AI、日本のアニメキャラ2890名分の音声を学習素材に超速度で進化中
- 日本語英語中国語を読み上げできる音声自動生成AIがgithubで公開され話題に
- Adversarial Learning for End-to-End Text-to-Speech
- [R] Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
vall-e
- VALL-E unoffical implementation (text to speech synthesis)
-
Do you think Vall-E will ever be Open Source?
Isn't it MIT-licensed? https://github.com/enhuiz/vall-e
- ‘Contract terms’ on neighbors front door for ringing doorbell
What are some alternatives?
tortoise-tts - A multi-voice TTS system trained with an emphasis on quality
Voice-Cloning-App - A Python/Pytorch app for easily synthesising human voices
tortoise-tts-fast - Fast TorToiSe inference (5x or your money back!)
WaveRNN - WaveRNN Vocoder + TTS
tacotron2 - Tacotron 2 - PyTorch implementation with faster-than-realtime inference
RadioTTS - RadioTTS lets you generate audio tracks with TTS introductions, directly from their file names!
Parallel-Tacotron2 - PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Amphion - Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
tacotron - A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
TTS - 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
glow-tts - A Generative Flow for Text-to-Speech via Monotonic Alignment Search
VALL-E-X - An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io