dectalk
VALL-E-X
Our great sponsors
dectalk | VALL-E-X | |
---|---|---|
3 | 2 | |
229 | 7,169 | |
7.0% | - | |
6.3 | 8.8 | |
about 1 month ago | 3 months ago | |
PostScript | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
dectalk
-
Dectalk as Twitch TTS
Link to Github repository with the source code (it has to be built manually though, but it's probably easier to do something with this than the plain .exe)
- DECtalk source code and new version under development
-
Compiled DECtalk from source code
15th September 2022 build: https://github.com/dectalk/dectalk/releases/tag/2022-09-15
VALL-E-X
What are some alternatives?
TTS - :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Amphion - Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
DiffSinger - DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Coeditor - Coeditor: Leveraging Repo-level Diffs for Code Auto-editing
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
vall-e - An unofficial PyTorch implementation of the audio LM VALL-E
NeMo - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
OpenVoice - Instant voice cloning by MyShell.
vits - VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech