DiffSinger
dectalk
DiffSinger | dectalk | |
---|---|---|
1 | 3 | |
4,107 | 229 | |
- | 3.9% | |
2.5 | 6.3 | |
almost 1 year ago | about 1 month ago | |
Python | PostScript | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DiffSinger
dectalk
-
Dectalk as Twitch TTS
Link to Github repository with the source code (it has to be built manually though, but it's probably easier to do something with this than the plain .exe)
- DECtalk source code and new version under development
-
Compiled DECtalk from source code
15th September 2022 build: https://github.com/dectalk/dectalk/releases/tag/2022-09-15
What are some alternatives?
nnsvs - Neural network-based singing voice synthesis library for research
VALL-E-X - An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
OpenUtau - Open singing synthesis platform / Open source UTAU successor
TTS - :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
RealTimeSingingSynthesizer - Live Coding Singing Synthesizer. Python sinsy-NG wrapper.
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
StableVideo - [ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
NeMo - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
tiktok-voice - Simple Python script to interact with the TikTok TTS API
vits - VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
THIRTY-DOLLAR-HAIRCUT-GENERATOR - 30 dollar haircut website MIDI converter - Using MIDIs, QUICKLY generate a chart for the "DON'T YOU LECTURE ME WITH YOUR THIRTY DOLLAR HAIRCUT" website. The site's by GDcolon, if you need to search it up.
RadioTTS - RadioTTS lets you generate audio tracks with TTS introductions, directly from their file names!