tango
ai-text-to-audio-latent-diffusion
tango | ai-text-to-audio-latent-diffusion | |
---|---|---|
2 | 1 | |
923 | 30 | |
6.4% | - | |
8.7 | 2.3 | |
14 days ago | 9 months ago | |
Python | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tango
-
[Research] [Project] Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Found relevant code at https://github.com/declare-lab/tango + all code implementations here
ai-text-to-audio-latent-diffusion
-
How far are we from this ai
No, this is though - https://github.com/serp-co/ai-text-to-audio-latent-diffusion
What are some alternatives?
audio-diffusion-pytorch - Audio generation using diffusion models, in PyTorch.
sd-webui-inpaint-anything - Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
nuwa-pytorch - Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
word2wave - Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.
transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
soundstorm - Soundstorm is a cutting-edge AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers, algorithmic composers, and experimental audio enthusiasts. From sample pack creation and algorithmic composition to AI text-to-audio and onscreen ChatGPT, Soundstorm is a sonic powerhouse.
IOPaint - Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
naturalspeech2-pytorch - Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch