tango
Amphion
tango | Amphion | |
---|---|---|
2 | 4 | |
923 | 3,975 | |
6.4% | 6.5% | |
8.7 | 8.6 | |
14 days ago | 5 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tango
-
[Research] [Project] Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Found relevant code at https://github.com/declare-lab/tango + all code implementations here
Amphion
- FLaNK Stack Weekly 11 Dec 2023
- Technique makes Taylor Swift to sing perfect Mandarin Chinese song
-
Novel vocoder for high-quality audio generation
Code: https://github.com/open-mmlab/Amphion/blob/main/models/vocod...
What are some alternatives?
audio-diffusion-pytorch - Audio generation using diffusion models, in PyTorch.
VALL-E-X - An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
ai-text-to-audio-latent-diffusion - text-to-audio-latent-diffusion
vall-e - An unofficial PyTorch implementation of the audio LM VALL-E
nuwa-pytorch - Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
canopy - Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
furnace - a multi-system chiptune tracker compatible with DefleMask modules
word2wave - Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.
Retrieval-based-Voice-Conversion-WebUI - Easily train a good VC model with voice data <= 10 mins!
Gooey - Turn (almost) any Python command line program into a full GUI application with one line
table-transformer - Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.