AudioLDM
Amphion
AudioLDM | Amphion | |
---|---|---|
10 | 4 | |
2,238 | 3,975 | |
- | 6.5% | |
6.0 | 8.6 | |
6 months ago | 5 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
AudioLDM
- Want to know if there's an ai for text (prompt) to sound effects like stable diffusion
- GitHub - haoheliu/AudioLDM: AudioLDM: Generate speech, sound effects, music and beyond, with text.
- AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
-
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Take a look at AudioLDM (https://github.com/haoheliu/AudioLDM), it might be more what you expected:
* Text-to-Audio Generation: Generate audio given text input.
-
Are you digital or traditional artist or student and use Stable Diffusion?
As a part time filmmaker, there's no way that I could be this close to being done after a week worth of work. AudioLDM (https://audioldm.github.io/) saved me so much time bc instead of looking for sonic textures or futzing around with a synth, I was able to prompt my way to a 30s audio output.
-
[N] AudioLM now available on GitHub and HF with demo and checkpoint
GitHub: https://github.com/haoheliu/AudioLDM
Amphion
- FLaNK Stack Weekly 11 Dec 2023
- Technique makes Taylor Swift to sing perfect Mandarin Chinese song
-
Novel vocoder for high-quality audio generation
Code: https://github.com/open-mmlab/Amphion/blob/main/models/vocod...
What are some alternatives?
AudioGPT - AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
VALL-E-X - An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
vall-e - An unofficial PyTorch implementation of the audio LM VALL-E
canopy - Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
furnace - a multi-system chiptune tracker compatible with DefleMask modules
Retrieval-based-Voice-Conversion-WebUI - Easily train a good VC model with voice data <= 10 mins!
Gooey - Turn (almost) any Python command line program into a full GUI application with one line
table-transformer - Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
ava - All-in-one desktop app for running LLMs locally.
ast-grep - ⚡A CLI tool for code structural search, lint and rewriting. Written in Rust
FLiPStackWeekly - FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
trippy - A network diagnostic tool