AudioLDM
audio-diffusion-pytorch
AudioLDM | audio-diffusion-pytorch | |
---|---|---|
10 | 1 | |
2,238 | 1,787 | |
- | 1.9% | |
6.0 | 2.9 | |
6 months ago | 11 months ago | |
Python | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
AudioLDM
- Want to know if there's an ai for text (prompt) to sound effects like stable diffusion
- GitHub - haoheliu/AudioLDM: AudioLDM: Generate speech, sound effects, music and beyond, with text.
- AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
-
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Take a look at AudioLDM (https://github.com/haoheliu/AudioLDM), it might be more what you expected:
* Text-to-Audio Generation: Generate audio given text input.
-
Are you digital or traditional artist or student and use Stable Diffusion?
As a part time filmmaker, there's no way that I could be this close to being done after a week worth of work. AudioLDM (https://audioldm.github.io/) saved me so much time bc instead of looking for sonic textures or futzing around with a synth, I was able to prompt my way to a 30s audio output.
-
[N] AudioLM now available on GitHub and HF with demo and checkpoint
GitHub: https://github.com/haoheliu/AudioLDM
audio-diffusion-pytorch
-
Quick question about diffusion models, and how pie torch may be relevant to this paper, and how I could possibly set it up locally?
This is the source code: https://github.com/archinetai/audio-diffusion-pytorch
What are some alternatives?
AudioGPT - AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
soundstorm-pytorch - Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
tts-generation-webui - TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)
tango - A family of diffusion models for text-to-audio generation.
pytorch-lightning - Build high-performance AI models with PyTorch Lightning (organized PyTorch). Deploy models with Lightning Apps (organized Python to build end-to-end ML systems). [Moved to: https://github.com/Lightning-AI/lightning]
magic3d-pytorch - Implementation of Magic3D, Text to 3D content synthesis, in Pytorch
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
pytorch-lightning - Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.