Top 8 Python audio-generation Projects
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
soundstorm-pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
-
-
For instance:
https://github.com/descriptinc/descript-audio-codec/blob/mai...
https://github.com/NVIDIA/BigVGAN/blob/main/loss.py#L23
https://arxiv.org/pdf/2210.13438 (the github repo doesn't include training, just inference)
It is INCREDIBLY common to use multi-scale spectral loss as the audio distance / objective measure in audio generation. They have some issues (i.e. they aren't always well correlated with human perception) but they are the known-current-best.
-
-
word2wave
Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.
-
neuralnoise
The AI Podcast Studio: generate podcasts scripts and their audio version with a team of AI workers in a Podcast Studio 🎙️📜
Project mention: Show HN: AI agents working together in a virtual podcast studio. NotebookLM alt | news.ycombinator.com | 2024-10-26
Python audio-generation discussion
Python audio-generation related posts
Index
What are some of the best open-source audio-generation projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | AudioLDM | 2,463 |
2 | audio-diffusion-pytorch | 1,909 |
3 | soundstorm-pytorch | 1,432 |
4 | tango | 1,094 |
5 | BigVGAN | 903 |
6 | modular-diffusion | 267 |
7 | word2wave | 119 |
8 | neuralnoise | 100 |