Top 8 Python audio-generation Projects

Amphion

4 3,898 8.7 Python

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Project mention: FLaNK Stack Weekly 11 Dec 2023 | dev.to | 2023-12-11

AudioLDM

10 2,227 6.0 Python

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Project mention: Want to know if there's an ai for text (prompt) to sound effects like stable diffusion | /r/StableDiffusion | 2023-05-19

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
audio-diffusion-pytorch

1 1,782 2.9 Python

Audio generation using diffusion models, in PyTorch.
tts-generation-webui

5 1,260 8.6 Python

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)

Project mention: OpenVoice: Versatile Instant Voice Cloning | news.ycombinator.com | 2024-03-29

https://github.com/rsxdalv/tts-generation-webui

soundstorm-pytorch

1 1,115 7.7 Python

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Project mention: Meta introduces Voicebox: state-of-the-art generative AI model for speech | news.ycombinator.com | 2023-06-19

got a response here https://github.com/lucidrains/soundstorm-pytorch/discussions...

tango

2 910 8.7 Python

A family of diffusion models for text-to-audio generation. (by declare-lab)

Project mention: [Research] [Project] Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model | /r/MachineLearning | 2023-05-04

Found relevant code at https://github.com/declare-lab/tango + all code implementations here

modular-diffusion

1 253 8.0 Python

Python library for designing and training your own Diffusion Models with PyTorch.

Project mention: I Built a Modular Python Library for Designing and Training Diffusion Models from Scratch | /r/SideProject | 2023-09-06

Last week, I released a project I've been working on for months: Modular Diffusion. It's a modular Python library for designing and training your own Diffusion Models in just a few lines of code. I also wrote a documentation page. The project has already gotten some great community feedback and I'm hoping you guys like it too!

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
word2wave

1 116 0.0 Python

Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).