Python audio-generation

Open-source Python projects categorized as audio-generation

Top 8 Python audio-generation Projects

  • Amphion

    Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

  • Project mention: FLaNK Stack Weekly 11 Dec 2023 | dev.to | 2023-12-11
  • AudioLDM

    AudioLDM: Generate speech, sound effects, music and beyond, with text.

  • Project mention: Want to know if there's an ai for text (prompt) to sound effects like stable diffusion | /r/StableDiffusion | 2023-05-19
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • audio-diffusion-pytorch

    Audio generation using diffusion models, in PyTorch.

  • tts-generation-webui

    TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)

  • Project mention: OpenVoice: Versatile Instant Voice Cloning | news.ycombinator.com | 2024-03-29

    https://github.com/rsxdalv/tts-generation-webui

  • soundstorm-pytorch

    Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

  • Project mention: Meta introduces Voicebox: state-of-the-art generative AI model for speech | news.ycombinator.com | 2023-06-19

    got a response here https://github.com/lucidrains/soundstorm-pytorch/discussions...

  • tango

    A family of diffusion models for text-to-audio generation. (by declare-lab)

  • Project mention: [Research] [Project] Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model | /r/MachineLearning | 2023-05-04

    Found relevant code at https://github.com/declare-lab/tango + all code implementations here

  • modular-diffusion

    Python library for designing and training your own Diffusion Models with PyTorch.

  • Project mention: I Built a Modular Python Library for Designing and Training Diffusion Models from Scratch | /r/SideProject | 2023-09-06

    Last week, I released a project I've been working on for months: Modular Diffusion. It's a modular Python library for designing and training your own Diffusion Models in just a few lines of code. I also wrote a documentation page. The project has already gotten some great community feedback and I'm hoping you guys like it too!

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • word2wave

    Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python audio-generation related posts

Index

What are some of the best open-source audio-generation projects in Python? This list will help you:

Project Stars
1 Amphion 3,898
2 AudioLDM 2,227
3 audio-diffusion-pytorch 1,782
4 tts-generation-webui 1,260
5 soundstorm-pytorch 1,115
6 tango 910
7 modular-diffusion 253
8 word2wave 116

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com