Top 23 Python audio-processing Projects

spleeter

230 24,878 1.5 Python

Deezer source separation library including pretrained models.

Project mention: Are stems a good way of making mashups | /r/Beatmatch | 2023-12-10

virtual dj and others stem separator is shrinked model of this https://github.com/deezer/spleeter you will get better results downloading original + their large model.

speechbrain

26 7,869 9.8 Python

A PyTorch-based Speech Toolkit

Project mention: SpeechBrain 1.0: A free and open-source AI toolkit for all things speech | news.ycombinator.com | 2024-02-28

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
audio-reactive-led-strip

7 2,636 0.0 Python

:musical_note: :rainbow: Real-time LED strip music visualization using Python and the ESP8266 or Raspberry Pi
auto-editor

24 2,481 9.2 Python

Auto-Editor: Effort free video editing!

Project mention: How can I decrease my editing time? | /r/VideoEditing | 2023-05-22

A few days ago I discovered a program that automatically trims the pauses from your video. This can decrease my raw footage duration by around 25%. I've used this for editing two videos so far, and this has been such a helpful tool.

ailia-models

4 1,814 9.8 Python

The collection of pre-trained, state-of-the-art AI models for ailia SDK
LedFx

8 1,202 9.9 Python

LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.
SincNet

3 1,097 0.0 Python

SincNet is a neural architecture for efficiently processing raw audio samples.
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
audio-slicer

1 1,097 2.8 Python

A simple GUI application that slices audio with silence detection

Project mention: Mahiru's wholesome lines | /r/OtonariNoTenshiSama | 2023-05-19

Believe it or not, gathering the samples wasn't the hardest part. Using this tool it only takes fiddling around with the settings until you're happy with the results.

nnAudio

1 953 5.3 Python

Audio processing by using pytorch 1D convolution network
SALMONN

2 796 9.0 Python

SALMONN: Speech Audio Language Music Open Neural Network

Project mention: Comparing Humans, GPT-4, and GPT-4V on Abstraction and Reasoning Tasks | news.ycombinator.com | 2023-11-19

> In other words, if you express a problem in a more complicated space (e.g. a visual problem, or an abstract algebra problem), you will not be able to solve it in the smaller token space, there's not enough information
You're aware multimodel transformers do exactly this?
https://github.com/bytedance/SALMONN

unsilence

7 520 0.0 Python

Console Interface and Library to remove silent parts of a media file 🔈

Project mention: Automatic video cut for podcast style recording - Like autopod | /r/kdenlive | 2023-05-04

TimeSide

0 365 0.0 Python

scalable audio processing framework and server written in Python
spectrographic

2 248 0.0 Python

Turn an image into sound whose spectrogram looks like the image.
moseca

1 198 7.6 Python

A Streamilt web app for music source separation & karaoke

Project mention: From Frustration to Creation: How I Built My Own Free AI Music Separation App | /r/opensource | 2023-08-23

Then I added a Karaoke experience from YouTube as suggested by my family. But here's the best part: You can now clone Moseca with a single click and set it up online for absolutely zero cost, all thanks to Hugging Face's magic! I genuinely built this out of my love for music and the desire to democratize access to high-quality music separation. So, whether you're like me, trying to jam to pure instrumentals, or looking for a karaoke tool, Moseca is here for you. Want to dive deeper? Contribute, or simply peek behind the curtain? Here's the GitHub repo: https://github.com/fabiogra/moseca

stemgen

2 168 7.5 Python

🎛 Stemgen is a Stem file generator. Convert any track into a Stem and have fun with Traktor.
pyCrossfade

1 114 0.0 Python

pyCrossfade is the result of a personal project to use beat matching, gradual bpm shift on bars, and EQ modification to provide smooth and tunable transitions between music files.
gensound

2 79 0.0 Python

Pythonic audio processing and generation framework
ipytone

1 54 5.0 Python

Interactive audio in Jupyter
SoundSage---LLM-Audio-Processing

2 25 9.1 Python

Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create a full set of tools for an AI to use for automating Audio processing for Music, Film, Game and any other possible applications. UI for AutoGain is very basic but the app is very functional. currently only for MacOS

Project mention: Text-to-Audio Processing *Help Needed* | /r/LLMDevs | 2023-07-01

I am currently working on a project called SoundSage - LLM Audio Processing, which is hosted on GitHub. The project is aimed at developing a system for audio processing using various tools and techniques. You can find the repository here: SoundSage - LLM Audio Processing

soundstorm

2 19 7.2 Python

Soundstorm is a cutting-edge AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers, algorithmic composers, and experimental audio enthusiasts. From sample pack creation and algorithmic composition to AI text-to-audio and onscreen ChatGPT, Soundstorm is a sonic powerhouse.

Project mention: Help needed in developing this! It’s an AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers that features AI text-to-audio, onboard fx, onscreen ChatGPT, and more. Send a line if you can help! | /r/aiMusic | 2023-09-24

Common-Voice

2 16 0.0 Python

Audio Classification with machine learning (by dachosen1)
cyberpunk

2 12 0.0 Python

Audio Processing Server (by jonaylor89)
vtc-py

1 12 10.0 Python

A SMTPE video timecode library for Python

Project mention: Homebrew/DIY Timecode Box - Raspberry Pi Pico based | /r/LocationSound | 2023-05-10

I wonder if this could help in some way https://github.com/opencinemac/vtc-py

SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python audio-processing related posts

Are stems a good way of making mashups
1 project | /r/Beatmatch | 10 Dec 2023
Big News!
1 project | /r/OnePieceMangaCut | 9 Dec 2023
Anybody here know what AI model does Steinberg's Spectralayers use to do stem separation?
1 project | /r/audioengineering | 8 Dec 2023
Comparing Humans, GPT-4, and GPT-4V on Abstraction and Reasoning Tasks
2 projects | news.ycombinator.com | 19 Nov 2023
Help needed in developing this! It’s an AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers that features AI text-to-audio, onboard fx, onscreen ChatGPT, and more. Send a line if you can help!
1 project | /r/aiMusic | 24 Sep 2023
AI tools list sorted by category in one place
1 project | /r/ChatGPT | 11 Jul 2023
Software to lower tracks?
1 project | /r/gratefulguitar | 3 Jul 2023
A note from our sponsor - SaaSHub
www.saashub.com | 26 Apr 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source audio-processing projects in Python? This list will help you:

	Project	Stars
1	spleeter	24,878
2	speechbrain	7,869
3	audio-reactive-led-strip	2,636
4	auto-editor	2,481
5	ailia-models	1,814
6	LedFx	1,202
7	SincNet	1,097
8	audio-slicer	1,097
9	nnAudio	953
10	SALMONN	796
11	unsilence	520
12	TimeSide	365
13	spectrographic	248
14	moseca	198
15	stemgen	168
16	pyCrossfade	114
17	gensound	79
18	ipytone	54
19	SoundSage---LLM-Audio-Processing	25
20	soundstorm	19
21	Common-Voice	16
22	cyberpunk	12
23	vtc-py	12