Top 23 Python Audio Projects

ultimatevocalremovergui

82 14,833 8.9 Python

GUI for a Vocal Remover that uses Deep Neural Networks.

Project mention: GUI for a Vocal Remover That Uses Deep Neural Networks | news.ycombinator.com | 2023-12-13

beets

186 12,393 9.7 Python

music library manager and MusicBrainz tagger

Project mention: Show HN: Synced lyrics database with a free, easy-to-use API | news.ycombinator.com | 2024-02-23

I was always frustrated that there is no solid source for synced lyrics that also offers decent API support. There is good ol' Crintsoft's MiniLyrics that is thankfully free software, was what I used a lot in my childhood, but unfortunately the API is highly obfuscated. Another popular choice is the Musixmatch API, which has a very large database of synced lyrics, but with "free" API that are reverse-engineered from their app, you will quickly run into rate-limit.
That's why I created LRCLIB. It's aimed to provide completely free synchronized lyrics for everyone, especially for FOSS music players, with zero profit intention. It currently has nearly 3,000,000 (not deduplicated) lyrics in database. You can also contribute to the database by adding and syncing lyrics for your favorite songs using the LRCGET client.
I'm trying my best to make LRCLIB server-side code open-source as soon as possible. But right now, full LRCLIB's database dumps have already been uploaded regularly and publicly, which are simply sqlite3 files. Feel free to download, look at or do anything you want with the database at https://lrclib.net/db-dumps.
Many open-source projects have already begun integrating LRCLIB, including:
- beets - music library metadata management (https://github.com/beetbox/beets)

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
pydub

25 8,339 0.0 Python

Manipulate audio with a simple and easy high level interface

Project mention: Looking for help with a winamp project please. | /r/learnpython | 2023-05-12

SpeechRecognition

16 8,040 8.7 Python

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Project mention: help with script (beginner) | /r/learnpython | 2023-12-07

Start and Stop Listening Example

speechbrain

26 7,869 9.8 Python

A PyTorch-based Speech Toolkit

Project mention: SpeechBrain 1.0: A free and open-source AI toolkit for all things speech | news.ycombinator.com | 2024-02-28

demucs

108 7,644 5.4 Python

Code for the paper Hybrid Spectrogram and Waveform Source Separation, but the goddamm motherfucker doesn't work.

Project mention: Best way to extract a vocal stem from a song | /r/edmproduction | 2023-10-19

I've had the best results from Facebook's DEMUCs. It's not too difficult to install, and I like the sound quality of their mdx_extra model. This is the command line I use (this will use the 2 stem version -- vocals, and everything else)

jukebox

129 7,563 0.0 Python

Code for the paper "Jukebox: A Generative Model for Music"

Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

openai/jukebox: Music Generation

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
librosa

14 6,681 7.2 Python

Python library for audio and music analysis

Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

librosa/librosa: Python library for audio and music analysis

ffsubsync

31 6,495 4.8 Python

Automagically synchronize subtitles with video.

Project mention: The GitHub Black Market That Helps Coders Cheat the Popularity Contest | news.ycombinator.com | 2023-10-23

> Another giveaway is the ratio of stars to watchers / forks. I remember one project with thousands of stars but only 10 users "watching" it. They went on to raise a sizable seed round too.
Not necessarily indicative of foul play. I have two projects like this (https://github.com/smacke/ffsubsync and https://github.com/ipyflow/ipyflow) and I attribute it to not having great developer documentation.

dejavu

15 6,316 0.0 Python

Audio fingerprinting and recognition in Python (by worldveil)

Project mention: Audio Fingerprinting and Recognition in Python | news.ycombinator.com | 2024-04-14

pyAudioAnalysis

11 5,668 5.0 Python

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Project mention: How would I compare two voice recordings of the same sentence and advise one speaker how to get closer to the second? | /r/learnpython | 2023-07-10

I actually came up with an el cheapo version of what I want to accomplish that isn't perfect but without any research can implement it and it may actually prove useful to language learners. PM me if you're interested in hearing it and critiquing it. I can share here that I'm using this guy's multiple repos though: https://github.com/tyiannak/pyAudioAnalysis

picard

238 3,551 9.8 Python

A cross-platform music tagger powered by the MusicBrainz database. Picard organizes your music collection by updating your tags, renaming your files, and sorting them into a folder structure, exactly the way you want it.

Project mention: Fans quitting Spotify to save their love of music (2022) | news.ycombinator.com | 2024-03-24

Have you tried https://www.funkwhale.audio/?
It can be used effectively as a "private spotify". Labeling is a solved problem thanks to https://picard.musicbrainz.org/, and the fact that a lot of the music you buy these days comes pre-labeled already.
You then have a web-app (and/or a mobile app, if that's your thing) where you can stream music as you would with Spotify.
You can even build yourself a little music-sharing commune with friends, where you all upload the albums you buy and accumulate a nice collection of diverse tunes.

Porcupine

31 3,424 9.1 Python

On-device wake word detection powered by deep learning
riffusion

13 3,164 3.2 Python

Stable diffusion for real-time music generation

Project mention: You know what I REALLY want? Something like img2img but for sound/music. | /r/StableDiffusion | 2023-12-04

Why don’t you try Riffusion? https://github.com/riffusion/riffusion The general idea is convert your audio waveform into an image (spectrogram).

distil-whisper

9 3,125 8.5 Python

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Project mention: FLaNK Stack 05 Feb 2024 | dev.to | 2024-02-05

basic-pitch

8 2,901 8.4 Python

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

spotify/basic-pitch: Audio to midi converter

auto-editor

24 2,481 9.2 Python

Auto-Editor: Effort free video editing!

Project mention: How can I decrease my editing time? | /r/VideoEditing | 2023-05-22

A few days ago I discovered a program that automatically trims the pauses from your video. This can decrease my raw footage duration by around 25%. I've used this for editing two videos so far, and this has been such a helpful tool.

aeneas

4 2,379 0.0 Python

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
mkchromecast

18 2,173 8.9 Python

Cast macOS and Linux Audio/Video to your Google Cast and Sonos Devices

Project mention: Stream to Chromecast with resolved, vlc and bash | news.ycombinator.com | 2024-01-07

You can forward a Chrome window to it via your web browser, or you can use something like https://mkchromecast.com/ to push local videos to it.

m3u8

2 1,918 6.8 Python

Python m3u8 Parser for HTTP Live Streaming (HLS) Transmissions
DeepFilterNet

10 1,914 9.1 Python

Noise supression using deep filtering

Project mention: Anyone know of a good TTS pipeline for raw speech data? | /r/AudioAI | 2023-10-03

You mean remove background noise and transcribe? Then you can use DeepFilterNet to remove noise, and Whisper to transcribe.

TauonMusicBox

9 1,578 8.6 Python

The desktop music player of today! :city_sunrise:
polymath

3 1,456 0.6 Python

Convert any music library into a music production sample-library with ML
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Audio related posts

Audio Fingerprinting and Recognition in Python
2 projects | news.ycombinator.com | 14 Apr 2024
Show HN: Sonauto – a more controllable AI music creator
1 project | news.ycombinator.com | 10 Apr 2024
TSAC: Low Bitrate Audio Compression
4 projects | news.ycombinator.com | 8 Apr 2024
Fans quitting Spotify to save their love of music (2022)
1 project | news.ycombinator.com | 24 Mar 2024
Show HN: Synced lyrics database with a free, easy-to-use API
3 projects | news.ycombinator.com | 23 Feb 2024
A transformer-based network model for pitch detection
5 projects | news.ycombinator.com | 19 Dec 2023
What do you use to tag your music on an NAS (Unraid)?
1 project | /r/musichoarder | 9 Dec 2023
A note from our sponsor - InfluxDB
www.influxdata.com | 26 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source Audio projects in Python? This list will help you:

	Project	Stars
1	ultimatevocalremovergui	14,833
2	beets	12,393
3	pydub	8,339
4	SpeechRecognition	8,040
5	speechbrain	7,869
6	demucs	7,644
7	jukebox	7,563
8	librosa	6,681
9	ffsubsync	6,495
10	dejavu	6,316
11	pyAudioAnalysis	5,668
12	picard	3,551
13	Porcupine	3,424
14	riffusion	3,164
15	distil-whisper	3,125
16	basic-pitch	2,901
17	auto-editor	2,481
18	aeneas	2,379
19	mkchromecast	2,173
20	m3u8	1,918
21	DeepFilterNet	1,914
22	TauonMusicBox	1,578
23	polymath	1,456