Python Audio

Open-source Python projects categorized as Audio

Top 23 Python Audio Projects

  • ultimatevocalremovergui

    GUI for a Vocal Remover that uses Deep Neural Networks.

    Project mention: GUI for a Vocal Remover That Uses Deep Neural Networks | | 2023-12-13
  • beets

    music library manager and MusicBrainz tagger

    Project mention: Show HN: Synced lyrics database with a free, easy-to-use API | | 2024-02-23

    I was always frustrated that there is no solid source for synced lyrics that also offers decent API support. There is good ol' Crintsoft's MiniLyrics that is thankfully free software, was what I used a lot in my childhood, but unfortunately the API is highly obfuscated. Another popular choice is the Musixmatch API, which has a very large database of synced lyrics, but with "free" API that are reverse-engineered from their app, you will quickly run into rate-limit.

    That's why I created LRCLIB. It's aimed to provide completely free synchronized lyrics for everyone, especially for FOSS music players, with zero profit intention. It currently has nearly 3,000,000 (not deduplicated) lyrics in database. You can also contribute to the database by adding and syncing lyrics for your favorite songs using the LRCGET client.

    I'm trying my best to make LRCLIB server-side code open-source as soon as possible. But right now, full LRCLIB's database dumps have already been uploaded regularly and publicly, which are simply sqlite3 files. Feel free to download, look at or do anything you want with the database at

    Many open-source projects have already begun integrating LRCLIB, including:

    - beets - music library metadata management (

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • pydub

    Manipulate audio with a simple and easy high level interface

    Project mention: Looking for help with a winamp project please. | /r/learnpython | 2023-05-12
  • SpeechRecognition

    Speech recognition module for Python, supporting several engines and APIs, online and offline.

    Project mention: help with script (beginner) | /r/learnpython | 2023-12-07

    Start and Stop Listening Example

  • demucs

    Code for the paper Hybrid Spectrogram and Waveform Source Separation, but the goddamm motherfucker doesn't work.

    Project mention: Best way to extract a vocal stem from a song | /r/edmproduction | 2023-10-19

    I've had the best results from Facebook's DEMUCs. It's not too difficult to install, and I like the sound quality of their mdx_extra model. This is the command line I use (this will use the 2 stem version -- vocals, and everything else)

  • jukebox

    Code for the paper "Jukebox: A Generative Model for Music"

    Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

    openai/jukebox: Music Generation

  • speechbrain

    A PyTorch-based Speech Toolkit

    Project mention: SpeechBrain 1.0: A free and open-source AI toolkit for all things speech | | 2024-02-28
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • librosa

    Python library for audio and music analysis

    Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

    librosa/librosa: Python library for audio and music analysis

  • ffsubsync

    Automagically synchronize subtitles with video.

    Project mention: The GitHub Black Market That Helps Coders Cheat the Popularity Contest | | 2023-10-23

    > Another giveaway is the ratio of stars to watchers / forks. I remember one project with thousands of stars but only 10 users "watching" it. They went on to raise a sizable seed round too.

    Not necessarily indicative of foul play. I have two projects like this ( and and I attribute it to not having great developer documentation.

  • dejavu

    Audio fingerprinting and recognition in Python (by worldveil)

    Project mention: Contacting Collectors or Creating API to help with searching | /r/TheMysteriousSong | 2023-05-22

    This doesn't seem hard, you can use something like this to dwoanload the songs: and something like this to calculate how much they match: The question is would you create a (dedicated) server to do your work? Or your own pc? You could also create a very simple page where someone would paste you a YouTube profile URL and you would check all songs of this URL. Also to have a db and save information about the matching and which youtube profiles have alsready been checked. Something like that could work.

  • pyAudioAnalysis

    Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

    Project mention: How would I compare two voice recordings of the same sentence and advise one speaker how to get closer to the second? | /r/learnpython | 2023-07-10

    I actually came up with an el cheapo version of what I want to accomplish that isn't perfect but without any research can implement it and it may actually prove useful to language learners. PM me if you're interested in hearing it and critiquing it. I can share here that I'm using this guy's multiple repos though:

  • picard

    A cross-platform music tagger powered by the MusicBrainz database. Picard organizes your music collection by updating your tags, renaming your files, and sorting them into a folder structure, exactly the way you want it.

    Project mention: What do you use to tag your music on an NAS (Unraid)? | /r/musichoarder | 2023-12-09

    I use picard for my collection. My work flow is Picard > Lyrics Finder > Foobar for BPM, RealGain and DR > Custom python script to pull genre and mood from Last.FM and spotify> then Advanced Renamer to perform naming clean up of folder names. I lowercase and underscore spaces. Also does files if I happen to need Mp3Tag when Picard doesn't find anything. Once this is done I move files on to may NAS.

  • Porcupine  

    On-device wake word detection powered by deep learning

    Project mention: I made a ChatGPT virtual assistant that you can talk to | /r/ArtificialInteligence | 2023-04-05

    I call it DaVinci. DaVinci uses Picovoice ( solutions for wake word and voice activity detection and for converting speech to text, Amazon Polly to convert its responses into a natural sounding voice, and OpenAI’s GPT 3.5 to do the heavy lifting. It’s all contained in about 300 lines of Python code.

  • riffusion

    Stable diffusion for real-time music generation

    Project mention: You know what I REALLY want? Something like img2img but for sound/music. | /r/StableDiffusion | 2023-12-04

    Why don’t you try Riffusion? The general idea is convert your audio waveform into an image (spectrogram).

  • distil-whisper

    Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

    Project mention: FLaNK Stack 05 Feb 2024 | | 2024-02-05
  • basic-pitch

    A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

    Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

    spotify/basic-pitch: Audio to midi converter

  • auto-editor

    Auto-Editor: Effort free video editing!

    Project mention: How can I decrease my editing time? | /r/VideoEditing | 2023-05-22

    A few days ago I discovered a program that automatically trims the pauses from your video. This can decrease my raw footage duration by around 25%. I've used this for editing two videos so far, and this has been such a helpful tool.

  • aeneas

    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

  • mkchromecast

    Cast macOS and Linux Audio/Video to your Google Cast and Sonos Devices

    Project mention: Stream to Chromecast with resolved, vlc and bash | | 2024-01-07

    You can forward a Chrome window to it via your web browser, or you can use something like to push local videos to it.

  • m3u8

    Python m3u8 Parser for HTTP Live Streaming (HLS) Transmissions

  • DeepFilterNet

    Noise supression using deep filtering

    Project mention: Anyone know of a good TTS pipeline for raw speech data? | /r/AudioAI | 2023-10-03

    You mean remove background noise and transcribe? Then you can use DeepFilterNet to remove noise, and Whisper to transcribe.

  • TauonMusicBox

    The desktop music player of today! :city_sunrise:

  • Watson Developer Cloud Python SDK

    :snake: Client library to use the IBM Watson services in Python and available in pip as watson-developer-cloud

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-02-28.

Python Audio related posts


What are some of the best open-source Audio projects in Python? This list will help you:

Project Stars
1 ultimatevocalremovergui 13,837
2 beets 12,288
3 pydub 8,178
4 SpeechRecognition 7,906
5 demucs 7,457
6 jukebox 7,455
7 speechbrain 7,307
8 librosa 6,531
9 ffsubsync 6,397
10 dejavu 6,261
11 pyAudioAnalysis 5,602
12 picard 3,491
13 Porcupine   3,352
14 riffusion 3,026
15 distil-whisper 2,889
16 basic-pitch 2,786
17 auto-editor 2,364
18 aeneas 2,319
19 mkchromecast 2,150
20 m3u8 1,873
21 DeepFilterNet 1,748
22 TauonMusicBox 1,520
23 Watson Developer Cloud Python SDK 1,450
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives