Python Audio

Open-source Python projects categorized as Audio

Top 23 Python Audio Projects

  1. ultimatevocalremovergui

    GUI for a Vocal Remover that uses Deep Neural Networks.

    Project mention: Ultimate Vocal Remover GUI, a FOSS audio stem splitter | news.ycombinator.com | 2025-05-09
  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. beets

    music library manager and MusicBrainz tagger

    Project mention: Jellyfin as a Spotify Alternative | news.ycombinator.com | 2025-04-16

    I've used beets to import and tag a huge personal music library:

    https://beets.io/

  4. AudioGPT

    AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

  5. speechbrain

    A PyTorch-based Speech Toolkit

    Project mention: Speaker Diarization in Python | dev.to | 2024-08-22

    Simple Diarizer Simple Diarizer is a speaker diarization library that utilizes pretrained models from SpeechBrain . To get started with simple_diarizer, follow these steps:

  6. pydub

    Manipulate audio with a simple and easy high level interface

  7. SpeechRecognition

    Speech recognition module for Python, supporting several engines and APIs, online and offline.

  8. jukebox

    Code for the paper "Jukebox: A Generative Model for Music"

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. librosa

    Python library for audio and music analysis

    Project mention: Librosa: Python library for audio and music analysis | news.ycombinator.com | 2024-09-03
  11. ffsubsync

    Automagically synchronize subtitles with video.

    Project mention: Ten years after the last release, Aegisub 3.4.0 released | news.ycombinator.com | 2024-12-21

    Aegis is great for authoring new subtitles but if you're just looking to sync then take a look at https://github.com/smacke/ffsubsync

    Plex also recently added auto-sync subtitles to the Plex Pass

    https://support.plex.tv/articles/auto-sync-subtitles/

  12. dejavu

    Audio fingerprinting and recognition in Python (by worldveil)

  13. pyAudioAnalysis

    Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

  14. Porcupine  

    On-device wake word detection powered by deep learning

  15. picard

    A cross-platform music tagger powered by the MusicBrainz database. Picard organizes your music collection by updating your tags, renaming your files, and sorting them into a folder structure, exactly the way you want it.

    Project mention: The Open Music Encyclopedia | news.ycombinator.com | 2024-09-30

    Make sure to checkout Picard:

    https://picard.musicbrainz.org/

    Which uses the MusicBrainz DB to auto tag and correct audio file names. Makes it really easy to organize a large collection of (pirated) audio.

  16. basic-pitch

    A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

  17. distil-whisper

    Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

    Project mention: New OpenAI Whisper model: "turbo" | news.ycombinator.com | 2024-09-30

    Details will be shared tomorrow, but from what I have read they have distilled the large model decoder into this turbo that only has 4 layers instead of 32, the encoder should remain the same size. Similar to https://github.com/huggingface/distil-whisper but the model is distilled using multilingual data instead of just English, and the decoder is 4 layers instead of 2.

  18. riffusion-hobby

    Stable diffusion for real-time music generation

    Project mention: Benn Jordan's AI poison pill and the weird world of adversarial noise | news.ycombinator.com | 2025-04-15

    https://github.com/riffusion/riffusion-hobby

    The more advanced music generators out now I believe have more of a 'stems' approach and a larger processing pipeline to increase fidelity and add tracking vocal capability but the underlying idea is the same.

    Any adversarial attack to hide information in the spectrograph to fool the model into categorizing the track as something it is not isn't different than the image adversarial attacks which have been found to have ways to be mitigated.

    Various forms of filtering for inaudible spectral information coupled with methods that destroy and re-synthesize/randomize phase information would likely break this poisoning attack.

  19. auto-editor

    Auto-Editor: Efficient media analysis and rendering

  20. DeepFilterNet

    Noise supression using deep filtering

    Project mention: Real-time ML audio noise suppression on Raspberry Pi Pico 2 | news.ycombinator.com | 2024-08-09

    Very cool! Would be curious to see how this compares to https://github.com/Rikorose/DeepFilterNet written in Rust.

    Or this Samsung Research paper https://research.samsung.com/blog/FSPEN-AN-ULTRA-LIGHTWEIGHT...

  21. aeneas

    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

  22. mkchromecast

    Cast macOS and Linux Audio/Video to your Google Cast and Sonos Devices

  23. m3u8

    Python m3u8 Parser for HTTP Live Streaming (HLS) Transmissions

  24. matchering

    🎚️ Open Source Audio Matching and Mastering

  25. Tauon

    The music player of today! :city_sunrise:

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Audio discussion

Log in or Post with

Python Audio related posts

Index

What are some of the best open-source Audio projects in Python? This list will help you:

# Project Stars
1 ultimatevocalremovergui 20,601
2 beets 13,475
3 AudioGPT 10,143
4 speechbrain 9,808
5 pydub 9,355
6 SpeechRecognition 8,723
7 jukebox 7,978
8 librosa 7,612
9 ffsubsync 7,144
10 dejavu 6,534
11 pyAudioAnalysis 6,009
12 Porcupine   4,103
13 picard 4,074
14 basic-pitch 3,913
15 distil-whisper 3,850
16 riffusion-hobby 3,682
17 auto-editor 3,296
18 DeepFilterNet 3,038
19 aeneas 2,640
20 mkchromecast 2,257
21 m3u8 2,149
22 matchering 2,048
23 Tauon 2,047

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?