Python audio-processing

Open-source Python projects categorized as audio-processing

Top 23 Python audio-processing Projects

audio-processing
  1. spleeter

    Deezer source separation library including pretrained models.

    Project mention: DeepMind releases Lyria 2 music generation model | news.ycombinator.com | 2025-04-24
  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. speechbrain

    A PyTorch-based Speech Toolkit

    Project mention: Speaker Diarization in Python | dev.to | 2024-08-22

    Simple Diarizer Simple Diarizer is a speaker diarization library that utilizes pretrained models from SpeechBrain . To get started with simple_diarizer, follow these steps:

  4. auto-editor

    Auto-Editor: Efficient media analysis and rendering

  5. audio-reactive-led-strip

    :musical_note: :rainbow: Real-time LED strip music visualization using Python and the ESP8266 or Raspberry Pi

  6. ailia-models

    The collection of pre-trained, state-of-the-art AI models for ailia SDK

  7. LedFx

    LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.

  8. audio-slicer

    A simple GUI application that slices audio with silence detection

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. SALMONN

    SALMONN: Speech Audio Language Music Open Neural Network

  11. SincNet

    SincNet is a neural architecture for efficiently processing raw audio samples.

  12. StreamSpeech

    StreamSpeech is an β€œAll in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

    Project mention: Ask HN: Real-time speech-to-speech translation | news.ycombinator.com | 2024-10-24

    Has anyone had any luck with an offline, free, open-source real-time speech-to-speech translation app on under-powered devices (i.e., older smart phones)?

    * https://github.com/ictnlp/StreamSpeech

    * https://github.com/k2-fsa/sherpa-onnx

    * https://github.com/openai/whisper

    I'm looking for a simple app that can listen for English, translate into Korean (and other languages), then perform speech synthesis on the translation. Basically, a Babelfish that doesn't stick in the ear. Although real-time would be great, a 3- to 5-second delay is manageable.

    RTranslator is awkward (couldn't get it to perform speech-to-speech using a single phone). 3PO sprouts errors like dandelions and requires an online connection.

    Any suggestions?

  13. nnAudio

    Audio processing by using pytorch 1D convolution network

  14. wunjo.wladradchenko.ru

    Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.

    Project mention: Why My Open Source Project Wunjo Can’t Reach 1K Stars? 😒 | dev.to | 2025-03-25

    I’ve been building Wunjo, an Open Source AI-powered video editing tool that can today automatically cut, highlight, and transform videos with a simple text prompt. Sounds cool, right? Yet, getting to 1K stars on GitHub feels like an endless grind. This is a set of tools in software to optimization process of video, photo editing and API (API Docs) inside for other pet-projects.

  15. FoleyCrafter

    FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AIζ‹ŸιŸ³ε€§εΈˆοΌŒη»™δ½ ηš„ζ— ε£°θ§†ι’‘ζ·»εŠ η”ŸεŠ¨θ€ŒδΈ”εŒζ­₯ηš„ιŸ³ζ•ˆ 😝

    Project mention: Bring Silent Videos to Life Sounds(Open-Source) | news.ycombinator.com | 2025-02-27
  16. unsilence

    Console Interface and Library to remove silent parts of a media file πŸ”ˆ

  17. TimeSide

    scalable audio processing framework and server written in Python

  18. whisper-at

    Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

    Project mention: Show HN: Voice-Pro – AI Voice Cloning Magic: Transform Any Voice in 15 Seconds | news.ycombinator.com | 2024-11-27

    Have you considered supporting whisper-at - https://github.com/YuanGongND/whisper-at ? Being able to identify sounds on a timeline can be useful e.g. politicians speech and how the audience is reacting to it (e.g. clapping, applauding)

  19. moseca

    A Streamilt web app for music source separation & karaoke

  20. spectrographic

    Turn an image into sound whose spectrogram looks like the image.

  21. stemgen

    πŸŽ› Stemgen is a Stem file generator. Convert any track into a Stem and have fun with Traktor.

  22. pyCrossfade

    pyCrossfade is the result of a personal project to use beat matching, gradual bpm shift on bars, and EQ modification to provide smooth and tunable transitions between music files.

  23. see2sound

    Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

    Project mention: Generate Spatial Audio from Image/Video | news.ycombinator.com | 2024-07-05
  24. voice-safety-classifier

    Voice safety classifier

    Project mention: Initiative from Google, OpenAI, Discord, others could transform trust and safety | news.ycombinator.com | 2025-02-11
  25. gensound

    Pythonic audio processing and generation framework

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python audio-processing discussion

Log in or Post with

Python audio-processing related posts

  • Are stems a good way of making mashups

    1 project | /r/Beatmatch | 10 Dec 2023
  • Big News!

    1 project | /r/OnePieceMangaCut | 9 Dec 2023
  • Anybody here know what AI model does Steinberg's Spectralayers use to do stem separation?

    1 project | /r/audioengineering | 8 Dec 2023
  • Comparing Humans, GPT-4, and GPT-4V on Abstraction and Reasoning Tasks

    2 projects | news.ycombinator.com | 19 Nov 2023
  • Help needed in developing this! It’s an AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers that features AI text-to-audio, onboard fx, onscreen ChatGPT, and more. Send a line if you can help!

    1 project | /r/aiMusic | 24 Sep 2023
  • AI tools list sorted by category in one place

    1 project | /r/ChatGPT | 11 Jul 2023
  • Software to lower tracks?

    1 project | /r/gratefulguitar | 3 Jul 2023
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 19 May 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more β†’

Index

What are some of the best open-source audio-processing projects in Python? This list will help you:

# Project Stars
1 spleeter 26,840
2 speechbrain 9,834
3 auto-editor 3,296
4 audio-reactive-led-strip 2,750
5 ailia-models 2,196
6 LedFx 1,545
7 audio-slicer 1,344
8 SALMONN 1,227
9 SincNet 1,171
10 StreamSpeech 1,073
11 nnAudio 1,064
12 wunjo.wladradchenko.ru 1,024
13 FoleyCrafter 582
14 unsilence 576
15 TimeSide 385
16 whisper-at 355
17 moseca 322
18 spectrographic 277
19 stemgen 234
20 pyCrossfade 128
21 see2sound 124
22 voice-safety-classifier 84
23 gensound 81

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?