Top 23 audio-analysis Open-Source Projects

Retrieval-based-Voice-Conversion-WebUI

56 19,255 9.6 Python

Easily train a good VC model with voice data <= 10 mins!

Project mention: OpenVoice: Versatile Instant Voice Cloning | news.ycombinator.com | 2024-03-29

RVC does live voice changing with a little latency: https://github.com/RVC-Project/Retrieval-based-Voice-Convers...
The product isn't exactly spectacular, but most of the works seems to have bene done. Just needs someone to go over the UI and make it less unstable, really.

essentia

2 2,694 8.7 C++

C++ library for audio and music analysis, description and synthesis, including Python bindings
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
audioFlux

57 2,056 7.8 C

A library for audio and music analysis, feature extraction.

Project mention: AudioFlux: Open-source for audio and music analysis, feature extraction | news.ycombinator.com | 2024-03-27

madmom

2 1,242 4.4 Python

Python audio and music signal processing library
DSWaveformImage

2 947 7.6 Swift

Generate waveform images from audio files on iOS, macOS & visionOS in Swift. Native SwiftUI & UIKit views.

Project mention: Are there any libraries that I can use or any way to make such audio waveforms? I looked into a library called "react-native-audiowaveform" but it's not being maintained anymore and doesn't work for newer versions of RN | /r/reactnative | 2023-05-21

https://github.com/dmrschmidt/DSWaveformImage for iOS

chromaprint

4 894 0.0 C++

C library for generating audio fingerprints used by AcoustID

Project mention: How to remove repeating segments from an video file? | /r/ffmpeg | 2023-12-09

you could always cut up the video into segments A,B,C,D,E using like ffmpeg -ss 00:00:00 -t 00:10:00 -i input -c copy A.ext (where A is the first 10 minutes of video) and then recombine with https://trac.ffmpeg.org/wiki/Concatenate, might take a while though and some basic math. maybe you could do something programatically with chromaprint by identifying the repeating audio segments? that's just an idea since that's what this jellyfin plugin does to identify repeated segments of shows in the form of opening credits.

friture

13 856 3.8 Python

Real-time audio visualizations (spectrum, spectrogram, etc.)

Project mention: Friture – a FOSS real-time audio analyzer for Windows, macOS, and Linux | news.ycombinator.com | 2023-09-24

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
PipeWire-Guide

12 829 5.8 Shell

PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.

Project mention: Alternatives to MultiPSK? | /r/RTLSDR | 2023-06-05

It is far easier to use a non-Windows operating system and simply direct the IQ data into the application you want, or use a better app which can take data directly from the RSPdx. However, in an RTL-SDR book, I saw a reference to VB-Cable, which is a separate software from VAC. Pipewire is another tool, definitely open-source and free, which should work.

LabSound

1 705 7.4 C++

:microscope: :speaker: graph-based audio engine
inaSpeechSegmenter

3 695 6.4 Python

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Project mention: Listen to HD radio with a $30 RTL SDR dongle | news.ycombinator.com | 2023-11-05

I have a little hobby project where I record an FM radio music station using a SDR and then remove all the non-music portions for offline listening. I like the music selections the DJs pick, but I prefer not to listen to the DJ commentary and the advertisements.
I evaluated three methods of recording: analog capture from a standalone FM receiver, using this nrsc5 library to record the "HD" radio stream, and using an AirSpy SDR with this library: https://github.com/jj1bdx/airspy-fmradion
Recording the "HD" (what a misnomer) radio was nice in that there was no hiss or multipath effects, but in comparison to the other methods the digital compression artifacts became impossible to un-hear. It seems to top out at about 96 kbps
The airspy-fmradion library has some nice stuff in it to address multipath, resulting in the best audio quality of the three methods I tested.
I use https://github.com/ina-foss/inaSpeechSegmenter to identify which segments of the recordings are speech vs. music.

Sushi

4 613 0.0 Python

Automatic subtitle shifter based on audio (by tp7)
audioMotion-analyzer

2 529 7.8 JavaScript

High-resolution real-time graphic audio spectrum analyzer JavaScript module with no dependencies.
machinehearing

2 220 7.3 Jupyter Notebook

Machine Learning applied to sound

Project mention: Zimtohrli: A New Psychoacoustic Perceptual Metric for Audio Compression | news.ycombinator.com | 2024-05-08

PEAQ/PESQ and visqol is worth trying for that. In principle they operate as you suggest. I keep a short overview of audio quality methods/tools here: https://github.com/jonnor/machinehearing/blob/master/audio-q...

pitch-detection

2 212 0.0 Rust

A collection of algorithms to determine the pitch of a sound sample.
GuitarTuner

2 130 3.4 Python

Guitar tuner program made with Python, Tkinter and PyAudio.
ebur128

1 82 3.4 Rust

Implementation of the EBU R128 loudness standard
sync-audio-tracks

1 59 3.9 C++

Audio tracks synchronization command-line tool for video editors that don't support it
WLEDAudioSync-Chataigne-Module

2 32 8.4 JavaScript

Stream music/audio to WLED Sound Reactive. Real time audio data analysis: volume, FFT, pitch detection etc.Include RTMGC. Include Real Time Music Mood Detection.WLED audio sync integrated v1 for esp8266 & v2 message for esp32.

Project mention: BPM synchronization | /r/WLED | 2023-07-06

SoundSage---LLM-Audio-Processing

2 25 9.1 Python

Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create a full set of tools for an AI to use for automating Audio processing for Music, Film, Game and any other possible applications. UI for AutoGain is very basic but the app is very functional. currently only for MacOS

Project mention: Text-to-Audio Processing *Help Needed* | /r/LLMDevs | 2023-07-01

I am currently working on a project called SoundSage - LLM Audio Processing, which is hosted on GitHub. The project is aimed at developing a system for audio processing using various tools and techniques. You can find the repository here: SoundSage - LLM Audio Processing

simple_dr_meter

1 17 0.0 Python

An (optimized) implementation of the music DR measurement (compliant with http://dr.loudness-war.info/), it supports CUE sheets and is faster than all currently available alternatives (at the time of writing, not sure about now)
AudioInsightsGenerator

2 11 5.2 Jupyter Notebook

Unlock AI power with AudioInsightsGenerator! From audio to summaries, emotion analysis, idea generation, narratives, and content filtering. Explore your audio's hidden dimensions!

Project mention: GitHub - Ravi-Teja-konda/AudioInsightsGenerator: Unlock AI power with AudioInsightsGenerator! From audio to summaries, emotion analysis, idea generation, narratives, and content filtering. Explore your audio's hidden dimensions! | /r/programming | 2023-07-07

audioparam-visualization

1 7 5.8 TypeScript

Visualization of how Web Audio API's AudioParam value changes over time
PlotAssert

0 6 0.0 Kotlin

Test the shape of your functions!
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

audio-analysis related posts

Zimtohrli: A New Psychoacoustic Perceptual Metric for Audio Compression

2 projects | news.ycombinator.com | 8 May 2024
AudioFlux: Open-source for audio and music analysis, feature extraction

1 project | news.ycombinator.com | 27 Mar 2024
How to remove repeating segments from an video file?

2 projects | /r/ffmpeg | 9 Dec 2023
Friture – a FOSS real-time audio analyzer for Windows, macOS, and Linux

1 project | news.ycombinator.com | 24 Sep 2023
GitHub - Ravi-Teja-konda/AudioInsightsGenerator: Unlock AI power with AudioInsightsGenerator! From audio to summaries, emotion analysis, idea generation, narratives, and content filtering. Explore your audio's hidden dimensions!

1 project | /r/programming | 7 Jul 2023
A library for audio processing , support Android platform

2 projects | /r/androiddev | 29 May 2023
A library for DSP and audio analysis, support iOS and macOS

1 project | /r/iOSProgramming | 25 May 2023
A note from our sponsor - InfluxDB
www.influxdata.com | 10 May 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source audio-analysis projects? This list will help you:

	Project	Stars
1	Retrieval-based-Voice-Conversion-WebUI	19,255
2	essentia	2,694
3	audioFlux	2,056
4	madmom	1,242
5	DSWaveformImage	947
6	chromaprint	894
7	friture	856
8	PipeWire-Guide	829
9	LabSound	705
10	inaSpeechSegmenter	695
11	Sushi	613
12	audioMotion-analyzer	529
13	machinehearing	220
14	pitch-detection	212
15	GuitarTuner	130
16	ebur128	82
17	sync-audio-tracks	59
18	WLEDAudioSync-Chataigne-Module	32
19	SoundSage---LLM-Audio-Processing	25
20	simple_dr_meter	17
21	AudioInsightsGenerator	11
22	audioparam-visualization	7
23	PlotAssert	6

audio-analysis

Top 23 audio-analysis Open-Source Projects

audio-analysis related posts

Zimtohrli: A New Psychoacoustic Perceptual Metric for Audio Compression

AudioFlux: Open-source for audio and music analysis, feature extraction

How to remove repeating segments from an video file?

Friture – a FOSS real-time audio analyzer for Windows, macOS, and Linux

GitHub - Ravi-Teja-konda/AudioInsightsGenerator: Unlock AI power with AudioInsightsGenerator! From audio to summaries, emotion analysis, idea generation, narratives, and content filtering. Explore your audio's hidden dimensions!

A library for audio processing , support Android platform

A library for DSP and audio analysis, support iOS and macOS

Index