Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 audio-analysis Open-Source Projects
-
essentia
C++ library for audio and music analysis, description and synthesis, including Python bindings
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
DSWaveformImage
Generate waveform images from audio files on iOS, macOS & visionOS in Swift. Native SwiftUI & UIKit views.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
PipeWire-Guide
PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.
-
inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
-
audioMotion-analyzer
High-resolution real-time graphic audio spectrum analyzer JavaScript module with no dependencies.
-
sync-audio-tracks
Audio tracks synchronization command-line tool for video editors that don't support it
-
WLEDAudioSync-Chataigne-Module
Stream music/audio to WLED Sound Reactive. Real time audio data analysis: volume, FFT, pitch detection etc.Include RTMGC. Include Real Time Music Mood Detection.WLED audio sync integrated v1 for esp8266 & v2 message for esp32.
-
SoundSage---LLM-Audio-Processing
Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create a full set of tools for an AI to use for automating Audio processing for Music, Film, Game and any other possible applications. UI for AutoGain is very basic but the app is very functional. currently only for MacOS
-
simple_dr_meter
An (optimized) implementation of the music DR measurement (compliant with http://dr.loudness-war.info/), it supports CUE sheets and is faster than all currently available alternatives (at the time of writing, not sure about now)
-
AudioInsightsGenerator
Unlock AI power with AudioInsightsGenerator! From audio to summaries, emotion analysis, idea generation, narratives, and content filtering. Explore your audio's hidden dimensions!
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
RVC does live voice changing with a little latency: https://github.com/RVC-Project/Retrieval-based-Voice-Convers...
The product isn't exactly spectacular, but most of the works seems to have bene done. Just needs someone to go over the UI and make it less unstable, really.
Project mention: AudioFlux: Open-source for audio and music analysis, feature extraction | news.ycombinator.com | 2024-03-27
Project mention: Are there any libraries that I can use or any way to make such audio waveforms? I looked into a library called "react-native-audiowaveform" but it's not being maintained anymore and doesn't work for newer versions of RN | /r/reactnative | 2023-05-21https://github.com/dmrschmidt/DSWaveformImage for iOS
you could always cut up the video into segments A,B,C,D,E using like ffmpeg -ss 00:00:00 -t 00:10:00 -i input -c copy A.ext (where A is the first 10 minutes of video) and then recombine with https://trac.ffmpeg.org/wiki/Concatenate, might take a while though and some basic math. maybe you could do something programatically with chromaprint by identifying the repeating audio segments? that's just an idea since that's what this jellyfin plugin does to identify repeated segments of shows in the form of opening credits.
Project mention: Friture – a FOSS real-time audio analyzer for Windows, macOS, and Linux | news.ycombinator.com | 2023-09-24
It is far easier to use a non-Windows operating system and simply direct the IQ data into the application you want, or use a better app which can take data directly from the RSPdx. However, in an RTL-SDR book, I saw a reference to VB-Cable, which is a separate software from VAC. Pipewire is another tool, definitely open-source and free, which should work.
I have a little hobby project where I record an FM radio music station using a SDR and then remove all the non-music portions for offline listening. I like the music selections the DJs pick, but I prefer not to listen to the DJ commentary and the advertisements.
I evaluated three methods of recording: analog capture from a standalone FM receiver, using this nrsc5 library to record the "HD" radio stream, and using an AirSpy SDR with this library: https://github.com/jj1bdx/airspy-fmradion
Recording the "HD" (what a misnomer) radio was nice in that there was no hiss or multipath effects, but in comparison to the other methods the digital compression artifacts became impossible to un-hear. It seems to top out at about 96 kbps
The airspy-fmradion library has some nice stuff in it to address multipath, resulting in the best audio quality of the three methods I tested.
I use https://github.com/ina-foss/inaSpeechSegmenter to identify which segments of the recordings are speech vs. music.
Project mention: Zimtohrli: A New Psychoacoustic Perceptual Metric for Audio Compression | news.ycombinator.com | 2024-05-08PEAQ/PESQ and visqol is worth trying for that. In principle they operate as you suggest. I keep a short overview of audio quality methods/tools here: https://github.com/jonnor/machinehearing/blob/master/audio-q...
I am currently working on a project called SoundSage - LLM Audio Processing, which is hosted on GitHub. The project is aimed at developing a system for audio processing using various tools and techniques. You can find the repository here: SoundSage - LLM Audio Processing
Project mention: GitHub - Ravi-Teja-konda/AudioInsightsGenerator: Unlock AI power with AudioInsightsGenerator! From audio to summaries, emotion analysis, idea generation, narratives, and content filtering. Explore your audio's hidden dimensions! | /r/programming | 2023-07-07
audio-analysis related posts
-
Zimtohrli: A New Psychoacoustic Perceptual Metric for Audio Compression
-
AudioFlux: Open-source for audio and music analysis, feature extraction
-
How to remove repeating segments from an video file?
-
Friture – a FOSS real-time audio analyzer for Windows, macOS, and Linux
-
GitHub - Ravi-Teja-konda/AudioInsightsGenerator: Unlock AI power with AudioInsightsGenerator! From audio to summaries, emotion analysis, idea generation, narratives, and content filtering. Explore your audio's hidden dimensions!
-
A library for audio processing , support Android platform
-
A library for DSP and audio analysis, support iOS and macOS
-
A note from our sponsor - InfluxDB
www.influxdata.com | 10 May 2024
Index
What are some of the best open-source audio-analysis projects? This list will help you:
Project | Stars | |
---|---|---|
1 | Retrieval-based-Voice-Conversion-WebUI | 19,255 |
2 | essentia | 2,694 |
3 | audioFlux | 2,056 |
4 | madmom | 1,242 |
5 | DSWaveformImage | 947 |
6 | chromaprint | 894 |
7 | friture | 856 |
8 | PipeWire-Guide | 829 |
9 | LabSound | 705 |
10 | inaSpeechSegmenter | 695 |
11 | Sushi | 613 |
12 | audioMotion-analyzer | 529 |
13 | machinehearing | 220 |
14 | pitch-detection | 212 |
15 | GuitarTuner | 130 |
16 | ebur128 | 82 |
17 | sync-audio-tracks | 59 |
18 | WLEDAudioSync-Chataigne-Module | 32 |
19 | SoundSage---LLM-Audio-Processing | 25 |
20 | simple_dr_meter | 17 |
21 | AudioInsightsGenerator | 11 |
22 | audioparam-visualization | 7 |
23 | PlotAssert | 6 |
Sponsored