pydub vs pyAudioAnalysis

pydub

Manipulate audio with a simple and easy high level interface (by jiaaro)

Audio

Source Code

pydub.com

Suggest alternative

Edit details

pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications (by tyiannak)

Audio Machine Learning signal-processing audio-data audio-analysis-tasks Pyaudioanalysis Python

Source Code

Suggest alternative

Edit details

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

pydub		pyAudioAnalysis
	Project
25	Mentions	11
8,316	Stars	5,659
-	Growth	-
0.0	Activity	5.0
14 days ago	Latest Commit	19 days ago
Python	Language	Python
MIT License	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

pydub

Posts with mentions or reviews of pydub. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-12.

Looking for help with a winamp project please.
3 projects | /r/learnpython | 12 May 2023
Best language(s) for creating/manipulating sounds
1 project | /r/learnprogramming | 30 Apr 2023

Honestly while, C++ is used for professional audio software, you can get a lot done with python and a library like pydub, or you can even learn to manipulate audio files without any libraries in any language. So if you are not particulary interested in C++ at the moment you can start with Python, which is easier to learn. You can check out other python audio manipulation libraries here
ChatGPT and Whisper APIs
15 projects | news.ycombinator.com | 1 Mar 2023

I doubt it will matter if you're breaking up mid sentence if you pass in the previous as a prompt and split words. This is how Whisper does it internally.
It's not absolutely perfect, but splitting on the word boundary is one line of code with the same package in their docs: https://github.com/jiaaro/pydub/blob/master/API.markdown#sil...
25MB is also a lot. That's 30 minutes to an hour on MP3 at reasonable compression. A 2 hour movie would have three splits.
FFmpeg 6.0
10 projects | news.ycombinator.com | 27 Feb 2023

Even given an option it can be difficult to find the corresponding documentation, if only because of the many different submodules and encoders and decoders and filters that have o-so-slightly different options. That said, I've just switched from pydub to ffmpeg-python (due to memory issues of the former[1]) and judging from the Jupiter notebook[2] it seems a much more intuitive method of constructing ffmpeg pipelines.
[1] https://github.com/jiaaro/pydub/issues/135
[2] https://github.com/kkroening/ffmpeg-python/tree/master/examp...
Download & Trim MP3 from Youtube with Python
3 projects | dev.to | 21 Dec 2022

With the file downloaded, we're now going to arbitrarily slice it locally (you might have considered wheter it is possible to simply download a clip from youtube; all reliable methods I've found will essentially boil down to downloading the whole and then editing locally). For that we'll use the pydub library:
Playing multiple .wav and/or mp3 files in Python
1 project | /r/learnpython | 17 Oct 2022

I guess it's possible in theory, a quick search suggest pydub library.But you may find something better if you do a little research.
I made a cross-platform command-line app called maestro to play music!
4 projects | /r/Python | 20 Jul 2022

Uses https://github.com/cheofusi/just_playback to play sound. It's actually surprising how hard it was to find a cross-platform Python module to play sound that doesn't require an external dependency like ffmpeg. Even then, modules like https://github.com/jiaaro/pydub don't support features like seeking/scrubbing, which was a must-have for my project.
Batch conversion FLAC to WAV
1 project | /r/audiophile | 16 May 2022

Once python is installed, you will also need to install the "pydub" package for this script to work. If you're on a Windows computer, you can do this from the command line (run the "cmd") program. If you're on mac, you can do this from the terminal. Basically, the way that you do this is using "pip" -- a "helper" program that comes with python. Once you launch the command line, just run the command python -m pip install pydub --upgrade and you should see a message showing that it successfully installed. If you're struggling with this step, just google how to "pip install python packages" and you can find a lot of beginner guides.
How can I modify the pitch of an audio file and save it to disk?
1 project | /r/FlutterDev | 20 Feb 2022

That is kinda what serverless functions are built for. Looks like python has some good libraries for this: https://github.com/jiaaro/pydub.
Playing large audio files?
1 project | /r/learnpython | 7 Jan 2022

The files are big, so it's not feasible to load one in all at once. They have to be streamed/chunked somehow. (sadly, pydub doesn't support this...)

pyAudioAnalysis

Posts with mentions or reviews of pyAudioAnalysis. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-07-10.

How would I compare two voice recordings of the same sentence and advise one speaker how to get closer to the second?
2 projects | /r/learnpython | 10 Jul 2023

I actually came up with an el cheapo version of what I want to accomplish that isn't perfect but without any research can implement it and it may actually prove useful to language learners. PM me if you're interested in hearing it and critiquing it. I can share here that I'm using this guy's multiple repos though: https://github.com/tyiannak/pyAudioAnalysis
How do I run code only when an audio file has bass
1 project | /r/learnpython | 27 May 2023
A Python library for audio feature extraction, classification, segmentation and applications
1 project | /r/coding | 26 Mar 2023
Phonetic search for audio files
1 project | /r/audio | 16 Jan 2023

Update: From one researcher to another. I was referred to a Python Audio AI project . Once I determine exactly which module to use I should be smooth sailing. I'll send more updates soon.
Clustering songs with different lengths
1 project | /r/MLQuestions | 23 Aug 2022

Hey folks, I'm looking into clustering audio files with features extracted by pyAudioAnalysis. However, every feature (I'm interested in MFCC, spectral centroid and spread, and BPM) is extracted for each frame of the song (by default 0.05s, excluding BPM that relates to the whole) so tracks with different lengths produce arrays with different shapes.
AUDIO ANALYSIS WITH LIBROSA
2 projects | dev.to | 23 May 2022

To learn more about pyAudioAnalysis here you go.
Creating Audio Features with PyAudio Analysis
1 project | dev.to | 6 Apr 2022

Humans are great at classifying noises. We can hear a chirp and surmise that it belongs to a bird, we can hear an abstract noise and classify it as as speech with a particular meaning and definition. This relationship between humans and audio classification forms the basis of speech and human communication as a whole. Translating this incredible ability to computers on the other hand can be a difficult challenge to say the least. Whilst we can naturally decompose signals, how do we teach computers to do this, and how do we show what parts of the signal matter and what parts of the signal are irrelevant or noisy? This is where PyAudio Analysis comes in. PyAudio Analysis is an open source Python project by Theodoros Giannakopoulos, a Principle researcher of multimodal machine learning at the Multimedia Analysis Group of the Computational Intelligence Lab (MagCIL). The package aims to simplify the feature extraction and classification process by providing a number of helpful tools at can sift through the signal and create relevant features. These features can then be used to train models for classification tasks.
[P] Feature extraction for acoustic signals
1 project | /r/MachineLearning | 16 Jan 2022

This might be relevant, which has a set of feature extraction methods implemented: https://github.com/tyiannak/pyAudioAnalysis/wiki/3.-Feature-Extraction
Hacker News top posts: Dec 11, 2021
3 projects | /r/hackerdigest | 11 Dec 2021

A library for audio feature extraction, regression, classification, segmentation\ (2 comments)
Audio feature extraction, classification, segmentation and applications
2 projects | news.ycombinator.com | 9 Dec 2021

What are some alternatives?

When comparing pydub and pyAudioAnalysis you can also consider the following projects:

librosa - Python library for audio and music analysis

SpeechRecognition - Speech recognition module for Python, supporting several engines and APIs, online and offline.

ffmpeg-python - Python bindings for FFmpeg - with complex filtering support

pyAcoustics - A collection of python scripts for extracting and analyzing acoustics from audio files.

mutagen - Python module for handling audio metadata

mingus - Mingus is a music package for Python

audioread - cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python

Watson Developer Cloud Python SDK - :snake: Client library to use the IBM Watson services in Python and available in pip as watson-developer-cloud

m3u8 - Python m3u8 Parser for HTTP Live Streaming (HLS) Transmissions

aeneas - aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)