InfluxDB is the Time Series Platform where developers build real-time applications for analytics, IoT and cloud-native services. Easy to start, it is available in the cloud or on-premises. Learn more →
Top 23 Python Audio Projects
-
-
SpeechRecognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
I am less familiar with speech recognition myself. I have implemented something similar many years ago, back when Google had a REST API that allowed you to upload audio and they would respond with the recognized words/sentence. I think they still have the same API available, though. They limited how much you could send, but for voice commands it was pretty solid. However, SpeechRecognition looks like a library worth trying out for this, as that seems like it could do offline processing depending on the underlying library. They also have some examples to look at.
-
InfluxDB
Build time-series-based applications quickly and at scale.. InfluxDB is the Time Series Platform where developers build real-time applications for analytics, IoT and cloud-native services. Easy to start, it is available in the cloud or on-premises.
-
With the file downloaded, we're now going to arbitrarily slice it locally (you might have considered wheter it is possible to simply download a clip from youtube; all reliable methods I've found will essentially boil down to downloading the whole and then editing locally). For that we'll use the pydub library:
-
Project mention: Tiny bit of experience but need to compile a Github program. What is the best video / resource to learn to do this quickly? | reddit.com/r/learnprogramming | 2023-01-16
If you read the installation.md file it clearly states that it has only been tested on UNIX systems, so you might be on your own trying to get it to wor in windows.
-
That already exists
-
-
Project mention: Looking for a program that will examine a folder full of mp3s or flacs and list out ones with lower or higher than average volume | reddit.com/r/software | 2022-10-29
librosa can do that easily but I think there is an easier way to find what are you looking for:
-
Sonar
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
Project mention: [D] What's stopping you from working on speech and voice? | reddit.com/r/MachineLearning | 2023-01-30
- https://github.com/speechbrain/speechbrain
-
pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
Update: From one researcher to another. I was referred to a Python Audio AI project . Once I determine exactly which module to use I should be smooth sailing. I'll send more updates soon.
-
demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation, but the goddamm motherfucker doesn't work.
Project mention: A stem splitting algorythm update would be cool! | reddit.com/r/KoalaSampler | 2023-01-26I think Koala is using Spleeter. There's also an open source alternative called Demucs, which yields far better results, especially on vocals and drums.
-
-
MusicBrainz Picard supports it. I use it all the time to tag my albums on my Macintosh.
-
Download Porcupine (i.e. Deep Neural Network). Run the following to turn the binary model into a base64 string, from the project folder.
-
aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Project mention: Anyone know of a tool to align (existing) subtitles to audio along sentence boundaries? | reddit.com/r/LanguageTechnology | 2023-02-03You could try aeneas. Syncabook apparently uses the afaligner library, which says that it was inspired by aeneas but uses FastDTW to find an approximation to the optimal warping path. This might make it slightly less accurate than aeneas.
-
I need to stream a captured video input from my Ubuntu Kinetic to a smart TV or iPad sink via Chromecast/Miracast or whatever. Can this be done without VLC (not reliable)? Mkchromecast is not working in Kinetic yet, and GNOME Network Displays only casts physical monitors.
-
-
Watson Developer Cloud Python SDK
:snake: Client library to use the IBM Watson services in Python and available in pip as watson-developer-cloud
-
auto-editor — removing silent portions from video recordings
-
There's a riffusion app you can run locally.
-
-
Whatever you name it, try to make the UI look like Tauon Music Box. I love that application but its built on GTK3 I believe.
-
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
I am developing such a simulation tool in python called pyroomacoustics. It is similar to wayverb linked in a different comment, but can be operated in python and is probably easier to get started with. https://github.com/LCAV/pyroomacoustics
-
Project mention: Does this SincNet (neural architecture) contain a discriminator? | reddit.com/r/learnmachinelearning | 2022-12-30
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Audio related posts
- Anyone know of a tool to align (existing) subtitles to audio along sentence boundaries?
- OK Google, Add Hotword Detection to Chrome
- Creating an asynchronous audio controller (using python-sounddevice) for my future wife
- Adding metadata to AIFF (Windows)
- Any app available to add lyrics to a song?
- Mongolian Gabba Goat Techno
- El éxito continuo de OpenAI: Y como llegaron a crear la IA más avanzada del 2023. ChatGPT.
-
A note from our sponsor - InfluxDB
www.influxdata.com | 4 Feb 2023
Index
What are some of the best open-source Audio projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | beets | 11,401 |
2 | SpeechRecognition | 6,732 |
3 | pydub | 6,727 |
4 | dejavu | 5,991 |
5 | jukebox | 5,772 |
6 | ffsubsync | 5,768 |
7 | librosa | 5,634 |
8 | speechbrain | 5,241 |
9 | pyAudioAnalysis | 5,116 |
10 | demucs | 4,946 |
11 | ultimatevocalremovergui | 4,840 |
12 | picard | 3,050 |
13 | Porcupine | 2,848 |
14 | aeneas | 2,102 |
15 | mkchromecast | 2,032 |
16 | m3u8 | 1,590 |
17 | Watson Developer Cloud Python SDK | 1,427 |
18 | auto-editor | 1,400 |
19 | riffusion | 1,382 |
20 | pyo | 1,139 |
21 | TauonMusicBox | 1,132 |
22 | pyroomacoustics | 1,051 |
23 | SincNet | 997 |