InfluxDB is the Time Series Platform where developers build real-time applications for analytics, IoT and cloud-native services. Easy to start, it is available in the cloud or on-premises. Learn more →
Top 23 Python Audio Projects
music library manager and MusicBrainz taggerProject mention: beets VS Musort - a user suggested alternative | libhunt.com/r/beets | 2023-01-27
Speech recognition module for Python, supporting several engines and APIs, online and offline.Project mention: Voice commands in Doom Eternal possible? | reddit.com/r/linux_gaming | 2022-12-23
I am less familiar with speech recognition myself. I have implemented something similar many years ago, back when Google had a REST API that allowed you to upload audio and they would respond with the recognized words/sentence. I think they still have the same API available, though. They limited how much you could send, but for voice commands it was pretty solid. However, SpeechRecognition looks like a library worth trying out for this, as that seems like it could do offline processing depending on the underlying library. They also have some examples to look at.
Build time-series-based applications quickly and at scale.. InfluxDB is the Time Series Platform where developers build real-time applications for analytics, IoT and cloud-native services. Easy to start, it is available in the cloud or on-premises.
Manipulate audio with a simple and easy high level interfaceProject mention: Download & Trim MP3 from Youtube with Python | dev.to | 2022-12-21
With the file downloaded, we're now going to arbitrarily slice it locally (you might have considered wheter it is possible to simply download a clip from youtube; all reliable methods I've found will essentially boil down to downloading the whole and then editing locally). For that we'll use the pydub library:
Audio fingerprinting and recognition in Python (by worldveil)Project mention: Tiny bit of experience but need to compile a Github program. What is the best video / resource to learn to do this quickly? | reddit.com/r/learnprogramming | 2023-01-16
If you read the installation.md file it clearly states that it has only been tested on UNIX systems, so you might be on your own trying to get it to wor in windows.
Code for the paper "Jukebox: A Generative Model for Music"Project mention: Mongolian Gabba Goat Techno | reddit.com/r/BrandNewSentence | 2023-02-02
That already exists
Automagically synchronize subtitles with video.Project mention: Drifting subtitles? | reddit.com/r/ffmpeg | 2022-12-16
Python library for audio and music analysisProject mention: Looking for a program that will examine a folder full of mp3s or flacs and list out ones with lower or higher than average volume | reddit.com/r/software | 2022-10-29
librosa can do that easily but I think there is an easier way to find what are you looking for:
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
A PyTorch-based Speech ToolkitProject mention: [D] What's stopping you from working on speech and voice? | reddit.com/r/MachineLearning | 2023-01-30
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and ApplicationsProject mention: Phonetic search for audio files | reddit.com/r/audio | 2023-01-16
Update: From one researcher to another. I was referred to a Python Audio AI project . Once I determine exactly which module to use I should be smooth sailing. I'll send more updates soon.
Code for the paper Hybrid Spectrogram and Waveform Source Separation, but the goddamm motherfucker doesn't work.Project mention: A stem splitting algorythm update would be cool! | reddit.com/r/KoalaSampler | 2023-01-26
I think Koala is using Spleeter. There's also an open source alternative called Demucs, which yields far better results, especially on vocals and drums.
GUI for a Vocal Remover that uses Deep Neural Networks.Project mention: And no message could've been any clearer | reddit.com/r/MadeMeSmile | 2023-01-24
MusicBrainz Picard audio file taggerProject mention: Adding metadata to AIFF (Windows) | reddit.com/r/audiophile | 2023-02-03
MusicBrainz Picard supports it. I use it all the time to tag my albums on my Macintosh.
On-device wake word detection powered by deep learningProject mention: OK Google, Add Hotword Detection to Chrome | dev.to | 2023-02-03
Download Porcupine (i.e. Deep Neural Network). Run the following to turn the binary model into a base64 string, from the project folder.
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)Project mention: Anyone know of a tool to align (existing) subtitles to audio along sentence boundaries? | reddit.com/r/LanguageTechnology | 2023-02-03
You could try aeneas. Syncabook apparently uses the afaligner library, which says that it was inspired by aeneas but uses FastDTW to find an approximation to the optimal warping path. This might make it slightly less accurate than aeneas.
Cast macOS and Linux Audio/Video to your Google Cast and Sonos DevicesProject mention: Stream from Ubuntu to Chromecast or Miracast | reddit.com/r/Ubuntu | 2022-11-17
I need to stream a captured video input from my Ubuntu Kinetic to a smart TV or iPad sink via Chromecast/Miracast or whatever. Can this be done without VLC (not reliable)? Mkchromecast is not working in Kinetic yet, and GNOME Network Displays only casts physical monitors.
Python m3u8 Parser for HTTP Live Streaming (HLS) TransmissionsProject mention: proxy for live tv m3u | reddit.com/r/jellyfin | 2022-07-17
:snake: Client library to use the IBM Watson services in Python and available in pip as watson-developer-cloud
Auto-Editor: Effort free video editing!Project mention: Name a program that doesn't get enough love! | reddit.com/r/linux | 2022-12-29
auto-editor — removing silent portions from video recordings
Stable diffusion for real-time music generationProject mention: Downloading songs? | reddit.com/r/riffusion | 2023-01-25
There's a riffusion app you can run locally.
Python DSP module
The desktop music player from the future! :city_sunset:Project mention: Any suggestions for a new media player name? | reddit.com/r/gnome | 2022-11-06
Whatever you name it, try to make the UI look like Tauon Music Box. I love that application but its built on GTK3 I believe.
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.Project mention: physically modeling reverb | reddit.com/r/DSP | 2022-09-15
I am developing such a simulation tool in python called pyroomacoustics. It is similar to wayverb linked in a different comment, but can be operated in python and is probably easier to get started with. https://github.com/LCAV/pyroomacoustics
SincNet is a neural architecture for efficiently processing raw audio samples.Project mention: Does this SincNet (neural architecture) contain a discriminator? | reddit.com/r/learnmachinelearning | 2022-12-30
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Audio related posts
Anyone know of a tool to align (existing) subtitles to audio along sentence boundaries?
4 projects | reddit.com/r/LanguageTechnology | 3 Feb 2023
OK Google, Add Hotword Detection to Chrome
1 project | dev.to | 3 Feb 2023
Creating an asynchronous audio controller (using python-sounddevice) for my future wife
1 project | reddit.com/r/pythonhelp | 3 Feb 2023
Adding metadata to AIFF (Windows)
1 project | reddit.com/r/audiophile | 3 Feb 2023
Any app available to add lyrics to a song?
1 project | reddit.com/r/Fedora | 3 Feb 2023
Mongolian Gabba Goat Techno
1 project | reddit.com/r/BrandNewSentence | 2 Feb 2023
El éxito continuo de OpenAI: Y como llegaron a crear la IA más avanzada del 2023. ChatGPT.
2 projects | dev.to | 2 Feb 2023
A note from our sponsor - InfluxDB
www.influxdata.com | 4 Feb 2023
What are some of the best open-source Audio projects in Python? This list will help you:
|17||Watson Developer Cloud Python SDK||1,427|