Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 Python Audio Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
SpeechRecognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
-
demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation, but the goddamm motherfucker doesn't work.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
-
picard
A cross-platform music tagger powered by the MusicBrainz database. Picard organizes your music collection by updating your tags, renaming your files, and sorting them into a folder structure, exactly the way you want it.
-
distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
-
aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: GUI for a Vocal Remover That Uses Deep Neural Networks | news.ycombinator.com | 2023-12-13
Project mention: Show HN: Synced lyrics database with a free, easy-to-use API | news.ycombinator.com | 2024-02-23I was always frustrated that there is no solid source for synced lyrics that also offers decent API support. There is good ol' Crintsoft's MiniLyrics that is thankfully free software, was what I used a lot in my childhood, but unfortunately the API is highly obfuscated. Another popular choice is the Musixmatch API, which has a very large database of synced lyrics, but with "free" API that are reverse-engineered from their app, you will quickly run into rate-limit.
That's why I created LRCLIB. It's aimed to provide completely free synchronized lyrics for everyone, especially for FOSS music players, with zero profit intention. It currently has nearly 3,000,000 (not deduplicated) lyrics in database. You can also contribute to the database by adding and syncing lyrics for your favorite songs using the LRCGET client.
I'm trying my best to make LRCLIB server-side code open-source as soon as possible. But right now, full LRCLIB's database dumps have already been uploaded regularly and publicly, which are simply sqlite3 files. Feel free to download, look at or do anything you want with the database at https://lrclib.net/db-dumps.
Many open-source projects have already begun integrating LRCLIB, including:
- beets - music library metadata management (https://github.com/beetbox/beets)
Start and Stop Listening Example
Project mention: SpeechBrain 1.0: A free and open-source AI toolkit for all things speech | news.ycombinator.com | 2024-02-28
I've had the best results from Facebook's DEMUCs. It's not too difficult to install, and I like the sound quality of their mdx_extra model. This is the command line I use (this will use the 2 stem version -- vocals, and everything else)
openai/jukebox: Music Generation
librosa/librosa: Python library for audio and music analysis
Project mention: The GitHub Black Market That Helps Coders Cheat the Popularity Contest | news.ycombinator.com | 2023-10-23> Another giveaway is the ratio of stars to watchers / forks. I remember one project with thousands of stars but only 10 users "watching" it. They went on to raise a sizable seed round too.
Not necessarily indicative of foul play. I have two projects like this (https://github.com/smacke/ffsubsync and https://github.com/ipyflow/ipyflow) and I attribute it to not having great developer documentation.
Project mention: How would I compare two voice recordings of the same sentence and advise one speaker how to get closer to the second? | /r/learnpython | 2023-07-10I actually came up with an el cheapo version of what I want to accomplish that isn't perfect but without any research can implement it and it may actually prove useful to language learners. PM me if you're interested in hearing it and critiquing it. I can share here that I'm using this guy's multiple repos though: https://github.com/tyiannak/pyAudioAnalysis
Project mention: Fans quitting Spotify to save their love of music (2022) | news.ycombinator.com | 2024-03-24Have you tried https://www.funkwhale.audio/?
It can be used effectively as a "private spotify". Labeling is a solved problem thanks to https://picard.musicbrainz.org/, and the fact that a lot of the music you buy these days comes pre-labeled already.
You then have a web-app (and/or a mobile app, if that's your thing) where you can stream music as you would with Spotify.
You can even build yourself a little music-sharing commune with friends, where you all upload the albums you buy and accumulate a nice collection of diverse tunes.
Project mention: You know what I REALLY want? Something like img2img but for sound/music. | /r/StableDiffusion | 2023-12-04Why don’t you try Riffusion? https://github.com/riffusion/riffusion The general idea is convert your audio waveform into an image (spectrogram).
spotify/basic-pitch: Audio to midi converter
A few days ago I discovered a program that automatically trims the pauses from your video. This can decrease my raw footage duration by around 25%. I've used this for editing two videos so far, and this has been such a helpful tool.
Project mention: Stream to Chromecast with resolved, vlc and bash | news.ycombinator.com | 2024-01-07You can forward a Chrome window to it via your web browser, or you can use something like https://mkchromecast.com/ to push local videos to it.
You mean remove background noise and transcribe? Then you can use DeepFilterNet to remove noise, and Whisper to transcribe.
Python Audio related posts
- Audio Fingerprinting and Recognition in Python
- Show HN: Sonauto – a more controllable AI music creator
- TSAC: Low Bitrate Audio Compression
- Fans quitting Spotify to save their love of music (2022)
- Show HN: Synced lyrics database with a free, easy-to-use API
- A transformer-based network model for pitch detection
- What do you use to tag your music on an NAS (Unraid)?
-
A note from our sponsor - InfluxDB
www.influxdata.com | 26 Apr 2024
Index
What are some of the best open-source Audio projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | ultimatevocalremovergui | 14,833 |
2 | beets | 12,393 |
3 | pydub | 8,339 |
4 | SpeechRecognition | 8,040 |
5 | speechbrain | 7,869 |
6 | demucs | 7,644 |
7 | jukebox | 7,563 |
8 | librosa | 6,681 |
9 | ffsubsync | 6,495 |
10 | dejavu | 6,316 |
11 | pyAudioAnalysis | 5,668 |
12 | picard | 3,551 |
13 | Porcupine | 3,424 |
14 | riffusion | 3,164 |
15 | distil-whisper | 3,125 |
16 | basic-pitch | 2,901 |
17 | auto-editor | 2,481 |
18 | aeneas | 2,379 |
19 | mkchromecast | 2,173 |
20 | m3u8 | 1,918 |
21 | DeepFilterNet | 1,914 |
22 | TauonMusicBox | 1,578 |
23 | polymath | 1,456 |
Sponsored