SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Audio Open-Source Projects
-
You might have heard of FFMPEG or ImageMagick for image and video edition in a programmatic way. MoviePy is a Python module for video editing (Python wrapper for FFMPEG and ImageMagick). It provides functions for cutting, concatenations, title insertions, video compositing, video processing, and the creation of custom effects. It can read and write common video and audio formats and be run on any platform with Python 2.7 or 3+.
-
quivr
Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.
Project mention: privateGPT VS quivr - a user suggested alternative | libhunt.com/r/privateGPT | 2024-01-12 -
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Project mention: MPV: Vulkan Video Decoding: Usage Guide and FAQ | news.ycombinator.com | 2023-12-20
-
srs
SRS is a simple, high-efficiency, real-time video server supporting RTMP, WebRTC, HLS, HTTP-FLV, SRT, MPEG-DASH, and GB28181.
Project mention: What's the state of screen-sharing games to friends on linux? | /r/linux_gaming | 2023-06-26 -
Project mention: Howler JavaScript – Audio library for the modern web | news.ycombinator.com | 2024-02-12
-
Koel Music Server (https://github.com/koel/koel) Opensource music server, nice GUI and allows high fidelity streaming of all your music files
-
awesome-python-applications
💿 Free software that works great, and also happens to be open-source Python.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
digital_video_introduction
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸
There's a great introduction to video tech, including codecs, at https://github.com/leandromoreira/digital_video_introduction
-
BackgroundMusic
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
-
Project mention: GUI for a Vocal Remover That Uses Deep Neural Networks | news.ycombinator.com | 2023-12-13
-
Project mention: Amazon plans to charge for Alexa in June–unless internal conflict delays revamp | news.ycombinator.com | 2024-01-20
Yeah, whisper is the closest thing we have, but even it requires more processing power than is present in most of these edge devices in order to feel smooth. I've started a voice interface project on a Raspberry Pi 4, and it takes about 3 seconds to produce a result. That's impressive, but not fast enough for Alexa.
From what I gather a Pi 5 can do it in 1.5 seconds, which is closer, so I suspect it's only a matter of time before we do have fully local STT running directly on speakers.
> Probably anathema to the space, but if the devices leaned into the ~five tasks people use them for (timers, weather, todo list?) could probably tighten up the AI models to be more accurate and/or resource efficient.
Yes, this is the approach taken by a lot of streaming STT systems, like Kaldi [0]. Rather than use a fully capable model, you train a specialized one that knows what kinds of things people are likely to say to it.
-
BlackHole
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
Project mention: Use Tonex software to RIP Amplitube Tones without Hardware. | /r/AmpliTube | 2023-12-10I did something similar using BlackHole , with an old Mac, not as fast, but I'm quite happy with the results.
-
As their github page says, based on https://ffmpegwasm.netlify.app ...
I'm guessing no one did GPU-optimizations? I saw a web app (not an ffmpeg transpilation) that went clever and used WebGL so it can access the GPU and use its parallel processing capabilities...
-
-
Project mention: Show HN: Synced lyrics database with a free, easy-to-use API | news.ycombinator.com | 2024-02-23
I was always frustrated that there is no solid source for synced lyrics that also offers decent API support. There is good ol' Crintsoft's MiniLyrics that is thankfully free software, was what I used a lot in my childhood, but unfortunately the API is highly obfuscated. Another popular choice is the Musixmatch API, which has a very large database of synced lyrics, but with "free" API that are reverse-engineered from their app, you will quickly run into rate-limit.
That's why I created LRCLIB. It's aimed to provide completely free synchronized lyrics for everyone, especially for FOSS music players, with zero profit intention. It currently has nearly 3,000,000 (not deduplicated) lyrics in database. You can also contribute to the database by adding and syncing lyrics for your favorite songs using the LRCGET client.
I'm trying my best to make LRCLIB server-side code open-source as soon as possible. But right now, full LRCLIB's database dumps have already been uploaded regularly and publicly, which are simply sqlite3 files. Feel free to download, look at or do anything you want with the database at https://lrclib.net/db-dumps.
Many open-source projects have already begun integrating LRCLIB, including:
- beets - music library metadata management (https://github.com/beetbox/beets)
-
#5467 Fix 24-bit recording.
-
I have wondered what grooves it could come with using https://sonic-pi.net/
-
You should also check out AudioKit. The latest release should have an audio engine completely rewritten purely with Swift iirc.
-
OpenFrameworks
openFrameworks is a community-developed cross platform toolkit for creative coding in C++.
Not exactly VJ, but could be used for it. https://openframeworks.cc
-
mediamtx
Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.
Project mention: Bad audio quality when re-encoding RTSP stream from AAC to Opus in MediaMTX | /r/ffmpeg | 2023-12-09I started a discussion on the MediaMTX Github repo, where you can find more information on my problem: https://github.com/bluenviron/mediamtx/discussions/2787
-
Take a look at SFML. Very easy to get started, and quite capable.
-
Project mention: Free and privacy-friendly screen recorder with no limits | news.ycombinator.com | 2024-01-10
-
react-player
A React component for playing a variety of URLs, including file paths, YouTube, Facebook, Twitch, SoundCloud, Streamable, Vimeo, Wistia and DailyMotion
Project mention: A little side project I have been working on for the past couple of weekends. | /r/australia | 2023-12-10 -
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Audio related posts
- Apple Buys DarwinAI Ahead of Major Generative AI Updates Coming in iOS 18
- Kdenlive 24.02 open source video editor released
- Show HN: React Mixcloud Player – Stream DJ sets with dynamic visual effects
- Show HN: React Geiger – performance profiling using sound
- Show HN: Mixmotion Player – Stream Mixcloud music with dynamic visual effects
- Ask HN: Is There a Blender for Music?
- The KDE desktop gets an overhaul with Plasma 6
-
A note from our sponsor - SaaSHub
www.saashub.com | 19 Mar 2024
Index
What are some of the best open-source Audio projects? This list will help you:
Project | Stars | |
---|---|---|
1 | FFmpeg | 41,479 |
2 | quivr | 29,928 |
3 | mpv | 25,478 |
4 | srs | 23,920 |
5 | howler.js | 22,772 |
6 | koel | 15,360 |
7 | awesome-python-applications | 15,227 |
8 | digital_video_introduction | 15,022 |
9 | BackgroundMusic | 14,893 |
10 | ultimatevocalremovergui | 14,096 |
11 | Kaldi Speech Recognition Toolkit | 13,579 |
12 | BlackHole | 13,065 |
13 | ffmpeg.wasm | 12,639 |
14 | Pion WebRTC | 12,517 |
15 | beets | 12,315 |
16 | audacity | 11,181 |
17 | Sonic Pi | 10,452 |
18 | AudioKit | 10,313 |
19 | OpenFrameworks | 9,746 |
20 | mediamtx | 9,528 |
21 | SFML | 9,524 |
22 | screenity | 9,263 |
23 | react-player | 8,712 |