Our great sponsors
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation, but the goddamm motherfucker doesn't work.
In my first post, quite a lot of alternatives were discussed: https://news.ycombinator.com/item?id=36707877
The model I'm using is called Open-Unmix (https://github.com/sigsep/open-unmix-pytorch). In 2021, there was an update to Open-Unmix to include new weights, UMX-L, which made it perform better than it used to on the older weights (UMXHQ).
In the grand landscape of music demixing, I don't think UMX-L is near the top anymore.
_However_, the demixing performance of freemusicdemixer.com is very close to the full PyTorch performance of Open-Unmix UMX-L, despite the tricks I needed to get it working in the browser, such as splitting up the inference to operate on segments of the song, or making the LSTM operate on streaming segments rather than holding the entire track in the LSTM memory.
In my first release, I loaded and did inference on the entire track at once (like the PyTorch model), which frequently crashed or exceeded the 4GB WASM memory for medium or large-size tracks.
For those interested, Facebook's Demucs page (https://github.com/facebookresearch/demucs) gives performance comparison for several models including open-unmix.
See also: https://www.stemroller.com This runs as a local app on Windows and Mac.
For those interested, Facebook's Demucs page (https://github.com/facebookresearch/demucs) gives performance comparison for several models including open-unmix.
See also: https://www.stemroller.com This runs as a local app on Windows and Mac.
Related posts
- are Acapellas hard to find?
- Is there an AI that can seperate vocal tracks from a song that have multiple people singing at once?
- StemRoller – Isolate vocals, drums, bass, and other stems from any song (FOSS)
- Free-music-demixer adds multi-threading to run Demucs faster in the browser
- Best way to extract a vocal stem from a song