Descript-audio-codec Alternatives

Similar projects and alternatives to descript-audio-codec based on common topics and language

opus

26 2,103 9.6 C descript-audio-codec VS opus

Modern audio compression for the internet.
lyra

18 3,720 0.0 C++ descript-audio-codec VS lyra

A Very Low-Bitrate Codec for Speech Compression
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
encodec

18 3,172 3.9 Python descript-audio-codec VS encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
pytorch-CycleGAN-and-pix2pix

10 21,998 2.8 Python descript-audio-codec VS pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch
edge-connect

1 2,474 4.1 Python descript-audio-codec VS edge-connect

EdgeConnect: Structure Guided Image Inpainting using Edge Prediction, ICCV 2019 https://arxiv.org/abs/1901.00212
Anime2Sketch

7 1,888 3.4 Python descript-audio-codec VS Anime2Sketch

A sketch extractor for anime/illustration.
anycost-gan

1 769 2.5 Python descript-audio-codec VS anycost-gan

[CVPR 2021] Anycost GANs for Interactive Image Synthesis and Editing
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better descript-audio-codec alternative or higher similarity.

Suggest an alternative to descript-audio-codec

descript-audio-codec reviews and mentions

Posts with mentions or reviews of descript-audio-codec. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-08.

Show HN: Sonauto – a more controllable AI music creator
1 project | news.ycombinator.com | 10 Apr 2024

Hey HN,
My cofounder (four months ago, classmate) and I trained an AI music generation model and after a month of testing we're launching 1.0 today. Ours is interesting because it's a latent diffusion model instead of a language model, which makes it more controllable: https://sonauto.ai/
Others do music generation by training a Vector Quantized Variational Autoencoder like Descript Audio Codec (https://github.com/descriptinc/descript-audio-codec) to turn music into tokens, then training an LLM on those tokens. Instead, we ripped the tokenization part off and replaced it with a normal variational autoencoder bottleneck (along with some other important changes to enable insane compression ratios). This gave us a nice, normally distributed latent space on which to train a diffusion transformer (like Sora). Our diffusion model is also particularly interesting because it is the first audio diffusion model to generate coherent lyrics!
We like diffusion models for music generation because they have some interesting properties that make controlling them easier (so you can make your own music instead of just taking what the machine gives you). For example, we have a rhythm control mode where you can upload your own percussion line or set a BPM. Very soon you'll also be able to generate proper variations of an uploaded or previously generated song (e.g., you could even sing into Voice Memos for a minute and upload that!). @Musicians of HN, try uploading your songs and using Rhythm Control/let us know what you think! Our goal is to enable more of you, not replace you.
For example, we turned this drum line (https://sonauto.ai/songs/uoTKycBghUBv7wA2YfNz) into this full song (https://sonauto.ai/songs/KSK7WM1PJuz1euhq6lS7 skip to 1:05 if inpatient) or this other song I like better (https://sonauto.ai/songs/qkn3KYv0ICT9kjWTmins we accidentally compressed it with AAC instead of Opus which hurt quality, though)
We also like diffusion models because while they're expensive to train, they're cheap to serve. We built our own efficient inference infrastructure instead of using those expensive inference as a service startups that are all the rage. That's why we're making generations on our site FREE and UNLIMITED for as long as possible.
We'd love to answer your questions. Let us know what you think of our first model! https://sonauto.ai/
TSAC: Low Bitrate Audio Compression
4 projects | news.ycombinator.com | 8 Apr 2024

Another useful model to compare to would be DAC https://github.com/descriptinc/descript-audio-codec
This is the codec that TSAC extended, so it could be a nice comparison to see. I'd also echo Vocos (from sibling comment), it operates on the same Encodec tokens but generally has better reconstruction quality.