encodec
audiolm-pytorch
encodec | audiolm-pytorch | |
---|---|---|
18 | 4 | |
3,185 | 2,249 | |
2.0% | - | |
3.9 | 9.0 | |
4 months ago | 3 months ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
encodec
-
TSAC: Low Bitrate Audio Compression
Since Ballard's codec is "AI" based, can you add google's lyrav2 ( https://github.com/google/lyra ) and Facebook's/meta EnCodec ( https://github.com/facebookresearch/encodec ).
Also I don't seem to be able to access your page, so there might be error.
Finally, when doing opus comparison it's good now to denote if it is using Lace or NoLace decoder post processing filters that became available in opus 1.5 (note, this feature need to be enabled at compile time, and defying decode a new API call needs to be made to force higher complexity decoder) . See https://opus-codec.org/demo/opus-1.5/
-
[R] Neural network for audio training sample size
But models rarely work on raw audio. You can also check EnCodec (https://github.com/facebookresearch/encodec) or SoundStream.
- Bark: A transformer based text to audio system
-
[D]: Is voice cloning or natural TTS (like Elevenlabs) possible due to LLMs?
VALL-E from Microsoft is transformer over Encodec code. SPEAR-TTS from Google is basically AudioLM for TTS.
- Why hasn't Meta made LLaMA open source?
- ML Codecs Similar to Encodec by Facebook?
- Wie Österreich die Glasfaser verschlief - ORF Topos
- EnCodec: State-of-the-art deep learning based audio codec
- High Fidelity Neural Audio Compression
- EnCodec: High Fidelity Neural Audio Compression
audiolm-pytorch
-
Bark: A transformer based text to audio system
It’s mostly there in https://github.com/lucidrains/audiolm-pytorch#hierarchical-t....
- FLiPN-FLaNK Stack Weekly 27Feb2023
-
Implementation of Google's MusicLM in PyTorch
This one is AudioLM modified from here https://github.com/lucidrains/audiolm-pytorch repository to support the music generation needs here.
-
Microsoft’s new text-to-speech model can duplicate anyone's voice in 3 seconds
There is an open source implementation of these features in Pytorch:
https://github.com/lucidrains/audiolm-pytorch
What are some alternatives?
bark - 🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model
bark - 🔊 Text-Prompted Generative Audio Model
FlexGen - Running large language models on a single GPU for throughput-oriented scenarios.
bark-with-voice-clone - 🔊 Text-prompted Generative Audio Model - With the ability to clone voices
highlight - highlight.io: The open source, full-stack monitoring platform. Error monitoring, session replay, logging, distributed tracing, and more.
musiclm-pytorch - Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
iTransformer - Unofficial implementation of iTransformer - SOTA Time Series Forecasting using Attention networks, out of Tsinghua / Ant group
jxc - JXC is a structured data language similar to JSON, but with a focus on being expressive, extensible, and human-friendly.
yobulkdev - 🔥 🔥 🔥Open Source & AI driven Data Onboarding Platform:Free flatfile.com alternative
tortoise-tt
marp-cli - A CLI interface for Marp and Marpit based converters