lyra
jukebox
Our great sponsors
lyra | jukebox | |
---|---|---|
18 | 129 | |
3,720 | 7,554 | |
0.9% | 1.7% | |
0.0 | 0.0 | |
over 1 year ago | about 2 months ago | |
C++ | Python | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
lyra
-
TSAC: Low Bitrate Audio Compression
Since Ballard's codec is "AI" based, can you add google's lyrav2 ( https://github.com/google/lyra ) and Facebook's/meta EnCodec ( https://github.com/facebookresearch/encodec ).
Also I don't seem to be able to access your page, so there might be error.
Finally, when doing opus comparison it's good now to denote if it is using Lace or NoLace decoder post processing filters that became available in opus 1.5 (note, this feature need to be enabled at compile time, and defying decode a new API call needs to be made to force higher complexity decoder) . See https://opus-codec.org/demo/opus-1.5/
-
Opus Databending Drumkit
I've thought about doing something similar for google's voice compression lyra https://github.com/google/lyra
-
Is it safe to say AV1 for video and OPUS for audio are best codecs respectively?
edit: It seems Lyra is opensource https://github.com/google/lyra
-
New Release of Audio Codec "Lyra" 1.3 (43% smaller and 20% faster)
1) https://github.com/google/lyra/releases/tag/v1.3.0
- Release Lyra 1.3.0 · google/lyra - performing arithmetic operations in 8-bit integers instead of 32-bit floats, the new model is 43% smaller (TFLite model size) and 20% faster
- Using AI to compress audio files for quick and easy sharing
-
Lyra V2 – a better, faster, and more versatile speech codec
Very impressive.
It'd be interesting to see what the lift would be to get encoding & decoding running in webassembly/wasm. Further, it'd be really neat to try to take something like the tflife_model_wrapper[1] and to get it backed by something like tsjs-tflite[2] perhaps even atop for example tfjs-backend-webgpu[3].
Longer run, the web-nn[4] spec should hopefully simplify/bake-in some of these libraries to the web platform, make running inference much easier. But there's still an interesting challenge & question, that I'm not sure how to tackle; how to take native code, compile it to wasm, but to have some of the implementation provided else-where.
[1] https://github.com/google/lyra/pull/89/files#diff-ed2f131a63...
[2] https://www.npmjs.com/package/@tensorflow/tfjs-tflite
[3] https://www.npmjs.com/package/@tensorflow/tfjs-backend-webgp...
-
Lyra 1.2.0 released with 5x speed improvement, higher quality speech, selectable bitrate (3.2, 6.0 and 9.2 kb/s), lower latency and Mac and Windows support
You can find an Android, Linux and macOS app here: https://github.com/google/lyra/actions/runs/3156735950
-
(Noob): Can Signal implement Lyra-Codec (developed by Google) for better audio quality?
Here's the repository: https://github.com/google/lyra and it's licensed under Apache.
- Lyra 0.0.2 ·The main improvement is the open-source release of the sparse_matmul library code, which was co-developed by Google and DeepMind. no more pre-compiled .so dynamic library binaries and no more restrictions on which toolchain to use, which opens up the door to port onto different platforms
jukebox
-
Open Source Libraries
openai/jukebox: Music Generation
- Will AI be able to create similar sounding music based off input?
-
Best model for music generation?
https://github.com/openai/jukebox The demo code is there.
-
Why didn't OpenAI MIT license Jukebox the same way they did CLIP?
I didn't even know about it until I heard Sam Altman casually mention it in an interview, I was expecting some basic tunes generator, but this is so amazing! I mean yeah the voices are not clear, it's muffled, but look at how far have image models progressed, if you applied the same amount of collaborative effort here, the results could be amazing! ElevenLabs showed how good and clear can AI-created voices sound. The only reason I can think of is that the Jukebox code is under view license only.
-
[R] [N] Noise2Music - Diffusion models for generating high quality music audio from text prompts, by Google Research
OpenAI had this figured out 3 years ago: https://openai.com/blog/jukebox/ . You could then even define your own text. Model is open source too.
-
Is music next?
They've had jukebox for a few years now, so I'm sure some new model will get released and explode overnight, like what chatGPT did.
-
Mongolian Gabba Goat Techno
That already exists
- El éxito continuo de OpenAI: Y como llegaron a crear la IA más avanzada del 2023. ChatGPT.
-
Implementation of Google's MusicLM in PyTorch
This model is designed to output raw audio.
However, there are many models which do output midi. That's actually much simpler, and has been done already a few years ago.
I thought OpenAI did this. But then, I might misremember, because their Jukebox actually also seems to produce raw audio (https://openai.com/blog/jukebox/).
However, midi generation is so easy, you even find it in some tutorials: https://www.tensorflow.org/tutorials/audio/music_generation
- FREE AI THINGS
What are some alternatives?
codec2 - Open source speech codec designed for communications quality speech between 700 and 3200 bit/s. The main application is low bandwidth HF/VHF digital radio.
lucid-sonic-dreams
ESP32_Codec2 - Codec2 library for ESP32 (Arduino)
ultimatevocalremovergui - GUI for a Vocal Remover that uses Deep Neural Networks.
minisearch - Tiny and powerful JavaScript full-text search engine for browser and Node
spleeter - Deezer source separation library including pretrained models.
Bazel - a fast, scalable, multi-language and extensible build system
music-demixing-challenge-starter-kit - Starter kit for getting started in the Music Demixing Challenge.
elasticsearch-py - Official Python client for Elasticsearch
dalle-mini - DALL·E Mini - Generate images from a text prompt
regex-benchmark - It's just a simple regex benchmark of different programming languages.
latent-diffusion - High-Resolution Image Synthesis with Latent Diffusion Models