basic-pitch
audio-webui
basic-pitch | audio-webui | |
---|---|---|
8 | 15 | |
2,941 | 902 | |
2.9% | - | |
8.4 | 9.0 | |
4 days ago | 20 days ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
basic-pitch
-
Open Source Libraries
spotify/basic-pitch: Audio to midi converter
-
Mac users: is it best to just rent a linux server?
I did just get it to work setting an alias for Python pointing to 3.11 but ran into this issue: https://github.com/spotify/basic-pitch/issues/63
-
Transcribing music from audio?
There's https://github.com/spotify/basic-pitch, a free converter, but require CLI usage.
-
Recommended python library for converting audio file into midi ?
Might be worthwhile checking out Spotify’s basic pitch library.
-
How to make a sub bass follow a bit from a mudpie?
quick add on: definitely take a look at this https://github.com/spotify/basic-pitch its a much much better offline pitch detection algo that you could use to get the midi for your mudpie. I'd probably bounce a low passed copy at around 150-200Hz, making sure its completley mono and then feed that wav/aiff file to that pitch detector
-
Spotify Research Open-Sources ‘Basic Pitch’: A Machine Learning Tool For Converting Audio Into MIDI
Continue reading | Check out the paper, github, project and post
-
Spotify Introduces an Open-Source Tool to Fix a Big Problem for Modern Musicians - It's FOSS News
GitHub link
audio-webui
-
Sub for AI voice models
I mean, just use gitmylo's repo.
-
What are some good tools for text2audio that I can run locally?
For pure voice and not autogeneration from the LLM you have stuff like: https://github.com/gitmylo/audio-webui
-
Open Source Libraries
gitmylo/audio-webui
-
Dedicated Riffusion Gradio training interface?
I was wondering if there might be some way to incorporate Riffusion and it's various capabilities into this platform? Multiple attempts have been made by me on my local server to combine the Automatic111 SD-Web-UI extensions and such into the Audiocraft_Plus (https://github.com/GrandaddyShmax/audiocraft_plus) and Audio Web (https://github.com/gitmylo/audio-webui) Ui's platform, but truth be told I am a total beginner and keep coming up short!
-
Any local voice models?
audio-webui is the stable diffusion of txt 2 speech stuff but don't expect high quality voice replication for a while. https://github.com/gitmylo/audio-webui
-
Best Tool for creating an AI celebrity voice clone?
You can try Audio-Webui if you're technically savvy. There are some voice cloning workflows as well as RVC, voice conversion.
-
Are there any AI resources to help create audiobooks from text to speech?
Have not tested but it looks like the audio-webui repo is ready for long texts (just click the COLAB link to test it). I would test it and then go tortoise if the quality is not as needed.
-
I found a youtube tutorial voiceover made by AI, and I'm blown away by its quality. Can you help me figure out which tool did the author use?
This is the best open source voice cloning. Super easy to install also.
-
How to change your voice to someone else’s for a song? What are the best ways being used right now?
People use https://github.com/gitmylo/audio-webui and https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI for that Check out this tutorial : https://www.youtube.com/watch?v=-JcvdDErkAU It's possible to separate music or background noises from voice with these tech and recombine them together or with other songs, it's amazing and fun.
-
What would be the Stable Diffusion equivalent, for AI music generation?
Check this out : https://github.com/gitmylo/audio-webui/wiki/Features
What are some alternatives?
ai-music - A vanilla Trasformer Decoder music generation model trained on Final Fantasy OST MIDI songs
tortoise-tts - A multi-voice TTS system trained with an emphasis on quality
THIRTY-DOLLAR-HAIRCUT-GENERATOR - 30 dollar haircut website MIDI converter - Using MIDIs, QUICKLY generate a chart for the "DON'T YOU LECTURE ME WITH YOUR THIRTY DOLLAR HAIRCUT" website. The site's by GDcolon, if you need to search it up.
TTS - 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
gitpod - The developer platform for on-demand cloud development environments to create software faster and more securely.
audiocraft_plus - Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
concordia - Crowdsourcing platform for full text transcription and tagging. https://crowd.loc.gov
DeepFilterNet - Noise supression using deep filtering
PiDTLN - Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi
bark - 🔊 Text-Prompted Generative Audio Model
torchlambda - Lightweight tool to deploy PyTorch models to AWS Lambda
Retrieval-based-Voice-Conversion-WebUI - Easily train a good VC model with voice data <= 10 mins!