SaaSHub helps you find the best software and product alternatives Learn more β
Audio-webui Alternatives
Similar projects and alternatives to audio-webui
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
-
-
-
-
-
-
-
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
-
-
-
-
-
easydiffusion
An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.
-
-
-
-
bark-voice-cloning-HuBERT-quantizer
The code for the bark-voicecloning model. Training and inference.
-
audio-webui discussion
audio-webui reviews and mentions
-
Sub for AI voice models
I mean, just use gitmylo's repo.
-
What are some good tools for text2audio that I can run locally?
For pure voice and not autogeneration from the LLM you have stuff like: https://github.com/gitmylo/audio-webui
-
Open Source Libraries
gitmylo/audio-webui
-
Dedicated Riffusion Gradio training interface?
I was wondering if there might be some way to incorporate Riffusion and it's various capabilities into this platform? Multiple attempts have been made by me on my local server to combine the Automatic111 SD-Web-UI extensions and such into the Audiocraft_Plus (https://github.com/GrandaddyShmax/audiocraft_plus) and Audio Web (https://github.com/gitmylo/audio-webui) Ui's platform, but truth be told I am a total beginner and keep coming up short!
-
Any local voice models?
audio-webui is the stable diffusion of txt 2 speech stuff but don't expect high quality voice replication for a while. https://github.com/gitmylo/audio-webui
-
Best Tool for creating an AI celebrity voice clone?
You can try Audio-Webui if you're technically savvy. There are some voice cloning workflows as well as RVC, voice conversion.
-
Are there any AI resources to help create audiobooks from text to speech?
Have not tested but it looks like the audio-webui repo is ready for long texts (just click the COLAB link to test it). I would test it and then go tortoise if the quality is not as needed.
-
I found a youtube tutorial voiceover made by AI, and I'm blown away by its quality. Can you help me figure out which tool did the author use?
This is the best open source voice cloning. Super easy to install also.
-
How to change your voice to someone elseβs for a song? What are the best ways being used right now?
People use https://github.com/gitmylo/audio-webui and https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI for that Check out this tutorial : https://www.youtube.com/watch?v=-JcvdDErkAU It's possible to separate music or background noises from voice with these tech and recombine them together or with other songs, it's amazing and fun.
-
What would be the Stable Diffusion equivalent, for AI music generation?
Check this out : https://github.com/gitmylo/audio-webui/wiki/Features
-
A note from our sponsor - SaaSHub
www.saashub.com | 14 Jun 2026
Stats
gitmylo/audio-webui is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of audio-webui is Python.
Popular Comparisons
- audio-webui VS SillyTavern
- audio-webui VS DeepFilterNet
- audio-webui VS TTS
- audio-webui VS Retrieval-based-Voice-Conversion-WebUI
- audio-webui VS tortoise-tts
- audio-webui VS bark
- audio-webui VS easydiffusion
- audio-webui VS fish-diffusion
- audio-webui VS bark-voice-cloning-HuBERT-quantizer
- audio-webui VS bark-with-voice-clone