Audio-webui Alternatives

Similar projects and alternatives to audio-webui

whisper

344 60,303 6.4 Python audio-webui VS whisper

Robust Speech Recognition via Large-Scale Weak Supervision
TTS

231 29,420 9.4 Python audio-webui VS TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
whisper.cpp

187 31,174 9.8 C audio-webui VS whisper.cpp

Port of OpenAI's Whisper model in C/C++
koboldcpp

180 3,817 10.0 C++ audio-webui VS koboldcpp

A simple one-file way to run various GGML and GGUF models with KoboldAI's UI
tortoise-tts

144 11,755 8.2 Jupyter Notebook audio-webui VS tortoise-tts

A multi-voice TTS system trained with an emphasis on quality
SillyTavern

76 5,930 10.0 JavaScript audio-webui VS SillyTavern

LLM Frontend for Power Users.
bark

67 32,668 5.4 Jupyter Notebook audio-webui VS bark

🔊 Text-Prompted Generative Audio Model
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Retrieval-based-Voice-Conversion-WebUI

56 19,077 9.6 Python audio-webui VS Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!
audiocraft

37 19,649 8.3 Python audio-webui VS audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
piper

33 4,075 8.6 C++ audio-webui VS piper

A fast, local neural text to speech system (by rhasspy)
pedalboard

24 4,846 8.3 C++ audio-webui VS pedalboard

🎛 🔊 A Python library for audio.
faster-whisper

23 8,899 8.1 Python audio-webui VS faster-whisper

Faster Whisper transcription with CTranslate2
bark-with-voice-clone

19 2,818 7.5 Python audio-webui VS bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices
DeepFilterNet

10 1,933 8.9 Python audio-webui VS DeepFilterNet

Noise supression using deep filtering
easydiffusion

16 9,116 9.4 JavaScript audio-webui VS easydiffusion

Easiest 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.
basic-pitch

8 2,925 8.4 Python audio-webui VS basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
audiocraft_plus

1 455 8.9 Python audio-webui VS audiocraft_plus

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
bark-voice-cloning-HuBERT-quantizer

2 595 7.2 Python audio-webui VS bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.
wenet

5 3,691 9.6 Python audio-webui VS wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit
LLamaSharp

3 1,917 9.8 C# audio-webui VS LLamaSharp

A C#/.NET library to run LLM models (🦙LLaMA/LLaVA) on your local device efficiently.
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better audio-webui alternative or higher similarity.

Suggest an alternative to audio-webui

audio-webui reviews and mentions

Posts with mentions or reviews of audio-webui. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-10-02.

Sub for AI voice models
1 project | /r/findareddit | 8 Dec 2023

I mean, just use gitmylo's repo.
What are some good tools for text2audio that I can run locally?
1 project | /r/LocalLLaMA | 6 Dec 2023

For pure voice and not autogeneration from the LLM you have stuff like: https://github.com/gitmylo/audio-webui
Open Source Libraries
25 projects | /r/AudioAI | 2 Oct 2023

gitmylo/audio-webui
Dedicated Riffusion Gradio training interface?
2 projects | /r/riffusion | 20 Aug 2023

I was wondering if there might be some way to incorporate Riffusion and it's various capabilities into this platform? Multiple attempts have been made by me on my local server to combine the Automatic111 SD-Web-UI extensions and such into the Audiocraft_Plus (https://github.com/GrandaddyShmax/audiocraft_plus) and Audio Web (https://github.com/gitmylo/audio-webui) Ui's platform, but truth be told I am a total beginner and keep coming up short!
Any local voice models?
2 projects | /r/ArtificialInteligence | 10 Jul 2023

audio-webui is the stable diffusion of txt 2 speech stuff but don't expect high quality voice replication for a while. https://github.com/gitmylo/audio-webui
Best Tool for creating an AI celebrity voice clone?
1 project | /r/singularity | 9 Jul 2023

You can try Audio-Webui if you're technically savvy. There are some voice cloning workflows as well as RVC, voice conversion.
Are there any AI resources to help create audiobooks from text to speech?
5 projects | /r/artificial | 9 Jul 2023

Have not tested but it looks like the audio-webui repo is ready for long texts (just click the COLAB link to test it). I would test it and then go tortoise if the quality is not as needed.
I found a youtube tutorial voiceover made by AI, and I'm blown away by its quality. Can you help me figure out which tool did the author use?
3 projects | /r/artificial | 8 Jul 2023

This is the best open source voice cloning. Super easy to install also.
How to change your voice to someone else’s for a song? What are the best ways being used right now?
2 projects | /r/artificial | 6 Jul 2023

People use https://github.com/gitmylo/audio-webui and https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI for that Check out this tutorial : https://www.youtube.com/watch?v=-JcvdDErkAU It's possible to separate music or background noises from voice with these tech and recombine them together or with other songs, it's amazing and fun.
What would be the Stable Diffusion equivalent, for AI music generation?
2 projects | /r/StableDiffusion | 4 Jul 2023

Check this out : https://github.com/gitmylo/audio-webui/wiki/Features
A note from our sponsor - SaaSHub
www.saashub.com | 1 May 2024

SaaSHub helps you find the best software and product alternatives Learn more →