chatgpt-voice-assistant
bark-with-voice-clone
chatgpt-voice-assistant | bark-with-voice-clone | |
---|---|---|
6 | 19 | |
60 | 2,850 | |
- | 3.3% | |
1.7 | 7.5 | |
10 months ago | 6 months ago | |
Python | Python | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
chatgpt-voice-assistant
- Easiest text to audio solution?
-
Just added voice support to ChatGPT to practice my english!
It is trivial to change the speech recognizer's input to Japanese. Simply change config.toml line 12 to lang = 'ja'. You may have to do additional work to get this to function properly, but Google speech recognition already caters to the international userbase.
Check it out at chatgpt-voice-assistant and feel free to contribute (tested on Ubuntu 18 only).
- ChatGPT com TTS e STT para ajudar a praticar o Inglรชs!
- Just added voice support to ChatGPT to practice my english
bark-with-voice-clone
-
I've open sourced my Flutter plugin to run on-device LLMs on any platform. TestFlight builds available now.
And more stuff Iโm often checking back on: - https://github.com/staghado/vit.cpp - https://github.com/serp-ai/bark-with-voice-clone - https://github.com/leejet/stable-diffusion.cpp (generate images) - etc โฆ thereโs too much fun stuff out there. Wish I had more free time haha.
-
Any local voice models?
Check out the Bark model: serp-ai/bark-with-voice-clone: ๐ Text-prompted Generative Audio Model - With the ability to clone voices (github.com)
-
Are there any AI resources to help create audiobooks from text to speech?
You can run a fork of bark locally https://serp.ai/tools/bark-text-to-speech-ai-voice-clone-app/
-
How to install Bark with voice cloning locally?
I want to install bark with voice cloning locally and I do not find any installation help with this. Can someone please provide step by step instructions for this? I tried the collab but that is very limited, and I can't get that to work properly, and it'd be way easier if I just set it up locally. I do not know of any scripts for local installs either so if someone could point me to a python script to use a custom voice with bark and run the TTS that'd be great too.
-
Descript alternative for voice cloning a video game character?
It's beyond my ability but apparently there's this: https://github.com/serp-ai/bark-with-voice-clone I'm forced to wait for a more user friendly alternative.
- Easiest text to audio solution?
- Bark: A transformer based text to audio system
- [R] Bark: Real-time Open-Source Text-to-Audio Rivaling ElevenLabs
- Bark: Real-time Open-Source Text-to-Audio Rivaling ElevenLabs
-
This is surreal: ElevenLabs AI can now clone the voice of someone that speaks English (BBC's David Attenborough in this case) and let them say things in a language, they don't speak, like German.
This fork with voice cloning unlocked
What are some alternatives?
Aetherius_AI_Assistant - A completely private, locally-operated Ai Assistant/Chatbot/Sub-Agent Framework with realistic Long Term Memory and thought formation using Open Source LLMs. Qdrant is used for the Vector DB.
bark - ๐ Text-Prompted Generative Audio Model
valheim-ai-assistant - Ask AI (Google Bard) questions what to make next in Valheim, by parsing your save data to Bard
bark_tts - Oobabooga extension for Bark TTS
GPT-agents - Browsing-enabled GPT agents with different personalities.
bark-voice-cloning-HuBERT-quantizer - The code for the bark-voicecloning model. Training and inference.
Talk2GPT - GPT-3 client for Windows and Unix with memories management that supports both text and speech in any language. Includes a free text2image
bark-gui - ๐ Text-Prompted Generative Audio Model with Gradio
wunjo.wladradchenko.ru - Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.
bark - ๐ BARK INFINITY GUI CMD ๐ถ Powered Up Bark Text-prompted Generative Audio Model
audio-webui - A webui for different audio related Neural Networks
encodec - State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.