bark-with-voice-clone
bark
bark-with-voice-clone | bark | |
---|---|---|
19 | 9 | |
2,864 | 960 | |
3.3% | - | |
3.9 | 8.7 | |
5 days ago | 7 months ago | |
Python | Jupyter Notebook | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
bark-with-voice-clone
-
I've open sourced my Flutter plugin to run on-device LLMs on any platform. TestFlight builds available now.
And more stuff Iβm often checking back on: - https://github.com/staghado/vit.cpp - https://github.com/serp-ai/bark-with-voice-clone - https://github.com/leejet/stable-diffusion.cpp (generate images) - etc β¦ thereβs too much fun stuff out there. Wish I had more free time haha.
-
Any local voice models?
Check out the Bark model: serp-ai/bark-with-voice-clone: π Text-prompted Generative Audio Model - With the ability to clone voices (github.com)
-
Are there any AI resources to help create audiobooks from text to speech?
You can run a fork of bark locally https://serp.ai/tools/bark-text-to-speech-ai-voice-clone-app/
-
How to install Bark with voice cloning locally?
I want to install bark with voice cloning locally and I do not find any installation help with this. Can someone please provide step by step instructions for this? I tried the collab but that is very limited, and I can't get that to work properly, and it'd be way easier if I just set it up locally. I do not know of any scripts for local installs either so if someone could point me to a python script to use a custom voice with bark and run the TTS that'd be great too.
-
Descript alternative for voice cloning a video game character?
It's beyond my ability but apparently there's this: https://github.com/serp-ai/bark-with-voice-clone I'm forced to wait for a more user friendly alternative.
- Easiest text to audio solution?
- Bark: A transformer based text to audio system
- [R] Bark: Real-time Open-Source Text-to-Audio Rivaling ElevenLabs
- Bark: Real-time Open-Source Text-to-Audio Rivaling ElevenLabs
-
This is surreal: ElevenLabs AI can now clone the voice of someone that speaks English (BBC's David Attenborough in this case) and let them say things in a language, they don't speak, like German.
This fork with voice cloning unlocked
bark
-
To Bridge the Gap Until the Official Audiobooks Are Released I Tried Making a Myne TTS [P5V5]
So I looked around and decided to use Bark Infinity. (Originally wanted to use Amazon Polly, but don't have a credit card) I tried around and found out that the female storyteller voice sounds quite decently. So I used that and a reference clip of Myne's voice as prompt (which I think might have helped a little... I don't get all that program's features) to generate a whole chapter. That worked quite well.
- Free/Affordable Text to Speech AI?
- Local and open-source equivalent to HeyGen Text-to-Speech (TTS) AI?
-
Whispers of Frostcliff Lodge
AI-generated voice. I'll have to try Bark Infinity and Speechify.
-
Bark: A transformer based text to audio system
I'll link my Bark fork with long audio generation and other features on the root thread, I suppose: https://github.com/JonathanFly/bark
There's going to be a big update this week with some new stuff I haven't talked about. And a bunch of amazing, clear voices, with a huge variety of styles, that blow the default Suno voices out of the water.
Don't get too attached though. I was just playing around and made a Bark fork and it got more popular than expected. But I wasn't thinking about the hours of unpaid support and maintenance in my future that I definitely can NOT afford, for software I don't even really have a personal use case for. I'm not generating my own audiobooks or anything, I wonβt be using it long term myself, I was just curious what Bark could do. (Turns out a LOT more than you might think at first glance, as you'll see this week.) So I'm trying to work out how I can elegantly wind this down and transition people somewhere else. But I'll keep it updated for at least a little while.
- Converting a Subreddit into a Podcast with GPT-4
- Ask a Text-To-Speech AI (Bark) to say "Why was six afraid of seven?" but ignore the "I'm done" token and force it to just keep talking.
- [R] πΆ Bark - Text2Speech...But with Custom Voice Cloning using your own audio/text samples ποΈπ
What are some alternatives?
bark - π Text-Prompted Generative Audio Model
encodec - State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
bark_tts - Oobabooga extension for Bark TTS
bark-voice-cloning-HuBERT-quantizer - The code for the bark-voicecloning model. Training and inference.
crowdcast - Converts a subreddit into a podcast
bark-gui - π Text-Prompted Generative Audio Model with Gradio
audiolm-pytorch - Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
audio-webui - A webui for different audio related Neural Networks
TTS - :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
chatgpt-voice-assistant - A voice assistant powered by OpenAI's ChatGPT language model, currently available in six languages.
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple