crowdcast
bark
crowdcast | bark | |
---|---|---|
74 | 9 | |
101 | 968 | |
- | - | |
6.1 | 8.7 | |
about 1 year ago | 8 months ago | |
Python | Jupyter Notebook | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
crowdcast
-
Suggestions for exercises/projects programmers can do in their spare time to stay up to date with GPT's capabilities?
Try making a podcast - https://github.com/AdmTal/crowdcast
-
Podcast made by AI - combination of ChatGPT and ElevenLabs
I recently abandoned a similar project, feel free to steal code if you want - https://github.com/AdmTal/crowdcast
- Prompts for building a weekly, interactive Podcast
-
100% open source, AI Generated Podcast -- Do y'all have any feedback on these prompts?
All of the code is here, but I'm going to give some quick links to the prompts below.
- Interactive, AI Generated Podcast -- Python Open Source
- Show HN: Crowdcast – An Open-Source, AI-Powered Interactive Podcast [4 Episodes]
-
Episode #4! - Pyramid Schemes, How history repeats itself, and top 5 negotiation tips from Pawn Stars
As always, the script is on Github - if you're more of a reading type of person 👀
- Shout out to ahonnecke on GitHub for being the first listener to contribute to the codebase!!!
-
Interactive, AI Generated Podcast
Otherwise, it’s pretty simple, I have a script that pulls the top three comments every week, and I run them through a series of prompts in order to build a full podcast script. Then I use Eleven Labs to generate the voice. I stitch it all together with some music using Python, and then publish it using Buzzsprout. I’m currently using GPT-4 to run the prompts.
-
Generating a Podcast using Reddit Comments, AI, and Python
Of course, all of the code is open source if you’re into that kind of thing - feel free to open a PR, would love to collaborate and get feedback.
bark
-
To Bridge the Gap Until the Official Audiobooks Are Released I Tried Making a Myne TTS [P5V5]
So I looked around and decided to use Bark Infinity. (Originally wanted to use Amazon Polly, but don't have a credit card) I tried around and found out that the female storyteller voice sounds quite decently. So I used that and a reference clip of Myne's voice as prompt (which I think might have helped a little... I don't get all that program's features) to generate a whole chapter. That worked quite well.
- Free/Affordable Text to Speech AI?
- Local and open-source equivalent to HeyGen Text-to-Speech (TTS) AI?
-
Whispers of Frostcliff Lodge
AI-generated voice. I'll have to try Bark Infinity and Speechify.
-
Bark: A transformer based text to audio system
I'll link my Bark fork with long audio generation and other features on the root thread, I suppose: https://github.com/JonathanFly/bark
There's going to be a big update this week with some new stuff I haven't talked about. And a bunch of amazing, clear voices, with a huge variety of styles, that blow the default Suno voices out of the water.
Don't get too attached though. I was just playing around and made a Bark fork and it got more popular than expected. But I wasn't thinking about the hours of unpaid support and maintenance in my future that I definitely can NOT afford, for software I don't even really have a personal use case for. I'm not generating my own audiobooks or anything, I won’t be using it long term myself, I was just curious what Bark could do. (Turns out a LOT more than you might think at first glance, as you'll see this week.) So I'm trying to work out how I can elegantly wind this down and transition people somewhere else. But I'll keep it updated for at least a little while.
- Converting a Subreddit into a Podcast with GPT-4
- Ask a Text-To-Speech AI (Bark) to say "Why was six afraid of seven?" but ignore the "I'm done" token and force it to just keep talking.
- [R] 🐶 Bark - Text2Speech...But with Custom Voice Cloning using your own audio/text samples 🎙️📝
What are some alternatives?
edge-tts - Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
encodec - State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
tts-server-android - 这是一个Android系统TTS应用,内置微软演示接口,可自定义HTTP请求,可导入其他本地TTS引擎,以及根据中文双引号的简单旁白/对话识别朗读 ,还有自动重试,备用配置,文本替换等更多功能。| Microsoft TTS Android APP implementation (Use demo API)
bark-with-voice-clone - 🔊 Text-prompted Generative Audio Model - With the ability to clone voices
elevenlabs-python - The official Python API for ElevenLabs Text to Speech.
bark - 🔊 Text-Prompted Generative Audio Model
audiolm-pytorch - Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
tortoise-tts - A multi-voice TTS system trained with an emphasis on quality
TTS - :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
emoji-puzzles - The code behind the book
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple