Bark: A transformer based text to audio system

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

bark

67 32,668 5.4 Jupyter Notebook

🔊 Text-Prompted Generative Audio Model

With some tinkering you can create really interesting stuff with Bark. I managed to generate a couple of song snippets / intros using free form text [1]
Haven't tested it personally yet but if you are interested in voice cloning, you might wanna check this fork of Bark [2]
[1] https://github.com/suno-ai/bark/discussions/249

bark

9 959 8.7 Jupyter Notebook

🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model (by JonathanFly)

I'll link my Bark fork with long audio generation and other features on the root thread, I suppose: https://github.com/JonathanFly/bark
There's going to be a big update this week with some new stuff I haven't talked about. And a bunch of amazing, clear voices, with a huge variety of styles, that blow the default Suno voices out of the water.
Don't get too attached though. I was just playing around and made a Bark fork and it got more popular than expected. But I wasn't thinking about the hours of unpaid support and maintenance in my future that I definitely can NOT afford, for software I don't even really have a personal use case for. I'm not generating my own audiobooks or anything, I won’t be using it long term myself, I was just curious what Bark could do. (Turns out a LOT more than you might think at first glance, as you'll see this week.) So I'm trying to work out how I can elegantly wind this down and transition people somewhere else. But I'll keep it updated for at least a little while.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
bark-with-voice-clone

19 2,838 7.5 Python

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

[2] https://github.com/serp-ai/bark-with-voice-clone

audiolm-pytorch

4 2,249 9.0 Python

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

It’s mostly there in https://github.com/lucidrains/audiolm-pytorch#hierarchical-t....

encodec

18 3,185 3.9 Python

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Microsoft’s new text-to-speech model can duplicate anyone's voice in 3 seconds

3 projects | news.ycombinator.com | 9 Jan 2023
Q-Transformer

2 projects | news.ycombinator.com | 30 Nov 2023
To Bridge the Gap Until the Official Audiobooks Are Released I Tried Making a Myne TTS [P5V5]

1 project | /r/HonzukiNoGekokujou | 19 Oct 2023
LongLlama

2 projects | /r/LocalLLaMA | 7 Jul 2023
Which features you wish that were added to Character Ai?

1 project | /r/CharacterAI | 7 Jul 2023

Bark: A transformer based text to audio system

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Artificial intelligence bark attention-mechanisms Tts audio-synthesis
Post date: 14 May 2023

bark

bark

InfluxDB

bark-with-voice-clone

audiolm-pytorch

encodec

SaaSHub

Related posts

Microsoft’s new text-to-speech model can duplicate anyone's voice in 3 seconds

Q-Transformer

To Bridge the Gap Until the Official Audiobooks Are Released I Tried Making a Myne TTS [P5V5]

LongLlama

Which features you wish that were added to Character Ai?

Bark: A transformer based text to audio system

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Artificial intelligence bark attention-mechanisms Tts audio-synthesis Post date: 14 May 2023

bark

bark

InfluxDB

bark-with-voice-clone

audiolm-pytorch

encodec

SaaSHub

Related posts

Microsoft’s new text-to-speech model can duplicate anyone's voice in 3 seconds

Q-Transformer

To Bridge the Gap Until the Official Audiobooks Are Released I Tried Making a Myne TTS [P5V5]

LongLlama

Which features you wish that were added to Character Ai?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Artificial intelligence bark attention-mechanisms Tts audio-synthesis
Post date: 14 May 2023