Bark: A transformer based text to audio system

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • bark

    🔊 Text-Prompted Generative Audio Model

  • With some tinkering you can create really interesting stuff with Bark. I managed to generate a couple of song snippets / intros using free form text [1]

    Haven't tested it personally yet but if you are interested in voice cloning, you might wanna check this fork of Bark [2]

    [1] https://github.com/suno-ai/bark/discussions/249

  • bark

    🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model (by JonathanFly)

  • I'll link my Bark fork with long audio generation and other features on the root thread, I suppose: https://github.com/JonathanFly/bark

    There's going to be a big update this week with some new stuff I haven't talked about. And a bunch of amazing, clear voices, with a huge variety of styles, that blow the default Suno voices out of the water.

    Don't get too attached though. I was just playing around and made a Bark fork and it got more popular than expected. But I wasn't thinking about the hours of unpaid support and maintenance in my future that I definitely can NOT afford, for software I don't even really have a personal use case for. I'm not generating my own audiobooks or anything, I won’t be using it long term myself, I was just curious what Bark could do. (Turns out a LOT more than you might think at first glance, as you'll see this week.) So I'm trying to work out how I can elegantly wind this down and transition people somewhere else. But I'll keep it updated for at least a little while.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • bark-with-voice-clone

    🔊 Text-prompted Generative Audio Model - With the ability to clone voices

  • [2] https://github.com/serp-ai/bark-with-voice-clone

  • audiolm-pytorch

    Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

  • It’s mostly there in https://github.com/lucidrains/audiolm-pytorch#hierarchical-t....

  • encodec

    State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Microsoft’s new text-to-speech model can duplicate anyone's voice in 3 seconds

    3 projects | news.ycombinator.com | 9 Jan 2023
  • Q-Transformer

    2 projects | news.ycombinator.com | 30 Nov 2023
  • To Bridge the Gap Until the Official Audiobooks Are Released I Tried Making a Myne TTS [P5V5]

    1 project | /r/HonzukiNoGekokujou | 19 Oct 2023
  • LongLlama

    2 projects | /r/LocalLLaMA | 7 Jul 2023
  • Which features you wish that were added to Character Ai?

    1 project | /r/CharacterAI | 7 Jul 2023