Base TTS (Amazon): The largest text-to-speech model to-date

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Scout Monitoring - Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
  • bark

    An inference server for Bark (by SaladTechnologies)

    Bark and Tortoise work fairly well. Bark does super fast inference[1] on my M1.

    [1] https://github.com/SaladTechnologies/bark

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • metavoice-src

    Foundational model for human-like, expressive TTS

    Interesting. Just a couple of hours ago I came across MetaVoice-1B [0] (Demo [1]) and was amazed by the quality of their TTS in English (sadly no other languages available).

    If this year becomes the year when high quality Open Source TTS and ASR models appear that can run in real-time on an Nvidia RTX 40x0 or 30x0, then that would be great. On CPU even better.

    [0] https://github.com/metavoiceio/metavoice-src

    [1] https://ttsdemo.themetavoice.xyz/

  • TTS

    πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

    I've used coqui.ai's TTS models[0] and library[1] to great success. I was able to get cloned voice to be rendered in about 80% of the audio clip length, and I believe you can also stream the response. Do note the model license for XTTS, it is one they wrote themselves that has some restrictions.

    [0] https://huggingface.co/coqui/XTTS-v2

    [1] https://github.com/coqui-ai/TTS

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • OpenAI deems its voice cloning tool too risky for general release

    1 project | news.ycombinator.com | 31 Mar 2024
  • Coqui Is Shutting Down

    1 project | news.ycombinator.com | 11 Jan 2024
  • Coqui.ai Is Shutting Down

    4 projects | news.ycombinator.com | 3 Jan 2024
  • Hello guys, any selfhosted alternative to eleven labs?

    3 projects | /r/selfhosted | 11 Dec 2023
  • Demo of Anagnorisis - completely local recommendation system powered by Llama 2. Radio mode. Work in progress.

    2 projects | /r/LocalLLaMA | 11 Dec 2023

Did you konow that Python is
the 1st most popular programming language
based on number of metions?