Ask HN: Open-source, local Text-to-Speech (TTS) generators

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • TTS

    πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

  • I just noticed that https://coqui.ai/ is "Shutting down".

    I'm building a web app (React / Django) which takes a list of affirmations & goals (in Markdown files), puts them into a database (SQlite), and uses voice synthesis to create voice audio files of the phrases. These are combined with a relaxed backing track (ffmpeg), made into playlists of 10-20 phrases (randomly sampled, or according to a theme: "mind" "body" "soul") and then play automatically in the morning & evening (cron). This allows you to persistently hear & vocalize your own goals & good vibes over time.

    I had been planning to use Coqui TTS as the local text-to-speech engine, but with this cancellation, I'd love to hear from the community what is a great open-source, local text-to-speech engine?

    Generally, I learn both the highest quality commercially available technology (example: ElevenLabs), and also the best open-source equivalent. Would love to hear suggestions & perspectives on this. What voice synth tools are you investing your time into learning & building with?

  • piper

    A fast, local neural text to speech system (by rhasspy)

  • Mozilla's browser tts is kind of not bad, just parse and buffer one sentence at a time and it does all right.

    For the backend, I've experimented with piper, which has a lot of voices and accents, though it's tricky to buffer and sync long texts.

    https://github.com/rhasspy/piper

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • [P] Making a TTS voice, HK-47 from Kotor using Tortoise (Ideally WaveRNN)

    2 projects | /r/MachineLearning | 6 Jul 2023
  • NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality

    14 projects | news.ycombinator.com | 17 May 2022
  • WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper

    9 projects | news.ycombinator.com | 17 Jan 2024
  • [D] TTS systems to download & run offline

    3 projects | /r/MachineLearning | 14 May 2023
  • AI-genereeritud Politseikroonika

    1 project | /r/Eesti | 17 Apr 2023