ChatTTS-Best TTS Model

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Nutrient - The #1 PDF SDK Library
Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrient’s PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free.
nutrient.io
featured
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
  1. piper

    A fast, local neural text to speech system

    > Last time I looked into TTS systems for German, Google was the only game in town. What I wouldn't give for a viable alternative! It doesn't even need to be open source, I'd be quite ready to pay top dollar.

    Will you still pay top dollar if it is open source though? :D

    Piper TTS[0] (MIT Licensed; developed by main dev of Larynx TTS, Mimic3 TTS & Rhasspy voice assistant) has support for ~30 languages, at least some of which have multiple voices available--in a range of quality & data licenses.

    And, particularly fortuitous for your needs, potentially, there's at at least one German voice that was recorded[1] specifically for Piper[2] (with emotion variants and CC0-licensed, no less :) )...

    Check out `thorsten` & `thorsten_emotional` on the samples page: https://rhasspy.github.io/piper-samples/

    I can't speak to the quality of the German voice specifically but for English at least I've found Piper's quality & range of voices of use[3].

    ---- footnotes ----

    [0] https://github.com/rhasspy/piper

    [1] https://www.youtube.com/playlist?list=PL19C7uchWZeojjI5FUk3q...

    [2] In addition to other German voices based on other sources: https://huggingface.co/rhasspy/piper-voices/tree/main/de/de_...

    [3] Somewhat of an understatement.

  2. Nutrient

    Nutrient - The #1 PDF SDK Library. Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrient’s PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free.

    Nutrient logo
  3. ChatTTS

    A generative speech model for daily dialogue.

    You can try to run this code to use

    https://github.com/2noise/ChatTTS/blob/main/infer.ipynb

  4. My interest in TTS is around "indie" game creation, animation and "radio plays".

    A couple of years ago I started development of a tool to help with the generation of game audio such as NPC dialogue, "barks" or narration for those without access to/budget for human voice actors: https://rancidbacon.itch.io/dialogue-tool-for-larynx-text-to...

    One thing I found interesting is that writing a small "scene" and then hearing dialogue being spoken by a variety of voices often prompted the writing of further lines of dialogue in response to perceived emotion contained in voices in the generated output. Plus it was just fun. :)

    The version of the tool on that page is based on Larynx TTS which has continued development more recently as Piper TTS: https://github.com/rhasspy/piper

    I'm yet to publish my port which uses Piper TTS though: https://gitlab.com/RancidBacon/larynx-dialogue/-/tree/featur...

    Though I did upload some sample output (including some "radio announcer" samples in response to a HN comment :) ): https://rancidbacon.gitlab.io/piper-tts-demos/

    Obviously there's variations in voice quality, and ability to control expression is currently limited but beats hearing my own voice. :D

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Show HN: Voice Cloning and Multilingual TTS in One Click (Windows)

    2 projects | news.ycombinator.com | 26 Jan 2025
  • Edge TTS

    4 projects | news.ycombinator.com | 22 Jan 2025
  • Ask HN: What's the best on-device TTS engine with natural voice

    1 project | news.ycombinator.com | 31 Oct 2024
  • Play 3.0 mini – A lightweight, reliable, cost-efficient Multilingual TTS model

    5 projects | news.ycombinator.com | 14 Oct 2024
  • Show HN: Offline audiobook from any format with one CLI command

    7 projects | news.ycombinator.com | 6 Oct 2024

Did you know that C++ is
the 7th most popular programming language
based on number of references?