NATSpeech: High Quality Text-to-Speech Implementation with HuggingFace Demo

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • NATSpeech

    A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

  • larynx

    Discontinued End to end text to speech system using gruut and onnx

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • gruut

    A tokenizer, text cleaner, and phonemizer for many human languages.

    Due to licensing constraints, I wrote my own text frontend (MIT): https://github.com/rhasspy/gruut

    I plan to release a version on Larynx that uses eSpeak for phonemization, since it covers many of those corner cases.

  • rhasspy

    Offline private voice assistant for many human languages

    I usually announce things on the Rhasspy voice assistant forums: https://community.rhasspy.org/

    I also have a Twitter account (@rhasspy) used almost exclusively for announcements.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts