Pink-Trombone VS piper-phonemize

Compare Pink-Trombone vs piper-phonemize and see what are their differences.

SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App
With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.
surveyjs.io
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
Pink-Trombone piper-phonemize
2 1
151 54
- -
5.2 7.7
5 months ago 2 months ago
JavaScript C++
GNU General Public License v3.0 only MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Pink-Trombone

Posts with mentions or reviews of Pink-Trombone. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-05-01.

piper-phonemize

Posts with mentions or reviews of piper-phonemize. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-05-01.
  • ESpeak-ng: speech synthesizer with more than one hundred languages and accents
    21 projects | news.ycombinator.com | 1 May 2024
    Yeah, it would be nice if the financial backing behind Rhasspy/Piper led to improvements in espeak-ng too but based on my own development-related experience with the espeak-ng code base (related elsewhere in the thread) I suspect it would be significantly easier to extract the specific required text to phonemes functionality or (to a certain degree) reimplement it (or use a different project as a base[3]) than to more closely/fully integrate changes with espeak-ng itself[4]. :/

    It seems Piper currently abstracts its phonemize-related functionality with a library[0] that currently makes use of a espeak-ng fork[1].

    Unfortunately it also seems license-related issues may have an impact[2] on whether Piper continues to make use of espeak-ng.

    For your specific example of handling 1984 as a year, my understanding is that espeak-ng can handle situations like that via parameters/configuration but in my experience there can be unexpected interactions between different configuration/API options[6].

    [0] https://github.com/rhasspy/piper-phonemize

    [1] https://github.com/rhasspy/espeak-ng

    [2] https://github.com/rhasspy/piper-phonemize/issues/30#issueco...

    [3] Previously I've made note of some potential options here: https://gitlab.com/RancidBacon/notes_public/-/blob/main/note...

    [4] For example, as I note here[5] there's currently at least four different ways to access espeak-ng's phoneme-related functionality--and it seems that they all differ in their output, sometimes consistently and other times dependent on configuration (e.g. audio output mode, spoken punctuation) and probably also input. :/

    [5] https://gitlab.com/RancidBacon/floss-various-contribs/-/blob...

    [6] For example, see my test cases for some other numeric-related configuration options here: https://gitlab.com/RancidBacon/floss-various-contribs/-/blob...

What are some alternatives?

When comparing Pink-Trombone and piper-phonemize you can also consider the following projects:

espeak-ng - eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

xVA-Synth - Machine learning based speech synthesis Electron app, with voices from specific characters from video games

web-speech-synthesis-and-recognition - Speech to Text and Text to Speech on a web browser

audioworklet-polyfill - 🔊 Polyfill AudioWorklet using the legacy ScriptProcessor API.