Pink-Trombone vs piper-phonemize

With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

surveyjs.io

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

Pink-Trombone		piper-phonemize
	Project
2	Mentions	1
151	Stars	54
-	Growth	-
5.2	Activity	7.7
5 months ago	Latest Commit	2 months ago
JavaScript	Language	C++
GNU General Public License v3.0 only	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Pink-Trombone

Posts with mentions or reviews of Pink-Trombone. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-05-01.

ESpeak-ng: speech synthesizer with more than one hundred languages and accents
21 projects | news.ycombinator.com | 1 May 2024

Too late to edit, but to any one who needs "convincing" of the flexibility of a formant synthesizer, you should 1) play with Pink Trombone[1], a Javascript formant synthesizer with a UI that lets you graphically manipulate a vocal tract, and 2) have a look at this programmable version of it[2]
[1] https://dood.al/pinktrombone/
[2] https://github.com/zakaton/Pink-Trombone
How to convert phonetic units into words by writing some code (general software approach)?
2 projects | /r/learnprogramming | 23 Jun 2022

Can I do it automatically without any audio/voice recordings at all, such as taking advantage of the pink trombone linguistics tool, (for which there is some source code)?

piper-phonemize

Posts with mentions or reviews of piper-phonemize. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-05-01.

ESpeak-ng: speech synthesizer with more than one hundred languages and accents
21 projects | news.ycombinator.com | 1 May 2024

Yeah, it would be nice if the financial backing behind Rhasspy/Piper led to improvements in espeak-ng too but based on my own development-related experience with the espeak-ng code base (related elsewhere in the thread) I suspect it would be significantly easier to extract the specific required text to phonemes functionality or (to a certain degree) reimplement it (or use a different project as a base[3]) than to more closely/fully integrate changes with espeak-ng itself[4]. :/
It seems Piper currently abstracts its phonemize-related functionality with a library[0] that currently makes use of a espeak-ng fork[1].
Unfortunately it also seems license-related issues may have an impact[2] on whether Piper continues to make use of espeak-ng.
For your specific example of handling 1984 as a year, my understanding is that espeak-ng can handle situations like that via parameters/configuration but in my experience there can be unexpected interactions between different configuration/API options[6].
[0] https://github.com/rhasspy/piper-phonemize
[1] https://github.com/rhasspy/espeak-ng
[2] https://github.com/rhasspy/piper-phonemize/issues/30#issueco...
[3] Previously I've made note of some potential options here: https://gitlab.com/RancidBacon/notes_public/-/blob/main/note...
[4] For example, as I note here[5] there's currently at least four different ways to access espeak-ng's phoneme-related functionality--and it seems that they all differ in their output, sometimes consistently and other times dependent on configuration (e.g. audio output mode, spoken punctuation) and probably also input. :/
[5] https://gitlab.com/RancidBacon/floss-various-contribs/-/blob...
[6] For example, see my test cases for some other numeric-related configuration options here: https://gitlab.com/RancidBacon/floss-various-contribs/-/blob...

What are some alternatives?

When comparing Pink-Trombone and piper-phonemize you can also consider the following projects:

espeak-ng - eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

xVA-Synth - Machine learning based speech synthesis Electron app, with voices from specific characters from video games

web-speech-synthesis-and-recognition - Speech to Text and Text to Speech on a web browser

audioworklet-polyfill - 🔊 Polyfill AudioWorklet using the legacy ScriptProcessor API.

Pink-Trombone vs espeak-ng Pink-Trombone vs xVA-Synth Pink-Trombone vs web-speech-synthesis-and-recognition Pink-Trombone vs audioworklet-polyfill

Compare Pink-Trombone vs piper-phonemize and see what are their differences.

Pink-Trombone

piper-phonemize

Pink-Trombone

piper-phonemize

What are some alternatives?