GoogleNetworkSpeechSynthesis vs TTS

With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

surveyjs.io

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

GoogleNetworkSpeechSynthesis		TTS
	Project
11	Mentions	232
0	Stars	29,631
-	Growth	4.7%
10.0	Activity	9.4
over 1 year ago	Latest Commit	5 days ago
JavaScript	Language	Python
BSD 3-clause "New" or "Revised" License	License	Mozilla Public License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

GoogleNetworkSpeechSynthesis

Posts with mentions or reviews of GoogleNetworkSpeechSynthesis. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-22.

Who is using Web Speech API in their Web sites?
1 project | /r/webdev | 2 Jul 2023

If you want you can use Google's TTS service without Web Speech API with a proxy, see GoogleNetworkSpeechSynthesis. I made a feature request for Google to release the source code as FOSS Re: Issue 263510047: Release TTS and STT source code and Google voices as FOSS so we don't have to make external requests just to use Web Speech API.
Is there a good text to speech program for linux?
6 projects | /r/linux | 22 Jun 2023

I requested to Google to Release TTS and STT source code and Google voices as FOSS which you can request over the network here GoogleNetworkSpeechSynthesis. Those are the voices Google Chrome uses for Web Speech API. Feel free to chime on the feature request in in support of Google releasing the source code of its network-based cloud service (that google uses for Web Speech API implementation) TTS and SST code as FOSS.
Google Cloud Text to Speech API: The Future of AI Voice Synthesis
2 projects | /r/javascript | 27 May 2023

Technically you can use Google API key shipped in the browser see guest271314/ GoogleNetworkSpeechSynthesis . No need to sign up for a Google account to use this. I don't think Google's TTS is related to "AI".
Web Speech API is (still) broken on Linux circa 2023
8 projects | /r/javascript | 15 Apr 2023

This is how you can make the request yourself GoogleNetworkSpeechSynthesis.
Build a Text-to-Speech component in React
4 projects | /r/javascript | 8 Apr 2023

If anybody want to build a TTS "module" or "component" that accepts and processes SSML input, and that the user can control - to an appreciable degree - on the front-end you can play around with this https://github.com/guest271314/GoogleNetworkSpeechSynthesis. (Makes an external request). Have fun!
how do you make mockups and portfolio with everything requiring paid API
3 projects | /r/webdev | 18 Mar 2023

It is possible to use Google Network Speech Synthesis service "free of charge", see GoogleNetworkSpeechSynthesis. On Chrome if you use Google voices with SpeechSynthesisUtterance() and window.speechSynthesis.speak() you are doing that anyway.
[AskJS] You have mastered writing JavaScript from scratch, why use TypeScript?
5 projects | /r/javascript | 15 Mar 2023

I implemented SSML parsing in JavaScript by hand for Web Speech API per SSML specification https://github.com/guest271314/SSMLParser, where the Web Speech API nor Firefox nor Chrome or Chromium browsers (Google does implement SSML parsing as a service https://github.com/guest271314/GoogleNetworkSpeechSynthesis) have implemented SSML parsing.
I Created A Web Speech API NPM Package Called SpeechKit
7 projects | /r/javascript | 23 Feb 2023

FYI when Google voices are used on Chrome or Chromium-based browsers the browser also makes a remote request, see https://github.com/guest271314/GoogleNetworkSpeechSynthesis.
[AskJS] Do specification authors and implementers listen to developers in the field?
5 projects | /r/javascript | 29 Jan 2023

How is processing SSML an attack vector? Google provides that capability as a service - with embedded limitations on character input that are not specified. This can and should be implemented in the browser - without making an external request; native-messaging-espeak-ng, GoogleNetworkSpeechSynthesis.
Google Network Speech Synthesis
1 project | /r/webdev | 17 Sep 2022

TTS

Posts with mentions or reviews of TTS. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-05-07.

Ask HN: Open-source, local Text-to-Speech (TTS) generators
2 projects | news.ycombinator.com | 7 May 2024

I just noticed that https://coqui.ai/ is "Shutting down".
I'm building a web app (React / Django) which takes a list of affirmations & goals (in Markdown files), puts them into a database (SQlite), and uses voice synthesis to create voice audio files of the phrases. These are combined with a relaxed backing track (ffmpeg), made into playlists of 10-20 phrases (randomly sampled, or according to a theme: "mind" "body" "soul") and then play automatically in the morning & evening (cron). This allows you to persistently hear & vocalize your own goals & good vibes over time.
I had been planning to use Coqui TTS as the local text-to-speech engine, but with this cancellation, I'd love to hear from the community what is a great open-source, local text-to-speech engine?
Generally, I learn both the highest quality commercially available technology (example: ElevenLabs), and also the best open-source equivalent. Would love to hear suggestions & perspectives on this. What voice synth tools are you investing your time into learning & building with?
OpenAI deems its voice cloning tool too risky for general release
1 project | news.ycombinator.com | 31 Mar 2024

lol this marketing technique is getting very old. https://github.com/coqui-ai/TTS is already amazing and open source.
What things are happening in ML that we can't hear oer the din of LLMs?
3 projects | news.ycombinator.com | 28 Mar 2024

Not sure how relevant this is but note that Coqui TTS (the realistic TTS) has already shut down
https://coqui.ai
Base TTS (Amazon): The largest text-to-speech model to-date
3 projects | news.ycombinator.com | 14 Feb 2024

I've used coqui.ai's TTS models[0] and library[1] to great success. I was able to get cloned voice to be rendered in about 80% of the audio clip length, and I believe you can also stream the response. Do note the model license for XTTS, it is one they wrote themselves that has some restrictions.
[0] https://huggingface.co/coqui/XTTS-v2
[1] https://github.com/coqui-ai/TTS
FLaNK Stack Weekly 12 February 2024
52 projects | dev.to | 12 Feb 2024
Coqui Is Shutting Down
1 project | news.ycombinator.com | 11 Jan 2024
Coqui.ai Is Shutting Down
4 projects | news.ycombinator.com | 3 Jan 2024

My only exposure to Coqui was their text to speech software. If I remember correctly the website was a commercialized service with TTS and probably some other related things. I hope the software work continues in the open.
https://github.com/coqui-ai/TTS
Hello guys, any selfhosted alternative to eleven labs?
3 projects | /r/selfhosted | 11 Dec 2023

Coqui.ai TTS (https://github.com/coqui-ai/TTS)
Demo of Anagnorisis - completely local recommendation system powered by Llama 2. Radio mode. Work in progress.
2 projects | /r/LocalLLaMA | 11 Dec 2023

"tts_models/multilingual/multi-dataset/xtts_v2" model from https://github.com/coqui-ai/TTS. It gives pretty good results and works with references, so it's pretty easy to change the voice. By the way the source code of the project is open: https://github.com/volotat/Anagnorisis but be ready, the code is pretty raw for now.
XTTS voice cloning with only a seconds of audio
1 project | news.ycombinator.com | 6 Dec 2023

A recent update to their GitHub also has a no-code gradio ui to facilitate fine-tuning and inferencing locally. https://github.com/coqui-ai/TTS/releases/tag/v0.21.3

What are some alternatives?

When comparing GoogleNetworkSpeechSynthesis and TTS you can also consider the following projects:

piper - A fast, local neural text to speech system

tortoise-tts - A multi-voice TTS system trained with an emphasis on quality

native-messaging-espeak-ng - Native Messaging => eSpeak NG => MediaStreamTrack

Real-Time-Voice-Cloning - Clone a voice in 5 seconds to generate arbitrary speech in real-time

SAM - Software Automatic Mouth - Tiny Speech Synthesizer

silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

captureSystemAudio - Capture system audio ("What-U-Hear")

vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

AudioWorkletStream - fetch() => ReadableStream => AudioWorklet

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

pocketsphinx - A small speech recognizer

GoogleNetworkSpeechSynthesis vs piper TTS vs tortoise-tts GoogleNetworkSpeechSynthesis vs native-messaging-espeak-ng TTS vs Real-Time-Voice-Cloning GoogleNetworkSpeechSynthesis vs SAM TTS vs silero-models GoogleNetworkSpeechSynthesis vs captureSystemAudio TTS vs vosk-api GoogleNetworkSpeechSynthesis vs AudioWorkletStream TTS vs text-generation-webui GoogleNetworkSpeechSynthesis vs pocketsphinx TTS vs piper

Compare GoogleNetworkSpeechSynthesis vs TTS and see what are their differences.

GoogleNetworkSpeechSynthesis

TTS

GoogleNetworkSpeechSynthesis

TTS

What are some alternatives?