Web Speech API is (still) broken on Linux circa 2023

Our great sponsors

SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

Our great sponsors

native-messaging-espeak-ng

21 4 6.7 JavaScript

Native Messaging => eSpeak NG => MediaStreamTrack

I already implemented direct connection to a local speech synthesis engine that processes SSML input, provides a means to pause and resume the audio output because we can play the file using HTMLMediaElement in the browser, and creates a MediaStreamTrack from the parsed WAV for the ability to record and share the stream with peers - to prove the requirement is possible https://github.com/guest271314/native-messaging-espeak-ng.

SSMLParser

9 33 10.0 JavaScript

Implement SSML parsing for Web Speech API

Or, as you noted, split the synthesis over multiple SpeechSynthesisUtterance instances in a user-defined queue calling speak() in succession. That's what I do here https://guest271314.github.io/SSMLParser/.

SurveyJS

surveyjs.io sponsored

Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.
GoogleNetworkSpeechSynthesis

11 0 10.0 JavaScript

Google's Network Speech Synthesis: Bring your own Google API key and proxy

This is how you can make the request yourself GoogleNetworkSpeechSynthesis.

TTS

62 8,806 0.0 Jupyter Notebook

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) (by mozilla)

There is a lot of TTS and SST development going on (https://github.com/mozilla/TTS; https://github.com/mozilla/DeepSpeech; https://github.com/common-voice/common-voice). That is the only way they work: Contributions from the wild.

DeepSpeech

67 24,278 0.0 C++

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

There is a lot of TTS and SST development going on (https://github.com/mozilla/TTS; https://github.com/mozilla/DeepSpeech; https://github.com/common-voice/common-voice). That is the only way they work: Contributions from the wild.

common-voice

66 3,250 10.0 TypeScript

Common Voice is part of Mozilla's initiative to help teach machines how real people speak.

There is a lot of TTS and SST development going on (https://github.com/mozilla/TTS; https://github.com/mozilla/DeepSpeech; https://github.com/common-voice/common-voice). That is the only way they work: Contributions from the wild.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Common Voice
5 projects | news.ycombinator.com | 5 Dec 2023
Offline speech to text software
2 projects | /r/AskTechnology | 16 Oct 2021
Voice Anonymization
4 projects | /r/privacytoolsIO | 7 Jun 2021
Anyone purchasing Novus internet should be aware, their unlimited data plans are not actually unlimited.
2 projects | /r/vancouver | 14 May 2021
Show HN: Cognita – open-source RAG framework for modular applications
2 projects | news.ycombinator.com | 27 Apr 2024

Web Speech API is (still) broken on Linux circa 2023

This page summarizes the projects mentioned and recommended in the original post on /r/javascript
Deep Learning open-data native-messaging Machine Learning text-to-speech
Post date: 15 Apr 2023

native-messaging-espeak-ng

SSMLParser

SurveyJS

GoogleNetworkSpeechSynthesis

TTS

DeepSpeech

common-voice

Related posts

Web Speech API is (still) broken on Linux circa 2023

This page summarizes the projects mentioned and recommended in the original post on /r/javascript Deep Learning open-data native-messaging Machine Learning text-to-speech Post date: 15 Apr 2023

native-messaging-espeak-ng

SSMLParser

SurveyJS

GoogleNetworkSpeechSynthesis

TTS

DeepSpeech

common-voice

Related posts

This page summarizes the projects mentioned and recommended in the original post on /r/javascript
Deep Learning open-data native-messaging Machine Learning text-to-speech
Post date: 15 Apr 2023