Our great sponsors
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
-
GoogleNetworkSpeechSynthesis
Google's Network Speech Synthesis: Bring your own Google API key and proxy
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Coqui TTS command line tool truly state-of-the-art quality - for some voices, synthesized speech is indistinguishable from natural supports many languages unfortunately, it's slow it's python program - you can install it with pip install TTS
eSpeak NG supports running on Linux, BSD, Mac, Android, Windows, has been compiled to WASM with Emscripten. See also espeak and meSpeak.js.
I requested to Google to Release TTS and STT source code and Google voices as FOSS which you can request over the network here GoogleNetworkSpeechSynthesis. Those are the voices Google Chrome uses for Web Speech API. Feel free to chime on the feature request in in support of Google releasing the source code of its network-based cloud service (that google uses for Web Speech API implementation) TTS and SST code as FOSS.
piper command line tool few English voices have excellent quality (my favorite: "lessac-medium" and "libritts-high") supports 21 languages it is super fast to use it, you need to download binary and models from Github's release page (I don't think there is a Linux package of any kind yet)