Our great sponsors
-
I've been quite happy with Mimic3 lately (https://github.com/MycroftAI/mimic3), the engine that powers Mycroft. It also comes with an easy-to-install Docker image.
-
The best is probably tortoise but you have to run it yourself https://github.com/neonbjb/tortoise-tts
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
I've had good luck with https://github.com/espeak-ng/espeak-ng (for very specific purposes, and I was willing to wrangle IPA)
-
I've had good results with https://github.com/rhasspy/larynx
-
I'm not sure about the licensing of all the models/etc, but Coqui AI's 'TTS' python package is fairly good.
-
-
TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) (by mozilla)
I have heard good things about Mozilla's TTS: https://github.com/mozilla/TTS
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
It's such an obvious answer perhaps is why nobody has commented it. But depending on the use, you might try web speech API synthesis. For example a Windows user might see a Cortana option whereas a Mac user might see Siri.
Demo Here: https://mdn.github.io/dom-examples/web-speech-api/speak-easy...
Read more here https://github.com/mdn/dom-examples/tree/main/web-speech-api
-
text-to-speech-ubuntu
🙊 Setup "selectable" text to speech / TTS on Ubuntu Linux 24.04 22.04 22.10 23.04 23.10 . Ideal for speed reading, programming, editing and writing.
-
buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
-
OpenAI’s whisper[1] should do the job for you.
-
tts
Given a URL, this service return an audio file / stream (in WAV format) that reads out the main content of the webpage. (by tslmy)
Given a URL, this service return an audio file / stream (in WAV format) that reads out the main content of the webpage.
-
For STT, take a look at Wenet: https://github.com/wenet-e2e/wenet
They provide support for running in a Raspberry Pi and it runs in real-time. I have tried the desktop version and the quality is good enough when the audio is clean.
-
If your use case allows for a web API, I've had good experience running OpenTTS[0].
It packages several models, including Coqui AI's TTS which I tend to use the most. There's a handy Docker image, too.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives