espeak-ng
flowtron
Our great sponsors
espeak-ng | flowtron | |
---|---|---|
25 | 6 | |
2,780 | 878 | |
4.9% | 0.8% | |
7.2 | 0.0 | |
4 days ago | 9 months ago | |
C | Jupyter Notebook | |
GNU General Public License v3.0 only | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
espeak-ng
-
IAMA senior javascript dev, ask me anything
I'm skeptical about a senior JavaScript developer claiming to be bored. Nonetheless, let's see. How would you go about modifying [this](ng/blob/master/emscripten/espeakng_glue.idl) IDL file, this C++ glue code, and the relevant Make file to compile eSpeak NG to JavaScript with Emscripten with SSML support enabled?
-
Is there a good text to speech program for linux?
eSpeak NG supports running on Linux, BSD, Mac, Android, Windows, has been compiled to WASM with Emscripten. See also espeak and meSpeak.js.
-
Balacoon: python package for text-to-speech
I didnt not release trainy parts to build voices. I am considering, but there is so many packages already (coqui, espnet, piper, nemo, fairseq to name a few) that i focused on usability for now. Support for new languages is a different question. Everyone wants to train fancy neural nets. But support for new language is about writing rules and having language expertise. I did it for English (https://github.com/balacoon/en_us_normalization/tree/c1019cf878aa6baf25d6fff719cf418cca5a3107/production/classify). Doing it for all the other languages would probably take me a lifetime. Other speech synthesis solutions use 17-years old espeak for this purpose (https://github.com/espeak-ng/espeak-ng/blob/master/docs/languages.md). I introduced the fallback to it in Balacoon too. But generally, it is outdated technology and I believe we should do better.
-
I Created A Web Speech API NPM Package Called SpeechKit
There are espeak-ng https://github.com/espeak-ng/espeak-ng and pocketsphinx https://github.com/cmusphinx/pocketsphinx which can be used locally without making external requests.
-
Ask HN: Are there any good open source Text-to-Speech tools?
I've had good luck with https://github.com/espeak-ng/espeak-ng (for very specific purposes, and I was willing to wrangle IPA)
-
Node.js Native Messaging host
Web Speech API does not provide a means to capture audio output of speechSynthesis.speak(new SpeechSynthesis.speak()). Using Native Messaging we start a local server, send input text or SSML to the local server with fetch(), pass the input data to local speech synthesis engine, in this case espeak-ng, get response back as WAV in the browser, which we parse to Float32Array and write to a MediaStreamTrackGenerator which we then output speakers and/or share with peers (https://github.com/guest271314/native-messaging-espeak-ng; https://github.com/espeak-ng/espeak-ng/tree/master/chromium_extension).
-
Curious with Text To Speech
The most portable way: https://github.com/espeak-ng/espeak-ng
- TTS engine that allows me to add my own MSI files
-
β³ 0 apps added, 78 updated at f-droid.org
eSpeak (version 1.51.1): Text-to-speech engine for Android. Based on the eSpeak engine.
-
How to convert phonetic units into words by writing some code (general software approach)?
This is one I use for a basic robo-assistant voice that you might be able to glance at and get an idea for: https://github.com/espeak-ng/espeak-ng, and I believe the big ML companies have released some neural net code freely, so that might also be an avenue.
flowtron
- [D] What is the best open source text to speech model?
- A thought: we need language and voice synthesis models as free as Stable Diffusion
-
Ask HN: Best FOSS software to read text allowed
If you want free (as open source) software, the NVIDIA research GitHub also has some good tools. For example : https://github.com/NVIDIA/flowtron
What are some alternatives?
RHVoice - a free and open source speech synthesizer for Russian and other languages
TensorFlowTTS - :stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
TTS - πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
piper - A fast, local neural text to speech system
tacotron - A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
scrcpy - Display and control your Android device
SAM - Software Automatic Mouth - Tiny Speech Synthesizer
aeneas - aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
espnet - End-to-End Speech Processing Toolkit
spokestack-android - Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
RealTimeSingingSynthesizer - Live Coding Singing Synthesizer. Python sinsy-NG wrapper.
WaveRNN - WaveRNN Vocoder + TTS