RealTimeSingingSynthesizer
espeak-ng
Our great sponsors
RealTimeSingingSynthesizer | espeak-ng | |
---|---|---|
2 | 25 | |
55 | 2,780 | |
- | 4.9% | |
1.8 | 7.2 | |
over 2 years ago | 5 days ago | |
C | C | |
GNU General Public License v3.0 only | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
RealTimeSingingSynthesizer
-
How hard do you think it would be to get an UTAU voicebank to work in Foxdot?
Live singing synthesis has been done with Espeak and Sinsy-NG: https://github.com/mathigatti/RealTimeSingingSynthesizer
espeak-ng
-
IAMA senior javascript dev, ask me anything
I'm skeptical about a senior JavaScript developer claiming to be bored. Nonetheless, let's see. How would you go about modifying [this](ng/blob/master/emscripten/espeakng_glue.idl) IDL file, this C++ glue code, and the relevant Make file to compile eSpeak NG to JavaScript with Emscripten with SSML support enabled?
-
Is there a good text to speech program for linux?
eSpeak NG supports running on Linux, BSD, Mac, Android, Windows, has been compiled to WASM with Emscripten. See also espeak and meSpeak.js.
-
Balacoon: python package for text-to-speech
I didnt not release trainy parts to build voices. I am considering, but there is so many packages already (coqui, espnet, piper, nemo, fairseq to name a few) that i focused on usability for now. Support for new languages is a different question. Everyone wants to train fancy neural nets. But support for new language is about writing rules and having language expertise. I did it for English (https://github.com/balacoon/en_us_normalization/tree/c1019cf878aa6baf25d6fff719cf418cca5a3107/production/classify). Doing it for all the other languages would probably take me a lifetime. Other speech synthesis solutions use 17-years old espeak for this purpose (https://github.com/espeak-ng/espeak-ng/blob/master/docs/languages.md). I introduced the fallback to it in Balacoon too. But generally, it is outdated technology and I believe we should do better.
-
I Created A Web Speech API NPM Package Called SpeechKit
There are espeak-ng https://github.com/espeak-ng/espeak-ng and pocketsphinx https://github.com/cmusphinx/pocketsphinx which can be used locally without making external requests.
-
Ask HN: Are there any good open source Text-to-Speech tools?
I've had good luck with https://github.com/espeak-ng/espeak-ng (for very specific purposes, and I was willing to wrangle IPA)
-
Node.js Native Messaging host
Web Speech API does not provide a means to capture audio output of speechSynthesis.speak(new SpeechSynthesis.speak()). Using Native Messaging we start a local server, send input text or SSML to the local server with fetch(), pass the input data to local speech synthesis engine, in this case espeak-ng, get response back as WAV in the browser, which we parse to Float32Array and write to a MediaStreamTrackGenerator which we then output speakers and/or share with peers (https://github.com/guest271314/native-messaging-espeak-ng; https://github.com/espeak-ng/espeak-ng/tree/master/chromium_extension).
-
Curious with Text To Speech
The most portable way: https://github.com/espeak-ng/espeak-ng
- TTS engine that allows me to add my own MSI files
-
β³ 0 apps added, 78 updated at f-droid.org
eSpeak (version 1.51.1): Text-to-speech engine for Android. Based on the eSpeak engine.
-
How to convert phonetic units into words by writing some code (general software approach)?
This is one I use for a basic robo-assistant voice that you might be able to glance at and get an idea for: https://github.com/espeak-ng/espeak-ng, and I believe the big ML companies have released some neural net code freely, so that might also be an avenue.
What are some alternatives?
RHVoice - a free and open source speech synthesizer for Russian and other languages
TTS - πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
piper - A fast, local neural text to speech system
scrcpy - Display and control your Android device
SAM - Software Automatic Mouth - Tiny Speech Synthesizer
aeneas - aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
spokestack-android - Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
flowtron - Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
YourTTS - YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
sonic - Simple library to speed up or slow down speech
dicio-android - Dicio assistant app for Android
vosk-android-service - Offline voice typing for Android