EmotiVoice
sshx
EmotiVoice | sshx | |
---|---|---|
5 | 3 | |
6,369 | 5,371 | |
- | - | |
8.9 | 8.2 | |
3 months ago | 7 days ago | |
Python | Rust | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
EmotiVoice
- FLaNK Stack Weekly 12 February 2024
-
WhisperSpeech โ An Open Source text-to-speech system built by inverting Whisper
Interested to see how it performs for Mandarin Chinese speech synthesis, especially with prosody and emotion. The highest quality open source model I've seen so far is EmotiVoice[0], which I've made a CLI wrapper around to generate audio for flashcards.[1] For EmotiVoice, you can apparently also clone your own voice with a GPU, but I have not tested this.[2]
[0] https://github.com/netease-youdao/EmotiVoice
[1] https://github.com/siraben/emotivoice-cli
[2] https://github.com/netease-youdao/EmotiVoice/wiki/Voice-Clon...
-
Microsoft releases Windows AI studio to run and fine tune models locally
Interesting. I'll have to check to be sure, but I think maybe something is happening automagically if you have reasonably up to date nvidia drivers on the host OS, because I was able to run the EmotiVoice TTS docker (which requires nvidia gpu) from WSL2.
https://github.com/netease-youdao/EmotiVoice
- FLaNK Stack Weekly for 13 November 2023
- EmotiVoice: A Multi-Voice and Prompt-Controlled TTS Engine
sshx
What are some alternatives?
Cgml - GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.
expectrl - A rust library for controlling interactive programs in a pseudo-terminal
TTS - ๐ธ๐ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
react-datasheet-grid - An Airtable-like / Excel-like component to create beautiful spreadsheets.
draw-a-ui - Draw a mockup and generate html for it
clipea - ๐๐ข Like Clippy but for the CLI. A blazing fast AI helper for your command line
MockingBird - ๐AIๆๅฃฐ: 5็งๅ ๅ ้ๆจ็ๅฃฐ้ณๅนถ็ๆไปปๆ่ฏญ้ณๅ ๅฎน Clone a voice in 5 seconds to generate arbitrary speech in real-time
wubloader
lhotse - Tools for handling speech data in machine learning projects.
uploadserver - Simple Rust file server which lets you upload, share, and download files from a web browser. Ready-to-run binaries for Windows, Mac, and Linux. Free/Open-Source alternative to AirDrop/Dropbox for transferring files on your local network without having to install anything. A more sophisticated version of `python3 -m http.server 8000`.
voice100 - Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.
cuml - cuML - RAPIDS Machine Learning Library