EmotiVoice
MockingBird
EmotiVoice | MockingBird | |
---|---|---|
5 | 9 | |
6,369 | 33,904 | |
- | - | |
8.9 | 5.8 | |
3 months ago | 2 months ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
EmotiVoice
- FLaNK Stack Weekly 12 February 2024
-
WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper
Interested to see how it performs for Mandarin Chinese speech synthesis, especially with prosody and emotion. The highest quality open source model I've seen so far is EmotiVoice[0], which I've made a CLI wrapper around to generate audio for flashcards.[1] For EmotiVoice, you can apparently also clone your own voice with a GPU, but I have not tested this.[2]
[0] https://github.com/netease-youdao/EmotiVoice
[1] https://github.com/siraben/emotivoice-cli
[2] https://github.com/netease-youdao/EmotiVoice/wiki/Voice-Clon...
-
Microsoft releases Windows AI studio to run and fine tune models locally
Interesting. I'll have to check to be sure, but I think maybe something is happening automagically if you have reasonably up to date nvidia drivers on the host OS, because I was able to run the EmotiVoice TTS docker (which requires nvidia gpu) from WSL2.
https://github.com/netease-youdao/EmotiVoice
- FLaNK Stack Weekly for 13 November 2023
- EmotiVoice: A Multi-Voice and Prompt-Controlled TTS Engine
MockingBird
-
TIL cyber criminals with the help of A.I voice cloning software, used a deepfaked voice of a company executive to fool a Emirati bank manager to transfer 35 million dollars into their personal accounts. The bank manager had recognized the executive's voice from having worked with him before.
Actually, there are already open source implementations available, for example, the MockingBird project on GitHub. It supports English and Mandarin Chinese. For those with enough computation power and willingness to try, you can even make your own voice dataset and train the model to generate ‘your’ sound, simply following the project docs.
-
Need a deep fake for my girlfriend
piece of recordings MockingBird
-
在GitHub上看到一个有意思的project,可以导入声音素材用ai训练,训练完成后可以克隆声音并且让他读一些自定义文本,有无码老嗨来分析一下用这个工具制造乳制品的可行性有多大?
项目地址:https://github.com/babysor/MockingBird
-
Hacker News top posts: Dec 28, 2021
Mocking Bird – Realtime Voice Clone for Chinese\ (45 comments)
- Mocking Bird – Realtime Voice Clone for Chinese
- Clone a voice in 5 seconds to generate arbitrary speech in real-time
-
Danke schön Sir
I wonder if there is a way to tie in this AI voice changer. https://github.com/babysor/MockingBird
-
Top 10 trending github repos of the week🌟.
View on GitHub
- [P] Clone a voice in 5 seconds to generate arbitrary speech in real-time
What are some alternatives?
Cgml - GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.
Real-Time-Voice-Cloning - Clone a voice in 5 seconds to generate arbitrary speech in real-time
TTS - 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
cuckoo - Cuckoo Sandbox is an automated dynamic malware analysis system
draw-a-ui - Draw a mockup and generate html for it
Speech-enhancement - Deep learning for audio denoising
lhotse - Tools for handling speech data in machine learning projects.
SimSwap - An arbitrary face-swapping framework on images and videos with one single trained model!
voice100 - Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.
uptrace - Open source APM: OpenTelemetry traces, metrics, and logs
clipea - 📎🟢 Like Clippy but for the CLI. A blazing fast AI helper for your command line
CombineExpectations - Utilities for tests that wait for Combine publishers