|5 months ago||10 days ago|
|MIT License||GNU General Public License v3.0 or later|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
[D] Anomaly detection without training set?
1 project | reddit.com/r/MachineLearning | 22 Jan 2021
Other than that, if you're looking for more complicated models you could try and adapt one of the models from this this GitHub repo for your task (this is an open source repo of some SOTA anomaly detection models).
I'm really starting to enjoy this. My Vehicle Damage System causes so much destruction.
1 project | reddit.com/r/Unity3D | 24 Jan 2022
oh for sure, there have already been commercial games released exploring such technology.. and you can go clone your voice right now on github w/ mit licensed tech if you want. nvm, today's commercial offerings can even add emotion convincingly to their synthesized speech.
2 projects | reddit.com/r/chonglangTV | 23 Jan 2022
How to clone someone’s voice with AI?
1 project | reddit.com/r/AskTechnology | 16 Jan 2022
I synthesized Eddy's voice
1 project | reddit.com/r/lingling40hrs | 12 Jan 2022
Hello! Using this amazing software I took 7 seconds of Eddy's voice (from their lofi video and from their bubble tea video consecutively) and got my 2 favorite results: https://youtu.be/XisqpQmbf1Y. Which is better? If you want them to say anything else you can comment and ill pick my favorites.
Jack Rhysider Voice Cloning
1 project | reddit.com/r/u_dark_net_user | 10 Jan 2022
Clone a voice in 5 seconds to generate arbitrary speech in real-time
3 projects | news.ycombinator.com | 27 Dec 2021
I'm the author of FakeYou.com, so I have a little experience in this area.
This appears to be a repackaging of RealTimeVoiceCloning , albeit with a few additions, such as GSTs.
No matter what the repo claims, your results will depend on high quality data. Lots of it, and with ample fine tuning.
If you're picking this up for a project, HiFi-Gan is pretty much the best vocoder right now. Tacotron still produces great results.3 projects | news.ycombinator.com | 27 Dec 2021
The Return of the Evil Empire!
2 projects | reddit.com/r/Patriots | 6 Dec 2021
Real-Time Voice Cloning for the... voice cloning. It's pretty finicky and works better with shorter phrases. Re-running the final "step" will spit out a different output each time, for better or worse. The result is going to be pretty monotone, so no yelling unfortunately (but perfect for BB). Hardest word to get right was "mafia".
Getting started with a GitHub project, question about Python
1 project | reddit.com/r/learnprogramming | 23 Nov 2021
Hi, I'm looking to try out a GitHub project (https://github.com/CorentinJ/Real-Time-Voice-Cloning) and already feeling in over my head.
Voice-cloning library for conlangs?
3 projects | reddit.com/r/conlangs | 9 Nov 2021
As for synthesis of text using your own voice - you can dig into Real Time Voice Cloning or maybe FastSpeech2, but I am not sure if you can use it with conlangs (and because of ML nature, you need many, many, many training data to get anything interesting).
What are some alternatives?
NeMo - NeMo: a toolkit for conversational AI
TTS - 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
gpt-2 - Code for the paper "Language Models are Unsupervised Multitask Learners"
GLaDOS-Voice-Assistant - DIY Voice Assistant based on the GLaDOS character from Portal video game series. Works with home assistant!
Real-Time-Voice-Cloning - Clone a voice in 5 seconds to generate arbitrary speech in real-time
TTS - :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
FastSpeech2 - An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Conv-TasNet - A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
DeepFaceLab - DeepFaceLab is the leading software for creating deepfakes.
RHVoice - a free and open source speech synthesizer for Russian and other languages
mimic-recording-studio - Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
Deep-Learning-Papers-Reading-Roadmap - Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!