|2 months ago||3 months ago|
|GNU General Public License v3.0 or later||GNU General Public License v3.0 or later|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Blender animation augmented with AI
3 projects | reddit.com/r/blender | 18 Aug 2022
How to use github repository installed with pip install
2 projects | reddit.com/r/learnpython | 18 May 2022
I'm finding this somewhat frustrating. I want to try out the code from here, so I tried using conda for the first time. One of the options suggests using pip install on the github link. Then, later it recommends calling
What voice-changing apps are available right now?
4 projects | reddit.com/r/artificial | 29 Jun 2022
We have the TorToiSe repo, the SV2TTS repo, and from here you have the other models like Tacotron 2, FastSpeech 2, and such. A there is a lot that goes into training a baseline for these models on the LJSpeech and LibriTTS datasets. Fine tuning is left up to the user.
Is there a way I could voice clone a character from a movie?
2 projects | reddit.com/r/ArtificialInteligence | 30 May 2022
2 projects | reddit.com/r/chonglangTV | 23 Jan 2022
Clone a voice in 5 seconds to generate arbitrary speech in real-time
3 projects | news.ycombinator.com | 27 Dec 2021
I'm the author of FakeYou.com, so I have a little experience in this area.
This appears to be a repackaging of RealTimeVoiceCloning , albeit with a few additions, such as GSTs.
No matter what the repo claims, your results will depend on high quality data. Lots of it, and with ample fine tuning.
If you're picking this up for a project, HiFi-Gan is pretty much the best vocoder right now. Tacotron still produces great results.3 projects | news.ycombinator.com | 27 Dec 2021
The Return of the Evil Empire!
2 projects | reddit.com/r/Patriots | 6 Dec 2021
Real-Time Voice Cloning for the... voice cloning. It's pretty finicky and works better with shorter phrases. Re-running the final "step" will spit out a different output each time, for better or worse. The result is going to be pretty monotone, so no yelling unfortunately (but perfect for BB). Hardest word to get right was "mafia".
Voice-cloning library for conlangs?
3 projects | reddit.com/r/conlangs | 9 Nov 2021
As for synthesis of text using your own voice - you can dig into Real Time Voice Cloning or maybe FastSpeech2, but I am not sure if you can use it with conlangs (and because of ML nature, you need many, many, many training data to get anything interesting).
Speech Synthesis on Linux
10 projects | news.ycombinator.com | 25 Sep 2021
Help me Immortalise my dying mums voice
2 projects | reddit.com/r/learnmachinelearning | 3 Sep 2021
I m so sorry to hear about your mom. On the topic of preserving her voice things liks this might help. Also I saw you were looking into chat bots you can train custom bots with intents and domains so that they respond in a way your mom does. If this is the kind of thing you looking for let me know I will be able to write a more detailed reply
How can I immortalise my dying mums voice in some sort of voice assistant?
2 projects | reddit.com/r/artificial | 2 Sep 2021
Unofficial implementation by CorentinJ (Corentin Jemine) https://github.com/CorentinJ/Real-Time-Voice-Cloning
What are some alternatives?
TTS - 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
DeepFaceLab - DeepFaceLab is the leading software for creating deepfakes.
NeMo - NeMo: a toolkit for conversational AI
TTS - :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
MockingBird - 🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
glados-voice-assistant - DIY Voice Assistant based on the GLaDOS character from Portal video game series. Works with home assistant!
tacotron2 - Tacotron 2 - PyTorch implementation with faster-than-realtime inference
RHVoice - a free and open source speech synthesizer for Russian and other languages
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
gpt-2 - Code for the paper "Language Models are Unsupervised Multitask Learners"
nvdiffrast - Nvdiffrast - Modular Primitives for High-Performance Differentiable Rendering
Conv-TasNet - A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).