Our great sponsors
-
ubisoft-laforge-daft-exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Ubisoft has their Daft-Exprt stuff on github that does a tolerable job of prosody/tone transfer, which is pretty much necessary to naturalize shit if you're going to be doing a cloning pipeline that isn't using a service's packaged voices. Without this I wouldn't even consider an ai speech pipeline due to how hardly constrained the range of tone is even with something like replicant studios actor shit.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- Using A.I voices or Sound Fonts (i.e. Undertale or Animal Crossing)
- WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper
- [D] What offline TTS Model is good enough for a realistic real-time task?
- [discussion] text to voice generation for textbooks (non-math part)
- [P] Making a TTS voice, HK-47 from Kotor using Tortoise (Ideally WaveRNN)