Our great sponsors
-
No, the reference wavs are only used during inference (asking the already trained model to make predictions). During training, the dataset was huge and it contained a lot of prosodic information - running Capacitron on LJSpeech would not make a lot of sense because the dataset is fairly monotonic. If you're interested about voice cloning from a small amount of data, you should check out the research from Coqui, YourTTS and you can even try out their product-level quality models for zero-shot voice cloning.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.