Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Stem Roller is a free downloadable version of a similar tool (can't comment on a quality comparison) if you want a non-web version:
https://www.stemroller.com/
I will also say -- UberDuck and AI TTS in general, when compared to the SURGE of development and tools that's happened on the image/video side of AI, is TERRIBLE.
UberDuck's community specifically seems geared towards kids making memes -- I suspect they just ended up there and didn't design it that way, but wading through the terrible user created models to find ones that work was tiresome.
I tried to get https://coqui.ai/ setup to do similar things, but have not been successful.
Surely this will all explode in the next 18 mo max
The rules for live DJ sets is very different from copyright rules that limit published music. This is the same as when artists use samples of meme audio, which otherwise would be copyrighted. Guetta has a long very large body of work and deserves the benefit of the doubt when it comes to the question of Eminem okaying it. He very clearly wants there to be a discussion about this. Celebrity impersonators for singing have been a thing for a long time. This isnāt about one artist in particular.
I was playing with Tortoise TTS and was genuinely surprised by how good it is with just a few minutes of clean audio. It didnāt take me hours to train or fine tune, the generation step is sort of long, 5-7 minutes for 30 seconds, but it feels really similar to stable diffusion where you do quick test with slow samples and iterations to find a decent seed, and then you let it do a more complete regeneration. Itās zero shot generation that ran on my laptop 2070 max q and i7 10750h. Itās not perfect but itās believable when layered with music.
https://github.com/152334H/tortoise-tts-fast