A family of diffusion models for text-to-audio generation.
Why do you think that https://github.com/serp-ai/ai-text-to-audio-latent-diffusion is a good alternative to tango
A family of diffusion models for text-to-audio generation.
Why do you think that https://github.com/serp-ai/ai-text-to-audio-latent-diffusion is a good alternative to tango