stable-karlo
naturalspeech2-pytorch
stable-karlo | naturalspeech2-pytorch | |
---|---|---|
10 | 1 | |
62 | 1,207 | |
- | - | |
2.3 | 8.3 | |
about 1 year ago | 8 months ago | |
Python | Python | |
GNU General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
stable-karlo
- Super Easy AI Installer Tool (SEAIT) Update 0.1.0
-
Unlimited-Size Diffusion Restoration
This repo uses the SD2 upscaling model in a workflow on top of Karlo and I've run that on my GPU, definitely nothing unusual about the memory requirements: https://github.com/kpthedev/stable-karlo
- [P] stable-karlo - Combining the Karlo diffusion model (based on OpenAI's unCLIP) with Stable-Diffusion v2 upscaling (Local UI + Colab notebook)
- [P] stable-karlo - A UI for Karlo (open-source model based on OpenAI's unCLIP) with Stable-Diffusion v2 upscaling. Google Colab available!
- Diverse examples generated with stable-karlo (link in comments!)
- stable-karlo - A UI for Karlo (open-source model based on OpenAI's unCLIP) with Stable-Diffusion v2 upscaling. Google Colab available!
- [P] Combining Kakaobrain's Karlo text-conditional diffusion model with Stable-Diffusion 2.1 (WebUI)
- I combined Karlo with the Stable Diffusion v2 Upscaler!
naturalspeech2-pytorch
What are some alternatives?
stable-diffusion-tensorflow-IntelMetal - Stable Diffusion in TensorFlow / Keras, Designed for Apple Metal on Intel. Forked from @divamgupta's work [Moved to: https://github.com/soten355/MetalDiffusion]
TTS - πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
diffusiondb - A large-scale text-to-image prompt gallery dataset based on Stable Diffusion
NeMo - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
stable-diffusion-videos - Create π₯ videos with Stable Diffusion by exploring the latent space and morphing between text prompts
cappr - Completion After Prompt Probability. Make your LLM make a choice
stable-diffusion - Optimized Stable Diffusion able to generate 1088x1088 images on just 4GB GPUs
DiffSinger - DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
alias-free-gan - Unofficial Alias-Free GAN implementation. Based on rosinality's version with expanded training and inference options.
espnet - End-to-End Speech Processing Toolkit
dream-factory - Multi-threaded GUI manager for mass creation of AI-generated art with support for multiple GPUs.
ai-text-to-audio-latent-diffusion - text-to-audio-latent-diffusion