Wuerstchen
CrossAttentionControl
Wuerstchen | CrossAttentionControl | |
---|---|---|
1 | 11 | |
486 | 1,237 | |
- | - | |
5.9 | 10.0 | |
about 1 month ago | over 1 year ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Wuerstchen
-
Like Diffusion but Faster: The Paella Model for Fast Image Generation
Fully correct, also the v2 of the paper introduced a model that is bigger and slower, however generates better images. So the 500ms was only for the first model we introduced in v1. I also want to mention our new work as it is very much related to this whole topic of "speeding up models" -> either training or sampling: Würstchen: https://github.com/dome272/wuerstchen/
CrossAttentionControl
- "How can I do X?" for image generation.
- The Stable Horde now supports img2img as well as multiple models available at the same time. And we just added SD 1.5
- Is there any way to make Automatic1111 change an image into a different pose/style while keeping the subject of the image in tact?
- Cross Attention Control with Stable Diffusion
-
First round of results from the new Cross-Attention paper
Stable Diffusion implementation of Cross Attention Github page (Legend!): https://github.com/bloc97/CrossAttentionControl
- Prompt-to-Prompt Image Editing with Cross Attention Control
- Reproducing the method in 'Prompt-to-Prompt Image Editing with Cross Attention Control' with Stable Diffusion
- Prompt-to-Prompt Image Editing with Cross Attention Control in Stable Diffusion
What are some alternatives?
Gen-L-Video - The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".
stable-diffusion-webui - Stable Diffusion web UI
Paella - Official Implementation of Paella https://arxiv.org/abs/2211.07292v2
stable-diffusion
stable-diffusion-webui - Stable Diffusion web UI [Moved to: https://github.com/Sygil-Dev/sygil-webui]
Magic123 - [ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
nataili - Nataili is a Python library that provides tools for building multimodal AI applications. With its modular design, Nataili makes it easy to use only the tools you need to build custom AI solutions.
anima - Turn text into video using Stable Diffusion and Google FILM
MultiDiffusion - Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)
diffusers - 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
upscayl - 🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.