DemoFusion
sd_lite
DemoFusion | sd_lite | |
---|---|---|
7 | 15 | |
1,876 | 18 | |
1.9% | - | |
8.6 | 4.5 | |
28 days ago | about 1 year ago | |
Jupyter Notebook | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DemoFusion
- List of Stable Diffusion research softwares that I don't think gotten widespread adoption.
- DemoFusion: Democratising High-Resolution Image Generation With No 💰
- DemoFusion - a new upscaling technique
-
💰DemoFusion: High-resolution generation using only a SDXL model and a RTX 3090 GPU!
For more comparison examples, please refer to our project page: https://ruoyidu.github.io/demofusion/demofusion.html.
- [CODE RELEASE!] DemoFusion: Democratising High-Resolution Image Generation With No 💰
sd_lite
- List of Stable Diffusion research softwares that I don't think gotten widespread adoption.
- Comparing 5 recent SD distillation methods SSD/LCM/Turbo to find the best option for low-VRAM users (images and statistical analysis included). SD-Turbo scores significantly higher on aesthetics, the boost to SD-21 is remarkable
-
Latent Jitter: a simple method for generating variations on a prompt to composite into a final image. Stacks well with prompt delay and The Stable Artist to give you 4+ options from a single seed/prompt.
The full details of how to do this are available on Github: latent jitter · thekitchenscientist/sd_lite but I will explain the idea briefly here. I have read this could be done with perlin or simplex noise but the code was too complex for my taste. This gets the job done with only minor modifications to the standard pipe.
-
"SEGA: Instructing Diffusion using Semantic Dimensions": Paper + GitHub repo + web app + Colab notebook for generating images that are variations of a base image generation by specifying secondary text prompt(s). In this example, the secondary text prompt was "smiling". See comment for details.
I did successfully swap the effiel tower for the burj Khalifa but that required additional steps https://github.com/thekitchenscientist/sd_lite/wiki/latent-jitter
-
Are there any sure-fire 100% SFW models for Stable Diffusion? Project for kids
I use it in my pipe as a general image beautifier. https://github.com/thekitchenscientist/sd_lite/wiki/safe-latent-diffusion
-
Messing with the denoising loop can allow you to reach new places in latent space. Over 8+ different research papers/Auto1111 extension ideas in a single pipe. Load once and do lots of different things (SD 2.1 or 1.5)
The pipe is available at sd_lite/pipeline_stable_diffusion_multi.py (github.com) it combines:
-
Comparison of new UniPC sampler method added to Automatic1111
This community has published many XY plots of CFG versus steps. https://github.com/thekitchenscientist/sd_lite/wiki/recommended The consistent theme is low CFG, lower steps; high CFG, more steps. UniPC can reach convergence in as few as 8 steps, so I increased by 1/3 to account for more complex prompts needing longer
-
Create Panorama images of ANY size using less then 6GB VRAM, also x6-10 speed-up and added support for batch mode! A modification of MultiDiffusion. Potato computers of the world rejoice. SD2.0 768 model gives fastest creation of larger sizes but the VAE image slicing means no VRAM spike.
the pipeline is available from github.com and is called in the usual way. The Technique requires the DDIM scheduler.
-
Img2Img as a side-scrolling enhancer - more pictures in the comments
https://github.com/thekitchenscientist/sd_lite is where the code is. Version 1 of the multi-pipe is limited to images 512 high or wide but any number on the other dimension
-
You to can create Panorama images 512x10240+ (not a typo) using less then 6GB VRAM (Vertorama works too). A modification of the MultiDiffusion code to pass the image through the VAE in slices then reassemble. Potato computers of the world rejoice.
Not to be deterred I hacked together some code to blend it all back together after the VAE but before the final colour balance. The pipe code is available on github. thekitchenscientist/sd_lite
What are some alternatives?
Dreambooth-Stable-Diffusion - Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
sd-dynamic-prompts - A custom script for AUTOMATIC1111/stable-diffusion-webui to implement a tiny template language for random prompt generation
ComfyUI_experiments - Some experimental custom nodes.
sd-dynamic-thresholding - Dynamic Thresholding (CFG Scale Fix) for Stable Diffusion (StableSwarmUI, ComfyUI, and Auto WebUI)
MotionDirector - MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
safe-latent-diffusion - Official Implementation of Safe Latent Diffusion for Text2Image
sliders - Concept Sliders for Precise Control of Diffusion Models
ziplora-pytorch - Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"
stable-diffusion-reference-only - img2img version of stable diffusion. Anime Character Remix. Line Art Automatic Coloring. Style Transfer.
erasing - Erasing Concepts from Diffusion Models
Specialist-Diffusion - [CVPR 2023] Specialist Diffusion: Extremely Low-Shot Fine-Tuning of Large Diffusion Models
Concurrent-gif2gif - Experimental Automatic1111 Stable Diffusion WebUI extension, concurrent frame rendering