stable-diffusion-videos
mixture-of-diffusers
stable-diffusion-videos | mixture-of-diffusers | |
---|---|---|
17 | 6 | |
4,234 | 384 | |
- | - | |
2.0 | 4.4 | |
about 1 year ago | 12 months ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
stable-diffusion-videos
- How to create it?
-
Stable Diffusion Text-to-Video WebUI
Main Code: https://github.com/nateraw/stable-diffusion-videos/
-
Messing with the denoising loop can allow you to reach new places in latent space. Over 8+ different research papers/Auto1111 extension ideas in a single pipe. Load once and do lots of different things (SD 2.1 or 1.5)
So I've continued to experiment with how many papers I can fit into a single pipe and have them play nicely together. The images below were created by combining the panorama code from omerbt/MultiDiffusion with the ideas from albarji/mixture-of-diffusers. Also turns out nateraw/stable-diffusion-videos can be seen as a special case of a panorama (in latent space rather than prompt space).
-
Comparison of new UniPC sampler method added to Automatic1111
https://huggingface.co/spaces/tomg-group-umd/pez-dispenser https://huggingface.co/spaces/AIML-TUDA/safe-stable-diffusion https://huggingface.co/spaces/AIML-TUDA/semantic-diffusion https://github.com/nateraw/stable-diffusion-videos
-
Start Frame -> Stable Diffusion + Linear Interpolation -> End Frame
The goal is to make a (short) video out of a given first and last frame. It is similar to what this guy does (https://github.com/nateraw/stable-diffusion-videos (7sec example video half way down page)). But instead of starting and ending with a prompt, I want to start and end with 2 different frames.
-
Stable Diffusion Videos Easy-to-Use Playground & Competition This Week
Hey Y'all! We've been working on a tool that extends Nate Raw's Stable Diffusion Videos repo and makes it as easy as possible to use for artists and are having a competition this week to stress test the beta and see who can use it to make the most compelling short video (40 seconds max)
- Create videos with Stablediffusion. Saw this project and thought someone here might like it.
-
Tried to pull off an ultra smooth video where you don't realize the scenes are changing until after-the-fact so I could make an 8hr background video that won't give seizures
Of course! There might be a better process but mainly used: 1.) Nate Raw's repo for morphing between prompts https://github.com/nateraw/stable-diffusion-videos 2.) Google FILM interpolation to smooth out transitions https://github.com/google-research/frame-interpolation
-
[video] Packed underground rave in North Korea with dj ill kim headlining
There are directions in the readme and an example script.
-
Short interpolation animation between several frames?
This does exactly that - https://github.com/nateraw/stable-diffusion-videos
mixture-of-diffusers
-
ControlNet Tile can generate details for each person in the crowd
Lol. But I know what I started with! I may have linked this above but do check out the mixture of diffusers repo, the image at the top of the readme is what I first tried to reproduce. Here is the prompt from that page:
- Allure of the lake - Txt2Img & region prompter
-
Messing with the denoising loop can allow you to reach new places in latent space. Over 8+ different research papers/Auto1111 extension ideas in a single pipe. Load once and do lots of different things (SD 2.1 or 1.5)
So I've continued to experiment with how many papers I can fit into a single pipe and have them play nicely together. The images below were created by combining the panorama code from omerbt/MultiDiffusion with the ideas from albarji/mixture-of-diffusers. Also turns out nateraw/stable-diffusion-videos can be seen as a special case of a panorama (in latent space rather than prompt space).
-
[R] [N] "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" enables controllable image generation without any further training or finetuning of diffusion models.
No I'm talking about https://github.com/albarji/mixture-of-diffusers but probably not like the spatial controls of MultiDiffusion.
-
You to can create Panorama images 512x10240+ (not a typo) using less then 6GB VRAM (Vertorama works too). A modification of the MultiDiffusion code to pass the image through the VAE in slices then reassemble. Potato computers of the world rejoice.
can you combine this with mixture of diffusers somehow?
-
First version of Stable Diffusion was released on August 22, 2022
If we combine Mixture of Diffusers + MultiDiffusion+ Composer+ cross-domain-compositing and probably some more I'm not thinking of.
What are some alternatives?
sd-dynamic-prompts - A custom script for AUTOMATIC1111/stable-diffusion-webui to implement a tiny template language for random prompt generation
multidiffusion-upscaler-for-automatic1111 - Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
frame-interpolation - FILM: Frame Interpolation for Large Motion, In ECCV 2022.
sd-webui-regional-prompter - set prompt to divided region
dain-ncnn-vulkan - DAIN, Depth-Aware Video Frame Interpolation implemented with ncnn library
MultiDiffusion - Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)
stable-diffusion-webui - Stable Diffusion web UI [Moved to: https://github.com/Sygil-Dev/sygil-webui]
composer - Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"
stable-karlo - Upscaling Karlo text-to-image generation using Stable Diffusion v2.
sd-dynamic-thresholding - Dynamic Thresholding (CFG Scale Fix) for Stable Diffusion (StableSwarmUI, ComfyUI, and Auto WebUI)
stable-diffusion-tensorflow-IntelMetal - Stable Diffusion in TensorFlow / Keras, Designed for Apple Metal on Intel. Forked from @divamgupta's work [Moved to: https://github.com/soten355/MetalDiffusion]
Stable-Diffusion-Desktop-Client - Stable Diffusion Desktop client for Windows, macOS, and Linux built in Embarcadero Delphi.