stable-diffusion-videos vs MultiDiffusion

stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts (by nateraw)

MultiDiffusion

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023) (by omerbt)

diffusion-models generative-model image-generation stable-diffusion text-to-image multidiffusion icml

Source Code

multidiffusion.github.io

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

stable-diffusion-videos		MultiDiffusion
	Project
17	Mentions	13
4,234	Stars	920
-	Growth	-
2.0	Activity	4.8
about 1 year ago	Latest Commit	8 months ago
Python	Language	Jupyter Notebook
Apache License 2.0	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

stable-diffusion-videos

Posts with mentions or reviews of stable-diffusion-videos. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-02.

How to create it?
1 project | /r/ChatGPT | 1 Jun 2023
Stable Diffusion Text-to-Video WebUI
3 projects | /r/StableDiffusion | 2 May 2023

Main Code: https://github.com/nateraw/stable-diffusion-videos/
Messing with the denoising loop can allow you to reach new places in latent space. Over 8+ different research papers/Auto1111 extension ideas in a single pipe. Load once and do lots of different things (SD 2.1 or 1.5)
7 projects | /r/StableDiffusion | 15 Mar 2023

So I've continued to experiment with how many papers I can fit into a single pipe and have them play nicely together. The images below were created by combining the panorama code from omerbt/MultiDiffusion with the ideas from albarji/mixture-of-diffusers. Also turns out nateraw/stable-diffusion-videos can be seen as a special case of a panorama (in latent space rather than prompt space).
Comparison of new UniPC sampler method added to Automatic1111
9 projects | /r/StableDiffusion | 11 Mar 2023

https://huggingface.co/spaces/tomg-group-umd/pez-dispenser https://huggingface.co/spaces/AIML-TUDA/safe-stable-diffusion https://huggingface.co/spaces/AIML-TUDA/semantic-diffusion https://github.com/nateraw/stable-diffusion-videos
Start Frame -> Stable Diffusion + Linear Interpolation -> End Frame
1 project | /r/StableDiffusion | 4 Jan 2023

The goal is to make a (short) video out of a given first and last frame. It is similar to what this guy does (https://github.com/nateraw/stable-diffusion-videos (7sec example video half way down page)). But instead of starting and ending with a prompt, I want to start and end with 2 different frames.
Stable Diffusion Videos Easy-to-Use Playground & Competition This Week
1 project | /r/StableDiffusion | 19 Dec 2022

Hey Y'all! We've been working on a tool that extends Nate Raw's Stable Diffusion Videos repo and makes it as easy as possible to use for artists and are having a competition this week to stress test the beta and see who can use it to make the most compelling short video (40 seconds max)
Create videos with Stablediffusion. Saw this project and thought someone here might like it.
1 project | /r/StableDiffusion | 17 Nov 2022
Tried to pull off an ultra smooth video where you don't realize the scenes are changing until after-the-fact so I could make an 8hr background video that won't give seizures
2 projects | /r/StableDiffusion | 8 Nov 2022

Of course! There might be a better process but mainly used: 1.) Nate Raw's repo for morphing between prompts https://github.com/nateraw/stable-diffusion-videos 2.) Google FILM interpolation to smooth out transitions https://github.com/google-research/frame-interpolation
[video] Packed underground rave in North Korea with dj ill kim headlining
1 project | /r/weirddalle | 27 Oct 2022

There are directions in the readme and an example script.
Short interpolation animation between several frames?
3 projects | /r/StableDiffusion | 27 Oct 2022

This does exactly that - https://github.com/nateraw/stable-diffusion-videos

MultiDiffusion

Posts with mentions or reviews of MultiDiffusion. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-08-15.

Opendream: A Non-Destructive UI for Stable Diffusion
4 projects | news.ycombinator.com | 15 Aug 2023

For composing this approach works pretty well
https://multidiffusion.github.io/
Messing with the denoising loop can allow you to reach new places in latent space. Over 8+ different research papers/Auto1111 extension ideas in a single pipe. Load once and do lots of different things (SD 2.1 or 1.5)
7 projects | /r/StableDiffusion | 15 Mar 2023

So I've continued to experiment with how many papers I can fit into a single pipe and have them play nicely together. The images below were created by combining the panorama code from omerbt/MultiDiffusion with the ideas from albarji/mixture-of-diffusers. Also turns out nateraw/stable-diffusion-videos can be seen as a special case of a panorama (in latent space rather than prompt space).
MultiDiffusion Region Control, a prompt on each mask webui extension is out.
3 projects | /r/StableDiffusion | 3 Mar 2023
Hubble Diffusion with MultiDiffusion
1 project | /r/StableDiffusion | 28 Feb 2023

Essentially, I fine-tuned Stable Diffusion 2.1 base (the 512x512) model on the ESA Hubble Deep Space Images & Captions dataset I collected from public Hubble images & captions. After around 33,000 training steps, I saved the model and was really impressed by the results. But I really wanted to be able to generate wallpaper-level quality space images, so I stumbled upon MultiDiffusion: a new project for generating massive panorama images using stable diffusion models. I then used hubble-diffusion-2 along with MultiDiffusion to generate each one of these amazing 2560x1536 images. Each image took a little over an hour to generate on a Google Colab T4 GPU. I used the following prompts for each of these images:
MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation
1 project | /r/StableDiffusion | 27 Feb 2023
What is the maximum size a 3090 24gb can produce?
1 project | /r/StableDiffusion | 26 Feb 2023

If you need generated and not upscaled 4k for some reason, try something like https://github.com/omerbt/MultiDiffusion
[R] [N] "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" enables controllable image generation without any further training or finetuning of diffusion models.
2 projects | /r/MachineLearning | 24 Feb 2023

Project: https://multidiffusion.github.io/ Paper: https://arxiv.org/abs/2302.08113 GitHub: https://github.com/omerbt/MultiDiffusion
Meet MultiDiffusion: A Unified AI Framework That Enables Versatile And Controllable Image Generation Using A Pre-Trained Text-to-Image Diffusion Model
2 projects | /r/machinelearningnews | 24 Feb 2023

Quick Read: https://www.marktechpost.com/2023/02/24/meet-multidiffusion-a-unified-ai-framework-that-enables-versatile-and-controllable-image-generation-using-a-pre-trained-text-to-image-diffusion-model/ Paper: https://arxiv.org/abs/2302.08113 Github: https://github.com/omerbt/MultiDiffusion Project: https://multidiffusion.github.io/
You to can create Panorama images 512x10240+ (not a typo) using less then 6GB VRAM (Vertorama works too). A modification of the MultiDiffusion code to pass the image through the VAE in slices then reassemble. Potato computers of the world rejoice.
3 projects | /r/StableDiffusion | 23 Feb 2023

So I haven't made many images with Stable Diffusion despite using it heavily. The reason is I've been messing with the internals of the diffusion pipe, to interfere with the diffusion process in different ways. Todays fun result is based on omerbt/MultiDiffusion for making panoramas.
First version of Stable Diffusion was released on August 22, 2022
4 projects | /r/StableDiffusion | 23 Feb 2023

If we combine Mixture of Diffusers + MultiDiffusion+ Composer+ cross-domain-compositing and probably some more I'm not thinking of.

What are some alternatives?

When comparing stable-diffusion-videos and MultiDiffusion you can also consider the following projects:

sd-dynamic-prompts - A custom script for AUTOMATIC1111/stable-diffusion-webui to implement a tiny template language for random prompt generation

stable-diffusion-webui-two-shot - Latent Couple extension (two shot diffusion port)

frame-interpolation - FILM: Frame Interpolation for Large Motion, In ECCV 2022.

sd-webui-controlnet - WebUI extension for ControlNet

dain-ncnn-vulkan - DAIN, Depth-Aware Video Frame Interpolation implemented with ncnn library

mixture-of-diffusers - Mixture of Diffusers for scene composition and high resolution image generation

stable-diffusion-webui - Stable Diffusion web UI [Moved to: https://github.com/Sygil-Dev/sygil-webui]

Diffusion-Models-Papers-Survey-Taxonomy - Diffusion model papers, survey, and taxonomy

stable-karlo - Upscaling Karlo text-to-image generation using Stable Diffusion v2.

openpose-editor - Openpose Editor for AUTOMATIC1111's stable-diffusion-webui

stable-diffusion-tensorflow-IntelMetal - Stable Diffusion in TensorFlow / Keras, Designed for Apple Metal on Intel. Forked from @divamgupta's work [Moved to: https://github.com/soten355/MetalDiffusion]

stable-diffusion-webui-sonar - Wrapped k-diffuison samplers with tricks to improve the generated image quality (maybe?), extension script for AUTOMATIC1111/stable-diffusion-webui

stable-diffusion-videos vs sd-dynamic-prompts MultiDiffusion vs stable-diffusion-webui-two-shot stable-diffusion-videos vs frame-interpolation MultiDiffusion vs sd-webui-controlnet stable-diffusion-videos vs dain-ncnn-vulkan MultiDiffusion vs mixture-of-diffusers stable-diffusion-videos vs stable-diffusion-webui MultiDiffusion vs Diffusion-Models-Papers-Survey-Taxonomy stable-diffusion-videos vs stable-karlo MultiDiffusion vs openpose-editor stable-diffusion-videos vs stable-diffusion-tensorflow-IntelMetal MultiDiffusion vs stable-diffusion-webui-sonar

Compare stable-diffusion-videos vs MultiDiffusion and see what are their differences.

stable-diffusion-videos

MultiDiffusion

stable-diffusion-videos

MultiDiffusion

What are some alternatives?