generative-models vs stable-diffusion-webui

generative-models

Generative Models by Stability AI (by Stability-AI)

Suggest topics

Source Code

Suggest alternative

Edit details

stable-diffusion-webui

Stable Diffusion web UI (by AUTOMATIC1111)

Python JS Deep Learning diffusion image-generation image2image img2img text2image txt2img AI ai-art gradio Pytorch stable-diffusion Torch Upscaling Web unstable

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

generative-models		stable-diffusion-webui
	Project
21	Mentions	2,808
22,649	Stars	131,121
4.4%	Growth	-
7.3	Activity	9.9
about 1 month ago	Latest Commit	7 days ago
Python	Language	Python
MIT License	License	MIT

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

generative-models

Posts with mentions or reviews of generative-models. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-12.

Creating Videos with Stable Video Diffusion
3 projects | dev.to | 12 Feb 2024

git clone https://github.com/Stability-AI/generative-models.git && cd generative-models
Show HN: I have created a free text-to-image website that supports SDXL Turbo
2 projects | news.ycombinator.com | 17 Dec 2023
How To Increase Performance Time on MacOS
3 projects | /r/StableDiffusion | 10 Dec 2023
Introducing Stable Video Diffusion: Stability AI's New AI Research Tool for Image-to-Video Synthesis
1 project | /r/Linkeesproject | 8 Dec 2023

Generative Models by Stability AI Github Repository
image-to-video tutorial
1 project | /r/StableDiffusion | 26 Nov 2023

# clone SD repo !git clone https://github.com/Stability-AI/generative-models.git # cd into working directory # the % sets the pwd globally as usually each command is run in a subshell in google colab %cd /content/generative-models/ # installing dependencies !pip install -r requirements/pt2.txt !pip install . # HACK # I was getting ModuleNotFoundError: No module named 'scripts' # This is what ChatGPT suggested (let me know if there is a better way) file_path = '/content/generative-models/scripts/sampling/simple_video_sample.py' new_text = "import sys\nsys.path.append('/content/generative-models')\n\n" with open(file_path, 'r') as file: original_content = file.read() updated_content = new_text + original_content with open(file_path, 'w') as file: file.write(updated_content) # Need to create a checkpoints/ folder - that is where the system looks for weights import os dir_name = 'checkpoints' if not os.path.exists(dir_name): os.makedirs(dir_name) print(f"Directory '{dir_name}' created") else: print(f"Directory '{dir_name}' already exists") # Download weights into checkpoints/ folder from huggingface_hub import hf_hub_download hf_hub_download(repo_id="stabilityai/stable-video-diffusion-img2vid", filename="svd.safetensors", local_dir="checkpoints", local_dir_use_symlinks=False) # I can't remember if this step is needed but it aims to reduce the memory footprint of pytorch # I kept getting CUDA out of memory # I got these instructions from the out of memory error message os.environ['PYTORCH_CUDA_ALLOC_CONF'] = 'max_split_size_mb:512' print(os.environ['PYTORCH_CUDA_ALLOC_CONF']) # Inside of scripts/sampling/simple_video_sample.py you need to make 2 updates 1. input_path (line 26): update to the location of your file (I attached Gdrive so mine was "/content/drive/MyDrive/examples/car.jpeg" 2. decoding_t (line 34): update it to 5. you need to do this for memory preservation (CUDA out of memory). I'm not sure if 5 is the best value but it worked for me # Finally generate the video (output will be in the outputs/ folder) !python scripts/sampling/simple_video_sample.py
Stable Video Diffusion
6 projects | news.ycombinator.com | 21 Nov 2023

It looks like the huggingface page links their github that seems to have python scripts to run these: https://github.com/Stability-AI/generative-models
GitHub - Stability-AI/generative-models: Generative Models by Stability AI
1 project | /r/cryptogeum | 4 Nov 2023
How does ComfyUI load SDXL 1.0 so VRAM-efficiently? How do I do the same in vanilla python code?
1 project | /r/StableDiffusion | 18 Aug 2023

However, when using the example code from HuggingFace or setting up stuff from the StabilityAI/generative-models repo in a jupyter notebook, I end up using 21 GB of VRAM just for running the default pipeline (with no base model output). If I try to run the extra `base.vae.decode(base_latents)` after generation to get unrefined outputs, I get a CUDA out of memory error as it blows past the 24GB of my NVIDIA RTX 3090.
SDXL 1.0 is out!
1 project | /r/StableDiffusion | 28 Jul 2023
SDXL 0.9 Anyone having luck NOT centering subjects?
1 project | /r/StableDiffusion | 10 Jul 2023

SDXL uses cropping information as part of the conditioning. Images were randomly cropped during training and the coordinates of the crop were included as two integers at the end of the conditioning vector. If you're using ComfyUI you can use the CLIPTextEncodeSDXL node to specify where the upper left corner of the image should appear to be in relation to some hypothetical uncropped image. Here's a figure with examples from the report on SDXL:

stable-diffusion-webui

Posts with mentions or reviews of stable-diffusion-webui. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-27.

Show HN: I made an app to use local AI as daily driver
31 projects | news.ycombinator.com | 27 Feb 2024

* LLaVA model: I'll add more documentation. You are right Llava could not generate images. For image generation I don't have immediate plans, but checkout these projects for local image generation.
- https://diffusionbee.com/
- https://github.com/comfyanonymous/ComfyUI
- https://github.com/AUTOMATIC1111/stable-diffusion-webui
AMD Funded a Drop-In CUDA Implementation Built on ROCm: It's Open-Source
23 projects | news.ycombinator.com | 12 Feb 2024

I would love to be able to have a native stable diffusion experience, my rx 580 takes 30s to generate a single image. But it does work after following https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki...
I got this up and running on my windows machine in short order and I don't even know what stable diffusion is.
But again, it would be nice to have first class support to locally participate in the fun.
Ask HN: What is the state of the art in AI photo enhancement?
2 projects | news.ycombinator.com | 24 Jan 2024

In Auto1111, that just uses Image.blend. :)
https://github.com/AUTOMATIC1111/stable-diffusion-webui/blob...
How To Increase Performance Time on MacOS
3 projects | /r/StableDiffusion | 10 Dec 2023
Can anyone suggest an AI model that can help me enhance a poorly drawn logo?
1 project | /r/ArtificialInteligence | 10 Dec 2023

I used SDXL in automatic1111 webui for both images. Now that I think about it, the procedure I described was how I made this one, but the one that looks like an illustration was done in two steps. I used the canny ControlNet as I said for the outer part of the logo to preserve the shape of the fonts, but I had to turn it off for the boot to give SDXL leeway to add detail and make it look more like a boot.
Seeking out an experienced and empathetic coding buddy.
2 projects | /r/StableDiffusion | 10 Dec 2023

That said, please do learn coding and don't get discouraged when somebody says to learn PyTorch or recommends using a Jupiter notebook with no further information on how to translate the skill into images. I would highly recommend some short term goals. Get your feet wet by taking apart the UIs. The comfy API documentation is here and the A1111 API documentation is here. There is a difference in completeness, welcome to programming. Writing nodes or plugins is also a good way to jump into this world. Custom wildcard logic might be very attractive to you if you aren't the type that want to deal with a nested file structure to simulate logic.
can't get it working with an AMD gpu
1 project | /r/StableDiffusion | 10 Dec 2023
SD extension that allows for setting override
2 projects | /r/StableDiffusion | 9 Dec 2023

Possibly Unprompted? https://github.com/AUTOMATIC1111/stable-diffusion-webui/discussions/8094
Need to write an application to use Stable Diffusion on my desktop PC - which resource should I learn to use?
2 projects | /r/learnprogramming | 9 Dec 2023
4090 Speed Decrease on each Generation/Iteration
1 project | /r/StableDiffusion | 7 Dec 2023

version: v1.6.1 • python: 3.10.13 • torch: 2.0.1+cu118 • xformers: 0.0.20 • gradio: 3.41.2 • checkpoint: 6e8d4871f8

What are some alternatives?

When comparing generative-models and stable-diffusion-webui you can also consider the following projects:

background-removal-js - Remove backgrounds from images directly in the browser environment with ease and no additional costs or privacy concerns. Explore an interactive demo.

stable-diffusion-ui - Easiest 1-click way to install and use Stable Diffusion on your computer. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image. [Moved to: https://github.com/easydiffusion/easydiffusion]

wizmap - Explore and interpret large embeddings in your browser with interactive visualization! 📍

ComfyUI - The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

evernote-ai-chatbot

SHARK - SHARK - High Performance Machine Learning Distribution

gping - Ping, but with a graph

lora - Using Low-rank adaptation to quickly fine-tune diffusion models.

graphic-walker - An open source alternative to Tableau. Embeddable visual analytic

InvokeAI - InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.

xgen - Salesforce open-source LLMs with 8k sequence length.

safetensors - Simple, safe way to store and distribute tensors

generative-models vs background-removal-js stable-diffusion-webui vs stable-diffusion-ui generative-models vs wizmap stable-diffusion-webui vs ComfyUI generative-models vs evernote-ai-chatbot stable-diffusion-webui vs SHARK generative-models vs gping stable-diffusion-webui vs lora generative-models vs graphic-walker stable-diffusion-webui vs InvokeAI generative-models vs xgen stable-diffusion-webui vs safetensors

Compare generative-models vs stable-diffusion-webui and see what are their differences.

generative-models

stable-diffusion-webui

generative-models

stable-diffusion-webui

What are some alternatives?