web-stable-diffusion
ControlNet
web-stable-diffusion | ControlNet | |
---|---|---|
21 | 127 | |
3,455 | 27,964 | |
1.6% | - | |
4.4 | 4.1 | |
about 2 months ago | 3 months ago | |
Jupyter Notebook | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
web-stable-diffusion
-
GPU-Accelerated LLM on a $100 Orange Pi
Yup, here's their web stable diffusion repo: https://github.com/mlc-ai/web-stable-diffusion
The input is a model (weights + runtime lib) compiled via the mlc-llm project: https://mlc.ai/mlc-llm/docs/compilation/compile_models.html
-
StableDiffusion can now run directly in the browser on WebGPU
The MLC team got that working back in March: https://github.com/mlc-ai/web-stable-diffusion
Even more impressively, they followed up with support for several Large Language Models: https://webllm.mlc.ai/
- Web StableDiffusion
-
[Stable Diffusion] Diffusion stable Web: exécution de diffusion stable directement dans le navigateur sans serveur GPU
[https://github.com/mlc-ai/web-stable-diffusion
-
Now that they started banning stable diffusion on google colab, what's the cheapest and the best way to deploy stable diffusion?
You can run it directly in the browser with WebGPU, https://mlc.ai/web-stable-diffusion/
-
I've got Stable Diffusion integrated into my site now, fully client side with no setup or servers.
Using the amazing work of https://mlc.ai/web-stable-diffusion/ I've got the code moved into a Web Worker and running fully local client side. It does require 2GB's of model files be downloaded (automatically), and takes a few minutes for the first load, but it works and once it's going it only takes 20s to make a 512x512 image.
-
Chrome Ships WebGPU
The Apache TVM machine learning compiler has a WASM and WebGPU backend, and can import from most DNN frameworks. Here's a project running Stable Diffusion with webgpu and TVM [1].
Questions exist around post-and-pre-processing code in folks' Python stacks, with e.g. NumPy and opencv. There's some NumPy to JS transpilers out there, but those aren't feature complete or fully integrated.
[1] https://github.com/mlc-ai/web-stable-diffusion
- Bringing stable diffusion models to web browsers
- mlc-ai/web-stable-diffusion: Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
- Web Stable Diffusion: Running Diffusion Models with WebGPU
ControlNet
-
With the recent developments, It looks like AI art is finally beginning to evolve in the right direction
It`s all possible. Have a look into Automatic1111`s Web UI, ControlNet, OpenPose and, if you don`t have a dedicated GPU with at least 8GB of VRAM, or at least 16GB of RAM to use the CPU, you can also use Stable Horde to use the webUI with a peer-to-peer connection, where you`ll only use a fraction of your resources, but you`ll be able to use local AI models with all the bells and whistles that you won`t get from "state-of-the-art" paid services.
-
AI "Artists" Are Lazy, and the Ultimate Goal of AI Image Generation (hint: its sloth)
Next up is ControlNet. Controlnet, as Illyasviel--creator of controlnet--describes it, "let's us control diffusion models!." ControlNet is a neural network structure to control diffusion models by adding extra connections. [8]. There is more to that than what I described, but the big take-away is that ControlNet takes a preprocessed image that you provide (or is generated) and uses that as a way of constraining the output the sampler's noise generates, allowing you to have a bit more control of the output. ControlNet is typically used for character or scene "artwork", which previously would have been a challenge with just prompting alone (at least with this current architecture).
- Making a ControlNet inpaint for sdxl
-
[P5V6P2] Mother and Daughter (by azfumi)
For your first part of the comment, I can simply refer you to technologies like ControlNet, LoRA and prompt embedding: https://github.com/lllyasviel/ControlNet https://github.com/microsoft/LoRA
- Calling yourself an AI artist is almost exactly the same as calling yourself a cook for heating readymade meals in a microwave
-
Why is the AI not listening to my prompts?
Here you can see what every controlnet preprocessor and model do, to give you an idea of how to use
-
Can't get img2img working well
Ya, it takes awhile to really start getting comfortable with the wonkiness. If you are trying to do something specific, look for a LoRA, but in general I'd recommend you get controlnet so you can feed it a reference image. Another simple trick is to edit the image a bit in GIMP or a photo editor to get the color scheme you like and then feed it back to img2img at low denoising (0.1-0.2) to refine it. You can also add just garishly bad cartoon drawing or photoshop in assets and img2img will usually make something of them and blend them into your image, I find this easier than using img2img scribble.
- ControlNet on A1111 seems to have been broken in the new update
-
Can anyone help me install SD and ControlNet on my Mac pro M1?
If there are no errors, go to the "Extensions" tab, then "Install from URL". There, enter "https://github.com/lllyasviel/ControlNet" then click "Install".
-
According to the poll on the recent thread, /r/dalle2 community decided to keep the subreddit restricted on Reddit.
This is a good place to start reading. Given the open-source nature of SD, there are setups of various difficulty available. A1111 is the "standard" people enjoy because it's easy to plug in new stuff (ControlNet, new models, etc.), but it's not inherently easy to set up and get going. There is an installer for it, but I haven't tried it.
What are some alternatives?
stable-diffusion-webui-directml - Stable Diffusion web UI
InvokeAI - InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.
rust-bert - Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
lora - Using Low-rank adaptation to quickly fine-tune diffusion models.
SHA256-WebGPU - Implementation of sha256 in WGSL
LoRA - Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
wgpu-py - Next generation GPU API for Python
sd-webui-controlnet - WebUI extension for ControlNet
onnxruntime - ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
stable-diffusion-webui-prompt-travel - Travel between prompts in the latent space to make pseudo-animation, extension script for AUTOMATIC1111/stable-diffusion-webui.
js-promise-integration - JavaScript Promise Integration
stable-diffusion-webui - Stable Diffusion web UI