artroom-stable-diffusion
mindall-e
artroom-stable-diffusion | mindall-e | |
---|---|---|
8 | 8 | |
219 | 630 | |
- | -0.2% | |
0.0 | 0.0 | |
about 1 year ago | almost 2 years ago | |
Python | Python | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
artroom-stable-diffusion
-
Easy-to-use local install of Stable Diffusion released
Github Repo: https://github.com/artmamedov/artroom-stable-diffusion
-
Ran an image of the boys through a few different AI models. Here are a few of the better outcomes.
I use Artroom (Alternative download link), mostly because I'm incompetent and it's the easiest one to set up from all of the things I've found.
- Which is your favorite text to image model overall?
-
Anyone have an idea what the issue is here, attempting to run the optimized scripts but keep hitting the same error, no problem running the normal scripts. Thanks.
wild guess after looking at this.
- I’m buying a 12 GB card for this, how big can I expect to be able to go?
-
image2image throwing errors, unsure how to get it to run
I used a new project called Artroom that did everything for me. I already had the 1.4 model downloaded so I just needed to rename it to model.ckpt and put in the right directory. The creator just added experimental image2image support in the 0.3.0 release, but you can only get that on the authors discord channel at the moment.
-
Cats sitting at a table playing poker
Other pictures are from Pic 1's prompt but with varying parameters. Created using Artroom
-
Stable Diffusion One-Click Install Local GUI
You can get latest from: https://github.com/artmamedov/artroom-stable-diffusion/releases
mindall-e
-
Which is your favorite text to image model overall?
Runner-ups are Craiyon (for being more "creative" than SD), Disco Diffusion, minDALL-E, and CLIP Guided Diffusion.
-
minDALL-E on Conceptual Captions
minDALL-E at replicate.com. (Found here.)
GitHub: https://github.com/kakaobrain/minDALL-E Colab demo: https://colab.research.google.com/drive/1Gg7-c7LrUTNfQ-Fk-BVNCe9kvedZZsAh?usp=sharing
-
We got openAI's DALL-E
For those wondering, this is minDALL-E as u/DEATH_STAR_EXTRACTOR mentioned
-
[P] minDALL-E: PyTorch implementation of a 1.3B text-to-image generation model trained on 14 million image-text pairs
Hello. I introduce an open source project, which released the checkpoint of the text-to-image generation model, DALL-E. Link: https://github.com/kakaobrain/minDALL-E
-
Release: 602M-parameter CLIP-conditioned diffusion model trained on Conceptual 12M (v-diffusion-pytorch)
See also the much chonkier minDALL-E: https://github.com/kakaobrain/minDALL-E Wonder which one is better? Diffusion models are pretty good with CLIP.
-
Kakao Brain releases 1.3 billion parameter text-to-image model minDALL-E. Details in a comment. Example: "a Christmas tree".
According to its GitHub repo, minDALL-E was trained on 14 million image+text pairs from the Conceptual Captions and Conceptual Captions 12M datasets.
What are some alternatives?
disco-diffusion
dalle-mini - DALL·E Mini - Generate images from a text prompt
stable-diffusion-webui - Stable Diffusion web UI
stable-diffusion-webui-docker - Easy Docker setup for Stable Diffusion with user-friendly UI
CLIP-Guided-Diffusion - Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.
VQGAN-CLIP - Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
InvokeAI - InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.
stable-diffusion - A latent text-to-image diffusion model