zero123
sd-webui-lobe-theme
zero123 | sd-webui-lobe-theme | |
---|---|---|
6 | 77 | |
2,503 | 2,198 | |
2.9% | 6.5% | |
6.9 | 9.3 | |
5 months ago | 5 days ago | |
Python | TypeScript | |
MIT License | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
zero123
-
Stable Cascade
Someone with resources will have to train Zero123 [1] with this backbone.
[1] https://zero123.cs.columbia.edu/
-
Stable Zero123: Quality 3D Object Generation from Single Images
This looks a fine-tune of the classic zero123 (https://github.com/cvlab-columbia/zero123) I’m excited to check out the quality improvements.
Though 3d model synthesis is one use case, I found the less advertised base reprojection model to be more useful for gamedev at the moment. You can generate a multiview spritesheet from an image, and it’s fast enough for synthesis during a gameplay session. I couldn’t get a good quality/time balance to do the same with the 3d models, and the lack of mesh rigging or animation combined with imperfections in a fully 3d model tends to break the suspension of disbelief compared to what players are used to. I’m this will change as the tech develops and we layer more AI on top (automatic animation synthesis is an active research area).
If you’re interested in this you might also want to check out deforum (https://github.com/deforum-art/deforum-stable-diffusion) which provides even more powerful camera controls on top of stable diffusion designed for full scenes rather than single objects.
-
Text-to-image-to-3D on 16GB GPU after stable-dreamfusion repo update
As described in the stable-dreamfusion repo for the image to 3D using the zero123 model (you can read more about that in their repo here: https://github.com/cvlab-columbia/zero123) I used the 105000 checkpoint of zero123. It took about an hour to go through their initial NeRF generation and cleanup steps to get the model output.
-
NVIDIA presents GeNVS: Generative Novel View Synthesis with 3D-Aware Diffusion Models
Until then https://github.com/cvlab-columbia/zero123 was kinda okay, but practical results often left to be desired, from the imprecision of the view angles to the at times fanciful re-imaginations of the source object.
-
Zero-1-to-3: Zero-shot One Image to 3D Object
For anyone else who tried to download the weights and got Google Drive throwing a quota error at you, they're working on it: https://github.com/cvlab-columbia/zero123/issues/2
sd-webui-lobe-theme
-
Upscayl – Free and Open Source AI Image Upscaler
upscayl is very approachable, but lacked many features i needed. i ended up using https://github.com/AUTOMATIC1111/stable-diffusion-webui after upscaling became part of my regular workflow, but for someone who just needs a few images enhanced, it's an ideal tool.
-
The Basics of AI Image Generation: How to create your own AI-generated image using Stable Diffusion on your local machine.
For the Git alternative, simply right-click on the location you want to put the Stable Diffusion and select “Git Bash Here”, then paste this on the CLI: git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui
-
Stable Cascade
ComfyUI is similar to Houdini in complexity, but immensely powerful. It's a joy to use.
There are also a large amount of resources available for it on YouTube, GitHub (https://github.com/comfyanonymous/ComfyUI_examples), reddit (https://old.reddit.com/r/comfyui), CivitAI, Comfy Workflows (https://comfyworkflows.com/), and OpenArt Flow (https://openart.ai/workflows/).
I still use AUTO1111 (https://github.com/AUTOMATIC1111/stable-diffusion-webui) and the recently released and heavily modified fork of AUTO1111 called Forge (https://github.com/lllyasviel/stable-diffusion-webui-forge).
-
Show HN: I made a local wrapper for Automatic 1111
Seems like an interesting project. Regarding the name, is there permission to use something so similar to AUTOMATIC1111 [1]?
> Diffusers will Cuda out of memory/perform very slowly for huge generations, like 2048x2048 images, while Auto 1111 SDK won't.
Do we have some numbers on this? I have seen AUTOMATIC1111 fall-over whilst using only half the available of GPU VRAM - there seems to be some weirdness where it tries to allocate before de-allocating the last batch or something.
> You can use any of the 6 compatible RealEsrgran models/weights with our RealEsrgran pipeline for upscaling images. Here are the model ids:
I've previously had trouble trying to use AUTOMATIC1111 upscalers, it seems like it needs more GPU VRAM than just generating the image already upscaled.
[1] https://github.com/AUTOMATIC1111/stable-diffusion-webui
-
Stable Code 3B: Coding on the Edge
You might be thinking of Fooocus: https://github.com/lllyasviel/Fooocus
The Stable Diffusion web interface that got a lot of people's attention originally was Automatic1111: https://github.com/AUTOMATIC1111/stable-diffusion-webui
Fooocus is definitely more beginner friendly. It does a lot of the prompt engineering for you. Automatic1111 has a ton of plugins, most notably ControlNet which gives you fine grained control over the images, but there is a learning curve.
- Google Imagen 2
-
Free or "practically-free" Ai picture generator?
Stable Diffusion https://github.com/AUTOMATIC1111/stable-diffusion-webui
-
Things to do, to put my old PC to use?
Make it into a stable diffusion server!
-
GTA 6 trailer screencaps, photorealistic style
There's no link version, you have to run it locally. You install it from here
-
Automatic1111 v1.7.0-RC published
Repository: AUTOMATIC1111/stable-diffusion-webui · Tag: v1.7.0-RC · Commit: 48fae7c · Released by: AUTOMATIC1111
What are some alternatives?
stable-diffusion-webui-forge
stable-diffusion-webui - Stable Diffusion web UI
stable-dreamfusion - Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
ComfyUI - The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
StableCascade - Official Code for Stable Cascade
automatic - SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models
ComfyUI-DiffusersStableCascade - Simple inference with StableCascade using diffusers in ComfyUI
stable-diffusion-webui-directml - Stable Diffusion web UI
genvs
stable-diffusion-webui-ux - Stable Diffusion web UI UX
deforum-stable-diffusion
stable-diffusion-webui-colab - stable diffusion webui colab