deforum-stable-diffusion
zero123
deforum-stable-diffusion | zero123 | |
---|---|---|
14 | 6 | |
2,131 | 2,489 | |
1.2% | 2.4% | |
7.4 | 6.9 | |
about 2 months ago | 5 months ago | |
Python | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
deforum-stable-diffusion
-
Stable Zero123: Quality 3D Object Generation from Single Images
This looks a fine-tune of the classic zero123 (https://github.com/cvlab-columbia/zero123) I’m excited to check out the quality improvements.
Though 3d model synthesis is one use case, I found the less advertised base reprojection model to be more useful for gamedev at the moment. You can generate a multiview spritesheet from an image, and it’s fast enough for synthesis during a gameplay session. I couldn’t get a good quality/time balance to do the same with the 3d models, and the lack of mesh rigging or animation combined with imperfections in a fully 3d model tends to break the suspension of disbelief compared to what players are used to. I’m this will change as the tech develops and we layer more AI on top (automatic animation synthesis is an active research area).
If you’re interested in this you might also want to check out deforum (https://github.com/deforum-art/deforum-stable-diffusion) which provides even more powerful camera controls on top of stable diffusion designed for full scenes rather than single objects.
-
Smooth Style and Concept Vid2Vid Conversion - The Inner-Reflections Method - Workflow and Tutorial
2/Deforum – Open source and good at developing trippy videos – I have not used this too much (https://github.com/deforum-art/deforum-stable-diffusion)
-
Stability Announces Stable Animation
Why is StabilityAI releasing what seems to be a clone of Deforum ?
- Astral Projection...!
-
How would someone create this AI art video?
might be deforum (project) (webui extension) or img2img loopback wave (webui extension). it's definitely something with img2img though. that's clear from how it's warping.
- Amsterdam trip) Smoking stable diffusion and drinking deforum)
-
First time output video by StableDiffusion
Sorry, I missed it, thanks to Deforum: https://github.com/deforum-art/deforum-stable-diffusion
- Walking in a winter dream
-
Music video remix concept ideas / help needed
Have you seen Deforum? Kinda sounds like you're about to reinvent it. :)
-
Deforum Stable Diffusion Local
I'm talking about it https://github.com/deforum-art/deforum-stable-diffusion It is not possible to put in a separate environment.
zero123
-
Stable Cascade
Someone with resources will have to train Zero123 [1] with this backbone.
[1] https://zero123.cs.columbia.edu/
-
Stable Zero123: Quality 3D Object Generation from Single Images
This looks a fine-tune of the classic zero123 (https://github.com/cvlab-columbia/zero123) I’m excited to check out the quality improvements.
Though 3d model synthesis is one use case, I found the less advertised base reprojection model to be more useful for gamedev at the moment. You can generate a multiview spritesheet from an image, and it’s fast enough for synthesis during a gameplay session. I couldn’t get a good quality/time balance to do the same with the 3d models, and the lack of mesh rigging or animation combined with imperfections in a fully 3d model tends to break the suspension of disbelief compared to what players are used to. I’m this will change as the tech develops and we layer more AI on top (automatic animation synthesis is an active research area).
If you’re interested in this you might also want to check out deforum (https://github.com/deforum-art/deforum-stable-diffusion) which provides even more powerful camera controls on top of stable diffusion designed for full scenes rather than single objects.
-
Text-to-image-to-3D on 16GB GPU after stable-dreamfusion repo update
As described in the stable-dreamfusion repo for the image to 3D using the zero123 model (you can read more about that in their repo here: https://github.com/cvlab-columbia/zero123) I used the 105000 checkpoint of zero123. It took about an hour to go through their initial NeRF generation and cleanup steps to get the model output.
-
NVIDIA presents GeNVS: Generative Novel View Synthesis with 3D-Aware Diffusion Models
Until then https://github.com/cvlab-columbia/zero123 was kinda okay, but practical results often left to be desired, from the imprecision of the view angles to the at times fanciful re-imaginations of the source object.
-
Zero-1-to-3: Zero-shot One Image to 3D Object
For anyone else who tried to download the weights and got Google Drive throwing a quota error at you, they're working on it: https://github.com/cvlab-columbia/zero123/issues/2
What are some alternatives?
stable-diffusion-nvidia-docker - GPU-ready Dockerfile to run Stability.AI stable-diffusion model v2 with a simple web interface. Includes multi-GPUs support.
stable-diffusion-webui-forge
automatic - SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models
stable-dreamfusion - Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
diffusion-models-class - Materials for the Hugging Face Diffusion Models Course
StableCascade - Official Code for Stable Cascade
deforum-for-automatic1111-webui - Deforum extension script for AUTOMATIC1111's Stable Diffusion webui [Moved to: https://github.com/deforum-art/sd-webui-deforum]
ComfyUI-DiffusersStableCascade - Simple inference with StableCascade using diffusers in ComfyUI
Stable-diffusion-webui-video-multiprompt - I added multiprompt functionality to this script, so that one can input a path to a file containing the "screenplay".
genvs
deforumed-walk - Take a walk in the generated world.
Fooocus - Focus on prompting and generating