stable-diffusion-webui-depthmap-script
stable-diffusion-webui-dataset-tag-editor
stable-diffusion-webui-depthmap-script | stable-diffusion-webui-dataset-tag-editor | |
---|---|---|
64 | 7 | |
1,594 | 628 | |
- | - | |
8.3 | 6.5 | |
2 months ago | 5 days ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
stable-diffusion-webui-depthmap-script
-
PATCHFUSION is really impressive. High resolution depth maps in 16bit. I've been waiting for this. https://github.com/zhyever/PatchFusion
The guide on the github page for the extension is OK: https://github.com/thygate/stable-diffusion-webui-depthmap-script
-
Extension not showing. Depthmap help 🙏
New to SD. I'm trying to get an extension to work (https://github.com/thygate/stable-diffusion-webui-depthmap-script) but opposite to the tutorials the "depth" tab doesn't show after installation. Anyone who can help locate the problem? Thanks!
-
Is anyone working on stereoscopic 3D SD? Is it even possible?
You can use this extension to generate stereoscopic images . . . I don't (yet) dabble in video, so I don't know what it'll do there. I've done a ton of stereo pics with it. My fascination sort of comes and goes. You can do cross-eyed or parllel view as well as red/cyan anaglyphs.
-
GUIDE: Ways to generate consistent environments for comics, novels, etc
Option 8. Use img2img of existing 360 HDRIs, extract their depth maps with the depth extension. Use that as a displacement map on a sphere in Blender, similarly to this, with the refurbished HDRI as an image texture, then take screenshots from a position close to the center of the sphere. You are limited to staying close to the center in order to avoid distortion, but now you have 360 degrees of consistent freedom for a particular scene. If you have 2 or more HDRIs of the same place, even better. You could also combine this with the 3D environments of the other options to use 360 renders as bases for the img2img.
-
Another Ai image to 3d
you have another automatic 1111 extension that allow you to create there also the 3d file, but this consume a lot of vram https://github.com/thygate/stable-diffusion-webui-depthmap-script
-
Get a 16-Bit Controlnet Depth
If you're using A1111 webui there is the depthmap2mask extension which you can install from the extensions tab. It will add a depth tab which will allow you to create 16-bit depth maps among many other things.
-
180 VR - Blue Techno World - (Stable Diffusion + Deforum) stereo video
actually it is very easy to do. What you need is install extension for Stable Diffusion webUI (https://stable-diffusion-art.com/install-windows/) . This extension will generate stereo for you automatically. Name is Depth. (https://github.com/thygate/stable-diffusion-webui-depthmap-script)
- Is it possible for me to approximate a depth map from a generated image and make a 3D model?
- Thanks for loving our Star Wars video! We created a new one for Lord of the Rings. Enjoy this mid-journey to Middle-Earth.
-
Found this site through twitter that slightly animates images. Throwing Stable Diffusion generations into it is pretty awesome. Site in comments.
You can do this inside of a1111 as well with this extension https://github.com/thygate/stable-diffusion-webui-depthmap-script
stable-diffusion-webui-dataset-tag-editor
-
Using hydrus for managing tags of training data
There are few tools for mass tagging data. Each with their own problems. * stable-diffusion-webui-dataset-tag-editor has good features. But it also has bugs that make it nearly unusable. It is also resource heavy as it runs in the webUI with stable diffusion, and stable diffusion always has models loaded. * BooruDatasetTagManager lacks many useful features.
-
What program to use for mass editing tags for training images?
I tried stable-diffusion-webui-dataset-tag-editor but it has a bug where it would get confused and sometimes swap tags from one image to another ruining everything.
-
Experiment AI Anime w/ C-Net 1.1 + GroundingDINO + SAM + MFR (workflow)
Use WD 1.4 tagger (https://github.com/toriato/stable-diffusion-webui-wd14-tagger) to extract prompt words from each frame (threshold 0.65), then use the dataset tag editor (https://github.com/toshiaki1729/stable-diffusion-webui-dataset-tag-editor) for batch editing, mainly:
-
Civitai should enforce a replicability check
If you haven't come across them yet, these two guides: this and this are good reads, and this one for info about learning rates. Beyond what those guides give info on, there are two points in which I noticed a large increase in my Lora quality- better captioning, and when I resized all the images to have about the same amount of pixels as was being trained. For captioning I have a text file with types of tags I know I'll have to hit- subject (solo, 1girl, 1boy, those early tags), what kind of perspective- portrait, closeup, full body, etc, where the character is looking (looking up, looking to the side, looking at viewer, etc), what the perspective of the viewer is (from above, from below, pov, etc), and I write down common clothing tags for the character. So I have that off to the side, and then I load up this extension for webui. It has a bit of learning curve, but I point it at what pictures I've gotten and get it to interrogate with all the models it offers except blip, and set the confidence threshold to 0.10 so it's spitting out lots of tags. After it interrogates all the pictures, I use the database feature to remove the duplicate tags, and then I save the database so it creates all the text files. Then I go to the "edit caption of selected image" select an image to caption from the left. At that point on the right the top box should be full of tags, and the bottom one should be empty. I look at my checklist from my textfile and start hitting all the areas I need to, which doesn't take long. Then I look up at the top box and read from left to right, top to bottom, one tag a time, and if it's a relevant tag, I type it in the bottom box.
-
embed txt tags
I have been using this: https://github.com/toshiaki1729/stable-diffusion-webui-dataset-tag-editor to get tags on some random images (not for a dataset, just for ease of browsing personal photos and such) unfortunately, this exports as a txt file and doesnt know how to do xmp or tag embedding. does anyone know of a way to emb the exported txt file into the image keywords/categories/whatever it supports (based on format) or a quick way to convert it to an xmp sidecar file? not necessarily related to ai generation, but it is related to ai usage. hopefully someone knows the answer or can point me where to find it.
-
Automatic1111 extensions. What're your must-haves?
Dataset Tag Editor is perfect for editing large datasets and their caption files. It's been around for a couple months and I only found out about it the other day. I could have saved so much time manually editing hundreds of caption files....
-
Questions About Improving Embeddings/Hypernetwork Results
There is one extension I use however: https://github.com/toshiaki1729/stable-diffusion-webui-dataset-tag-editor
What are some alternatives?
MiDaS - Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
BooruDatasetTagManager
a1111-sd-zoe-depth - a1111 sd WebUI extention version of ZoeDepth
sd-webui-additional-networks
multi-subject-render - Generate multiple complex subjects all at once!
stable-diffusion-webui-wd14-tagger - Labeling extension for Automatic1111's Web UI
Thin-Plate-Spline-Motion-Model - [CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
kohya-trainer - Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning
depthmap2mask - Create masks out of depthmaps in img2img
stable-diffusion-webui - Stable Diffusion web UI
point-e - Point cloud diffusion for 3D model synthesis
sd-webui-image-sequence-toolkit - Extension for AUTOMATIC111's WebUI