MiDaS
multi-subject-render
Our great sponsors
MiDaS | multi-subject-render | |
---|---|---|
27 | 18 | |
4,089 | 359 | |
4.1% | - | |
2.4 | 2.5 | |
3 months ago | about 1 year ago | |
Python | Python | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
MiDaS
-
How to Estimate Depth from a Single Image
The checkpoint below uses MiDaS, which returns the inverse depth map, so we have to invert it back to get a comparable depth map.
-
Distance estimation from monocular vision using deep learning
Hi, I have made use of the KITTI dataset for this, and yes it depends on objects of know sizes. Here I have defined the following classes: Car, Van, Truck, Pedestrian, Person_sitting, Cyclist, Tram, Misc, or DontCare and the predictions are pretty accurate for those classes. Even if it's not the same class, it still recognizes the object since I have made use of the coco names dataset here and that is used along with YOLO for object detection. And there are several already implemented projects that make use of deep learning models trained on 2D datasets to predict 3D distance. This was one of my inspirations for this project: https://blogs.nvidia.com/blog/2019/06/19/drive-labs-distance-to-object-detection/ Furthermore, there are well-documented and researched papers like DistYOLO or MiDaS that makes use of deep learning for depth estimation
-
OMPR V0.6.10 update
-Added AI image depth generator Create your own depth map image at a click of a button. Using the awesome MIDAS3.1 https://github.com/isl-org/MiDaS as the backend and the model "dpt_beit_large_512" for the highest quality depth map. Video and GIF depth map generators coming out next together with the Depth movie player feature.
-
AI that converts a regular 2d image to stereoscopic
It uses MiDaS. That extension may be the most accessible way to use it at home. IDK.
-
Idea: training on magiceye images
Here's the project homepage https://github.com/isl-org/MiDaS
-
MiDaS v3_1 and DiscoDiffusion
The problem came up after MiDaS updated to version V3_1 on Dec 24th. Although the fix works fine, with the new version there are many changes, which for me produces slightly different results. I would like to able to produce results like before. I still clone the MiDaS repo, but then set it back to the last commit before the changes in december, which is 66882994a432727317267145dc3c2e47ec78c38a.
-
File not found error
try: from midas.dpt_depth import DPTDepthModel except: if not os.path.exists('MiDaS'): gitclone("https://github.com/isl-org/MiDaS.git") gitclone("https://github.com/bytedance/Next-ViT.git", f'{PROJECT_DIR}/externals/Next_ViT') if not os.path.exists('MiDaS/midas_utils.py'): shutil.move('MiDaS/utils.py', 'MiDaS/midas_utils.py') if not os.path.exists(f'{model_path}/dpt_large-midas-2f21e586.pt'): wget("https://github.com/intel-isl/DPT/releases/download/1_0/dpt_large-midas-2f21e586.pt", model_path) sys.path.append(f'{PROJECT_DIR}/MiDaS')
-
A quick demo to show how structurally coherent depth2img is compared to img2img using Automatic1111.
Cool. The repo for MiDaS is here. https://github.com/isl-org/MiDaS You can see that they partially trained the model on 3D movies Here's a list of the movies that were used to train it. I wonder if they'll be training a MiDaS v 4.0 as things have moved on quite a bit since it was released in Apr 2021?
-
Boosting Monocular Depth repo
We present a stand-alone implementation of our Merging Operator. This new repo allows using any pair of monocular depth estimations in our double estimation. This includes using separate networks for base and high-res estimations, using networks not supported by this repo (such as Midas-v3), or using manually edited depth maps for artistic use. This will also be useful for scientists developing CNN-based MDE as a way to quickly apply double estimation to their own network. For more details please take a look here.
-
DepthViewer is now live on Steam :)
I'll make the feature to export only the depthmap .png file. If you need the depthmap .png right now you can use the MiDaS python script.
multi-subject-render
-
Creating pictures of multiple people with distinct faces
You can use the multi subject renderer https://github.com/Extraltodeus/multi-subject-render.git
-
Can I use SD to generate group pictures (of say, me and my cousin, or me and multiple cousins)?
Get this Extension, and as always, please read the docs to avoid problems.
-
Find it hard to tune my prompt for more than 2 characters
There's also a script/extension https://github.com/Extraltodeus/multi-subject-render but it's fiddily to get work right, and i think the other workflow is faster.
- Textual Inversion: TI TLDR for the Lazy. How to Make Fake People: Simple TI Traning Using 6 Images and very low Settings. Bonus 1: How to Make Fake People that Look Like Anything you Want. Bonus 2: Why 1980s Nightcrawler dont care about your prompts. With Unedited Image Samples.
-
How to do Multiple chars in 1 image
There are some ideas to create multiple different subjects, such as this extension for automatic (https://github.com/Extraltodeus/multi-subject-render), or Area Composition if you are using ComfyUI (https://comfyanonymous.github.io/ComfyUI_examples/area_composition/).
- How to detail 2 objects, each with its own qualities in prompt?
- Ladies in sexy pajamas
- Uhhhhh
-
Tips for creating picture with multiple characters?
you can do it with https://github.com/Extraltodeus/multi-subject-render but i don't really know how to use it
-
What are you struggling to do?
There is an extension called multi-subject-render that allows you to provide one prompt for the background and a second prompt for the foreground.
What are some alternatives?
stable-diffusion-webui-depthmap-script - High Resolution Depth Maps for Stable Diffusion WebUI
DenseDepth - High Quality Monocular Depth Estimation via Transfer Learning
stable-diffusion-webui-distributed - Chains stable-diffusion-webui instances together to facilitate faster image generation.
stablediffusion - High-Resolution Image Synthesis with Latent Diffusion Models
depthmap2mask - Create masks out of depthmaps in img2img
deeplearning4j-examples - Deeplearning4j Examples (DL4J, DL4J Spark, DataVec) [Moved to: https://github.com/deeplearning4j/deeplearning4j-examples]
sdweb-merge-board - Multi-step automation merge tool. Extension/Script for Stable Diffusion UI by AUTOMATIC1111 https://github.com/AUTOMATIC1111/stable-diffusion-webui
DiverseDepth - The code and data of DiverseDepth
sd-webui-reactor-force - Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111, SD.Next, Cagliostro) with NVIDIA GPU Support
Insta-DM - Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency (AAAI 2021)
Lora-for-Diffusers - The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥