Our great sponsors
-
OpenMVG (open Multiple View Geometry)
open Multiple View Geometry library. Basis for 3D computer vision and Structure from Motion.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
BundleFusion
[Siggraph 2017] BundleFusion: Real-time Globally Consistent 3D Reconstruction using Online Surface Re-integration
-
ACMMP
Multi-Scale Geometric Consistency Guided and Planar Prior Assisted Multi-View Stereo (TPAMI 2022)
-
deep-video-mvs
Code for "DeepVideoMVS: Multi-View Stereo on Video with Recurrent Spatio-Temporal Fusion" (CVPR 2021)
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
I am looking for some pointers to papers/open-source software that can do fast and accurate depth estimation from multi-view videos. I am aware of colmap and OpenMVG software but was curious to know if there are others out there that outperform these. I am also interested in any state-of-the-art for depth estimation using unsupervised or self-supervised neural networks.
I am looking for some pointers to papers/open-source software that can do fast and accurate depth estimation from multi-view videos. I am aware of colmap and OpenMVG software but was curious to know if there are others out there that outperform these. I am also interested in any state-of-the-art for depth estimation using unsupervised or self-supervised neural networks.
Take a look at BundleFusion: Paper, Project Page, Source Code and SIFTGPU Source Code, used in BundleFusion source. It has a “server GPU” implementation
Take a look at BundleFusion: Paper, Project Page, Source Code and SIFTGPU Source Code, used in BundleFusion source. It has a “server GPU” implementation
That said, for classical 3D reconstruction from images, COLMAP is by far the best method in terms of usability - performances tradeoff. You can get something a bit better (such as https://github.com/GhiXu/ACMMP), but it is not very user-friendly.
If deep learning is an option, then you might want to check out http://zak.murez.com/atlas/, https://zju3dv.github.io/neuralrecon/, https://github.com/ardaduz/deep-video-mvs and the references therein. These methods can be better than classical ones, especially if overfitted on a specific type of scene.
I have been curious about NeRF and have tried out the software. I really like the idea, but I have found generating the NeRF representation to be too slow and does not do seem to do too well if the number of cameras in the rig are a bit sparse. I have seen that there have been some evolution and variations of the NeRF concept and I am looking into some of them currently.
Regarding Nerf being too slow, you might check out instant nerf from nvidia which has nerf models training in seconds to minutes.
ah, good call-out. Luxonis' DepthAI has solid resources for this here: https://github.com/luxonis/depthai/issues/173 DepthAI runs neural inference on stereo to produce a depth map (among doing other things too)