☆ 4.3 Article

A survey on conventional and learning-based methods for multi-view stereo

PHOTOGRAMMETRIC RECORD (2023)

Journal

PHOTOGRAMMETRIC RECORD

Volume -, Issue -, Pages -

Publisher

WILEY

DOI: 10.1111/phor.12456

Keywords

deep learning; dense reconstruction; depth estimation; MVS; PatchMatch; stereomatching

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

3D reconstruction of scenes using multiple images has been extensively studied in recent years. Multi-view stereo algorithms aim to generate a dense 3D model of the scene, but achieving complete, accurate, and aesthetically pleasing representations remains a challenge. This work provides a survey on the most widely used multi-view stereo methods, discussing the underlying concepts and challenges, with a focus on close-range 3D reconstruction applications.

3D reconstruction of scenes using multiple images, relying on robust correspondence search and depth estimation, has been thoroughly studied for the two-view and multi-view scenarios in recent years. Multi-view stereo (MVS) algorithms aim to generate a rich, dense 3D model of the scene in the form of a dense point cloud or a triangulated mesh. In a typical MVS pipeline, the robust estimations for the camera poses along with the sparse points obtained from structure from motion (SfM) are used as input. During this process, the depth of generally every pixel of the scene is to be calculated. Several methods, either conventional or, more recently, learning-based have been developed for solving the correspondence search problem. A vast amount of research exists in the literature using local, global or semi-global stereomatching approaches, with the PatchMatch algorithm being among the most popular and efficient conventional ones in the last decade. Yet, and despite the widespread evolution of the algorithms, yielding complete, accurate and aesthetically pleasing 3D representations of a scene remains an open issue in real-world and large-scale photogrammetric applications. This work aims to provide a concrete survey on the most widely used MVS methods, investigating underlying concepts and challenges. To this end, the theoretical background and relative literature are discussed for both conventional and learning-based approaches, with a particular focus on close-range 3D reconstruction applications.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3

Not enough ratings

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Aleatoric uncertainty estimation for dense stereo matching via CNN-based cost volume analysis

Max Mehltretter, Christian Heipke

Summary: This study combines the advantages of deep learning and cost volume-based features to propose a new Convolutional Neural Network (CNN) architecture for learning features from volumetric 3D data for uncertainty estimation. Three different uncertainty models are discussed and applied to train the CNN, showing the generality and state-of-the-art accuracy of the proposed method in extensive evaluations on three datasets using three common dense stereo matching methods.

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING (2021)