4.6 Article

Detail Preserved Surface Reconstruction from Point Cloud

Journal

SENSORS
Volume 19, Issue 6, Pages -

Publisher

MDPI
DOI: 10.3390/s19061278

Keywords

computer vision; 3D reconstruction; point cloud

Funding

  1. National Natural Science Foundation of China [61632003, 61873265, 61573351]

Ask authors/readers for more resources

In this paper, we put forward a new method for surface reconstruction from image-based point clouds. In particular, we introduce a new visibility model for each line of sight to preserve scene details without decreasing the noise filtering ability. To make the proposed method suitable for point clouds with heavy noise, we introduce a new likelihood energy term to the total energy of the binary labeling problem of Delaunay tetrahedra, and we give its s-t graph implementation. Besides, we further improve the performance of the proposed method with the dense visibility technique, which helps to keep the object edge sharp. The experimental result shows that the proposed method rivalled the state-of-the-art methods in terms of accuracy and completeness, and performed better with reference to detail preservation.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

View-graph construction framework for robust and efficient structure-from-motion

Hainan Cui, Tianxin Shi, Jun Zhang, Pengfei Xu, Yiping Meng, Shuhan Shen

Summary: The paper proposes an incremental framework for constructing a view-graph to improve its accuracy and robustness. Experimental results show that the new view-graph provides a better foundation for conventional SfM systems compared to many state-of-the-art methods.

PATTERN RECOGNITION (2021)

Article Computer Science, Artificial Intelligence

Incremental Rotation Averaging

Xiang Gao, Lingjie Zhu, Zexiao Xie, Hongmin Liu, Shuhan Shen

Summary: The paper introduces a simple yet effective rotation averaging pipeline called Incremental Rotation Averaging (IRA), inspired by incremental Structure from Motion techniques. By estimating absolute rotations incrementally, IRA is robust to outliers and achieves accurate rotation averaging results. Key techniques such as initial triplet selection, Weighted Local/Global Optimization, and Re-Rotation Averaging further improve the results.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2021)

Article Engineering, Electrical & Electronic

IRA plus plus : Distributed Incremental Rotation Averaging

Xiang Gao, Lingjie Zhu, Hainan Cui, Zexiao Xie, Shuhan Shen

Summary: This work presents an upgraded version of Incremental Rotation Averaging (IRA) called IRA++, which improves the scalability in both accuracy and efficiency by dividing the original Epipolar-geometry Graph (EG) into sub-graphs and performing distributed inner-rotation averaging. The evaluation results on various datasets demonstrate that IRA++ outperforms IRA and other state-of-the-art rotation averaging methods in terms of accuracy and efficiency, especially in large-scale and noise-polluted situations.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Article Engineering, Electrical & Electronic

Active Learning Based 3D Semantic Labeling From Images and Videos

Mengqi Rong, Hainan Cui, Zhanyi Hu, Hanqing Jiang, Hongmin Liu, Shuhan Shen

Summary: In this paper, an active learning based 3D semantic labeling method is proposed to generate accurate 3D semantic mesh models by integrating 2D semantic segmentation results and 3D mesh models. Through the iterative process of training, fusion, and selection, the labeling quality is improved while reducing the amount of annotation needed.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Article Engineering, Electrical & Electronic

Incremental Translation Averaging

Xiang Gao, Lingjie Zhu, Bin Fan, Hongmin Liu, Shuhan Shen

Summary: This paper proposes a simple yet effective translation averaging pipeline, Incremental Translation Averaging (ITA), which overcomes limitations in accuracy, robustness, simplicity, and efficiency present in traditional translation averaging methods. ITA computes camera locations incrementally, leading to higher accuracy and robustness, and is robust to measurement outliers and accurate in parameter estimation while being simple and efficient. Comprehensive evaluations on the 1DSfM dataset demonstrate the effectiveness of ITA and its superiority over state-of-the-art translation averaging approaches.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Article Geography, Physical

FloorUSG: Indoor floorplan reconstruction by unifying 2D semantics and 3D geometry

Jiali Han, Yuzhou Liu, Mengqi Rong, Xianwei Zheng, Shuhan Shen

Summary: This paper proposes a method called FloorUSG for multistage floorplan reconstruction from RGB images and dense 3D mesh. It combines 2D plane instances and 3D plane primitives and accurately recovers the location of the floorplan through 2D-3D primitive fusion. Experimental results show that the method can recover detailed structures of scenes of different scales and reconstruct the floorplan with high robustness.

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING (2023)

Article Engineering, Electrical & Electronic

IRAv3: Hierarchical Incremental Rotation Averaging on the Fly

Xiang Gao, Hainan Cui, Menghan Li, Zexiao Xie, Shuhan Shen

Summary: We propose IRAv3, which is built upon the state-of-the-art rotation averaging method IRA++, to advance the fundamental task in 3D computer vision. The key observation is that the community detection-based Epipolar-geometry Graph (EG) clustering in IRA++ is fixed and does not allow changes, limiting the accuracy of absolute rotation estimation. However, in IRAv3, the EG clustering is performed together with the estimation of absolute rotation for each cluster, allowing for dynamic determination of vertex affiliation. Experimental results on 1DSfM and KITTI odometry datasets demonstrate the effectiveness of IRAv3 in large-scale rotation averaging problems and its advantages over previous works and other state-of-the-art methods.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Engineering, Civil

Vehicle-Borne Multi-Sensor Temporal-Spatial Pose Globalization via Cross-Domain Data Association

Xiang Gao, Dongdong Tao, Yuqian Liu, Zexiao Xie, Shuhan Shen

Summary: Large-scale urban scene 3D mapping requires sensor pose globalization, and integrating street-view images and vehicle-borne LiDAR points can facilitate this task. Existing methods often assume strict synchronization and exact calibration between cameras and LiDARs, which are difficult to guarantee in practice. To address this, we propose a pipeline for temporal and spatial pose globalization using GNSS/IMU guidance, loosening the assumptions. Our method initializes and refines global poses using multi-sensor pre-calibrations, and performs global optimization with image-based, LiDAR-based, and cross-domain data association and constraint construction. Experimental results on self-collected and KITTI Odometry datasets demonstrate the effectiveness of our method for multi-sensor pose globalization in large-scale urban scene 3D mapping.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Efficient 3D Scene Semantic Segmentation via Active Learning on Rendered 2D Images

Mengqi Rong, Hainan Cui, Shuhan Shen

Summary: Inspired by Active Learning and 2D-3D semantic fusion, our proposed framework utilizes rendered 2D images to achieve efficient semantic segmentation of large-scale 3D scenes with only a few 2D image annotations. By rendering perspective images in the 3D scene and fine-tuning a pre-trained network for segmentation, we can project and fuse dense predictions onto the 3D model. Through an iterative process of rendering-segmentation-fusion, difficult-to-segment image samples can be generated without complex 3D annotations, resulting in label-efficient 3D scene segmentation. Experimental results on three large-scale datasets demonstrate the effectiveness of our method compared to state-of-the-art approaches.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

Proceedings Paper Automation & Control Systems

Multi-Camera-LiDAR Auto-Calibration by Joint Structure-from-Motion

Diantao Tu, Baoyu Wang, Hainan Cui, Yuqian Liu, Shuhan Shen

Summary: This paper proposes a novel calibration pipeline that can automatically calibrate multiple cameras and LiDARs in a Structure-from-Motion (SfM) process, eliminating the need for manual design of calibration objects.

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

MMA: Multi-Camera Based Global Motion Averaging

Hainan Cui, Shuhan Shen

Summary: This paper presents a tailor-made multi-camera based motion averaging system that utilizes fixed relative poses to improve the accuracy and robustness of SfM. The algorithm achieves superior accuracy and robustness compared to state-of-the-art methods, as demonstrated by experiments on various datasets.

THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE (2022)

Article Remote Sensing

Dense context distillation network for semantic parsing of oblique UAV images

Youli Ding, Xianwei Zheng, Yiping Chen, Shuhan Shen, Hanjiang Xiong

Summary: In this paper, a dense context distillation network (DCDNet) is proposed for semantic segmentation of oblique unmanned aerial vehicle (UAV) images. DCDNet effectively learns distortion-robust feature representation by densely and selectively gathering useful context from dual-scale feature maps. It also incorporates joint supervision and multi-scale feature aggregation for better learning and prediction, achieving a state-of-the-art segmentation performance on the challenging UAVid dataset with a mIoU score of 72.38%.

INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION (2022)

Article Computer Science, Artificial Intelligence

Recent Advances in Conventional and Deep Learning-Based Depth Completion: A Survey

Zexiao Xie, Xiaoxuan Yu, Xiang Gao, Kunqian Li, Shuhan Shen

Summary: Depth completion is the task of recovering pixelwise depth from incomplete and noisy depth measurements, and it is important for various computer vision applications. Traditional image processing techniques were used in the past, but deep learning methods, especially for LiDAR-image-based depth completion, have become increasingly popular. This article reviews the related works and discusses future research directions for depth completion.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

VidSfM: Robust and Accurate Structure-From-Motion for Monocular Videos

Hainan Cui, Diantao Tu, Fulin Tang, Pengfei Xu, Hongmin Liu, Shuhan Shen

Summary: With the popularization of smartphones, more high-quality videos are available, leading to an increase in the scale of scene reconstruction. A tailor-made framework is proposed to solve the problems caused by high-resolution and high frame rate videos, aiming to achieve accurate and robust structure-from-motion based on monocular videos. The key ideas include utilizing the spatial and temporal continuity of video sequences for improved reconstruction accuracy and robustness, as well as leveraging the redundancy of video sequences to enhance efficiency and scalability. The system is able to integrate data from different video sequences for simultaneous reconstruction.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Proceedings Paper Automation & Control Systems

Semantically Guided Multi-View Stereo for Dense 3D Road Mapping

Mingzhe Lv, Diantao Tu, Xincheng Tang, Yuqian Liu, Shuhan Shen

Summary: The paper proposes a novel semantically guided Multi-View Stereo method for dense 3D road mapping, which integrates semantic information into the process to improve completeness and handle holes and outliers in low-textured areas. Experimental results demonstrate that the method achieves superior completeness with comparable accuracy for 3D road mapping compared to state-of-the-art MVS methods.

2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) (2021)

No Data Available