4.7 Article

Sparse Ellipsometry: Portable Acquisition of Polarimetric SVBRDF and Shape with Unstructured Flash Photography

期刊

ACM TRANSACTIONS ON GRAPHICS
卷 41, 期 4, 页码 -

出版社

ASSOC COMPUTING MACHINERY
DOI: 10.1145/3528223.3530075

关键词

Polarimetric appearance; 3D reconstruction; material appearance; shape

资金

  1. MSIT/IITP of Korea [RS-2022-00155620, 2017-0-00072]
  2. Samsung Research Funding Center [SRFC-IT2001-04]
  3. Samsung Electronics
  4. Microsoft Research Asia
  5. European Research Council (ERC) under the EU [682080]
  6. Spanish Ministry of Science and Innovation [PID2019-105004GB-I00]
  7. Govern of Aragon (project BLINDSIGHT)
  8. NIRCH of Korea [2021A02P02-001]
  9. European Research Council (ERC) [682080] Funding Source: European Research Council (ERC)

向作者/读者索取更多资源

Ellipsometry techniques are used to measure the polarization information of materials, but traditional methods are time-consuming and require cumbersome devices. This paper presents a sparse ellipsometry method that can capture both polarimetric reflectance information and the 3D shape of objects using a portable device. The results are in strong agreement with a ground-truth dataset of polarimetric BRDFs of real-world objects.
Ellipsometry techniques allow to measure polarization information of materials, requiring precise rotations of optical components with different configurations of lights and sensors. This results in cumbersome capture devices, carefully calibrated in lab conditions, and in very long acquisition times, usually in the order of a few days per object. Recent techniques allow to capture polarimetric spatially-varying reflectance information, but limited to a single view, or to cover all view directions, but limited to spherical objects made of a single homogeneous material. We present sparse ellipsometry, a portable polarimetric acquisition method that captures both polarimetric SVBRDF and 3D shape simultaneously. Our handheld device consists of off-the-shelf, fixed optical components. Instead of days, the total acquisition time varies between twenty and thirty minutes per object. We develop a complete polarimetric SVBRDF model that includes diffuse and specular components, as well as single scattering, and devise a novel polarimetric inverse rendering algorithm with data augmentation of specular reflection samples via generative modeling. Our results show a strong agreement with a recent ground-truth dataset of captured polarimetric BRDFs of real-world objects.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Theory & Methods

Multimodality in VR: A Survey

Daniel Martin, Sandra Malpica, Diego Gutierrez, Belen Masia, Ana Serrano

Summary: Virtual reality (VR) is a rapidly growing technology with the potential to revolutionize content creation and consumption. This survey examines the role and benefits of multimodality in VR, showcasing its importance in enhancing user experience, improving overall performance, and enabling unprecedented abilities in skill and knowledge transfer.

ACM COMPUTING SURVEYS (2022)

Article Computer Science, Software Engineering

A Generative Framework for Image-based Editing of Material Appearance using Perceptual Attributes

J. Delanoy, M. Lagunas, J. Condor, D. Gutierrez, B. Masia

Summary: This article presents an image-based editing method that modifies the material appearance of an object by changing high-level perceptual attributes. The method utilizes a two-step generative network to drive appearance changes and generate images with high-frequency details. To train the network, the researchers augmented an existing material appearance dataset with perceptual judgments of high-level attributes obtained through crowd-sourced experiments, and employed training strategies that avoided the need for original-edited image pairs. The perception of appearance in the edited images was validated through a user study.

COMPUTER GRAPHICS FORUM (2022)

Article Computer Science, Software Engineering

Modelling Surround-aware Contrast Sensitivity for HDR Displays

Shinyoung Yi, Daniel S. Jeon, Ana Serrano, Se-Yoon Jeong, Hui-Yong Kim, Diego Gutierrez, Min H. Kim

Summary: Despite advances in display technology, many applications still rely on outdated psychophysical datasets. We conducted experiments to explore contrast sensitivity and developed a novel function that predicts human contrast sensitivity more accurately. We also provide a practical version that is backward compatible and consistently produces good results in existing applications.

COMPUTER GRAPHICS FORUM (2022)

Article Management

Towards assisting the decision-making process for content creators in cinematic virtual reality through the analysis of movie cuts and their influence on viewers' behavior

Carlos Maranes, Diego Gutierrez, Ana Serrano

Summary: Virtual Reality (VR) is becoming increasingly popular with the commercialization of personal devices. However, the development of Cinematic VR (CVR) content is still in the exploratory stage. Users have partial or complete control of the camera in this medium, which can hinder the delivery of a pre-established narrative. In particular, during movie cuts, viewers may miss key elements of the story. This work explores studies on viewers' behavior and provides guidelines and methods for filmmakers to make decisions during filming and editing.

INTERNATIONAL TRANSACTIONS IN OPERATIONAL RESEARCH (2023)

Article Ophthalmology

Age norms for grating acuity and contrast sensitivity in children using eye tracking technology

E. Esteban-Ibanez, T. Perez-Roche, E. Prieto, O. Castillo, A. Fanlo-Zarazaga, A. Alejandre, D. Gutierrez, M. Ortin, V. Pueyo

Summary: Visual acuity and contrast sensitivity are key measures in assessing visual function in children, with digital devices playing an increasingly important role in evaluation. Eye tracking technology enables objective and quick measurement of children's visual function without the need for experienced examiners.

INTERNATIONAL OPHTHALMOLOGY (2022)

Article Multidisciplinary Sciences

Larger visual changes compress time: The inverted effect of asemantic visual features on interval time perception

Sandra Malpica, Belen Masia, Laura Herman, Gordon Wetzstein, David M. Eagleman, Diego Gutierrez, Zoya Bylinskii, Qi Sun

Summary: Time perception is influenced by manipulations to visual inputs, with changes in visual properties impacting both millisecond-level and longer interval time judgments. While high-level cognitive effects elicited by visual inputs affect time perception, they are difficult to measure and control due to confounding semantic information. This study investigates the impact of asemantic visual properties on interval time perception and reveals a consistent pattern where larger visual changes shorten perceived time in intervals up to 3 minutes, contrary to their effect on millisecond-level perception. These findings have significant real-world implications as they can alter participants' time perception.

PLOS ONE (2022)

Article Computer Science, Software Engineering

SST-Sal: A spherical spatio-temporal approach for saliency prediction in 360° videos

Edurne Bernal Berdun, Daniel Martin Serrano, Diego Gutierrez Perez, Belen Masia Corcoy

Summary: Virtual reality has the potential to change the way people consume content, but much is still unknown about the grammar and visual language of this new medium. This study proposes a novel saliency prediction model that combines spherical convolutions and recurrent neural networks to extract and model spatio-temporal features from 360 degrees videos. The model outperforms previous works and successfully mimics human visual attention when exploring dynamic 360 degrees videos.

COMPUTERS & GRAPHICS-UK (2022)

Article Computer Science, Software Engineering

Semi-supervised 3D shape segmentation with multilevel consistency and part substitution

Chun-Yu Sun, Yu-Qi Yang, Hao-Xiang Guo, Peng-Shuai Wang, Xin Tong, Yang Liu, Heung-Yeung Shum

Summary: The lack of fine-grained 3D shape segmentation data is a major challenge for developing learning-based 3D segmentation techniques. In this study, we propose an effective semi-supervised method that learns 3D segmentations using a combination of labeled and unlabeled data. Our approach incorporates a novel multilevel consistency loss to ensure consistent network predictions across different levels of perturbed 3D shapes, and a part substitution scheme to augment labeled data for improved training. Extensive validation on different tasks demonstrates the superior performance of our method compared to existing semi-supervised and unsupervised pre-training approaches.

COMPUTATIONAL VISUAL MEDIA (2023)

Article Engineering, Manufacturing

Automated Visual Inspection of Defects in Transparent Display Layers Using Light-Field 3-D Imaging

Hyeonjoong Jang, Sanghoon Cho, Daniel S. S. Jeon, Dahyun Kang, Myeongho Song, Changhyun Park, Jaewon Kim, Min H. H. Kim

Summary: In this work, a novel visual inspection method using light-field 3D imaging is proposed to detect defects on a display panel. By acquiring high-resolution depth information of defects located inside transparent layers without powering the panel, the physical locations of defects can be estimated. The types of defects and their layer locations can be automatically classified along the depth axis in multiple transparent layers. Experimental results validate the successful detection and classification of various display defects.

IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING (2023)

Article Optics

Microlens array camera with variable apertures for single-shot high dynamic range (HDR) imaging

Young-Gil Cha, Jiwoong Na, Hyun-Kyung Kim, Jae-Myeong Kwon, Seok-Haeng Huh, Seung-Un Jo, Chang-Hwan Kim, Min H. Kim, Ki-Hun Jeong

Summary: This article presents a microlens array camera with variable apertures for high dynamic range imaging. The camera captures low dynamic range images with different F-stop values and reconstructs HDR images with expanded dynamic range and high resolution.

OPTICS EXPRESS (2023)

Proceedings Paper Computer Science, Artificial Intelligence

FloatingFusion: Depth from ToF and Image-Stabilized Stereo Cameras

Andreas Meuleman, Hakyeong Kim, James Tompkin, Min H. Kim

Summary: High-accuracy per-pixel depth is crucial for computational photography, and smartphones now employ multimodal camera systems to achieve this. However, the low resolution and limited active illumination power of ToF sensors make it challenging to produce accurate high-resolution depth. By fusing RGB stereo and ToF information, these issues can be overcome. A key problem, however, is the unknown pose of the floating lens caused by the optical stabilization of the main color sensor, which breaks the geometric relationships between the multimodal image sensors. To address this, an automatic calibration technique based on dense 2D/3D matching is proposed, which can estimate the parameters of the stabilized main RGB sensor from a single snapshot. Fusion is achieved through deep learning using a real-world training dataset with depth supervision. Evaluation results demonstrate higher accuracy compared to existing baselines.

COMPUTER VISION - ECCV 2022, PT I (2022)

Proceedings Paper Computer Science, Artificial Intelligence

GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation

Yu Deng, Jiaolong Yang, Jianfeng Xiang, Xin Tong

Summary: The study aims to generate 3D-consistent images with controllable camera poses through 3D-aware image generative modeling. A novel approach is proposed to regulate point sampling and radiance field learning on 2D manifolds, addressing the limitations in handling fine details and stable training in existing generators.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2022)

Article Computer Science, Artificial Intelligence

SkeletonNet: A Topology-Preserving Solution for Learning Mesh Reconstruction of Object Surfaces From RGB Images

Jiapeng Tang, Xiaoguang Han, Mingkui Tan, Xin Tong, Kui Jia

Summary: This paper focuses on the challenging task of learning 3D object surface reconstructions from RGB images and proposes a method that learns and uses a topology-preserved skeletal shape representation to assist the surface reconstruction. Through experiments, the proposed method is shown to be effective and outperforms existing methods in the surface reconstruction task.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

暂无数据