4.8 Article

Reflectance and Natural Illumination from Single-Material Specular Objects Using Deep Learning

Journal

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TPAMI.2017.2742999

Keywords

Reflectance maps; intrinsic images; reflectance; natural illumination; specular shading; convolutional neural networks

Funding

  1. Toyota Research Institute
  2. FWO [G086617N]

Ask authors/readers for more resources

In this paper, we present a method that estimates reflectance and illumination information from a single image depicting a single-material specular object from a given class under natural illumination. We follow a data-driven, learning-based approach trained on a very large dataset, but in contrast to earlier work we do not assume one or more components (shape, reflectance, or illumination) to be known. We propose a two-step approach, where we first estimate the object's reflectance map, and then further decompose it into reflectance and illumination. For the first step, we introduce a Convolutional Neural Network (CNN) that directly predicts a reflectance map from the input image itself, as well as an indirect scheme that uses additional supervision, first estimating surface orientation and afterwards inferring the reflectance map using a learning-based sparse data interpolation technique. For the second step, we suggest a CNN architecture to reconstruct both Phong reflectance parameters and high-resolution spherical illumination maps from the reflectance map. We also propose new datasets to train these CNNs. We demonstrate the effectiveness of our approach for both steps by extensive quantitative and qualitative evaluation in both synthetic and real data as well as through numerous applications, that show improvements over the state-of-the-art.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

Residual Tuning: Toward Novel Category Discovery Without Labels

Yu Liu, Tinne Tuytelaars

Summary: Discovering novel visual categories from unlabeled images is crucial for intelligent vision systems, and we propose a residual-tuning approach to overcome the tradeoff between preserving features on labeled data and adapting features on unlabeled data. Our method achieves consistent and considerable gains on benchmark tests, reducing the performance gap to fully supervised learning setup.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Agriculture, Multidisciplinary

Inline nondestructive internal disorder detection in pear fruit using explainable deep anomaly detection on X-ray images

Tim Van De Looverbosch, Jiaqi He, Astrid Tempelaere, Klaas Kelchtermans, Pieter Verboven, Tinne Tuytelaars, Jan Sijbers, Bart Nicolai

Summary: X-ray radiography has been investigated as a technique for internal quality inspection of pears in storage, with multiple deep anomaly detection methods showing effectiveness in detecting pears with internal cavity and browning disorders. The best performing methods were found to be on par with a state-of-the-art multisensor disorder detection method.

COMPUTERS AND ELECTRONICS IN AGRICULTURE (2022)

Article Medicine, General & Internal

Deep-Learning-Based Thrombus Localization and Segmentation in Patients with Posterior Circulation Stroke

Riaan Zoetmulder, Agnetha A. E. Bruggeman, Ivana Isgum, Efstratios Gavves, Charles B. L. M. Majoie, Ludo F. M. Beenen, Diederik W. J. Dippel, Nikkie Boodt, Sanne J. den Hartog, Pieter J. van Doormaal, Sandra A. P. Cornelissen, Yvo B. W. E. M. Roos, Josje Brouwer, Wouter J. Schonewille, Anne F. V. Pirson, Wim H. van Zwam, Christiaan van der Leij, Rutger J. B. Brans, Adriaan C. G. M. van Es, Henk A. Marquering

Summary: In this study, an automatic method for thrombus localization and segmentation on CT images in patients with posterior circulation stroke (PCS) was developed. The method achieved good results in localizing and segmenting thrombi. Restricting the volume-of-interest (VOI) to the brainstem improved the precision and recall of thrombus localization.

DIAGNOSTICS (2022)

Article Computer Science, Artificial Intelligence

CLAD: A realistic Continual Learning benchmark for Autonomous Driving

Eli Verwimp, Kuo Yang, Sarah Parisot, Lanqing Hong, Steven McDonagh, Eduardo Perez-Pellitero, Matthias De Lange, Tinne Tuytelaars

Summary: In this paper, a new Continual Learning benchmark for Autonomous Driving (CLAD) is introduced, focusing on object classification and object detection problems. The benchmark utilizes SODA10M, a large-scale dataset related to autonomous driving. Existing continual learning benchmarks are reviewed and discussed, showing that most of them are extreme cases. Online classification benchmark CLAD-C and domain incremental continual object detection benchmark CLAD-D are introduced. The inherent difficulties and challenges are examined through a survey of top-3 participants in a CLAD-challenge workshop at ICCV 2021. Possible pathways to improve the current state of continual learning and promising directions for future research are discussed.

NEURAL NETWORKS (2023)

Article Agronomy

Synthetic data for X-ray CT of healthy and disordered pear fruit using deep learning

Astrid Tempelaere, Tim Van De Looverbosch, Klaas Kelchtermans, Pieter Verboven, Tinne Tuytelaars, Bart Nicolai

Summary: This study proposes a method to generate synthetic CT images using a conditional cGAN to overcome the challenges of obtaining large annotated datasets. The performance of the predictor was evaluated quantitatively and visually, showing that the cGAN effectively generated CT images of healthy and defective fruit based on annotations.

POSTHARVEST BIOLOGY AND TECHNOLOGY (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Spatial Consistency Loss for Training Multi-Label Classifiers from Single-Label Annotations

Thomas Verelst, Paul K. Rubenstein, Marcin Eichner, Tinne Tuytelaars, Maxim Berman

Summary: Multi-label image classification is more practical for real-world scenarios than single-label classification due to the presence of multiple objects in natural images. However, annotating every object of interest is time-consuming and expensive. In this study, we propose an Expected Negative loss to train multi-label classifiers using datasets where each image is annotated with a single positive label. To handle the uncertainty of other classes, we generate a set of expected negative labels based on prediction consistency. Additionally, we introduce a novel spatial consistency loss to improve supervision by maintaining consistent spatial feature maps for each training image. Our experiments on various datasets demonstrate the effectiveness of the Expected Negative loss in combination with consistency and spatial consistency losses, and we achieve improved multi-label classification mAP on ImageNet-1K using the ReaL multi-label validation set.

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) (2023)

Proceedings Paper Computer Science, Artificial Intelligence

SimGlim: Simplifying glimpse based active visual reconstruction

Abhishek Jha, Soroush Seifi, Tinne Tuytelaars

Summary: In active visual exploration, it is crucial to sample informative local observations for modeling global context. This paper proposes the use of vision transformers instead of CNNs for such agents and introduces a transformer-based active visual sampling model called SimGlim. The model utilizes the transformer's self-attention architecture to predict the best next location based on the current observable environment. Experimental results demonstrate the effectiveness of the proposed method in image reconstruction and comparisons against existing methods are provided. Ablation studies are also conducted to analyze the importance of design choices in the overall architecture.

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Barlow constrained optimization for Visual Question Answering

Abhishek Jha, Badri Patro, Luc Van Gool, Tinne Tuytelaars

Summary: This paper proposes a novel regularization method called COB to improve the information content of the joint space in visual question answering models. It reduces redundancy by minimizing the correlation between learned feature components, disentangling semantic concepts. The model aligns the joint space with the answer embedding space and shows improved accuracy on VQA datasets.

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Global-Local Self-Distillation for Visual Representation Learning

Tim Lebailly, Tinne Tuytelaars

Summary: The downstream accuracy of self-supervised methods depends on the proxy task and the quality of gradients extracted during training. Incorporating local cues in the proxy task can improve model accuracy on downstream tasks. We propose a geometric approach for matching local representations in self-distillation, which outperforms similarity-based methods, especially in low-data regimes. However, similarity-based matchings are highly detrimental to model performance in low-data regimes compared to the baseline without local self-distillation.

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Weakly Supervised Face Naming with Symmetry-Enhanced Contrastive Loss

Tingyu Qu, Tinne Tuytelaars, Marie-Francine Moens

Summary: This paper revisits the weakly supervised cross-modal face-name alignment task and proposes SECLA and SECLA-B models. These models use appropriate loss functions to learn the alignments between names and faces in a neural network setting. SECLA maximizes the similarity scores between faces and names in a weakly supervised fashion, while SECLA-B learns to align names and faces from easy to hard cases, further improving the performance.

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) (2023)

Proceedings Paper Computer Science, Artificial Intelligence

CrOC : Cross-View Online Clustering for Dense Visual Representation Learning

Thomas Stegmuller, Tim Lebailly, Behzad Bozorgtabar, Tinne Tuytelaars, Jean-Philippe Thiran

Summary: In this paper, we propose a method for learning dense visual representations without labels by discovering and segmenting the semantics of views through an online clustering mechanism. The resulting method is highly generalizable and does not require cumbersome pre-processing steps.

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR (2023)

Article Computer Science, Artificial Intelligence

A Continual Learning Survey: Defying Forgetting in Classification Tasks

Matthias De Lange, Rahaf Aljundi, Marc Masana, Sarah Parisot, Xu Jia, Ales Leonardis, Greg Slabaugh, Tinne Tuytelaars

Summary: This article introduces the application of artificial neural networks in continual learning, focusing on task incremental classification. It proposes a new framework for continually evaluating the stability-plasticity trade-off of the network and performs experimental comparisons of 11 state-of-the-art continual learning methods, evaluating their strengths and weaknesses by considering different benchmark datasets.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Proceedings Paper Automation & Control Systems

RARA: Zero-shot Sim2Real Visual Navigation with Following Foreground Cues

Klaas Kelchtermans, Tinne Tuytelaars

Summary: The gap between simulation and the real world hampers the application of machine learning in computer vision and reinforcement learning. This study addresses this issue by focusing on camera-based navigation and utilizing various techniques to successfully bridge the gap between simulation and the real world.

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Generative Negative Text Replay for Continual Vision-Language Pretraining

Shipeng Yan, Lanqing Hong, Hang Xu, Jianhua Han, Tinne Tuytelaars, Zhenguo Li, Xuming He

Summary: This study focuses on learning VLP models with sequential chunks of image-text pair data and proposes pseudo text replay and multi-modal knowledge distillation to tackle the forgetting issue in continual learning. The experiments demonstrate the superiority of the proposed method in zero-shot image classification and image-text retrieval tasks.

COMPUTER VISION, ECCV 2022, PT XXXVI (2022)

Article Computer Science, Artificial Intelligence

Three types of incremental learning

Gido M. van de Ven, Tinne Tuytelaars, Andreas S. Tolias

Summary: Deep neural networks face challenges in continual learning, with different scenarios of continual learning having varying challenges and effectiveness. Distinguishing between task-incremental, domain-incremental, and class-incremental learning is an important foundation for organizing the continual learning field.

NATURE MACHINE INTELLIGENCE (2022)

No Data Available