4.5 Article

Tone mapping high dynamic range images based on region-adaptive self-supervised deep learning

Journal

SIGNAL PROCESSING-IMAGE COMMUNICATION
Volume 102, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.image.2021.116595

Keywords

Tone mapping; High dynamic range; Region-adaptive self-supervised learning

Funding

  1. Guangdong Basic and Applied Basic Research Foundation, China [2021A1515011584]

Ask authors/readers for more resources

This paper presents a new region-adaptive self-supervised deep learning technique for HDR image tone mapping. The experimental results demonstrate that this technique achieves excellent performance in preserving overall contrasts, revealing fine details, and eliminating visual artifacts.
This paper presents a region-adaptive self-supervised deep learning (RASSDL) technique for high dynamic range (HDR) image tone mapping. The RASSDL tone mapping operator (TMO) is a convolutional neural network (CNN) trained on local image regions that can seamlessly tone map images of arbitrary sizes. The training of RASSDL TMO is through the design of a self-supervising target that automatically adapts to the local image regions based on their information contents. The self-supervising target is designed to ensure the tone-mapped output achieves a balance between preserving the relative contrast of the original scene and the visibilities of the fine details to achieve faithful reproduction of the HDR scene. Distinguishing from many existing TMOs that require manual tuning of parameters, RASSDL is parameter-free and completely automatic. Experimental results demonstrate that RASSDL TMO can achieve state-of-the-art performance in terms of preserving overall contrasts, revealing fine details, and being free from visual artifacts.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Information Systems

Geographical and temporal huff model calibration using taxi trajectory data

Shuhui Gong, John Cartlidge, Ruibin Bai, Yang Yue, Qingquan Li, Guoping Qiu

Summary: The Huff model is calibrated using GTWR to show significant geographical and temporal variations in attractiveness and travel cost parameters, with wealthy customers being more sensitive to a shopping centre's attractiveness. Factors such as customer wealth, spare time, and travel mode influence shopping behaviors, and there are differences in customer behaviors between New York and Shenzhen, particularly at weekends. The GTWR calibration and identification of factors affecting urban travel behaviors can contribute to optimizing urban transportation design.

GEOINFORMATICA (2021)

Article Computer Science, Theory & Methods

Combined window filtering and its applications

Hui Yin, Yuanhao Gong, Guoping Qiu

Summary: The article introduces a new local window based image processing framework called combined window filtering (CWF), which combines full window filtering strategy (FWF) and side window filtering strategy (SWF) to achieve improved edge-preserving and texture-removing capabilities. By using different filtering strategies for edges and textures, the new framework significantly enhances the performance in applications.

MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING (2021)

Article Computer Science, Artificial Intelligence

Relative geometry-aware siamese neural network for 6DOF camera relocalization

Qing Li, Jiasong Zhu, Rui Cao, Ke Sun, Jonathan M. Garibaldi, Qingquan Li, Bozhi Liu, Guoping Qiu

Summary: This study proposes a relative geometry-aware Siamese neural network, which enhances the performance of deep learning methods by explicitly leveraging the relative geometry constraints between images. Multi-task learning is used to predict absolute and relative poses, while shared-weight twin networks are regularized to ensure correct estimations globally and locally. Additionally, an adaptive metric distance loss is designed to learn features capable of distinguishing poses of visually similar images from different locations.

NEUROCOMPUTING (2021)

Article Computer Science, Interdisciplinary Applications

End-to-End Fovea Localisation in Colour Fundus Images With a Hierarchical Deep Regression Network

Ruitao Xie, Jingxin Liu, Rui Cao, Connor S. Qiu, Jiang Duan, Jon Garibaldi, Guoping Qiu

Summary: This paper presents a new end-to-end fovea localisation method based on a hierarchical coarse-to-fine deep regression neural network, with innovative features including multi-scale feature fusion and self-attention techniques. Extensive experimental results demonstrate state-of-the-art performances, and a comprehensive ablation study and analysis validate the technical soundness and effectiveness of the overall framework and its constituent components.

IEEE TRANSACTIONS ON MEDICAL IMAGING (2021)

Article Engineering, Electrical & Electronic

Deep Cross-Modal Representation Learning and Distillation for Illumination-Invariant Pedestrian Detection

Tianshan Liu, Kin-Man Lam, Rui Zhao, Guoping Qiu

Summary: This paper proposes a method based on cross-modal feature learning and knowledge distillation to address illumination-invariant pedestrian detection. By inserting feature learning modules at multiple levels and incorporating a segmentation auxiliary task, the multimodal network is trained end-to-end. Furthermore, a knowledge distillation framework is introduced to train a student detector with only RGB images as input, reducing the reliance on thermal images. Experimental results demonstrate the robust performance of the proposed method on a public dataset.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Article Computer Science, Artificial Intelligence

Towards Disentangling Latent Space for Unsupervised Semantic Face Editing

Kanglin Liu, Gaofeng Cao, Fei Zhou, Bozhi Liu, Jiang Duan, Guoping Qiu

Summary: In this paper, a new technique called STIA-WO is presented to disentangle the latent space for unsupervised semantic face editing. By applying STIA-WO to GAN, a StyleGAN named STGAN-WO is developed, which achieves better attribute editing than state of the art methods.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Article Geography, Physical

Improving synthetic 3D model-aided indoor image localization via domain adaptation

Qing Li, Rui Cao, Jiasong Zhu, Xianxu Hou, Jun Liu, Sen Jia, Qingquan Li, Guoping Qiu

Summary: The paper proposes a domain adaptation-based approach to improve the accuracy of indoor image localization using synthetic images. By incorporating a multi-level constrained pose regression network and a feature-level discriminator network, the proposed method effectively reduces the performance gaps between real and synthetic images.

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING (2022)

Article Environmental Sciences

Clustering-Based Representation Learning through Output Translation and Its Application to Remote-Sensing Images

Qinglin Li, Bin Li, Jonathan M. Garibaldi, Guoping Qiu

Summary: This paper proposes a clustering-based method for representation learning of remote-sensing images. It introduces a metric to measure the discriminativeness of representations and develops an algorithm to achieve even distribution of samples while preserving their neighborhood relations.

REMOTE SENSING (2022)

Article Environmental Sciences

Improving Image Clustering through Sample Ranking and Its Application to Remote Sensing Images

Qinglin Li, Guoping Qiu

Summary: This paper proposes a novel method for image clustering by optimizing sample ranking and weighted training, and demonstrates its effectiveness through extensive experiments.

REMOTE SENSING (2022)

Article Computer Science, Artificial Intelligence

Class-Imbalanced Deep Learning via a Class-Balanced Ensemble

Zhi Chen, Jiang Duan, Li Kang, Guoping Qiu

Summary: In this work, ensemble learning is embedded into deep convolutional neural networks to address the issue of class imbalance in model learning. A new loss function is designed to rectify the bias towards majority classes and improve the performance significantly.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Supervised Anomaly Detection via Conditional Generative Adversarial Network and Ensemble Active Learning

Zhi Chen, Jiang Duan, Li Kang, Guoping Qiu

Summary: This paper presents a new supervised anomaly detector called Ensemble Active Learning Generative Adversarial Network (EAL-GAN). The EAL-GAN uses a conditional GAN to generate balanced training data and introduces an innovative ensemble learning loss function and an active learning algorithm to overcome the challenges of class imbalance and high labeling cost. Extensive experimental results show that the new detector consistently outperforms other methods by significant margins.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Proceedings Paper Computer Science, Software Engineering

A Novel Structure Adaptive Algorithm for Feature-preserving 3D Mesh Denoising

Wenming Tang, Yuanhao Gong, Guoping Qiu

Summary: This paper introduces a novel algorithm for 3D mesh filtering (SAF) based on mesh structural adaptation, which achieves feature-preserving denoising by protecting corners, edges, and planes. Through experimental data, SAF has been shown to outperform or be comparable to state-of-the-art methods in feature-preserving denoising at different noise levels.

2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Self-Supervised Video Super-Resolution by Spatial Constraint and Temporal Fusion

Cuixin Yang, Hongming Luo, Guangsen Liao, Zitao Lu, Fei Zhou, Guoping Qiu

Summary: A method using middle-layer feature loss to handle video super-resolution task was proposed, achieving superior results compared to other methods by allowing deeper network architecture.

PATTERN RECOGNITION AND COMPUTER VISION,, PT III (2021)

Proceedings Paper Computer Science, Artificial Intelligence

A DISCRETE SCHEME FOR COMPUTING IMAGE'S WEIGHTED GAUSSIAN CURVATURE

Yuanhao Gong, Wenming Tang, Lebin Zhou, Lantao Yu, Guoping Qiu

Summary: The newly proposed discrete computation scheme does not require second order differentiability, is more accurate, has a smaller support region, and is computationally more efficient.

2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) (2021)

Article Engineering, Electrical & Electronic

Image tone mapping based on clustering and human visual system models

Xueyu Han, Ishtiaq Rasool Khan, Susanto Rahardja

Summary: This paper proposes a clustering-based TMO method by embedding human visual system models to adapt to different HDR scenes. The method reduces computational complexity using a hierarchical scheme for clustering and enhances local contrast by superimposing details and controlling color saturation by limiting the adaptive saturation parameter. Experimental results show that the proposed method achieves improvements in generating high quality tone-mapped images compared to competing methods.

SIGNAL PROCESSING-IMAGE COMMUNICATION (2024)

Article Engineering, Electrical & Electronic

YOLO-PAI: Real-time handheld call behavior detection algorithm and embedded application

Zuopeng Zhao, Tianci Zheng, Kai Hao, Junjie Xu, Shuya Cui, Xiaofeng Liu, Guangming Zhao, Jie Zhou, Chen He

Summary: The research team developed a handheld phone detection network called YOLO-PAI, which successfully achieved real-time detection and underwent testing under various conditions. Experimental results show that YOLO-PAI reduces network structure parameters and computational costs while maintaining accuracy, outperforming other popular networks in terms of speed and accuracy.

SIGNAL PROCESSING-IMAGE COMMUNICATION (2024)

Article Engineering, Electrical & Electronic

ClGanNet: A novel method for maize leaf disease identification using ClGan and deep CNN

Vivek Sharma, Ashish Kumar Tripathi, Purva Daga, M. Nidhi, Himanshu Mittal

Summary: In this study, a novel ClGan method is proposed for automated plant disease detection. The method reduces the number of parameters and addresses the issues of vanishing gradients, training instability, and non-convergence by using an encoder-decoder network. Additionally, an improved loss function is introduced to stabilize the learning process and optimize weights effectively. Furthermore, a new plant leaf classification method called ClGanNet is introduced, achieving 99.97% training accuracy and 99.04% testing accuracy using the least number of parameters.

SIGNAL PROCESSING-IMAGE COMMUNICATION (2024)

Article Engineering, Electrical & Electronic

Individual tooth segmentation in human teeth images using pseudo edge-region obtained by deep neural networks

Seongeun Kim, Chang-Ock Lee

Summary: This article introduces a method for segmenting individual teeth in human teeth images by using deep neural networks to obtain pseudo edge-regions and applying active contour models for segmentation.

SIGNAL PROCESSING-IMAGE COMMUNICATION (2024)