4.5 Article

Hierarchical visual comfort assessment for stereoscopic image retargeting

Journal

SIGNAL PROCESSING-IMAGE COMMUNICATION
Volume 95, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.image.2021.116236

Keywords

Visual comfort assessment; Stereoscopic image retargeting; Hierarchical; Hi-VCA; Hybrid distortion

Funding

  1. NSFC, China [U1908209, 61632001]
  2. National Key Research and Development Program of China [2018AAA0101400]

Ask authors/readers for more resources

This paper proposes a Hierarchical Visual Comfort Assessment (Hi-VCA) scheme for SIR, which considers hybrid distortions and incorporates valid Local-SSIM and Dual Natural Scene Statistics (D-NSS) features to measure structural distortion and information loss. Extensive experiments show that Hi-VCA outperforms state-of-the-art schemes in handling hybrid distortions.
Stereoscopic Image Retargeting (SIR) has made it possible for the popularity of 3D application. Meanwhile, the adjustments brought to images may affect the visual comfort when enjoying 3D service. While for SIR, previous Visual Comfort Assessment (VCA) methods often cannot perform well, because they only analyze the influence of disparity on discomfort and do not take into account the effects from the unique and complex distortions of SIR. In this paper, we propose a Hierarchical Visual Comfort Assessment (Hi-VCA) scheme for SIR, considering hybrid distortions including structure, information, semantic distortions usually occurring in retargeting, and binocular incongruity existing in stereoscopic multimedia. Specifically, we first propose valid Local-SSIM and Dual Natural Scene Statistics (D-NSS) features to measure structural distortion and information loss. Considering disparity adjustments may brought by SIR, we design the binocular incongruity measurement by analyzing various binocular anomaly perception mechanisms of HVS. Finally, CNN-based feature is utilized to ensure the correct delivery of semantic information. Each measurement is complementary in describing visual comfort degradation and they are further aggregated. Extensive experiment results on published SIR database SIRD and two ordinary databases IEEE-SA and NBU 3D-VCA, demonstrate Hi-VCA has superior performance by better handling hybrid distortions compared to state-of-the-art schemes.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
Article Engineering, Electrical & Electronic

Image tone mapping based on clustering and human visual system models

Xueyu Han, Ishtiaq Rasool Khan, Susanto Rahardja

Summary: This paper proposes a clustering-based TMO method by embedding human visual system models to adapt to different HDR scenes. The method reduces computational complexity using a hierarchical scheme for clustering and enhances local contrast by superimposing details and controlling color saturation by limiting the adaptive saturation parameter. Experimental results show that the proposed method achieves improvements in generating high quality tone-mapped images compared to competing methods.

SIGNAL PROCESSING-IMAGE COMMUNICATION (2024)

Article Engineering, Electrical & Electronic

YOLO-PAI: Real-time handheld call behavior detection algorithm and embedded application

Zuopeng Zhao, Tianci Zheng, Kai Hao, Junjie Xu, Shuya Cui, Xiaofeng Liu, Guangming Zhao, Jie Zhou, Chen He

Summary: The research team developed a handheld phone detection network called YOLO-PAI, which successfully achieved real-time detection and underwent testing under various conditions. Experimental results show that YOLO-PAI reduces network structure parameters and computational costs while maintaining accuracy, outperforming other popular networks in terms of speed and accuracy.

SIGNAL PROCESSING-IMAGE COMMUNICATION (2024)

Article Engineering, Electrical & Electronic

ClGanNet: A novel method for maize leaf disease identification using ClGan and deep CNN

Vivek Sharma, Ashish Kumar Tripathi, Purva Daga, M. Nidhi, Himanshu Mittal

Summary: In this study, a novel ClGan method is proposed for automated plant disease detection. The method reduces the number of parameters and addresses the issues of vanishing gradients, training instability, and non-convergence by using an encoder-decoder network. Additionally, an improved loss function is introduced to stabilize the learning process and optimize weights effectively. Furthermore, a new plant leaf classification method called ClGanNet is introduced, achieving 99.97% training accuracy and 99.04% testing accuracy using the least number of parameters.

SIGNAL PROCESSING-IMAGE COMMUNICATION (2024)

Article Engineering, Electrical & Electronic

Individual tooth segmentation in human teeth images using pseudo edge-region obtained by deep neural networks

Seongeun Kim, Chang-Ock Lee

Summary: This article introduces a method for segmenting individual teeth in human teeth images by using deep neural networks to obtain pseudo edge-regions and applying active contour models for segmentation.

SIGNAL PROCESSING-IMAGE COMMUNICATION (2024)