☆ 4.7 Article

Perceptually Aware Image Retargeting for Mobile Devices

IEEE TRANSACTIONS ON IMAGE PROCESSING (2018)

Journal

IEEE TRANSACTIONS ON IMAGE PROCESSING

Volume 27, Issue 5, Pages 2301-2313

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TIP.2017.2779272

Keywords

Mobile platform; retarget; perceptual; gaze behavior; deep feature; probabilistic model

Funding

Natural Science Foundation of Zhejiang Province [LQ16F030006]
National Natural Science Foundation of China [61503110, 61572169, 61472266]
National University of Singapore (Suzhou) Research Institute, Suzhou, China
Fundamental Research Funds for the Central Universities

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Retargeting aims at adapting an original high-resolution photograph/video to a low-resolution screen with an arbitrary aspect ratio. Conventional approaches are generally based on desktop PCs, since the computation might be intolerable for mobile platforms (especially when retargeting videos). Typically, only low-level visual features are exploited, and human visual perception is not well encoded. In this paper, we propose a novel retargeting framework that rapidly shrinks a photograph/video by leveraging human gaze behavior. Specifically, we first derive a geometry-preserving graph ranking algorithm, which efficiently selects a few salient object patches to mimic the human gaze shifting path (GSP) when viewing a scene. Afterward, an aggregation-based CNN is developed to hierarchically learn the deep representation for each GSP. Based on this, a probabilistic model is developed to learn the priors of the training photographs that are marked as aesthetically pleasing by professional photographers. We utilize the learned priors to efficiently shrink the corresponding GSP of a retargeted photograph/video to maximize its similarity to those from the training photographs. Extensive experiments have demonstrated that: 1) our method requires less than 35 ms to retarget a 1024x768 photograph (or a 1280x720 video frame) on popular iOS/Android devices, which is orders of magnitude faster than the conventional retargeting algorithms; 2) the retargeted photographs/videos produced by our method significantly outperform those of its competitors based on a paired-comparison-based user study; and 3) the learned GSPs are highly indicative of human visual attention according to the human eye tracking experiments.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7

Not enough ratings

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Image Features Influence Reaction Time: A Learned Probabilistic Perceptual Model for Saccade Latency

Budmonde Duinkharjav, Praneeth Chakravarthula, Rachel Brown, Anjul Patney, Qi Sun

Summary: This article explores the disconnect between human saccadic behaviors and spatial visual acuity through psychophysical studies. It develops a perceptual model that predicts temporal gaze behavior and validates the model using objective measurements and user studies. The article also demonstrates that sub-threshold image modifications commonly introduced in graphics pipelines can significantly alter human reaction timing.

ACM TRANSACTIONS ON GRAPHICS (2022)