4.8 Article

A Deep Network Solution for Attention and Aesthetics Aware Photo Cropping

Journal

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TPAMI.2018.2840724

Keywords

Photo cropping; attention box prediction; aesthetics assessment; deep learning

Funding

  1. Beijing Natural Science Foundation [4182056]
  2. National Basic Research Program of China [2013CB328805]
  3. Fok Ying-Tong Education Foundation for Young Teachers
  4. US NSF [1618398, 1449860, 1350521]
  5. Direct For Computer & Info Scie & Enginr
  6. Division Of Computer and Network Systems [1449860] Funding Source: National Science Foundation

Ask authors/readers for more resources

We study the problem of photo cropping, which aims to find a cropping window of an input image to preserve as much as possible its important parts while being aesthetically pleasant. Seeking a deep learning-based solution, we design a neural network that has two branches for attention box prediction (ABP) and aesthetics assessment (AA), respectively. Given the input image, the ABP network predicts an attention bounding box as an initial minimum cropping window, around which a set of cropping candidates are generated with little loss of important information. Then, the AA network is employed to select the final cropping window with the best aesthetic quality among the candidates. The two sub-networks are designed to share the same full-image convolutional feature map, and thus are computationally efficient. By leveraging attention prediction and aesthetics assessment, the cropping model produces high-quality cropping results, even with the limited availability of training data for photo cropping. The experimental results on benchmark datasets clearly validate the effectiveness of the proposed approach. In addition, our approach runs at 5 fps, outperforming most previous solutions. The code and results are available at: https://github.com/shenjianbing/DeepCropping.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available