期刊
PATTERN RECOGNITION
卷 59, 期 -, 页码 176-187出版社
ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2016.01.034
关键词
Stereoscopic image; Quality assessment; Convolutional neural network (CNN)
资金
- NSFC Grant [61203253, 61573222, 61233014, 61401167]
- Open Program of Jiangsu Key Laboratory of 3D Printing Equipment and Manufacturing [3DL201502]
- Major Research Program of Shandong Province [2015ZDXX0801A02]
- Key Lab of ICSP MOE China
In this paper, we propose to learn the structures of stereoscopic image based on convolutional neural network (CNN) for no-reference quality assessment. Taking image patches from the stereoscopic images as inputs, the proposed CNN can learn the local structures which are sensitive to human perception and representative for perceptual quality evaluation. By stacking multiple convolution and max-pooling layers together, the learned structures in lower convolution layers can be composed and convolved to higher levels to form a fixed-length representation. Multilayer perceptron (MLP) is further employed to summarize the learned representation to a final value to indicate the perceptual quality of the stereo image patch pair. With different inputs, two different CNNs are designed, namely one-column CNN with only the image patch from the difference image as input, and three-column CNN with the image patches from left-view image, right-view image, and difference image as the input. The CNN parameters for stereoscopic images are learned and transferred based on the large number of 2D natural images. With the evaluation on public LIVE phase-I, LIVE phase-II, and IVC stereoscopic image databases, the proposed no-reference metric achieves the state-of-the-art performance for quality assessment of stereoscopic images, and is even competitive to existing full-reference quality metrics. (C) 2016 Elsevier Ltd. All rights reserved.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据