☆ 4.6 Article

Learning spatial-temporal features for video copy detection by the combination of CNN and RNN

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION (2018)

期刊

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION

卷 55, 期 -, 页码 21-29

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

DOI: 10.1016/j.jvcir.2018.05.013

关键词

Video copyright; CNN; Sequence matching; SiamesLSTM

类别

Computer Science, Information Systems Computer Science, Software Engineering

资金

National Natural Science Foundation of China [61374194]
National Key Science & Technology Pillar Program of China [2014BAG01DB03]
Key Research & Development Program of Jiangsu Province [BE2016739]
Priority Academic Program Development of Jiangsu Higher Education Institutions

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Following the rapid developments of network multimedia, video copyright protection online has become a hot topic in recent researches. However, video copy detection is still a challenging task in the domain of video analysis and computer vision, due to the large variations in scale and illumination of the copied contents. In this paper, we propose a novel deep learning based approach, in which we jointly use the Convolution Neural Network (CNN) and Recurrent Neural Network (RNN) to solve the specific problem of detecting copied segments in videos. We first utilize a Residual Convolutional Neural Network(ResNet) to extract content features of frame-levels, and then employ a SiameseLSTM architecture for spatial-temporal fusion and sequence matching. Finally, the copied segments are detected by a graph based temporal network. We evaluate the performance of the proposed CNN-RNN based approach on a public large scale video copy dataset called VCDB, and the experiment results demonstrate the effectiveness and high robustness of our method which achieves the significant performance improvements compared to the state of the art.

Learning spatial-temporal features for video copy detection by the combination of CNN and RNN

期刊

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Learning spatial-temporal features for video copy detection by the combination of CNN and RNN

期刊

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文