☆ 4.6 Article

Deep canonical correlation analysis with progressive and hypergraph learning for cross-modal retrieval

NEUROCOMPUTING (2016)

期刊

NEUROCOMPUTING

卷 214, 期 -, 页码 618-628

出版社

ELSEVIER

DOI: 10.1016/j.neucom.2016.06.047

关键词

Progressive; Semantic; Hypergraph; Search-based

类别

Computer Science, Artificial Intelligence

资金

Chinese National Natural Science Foundation [61471049, 61372169, 61532018]
Postgraduate Innovation Fund of SICE, BUPT

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

This paper deals with the problem of modeling Internet images and associated texts for cross-modal retrieval such as text-to-image retrieval and image-to-text retrieval. We start with deep canonical correlation analysis (DCCA), a deep approach for mapping text and image pairs into a common latent space. We first propose a novel progressive framework and embed DCCA in it. In our progressive framework, a linear projection loss layer is inserted before the nonlinear hidden layers of a deep network. The training of linear projection and the training of nonlinear layers are combined to ensure that the linear projection is well matched with the nonlinear processing stages and good representations of the input raw data are learned at the output of the network. Then we introduce a hypergraph semantic embedding (HSE) method, which extracts latent semantics from texts, into DCCA to regularize the latent space learned by image view and text view. In addition, a search-based similarity measure is proposed to score relevance of image-text pairs. Based on the above ideas, we propose a model, called DCCA-PHS, for cross-modal retrieval. Experiments on three publicly available data sets show that DCCA-PHS is effective and efficient, and achieves state-of-the-art performance for unsupervised scenario. (C) 2016 Elsevier B.V. All rights reserved.

Deep canonical correlation analysis with progressive and hypergraph learning for cross-modal retrieval

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Deep canonical correlation analysis with progressive and hypergraph learning for cross-modal retrieval

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文