4.7 Article

A Survey on Canonical Correlation Analysis

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2019.2958342

关键词

Correlation; Kernel; Computational modeling; Probabilistic logic; Data models; Analytical models; Bayes methods; Canonical correlation analysis; survey; machine learning

资金

  1. National Natural Science Foundation of China [61671480]
  2. Fundamental Research Funds for the Central Universities, China University of Petroleum (East China) [18CX07011A]

向作者/读者索取更多资源

In recent years, CCA has been widely used in data analysis, with various models proposed based on different machine learning mechanisms over the past 80 years. However, there is a lack of in-depth review on the latest developments in CCA. This survey provides a well-organized overview of CCA and its extensions, categorizing different progresses and showcasing representative mathematical models for each group.
In recent years, the advances in data collection and statistical analysis promotes canonical correlation analysis (CCA) available for more advanced research. CCA is the main technique for two-set data dimensionality reduction such that the correlation between the pairwise variables in the common subspace is mutually maximized. Over 80-years of developments, a number of CCA models have been proposed according to different machine learning mechanisms. However, the field lacks an insightful review for the state-of-art developments. This survey targets to provide a well-organized overview for CCA and its extensions. Specifically, we first review the CCA theory from the perspective of both model formation and model optimization. The association between two popular solution methods, i.e., eigen value decomposition (EVD) and singular value decomposition (SVD), are discussed. Following that, we present a taxonomy of current progresses and classify them into seven groups: 1) multi-view CCA, 2) probabilistic CCA, 3) deep CCA, 4) kernel CCA, 5) discriminative CCA, 6) sparse CCA and 7) locality preserving CCA. For each group, we demonstrate two or three representative mathematical models, identifying their strengths and limitations. We summarize the representative applications and numerical results of these seven groups in real-world practices, collecting the data sets and open-sources for implementation. In the end, we provide several promising future research directions that can improve the current state of the art.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据