☆ 4.5 Article

Outlier-Robust PCA: The High-Dimensional Case

IEEE TRANSACTIONS ON INFORMATION THEORY (2013)

期刊

IEEE TRANSACTIONS ON INFORMATION THEORY

卷 59, 期 1, 页码 546-572

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TIT.2012.2212415

关键词

Dimension reduction; outlier; principal component analysis (PCA); robustness; statistical learning

类别

Computer Science, Information Systems Engineering, Electrical & Electronic

资金

Ministry of Education of Singapore through the National University of Singapore [R-265-000-384-133]
U.S. National Science Foundation [EFRI-0735905, EECS-1056028]
Defence Threat Reduction Agency [HDTRA 1-08-0029]
Israel Science Foundation [890015]
Directorate For Engineering [1056028] Funding Source: National Science Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Principal component analysis plays a central role in statistics, engineering, and science. Because of the prevalence of corrupted data in real-world applications, much research has focused on developing robust algorithms. Perhaps surprisingly, these algorithms are unequipped-indeed, unable-to deal with outliers in the high-dimensional setting where the number of observations is of the same magnitude as the number of variables of each observation, and the dataset contains some (arbitrarily) corrupted observations. We propose a high-dimensional robust principal component analysis algorithm that is efficient, robust to contaminated points, and easily kernelizable. In particular, our algorithm achieves maximal robustness-it has a breakdown point of 50% (the best possible), while all existing algorithms have a breakdown point of zero. Moreover, our algorithm recovers the optimal solution exactly in the case where the number of corrupted points grows sublinearly in the dimension.

Outlier-Robust PCA: The High-Dimensional Case

期刊

IEEE TRANSACTIONS ON INFORMATION THEORY

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Outlier-Robust PCA: The High-Dimensional Case

期刊

IEEE TRANSACTIONS ON INFORMATION THEORY

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文