September 19, 2012

Outlier-Robust PCA: The High-Dimensional Case

Key Points

Key points are not available for this paper at this time.

Abstract

Principal component analysis plays a central role in statistics, engineering, and science. Because of the prevalence of corrupted data in real-world applications, much research has focused on developing robust algorithms. Perhaps surprisingly, these algorithms are unequipped-indeed, unable-to deal with outliers in the high-dimensional setting where the number of observations is of the same magnitude as the number of variables of each observation, and the dataset contains some (arbitrarily) corrupted observations. We propose a high-dimensional robust principal component analysis algorithm that is efficient, robust to contaminated points, and easily kernelizable. In particular, our algorithm achieves maximal robustness-it has a breakdown point of 50% (the best possible), while all existing algorithms have a breakdown point of zero. Moreover, our algorithm recovers the optimal solution exactly in the case where the number of corrupted points grows sublinearly in the dimension.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Huan Xu

Jiangxi University of Traditional Chinese Medicine

Constantine Caramanis

The University of Texas at Austin

Shie Mannor

Technion – Israel Institute of Technology

Journals

IEEE Transactions on Information Theory

Actions

Institutions

The University of Texas at Austin

National University of Singapore

Technion – Israel Institute of Technology

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Outlier-Robust PCA: The High-Dimensional Case

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study