Key points are not available for this paper at this time.
Principal component analysis plays a central role in statistics, engineering, and science. Because of the prevalence of corrupted data in real-world applications, much research has focused on developing robust algorithms. Perhaps surprisingly, these algorithms are unequipped-indeed, unable-to deal with outliers in the high-dimensional setting where the number of observations is of the same magnitude as the number of variables of each observation, and the dataset contains some (arbitrarily) corrupted observations. We propose a high-dimensional robust principal component analysis algorithm that is efficient, robust to contaminated points, and easily kernelizable. In particular, our algorithm achieves maximal robustness-it has a breakdown point of 50% (the best possible), while all existing algorithms have a breakdown point of zero. Moreover, our algorithm recovers the optimal solution exactly in the case where the number of corrupted points grows sublinearly in the dimension.
Building similarity graph...
Analyzing shared references across papers
Loading...
Huan Xu
Jiangxi University of Traditional Chinese Medicine
Constantine Caramanis
The University of Texas at Austin
Shie Mannor
Technion – Israel Institute of Technology
IEEE Transactions on Information Theory
The University of Texas at Austin
National University of Singapore
Technion – Israel Institute of Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Xu et al. (Wed,) studied this question.
synapsesocial.com/papers/6a2145c9a2a97f3a085ae35e — DOI: https://doi.org/10.1109/tit.2012.2212415