Two Sigma uses third-party advertising and advertising analytics cookies that allow us and our partners to serve your more relevant advertisements across platforms. You may accept or decline our use of these kinds of cookies by selecting “accept” or “decline” below. For more information about our privacy practices, please visit our Cookies Policy here.

Learn More

Data Science

Sparse PCA from Sparse Linear Regression

Authors: Mӑdӑlina Persu (Two Sigma), Guy Bresler, Sung Min Park

To be presented at: 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montréal, Canada

Abstract: Sparse Principal Component Analysis (SPCA) and Sparse Linear Regression (SLR) have a wide range of applications and have attracted a tremendous amount of attention in the last two decades as canonical examples of statistical problems in high dimension. A variety of algorithms have been proposed for both SPCA and SLR, but an explicit connection between the two had not been made. We show how to efficiently transform a black-box solver for SLR into an algorithm for SPCA: assuming the SLR solver satisfies prediction error guarantees achieved by existing efficient algorithms such as those based on the Lasso, the SPCA algorithm derived from it achieves near state of the art guarantees for testing and for support recovery for the single spiked covariance model as obtained by the current best polynomial time algorithms. Our reduction not only highlights the inherent similarity between the two problems, but also, from a practical standpoint, allows one to obtain a collection of algorithms for SPCA directly from known algorithms for SLR. We provide experimental results on simulated data comparing our proposed framework to other algorithms for SPCA.