Regularization methods for learning incomplete matrices

AbstractWe use convex relaxation techniques to provide a sequence of solutions to the matrix completionproblem. Using the nuclear norm as a regularizer, we provide simple and very efficient algorithms forminimizing the reconstruction error subject to a bound on the nuclear norm. Our algorithm iterativelyreplaces the missing elements with those obtained from a thresholded SVD. With warm starts this allowsus to efficiently compute an entire regularization path of solutions. 1 Introduction In many applications measured data can be represented in a matrix X m×n , for which only a relativelysmall number of entries are observed. The problem is to “complete” the matrix based on the observedentries, and has been dubbed the matrix completion problem [CCS08, CR08, RFP07, CT09, KOM09]. The“Netflix” competition is a primary example, where the data is the basis for a recommender system. Therows correspond to viewers and the columns to movies, with the entry X ij being the rating ∈{1,...,5}byviewer i for movie j. There are 480K viewers and 18K movies, and hence 8.6 billion (8.6 ×10

[1]  I. Johnstone,et al.  Wavelet Shrinkage: Asymptopia? , 1995 .

[2]  J. Lasserre A trace inequality for matrix product , 1995, IEEE Trans. Autom. Control..

[3]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[4]  R. Larsen Lanczos Bidiagonalization With Partial Reorthogonalization , 1998 .

[5]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[6]  Trevor Hastie,et al.  Imputing Missing Data for Gene Expression Arrays , 2001 .

[7]  Russ B. Altman,et al.  Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[8]  Tommi S. Jaakkola,et al.  Weighted Low-Rank Approximations , 2003, ICML.

[9]  Patrick L. Combettes,et al.  Signal Recovery by Proximal Forward-Backward Splitting , 2005, Multiscale Model. Simul..

[10]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[11]  Cun-Hui Zhang PENALIZED LINEAR UNBIASED SELECTION , 2007 .

[12]  Lieven Vandenberghe,et al.  Interior-Point Method for Nuclear Norm Approximation with Application to System Identification , 2009, SIAM J. Matrix Anal. Appl..

[13]  Andrea Montanari,et al.  Matrix completion from a few entries , 2009, ISIT.

[14]  Emmanuel J. Candès,et al.  A Singular Value Thresholding Algorithm for Matrix Completion , 2008, SIAM J. Optim..

[15]  Pablo A. Parrilo,et al.  Guaranteed Minimum-Rank Solutions of Linear Matrix Equations via Nuclear Norm Minimization , 2007, SIAM Rev..

[16]  Andrea Montanari,et al.  Matrix completion from a few entries , 2009, 2009 IEEE International Symposium on Information Theory.

[17]  Emmanuel J. Candès,et al.  The Power of Convex Relaxation: Near-Optimal Matrix Completion , 2009, IEEE Transactions on Information Theory.

[18]  Shiqian Ma,et al.  Fixed point and Bregman iterative methods for matrix rank minimization , 2009, Math. Program..