Optimization and estimation on manifolds

How to make the best decision? This general concern, pervasive in both research and industry, is what optimization is all about. Optimization is a field of applied mathematics concerned with making the best use—according to some quantitative criterion called the cost function—of our degrees of freedom called the variables, possibly under some constraints. Optimization problems come in various forms. We consider continuous variables with differentiable cost functions. Furthermore, and this is central to our investigation, we assume that the variables are constrained to belong to a Riemannian manifold, that is, to a smooth space. Building upon prior theory, we develop Manopt, a toolbox which considerably simplifies the use of Riemannian optimization. We apply this tool to two applications. First, we study low-rank matrix completion, which appears in recommender systems. Such systems aim at predicting which movies, books, etc. different users might appreciate, based on partial knowledge of their preferences. Second, we study synchronization of rotations. This is a central player in the reconstruction of 3D computer models of physical objects based on scans of their surface. In both cases, Riemannian optimization provides competitive, scalable and accurate algorithms. Both applications constitute estimation problems. In estimation, one wishes to determine the value of unknown parameters based on noisy measurements. We address the following fundamental question: given a noise level on the measurements, how accurately can one hope to estimate the parameters? This prompts us to further develop Cramer-Rao bounds when the parameter space is a manifold. Applied to synchronization, these bounds bring about practical implications. First, they suggest that in many nontrivial scenarios, our estimation algorithm could be optimal. Second, they reveal the defining features that make a synchronization task more or less difficult, hinting at which measurements should be acquired.

[1]  K. Leichtweiss Zur Riemannschen Geometrie in Grassmannschen Mannigfaltigkeiten , 1961 .

[2]  R. Bouldin The Pseudo-Inverse of a Product , 1973 .

[3]  H. Karcher Riemannian center of mass and mollifier smoothing , 1977 .

[4]  B. O'neill Semi-Riemannian Geometry With Applications to Relativity , 1983 .

[5]  Peter G. Doyle,et al.  Random Walks and Electric Networks: REFERENCES , 1987 .

[6]  P. Diaconis,et al.  The Subgroup Algorithm for Generating Uniform Random Variables , 1987, Probability in the Engineering and Informational Sciences.

[7]  F. Markley Attitude determination using vector observations and the singular value decomposition , 1988 .

[8]  Alfred O. Hero,et al.  Lower bounds for parametric estimation with constraints , 1990, IEEE Trans. Inf. Theory.

[9]  Jorge Nocedal,et al.  Global Convergence Properties of Conjugate Gradient Methods for Optimization , 1992, SIAM J. Optim..

[10]  C. R. Rao,et al.  Information and the Accuracy Attainable in the Estimation of Statistical Parameters , 1992 .

[11]  I. Chavel Riemannian Geometry: Subject Index , 2006 .

[12]  David P. Williamson,et al.  Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming , 1995, JACM.

[13]  V. Girko A Matrix Equation for Resolvents of Random Matrices with Independent Blocks , 1996 .

[14]  John M. Lee Riemannian Manifolds: An Introduction to Curvature , 1997 .

[15]  Alan Edelman,et al.  The Geometry of Algorithms with Orthogonality Constraints , 1998, SIAM J. Matrix Anal. Appl..

[16]  B. C. Ng,et al.  On the Cramer-Rao bound under parametric constraints , 1998, IEEE Signal Processing Letters.

[17]  Shun-ichi Amari,et al.  Natural Gradient Learning for Over- and Under-Complete Bases in ICA , 1999, Neural Computation.

[18]  Shun-ichi Amari,et al.  Methods of information geometry , 2000 .

[19]  Nicholas I. M. Gould,et al.  Trust Region Methods , 2000, MOS-SIAM Series on Optimization.

[20]  Marc Levoy,et al.  Efficient variants of the ICP algorithm , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[21]  Thomas L. Marzetta,et al.  Parameter estimation problems with singular information matrices , 2001, IEEE Trans. Signal Process..

[22]  D K Smith,et al.  Numerical Optimization , 2001, J. Oper. Res. Soc..

[23]  Y. Chikuse Statistics on special manifolds , 2003 .

[24]  A. Dembo,et al.  Spectral measure of large random Hankel, Markov and Toeplitz matrices , 2003, math/0307330.

[25]  François Fouss,et al.  The Principal Components Analysis of a Graph, and Its Relationships to Spectral Clustering , 2004, ECML.

[26]  Yurii Nesterov,et al.  Introductory Lectures on Convex Optimization - A Basic Course , 2014, Applied Optimization.

[27]  João M. F. Xavier,et al.  The Riemannian geometry of certain parameter estimation problems with singular Fisher information matrices , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[28]  W. Hager,et al.  A SURVEY OF NONLINEAR CONJUGATE GRADIENT METHODS , 2005 .

[29]  Paul Van Dooren,et al.  On the pseudo-inverse of the Laplacian of a bipartite graph , 2005, Appl. Math. Lett..

[30]  S.T. Smith,et al.  Covariance, subspace, and intrinsic Crame/spl acute/r-Rao bounds , 2005, IEEE Transactions on Signal Processing.

[31]  João M. F. Xavier,et al.  Intrinsic variance lower bound (IVLB): an extension of the Cramer-Rao bound to Riemannian manifolds , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[32]  N. Linial,et al.  Expander Graphs and their Applications , 2006 .

[33]  Henry Cohn,et al.  Universally optimal distribution of points on spheres , 2006, math/0607446.

[34]  Anant Sahai,et al.  Cramér-Rao-Type Bounds for Localization , 2006, EURASIP J. Adv. Signal Process..

[35]  Pierre-Antoine Absil,et al.  Joint Diagonalization on the Oblique Manifold for Independent Component Analysis , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[36]  Kaare Brandt Petersen,et al.  The Matrix Cookbook , 2006 .

[37]  F. Mezzadri How to generate random matrices from the classical compact groups , 2006, math-ph/0609050.

[38]  Robert E. Mahony,et al.  Optimization Algorithms on Matrix Manifolds , 2007 .

[39]  A. Jamakovic,et al.  On the relationship between the algebraic connectivity and graph's robustness to node and link failures , 2007, 2007 Next Generation Internet Networks.

[40]  Peter D. Hoff,et al.  Simulation of the Matrix Bingham–von Mises–Fisher Distribution, With Applications to Multivariate and Relational Data , 2007, 0712.4166.

[41]  John B. Moore,et al.  Optimisation-on-a-manifold for global registration of multiple 3D point sets , 2007, Int. J. Intell. Syst. Technol. Appl..

[42]  U. Helmke,et al.  Nonsmooth Riemannian Optimization with Applications to Sphere Packing and Grasping , 2007 .

[43]  Pierre-Antoine Absil,et al.  Trust-Region Methods on Riemannian Manifolds , 2007, Found. Comput. Math..

[44]  J. Hespanha,et al.  Estimation on graphs from relative measurements , 2007, IEEE Control Systems.

[45]  Igor Grubisic,et al.  Efficient Rank Reduction of Correlation Matrices , 2007 .

[46]  Stefano Soatto,et al.  Wide-Sense Estimation on the Special Orthogonal Group , 2008, Commun. Inf. Syst..

[47]  Stella X. Yu,et al.  Angular embedding: From jarring intensity differences to perceived luminance , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[48]  C. Donati-Martin,et al.  The largest eigenvalues of finite rank deformation of large Wigner matrices: Convergence and nonuniversality of the fluctuations. , 2007, 0706.0136.

[49]  Fabian J. Theis,et al.  Soft Dimension Reduction for ICA by Joint Diagonalization on the Stiefel Manifold , 2009, ICA.

[50]  E. Weinan,et al.  Fast algorithm for extracting the diagonal of the inverse matrix with application to the electronic structure analysis of metallic systems , 2009 .

[51]  Emmanuel J. Candès,et al.  Exact Matrix Completion via Convex Optimization , 2009, Found. Comput. Math..

[52]  Yonina C. Eldar,et al.  On the Constrained CramÉr–Rao Bound With a Singular Fisher Information Matrix , 2009, IEEE Signal Processing Letters.

[53]  S. Yun,et al.  An accelerated proximal gradient algorithm for nuclear norm regularized linear least squares problems , 2009 .

[54]  Andrea Montanari,et al.  Matrix Completion from Noisy Entries , 2009, J. Mach. Learn. Res..

[55]  R. Vidal,et al.  Distributed 3-D localization in camera networks , 2009 .

[56]  Yehuda Koren,et al.  The BellKor Solution to the Netflix Grand Prize , 2009 .

[57]  Alain Sarlette,et al.  Consensus Optimization on Manifolds , 2008, SIAM J. Control. Optim..

[58]  Andrea Montanari,et al.  Low-rank matrix completion with noisy observations: A quantitative comparison , 2009, 2009 47th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[59]  D. Bernstein Matrix Mathematics: Theory, Facts, and Formulas , 2009 .

[60]  Sewoong Oh,et al.  A Gradient Descent Algorithm on the Grassman Manifold for Matrix Completion , 2009, ArXiv.

[61]  Emmanuel J. Candès,et al.  A Singular Value Thresholding Algorithm for Matrix Completion , 2008, SIAM J. Optim..

[62]  Zhi-Quan Luo,et al.  Semidefinite Relaxation of Quadratic Optimization Problems , 2010, IEEE Signal Processing Magazine.

[63]  Francis R. Bach,et al.  Low-Rank Optimization on the Cone of Positive Semidefinite Matrices , 2008, SIAM J. Optim..

[64]  Yoram Bresler,et al.  ADMiRA: Atomic Decomposition for Minimum Rank Approximation , 2009, IEEE Transactions on Information Theory.

[65]  Wotao Yin,et al.  Alternating direction augmented Lagrangian methods for semidefinite programming , 2010, Math. Program. Comput..

[66]  W. J. Russell,et al.  Optimal estimation on the graph cycle space , 2010, ACC 2010.

[67]  Andrea Montanari,et al.  Regularization for matrix completion , 2010, 2010 IEEE International Symposium on Information Theory.

[68]  Yurii Nesterov,et al.  Generalized Power Method for Sparse Principal Component Analysis , 2008, J. Mach. Learn. Res..

[69]  Tiefeng Jiang,et al.  SPECTRAL DISTRIBUTIONS OF ADJACENCY AND LAPLACIAN MATRICES OF RANDOM GRAPHS , 2010, 1011.2608.

[70]  William Moran,et al.  Estimation and Registration on Graphs , 2010, ArXiv.

[71]  Philipp Birken,et al.  Numerical Linear Algebra , 2011, Encyclopedia of Parallel Computing.

[72]  Gongguo Tang,et al.  Lower Bounds on the Mean-Squared Error of Low-Rank Matrix Reconstruction , 2011, IEEE Transactions on Signal Processing.

[73]  Gongguo Tang,et al.  Constrained Cramér–Rao Bound on Robust Principal Component Analysis , 2011, IEEE Transactions on Signal Processing.

[74]  Mikael Carmona,et al.  Algorithm for Sensor Network Attitude Problem , 2011, ArXiv.

[75]  S. R. Jammalamadaka,et al.  Directional Statistics, I , 2011 .

[76]  Massimo Fornasier,et al.  Low-rank Matrix Recovery via Iteratively Reweighted Least Squares Minimization , 2010, SIAM J. Optim..

[77]  Ameet Talwalkar,et al.  Divide-and-Conquer Matrix Factorization , 2011, NIPS.

[78]  Olgica Milenkovic,et al.  Subspace Evolution and Transfer (SET) for Low-Rank Matrix Completion , 2010, IEEE Transactions on Signal Processing.

[79]  Nicolas Gillis,et al.  Low-Rank Matrix Approximation with Weights or Missing Data Is NP-Hard , 2010, SIAM J. Matrix Anal. Appl..

[80]  Pierre-Antoine Absil,et al.  RTRMC: A Riemannian trust-region method for low-rank matrix completion , 2011, NIPS.

[81]  A. Singer Angular Synchronization by Eigenvectors and Semidefinite Programming. , 2009, Applied and computational harmonic analysis.

[82]  Silvere Bonnabel,et al.  Linear Regression under Fixed-Rank Constraints: A Riemannian Approach , 2011, ICML.

[83]  Jochen Trumpf,et al.  L1 rotation averaging using the Weiszfeld algorithm , 2011, CVPR 2011.

[84]  Bamdev Mishra,et al.  Low-rank optimization for distance matrix completion , 2011, IEEE Conference on Decision and Control and European Control Conference.

[85]  Xiaoming Yuan,et al.  Recovering Low-Rank and Sparse Components of Matrices from Incomplete and Noisy Observations , 2011, SIAM J. Optim..

[86]  Yoel Shkolnisky,et al.  Three-Dimensional Structure Determination from Common Lines in Cryo-EM by Eigenvectors and Semidefinite Programming , 2011, SIAM J. Imaging Sci..

[87]  Silvere Bonnabel,et al.  Regression on Fixed-Rank Positive Semidefinite Matrices: A Riemannian Approach , 2010, J. Mach. Learn. Res..

[88]  Benedikt Wirth,et al.  Optimization Methods on Riemannian Manifolds and Their Application to Shape Space , 2012, SIAM J. Optim..

[89]  Olgica Milenkovic,et al.  A Geometric Approach to Low-Rank Matrix Completion , 2010, IEEE Transactions on Information Theory.

[90]  Yin Zhang,et al.  Solving a low-rank factorization model for matrix completion by a nonlinear successive over-relaxation algorithm , 2012, Mathematical Programming Computation.

[91]  Vincent D. Blondel,et al.  Cramér-Rao bounds for synchronization of rotations , 2012, ArXiv.

[92]  Amit Singer,et al.  Eigenvector Synchronization, Graph Rigidity and the Molecule Problem , 2011, Information and inference : a journal of the IMA.

[93]  Ira Kemelmacher-Shlizerman,et al.  Global Motion Estimation from Point Matches , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.

[94]  Amit Singer,et al.  Exact and Stable Recovery of Rotations for Robust Synchronization , 2012, ArXiv.

[95]  Yaron Lipman,et al.  Sensor network localization by eigenvector synchronization over the euclidean group , 2012, TOSN.

[96]  Yousef Saad,et al.  Scaled Gradients on Grassmann Manifolds for Matrix Completion , 2012, NIPS.

[97]  Stella Yu,et al.  Angular Embedding: A Robust Quadratic Criterion , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[98]  Daphna Weinshall,et al.  Online Learning in the Embedded Manifold of Low-rank Matrices , 2012, J. Mach. Learn. Res..

[99]  Bamdev Mishra,et al.  A Riemannian geometry for low-rank matrix completion , 2012, ArXiv.

[100]  Amit Singer,et al.  A Cheeger Inequality for the Graph Connection Laplacian , 2012, SIAM J. Matrix Anal. Appl..

[101]  Amit Singer,et al.  Approximating the Little Grothendieck Problem over the Orthogonal Group , 2013, ArXiv.

[102]  Bart Vandereycken,et al.  Low-Rank Matrix Completion by Riemannian Optimization , 2013, SIAM J. Optim..

[103]  Leonidas J. Guibas,et al.  Consistent Shape Maps via Semidefinite Programming , 2013, SGP '13.

[104]  Marc Sebban,et al.  A Survey on Metric Learning for Feature Vectors and Structured Data , 2013, ArXiv.

[105]  Silvere Bonnabel,et al.  Stochastic Gradient Descent on Riemannian Manifolds , 2011, IEEE Transactions on Automatic Control.

[106]  Amit Singer,et al.  Orientation Determination of Cryo-EM Images Using Least Unsquared Deviations , 2012, SIAM J. Imaging Sci..

[107]  Bamdev Mishra,et al.  Low-Rank Optimization with Trace Norm Penalty , 2011, SIAM J. Optim..

[108]  Christopher Ré,et al.  Parallel stochastic gradient algorithms for large-scale matrix completion , 2013, Mathematical Programming Computation.

[109]  Amit Singer,et al.  Noisy dynamic simulations in the presence of symmetry: Data alignment and model reduction , 2011, Comput. Math. Appl..

[110]  Laurent Jacques,et al.  The PANOPTIC Camera: A Plenoptic Sensor with Real-Time Omnidirectional Capability , 2013, J. Signal Process. Syst..

[111]  Amit Singer,et al.  Robust estimation of rotations from relative measurements by maximum likelihood , 2013, 52nd IEEE Conference on Decision and Control.

[112]  Nicolas Boumal,et al.  Interpolation and Regression of Rotation Matrices , 2013, GSI.

[113]  Bart Vandereycken,et al.  Low-rank tensor completion by Riemannian optimization , 2014 .

[114]  Bamdev Mishra,et al.  Manopt, a matlab toolbox for optimization on manifolds , 2013, J. Mach. Learn. Res..

[115]  Gilles Meyer,et al.  Two Newton methods on the manifold of fixed-rank matrices endowed with Riemannian quotient geometries , 2012, Computational Statistics.

[116]  Bamdev Mishra,et al.  Fixed-rank matrix factorizations and Riemannian low-rank optimization , 2012, Comput. Stat..

[117]  Alexandre d'Aspremont,et al.  Phase recovery, MaxCut and complex semidefinite programming , 2012, Math. Program..

[118]  Hiroyuki Sato,et al.  A new, globally convergent Riemannian conjugate gradient method , 2013, 1302.0125.