A Riemannian Approach for Graph-Based Clustering by Doubly Stochastic Matrices

Convex optimization is a well-established area with applications in almost all fields. However, these convex methods can be rather slow and computationally intensive for high dimensional problems. For a particular class of problems, this paper considers a different approach, namely Riemannian optimization. The main idea is to view the constrained optimization problem as an unconstrained one over a restricted search space (the manifold). Riemannian optimization explicitly exploits the geometry of the problem and often reduces its dimension, thereby potentially allowing significant speedup as compared to conventional approaches. The paper introduces the doubly stochastic, the symmetric, and the definite multinomial manifolds which generalize the simplex. The method is applied to a convex and a non-convex graph-based clustering problem. Theoretical analysis and simulation results demonstrate the efficiency of the proposed method over the state of the art as it outperforms conventional generic and specialized solvers, especially in high dimensions.

[1]  Bamdev Mishra,et al.  Manopt, a matlab toolbox for optimization on manifolds , 2013, J. Mach. Learn. Res..

[2]  Benedikt Wirth,et al.  Optimization Methods on Riemannian Manifolds and Their Application to Shape Space , 2012, SIAM J. Optim..

[3]  Richard Sinkhorn,et al.  Concerning nonnegative matrices and doubly stochastic matrices , 1967 .

[4]  Erkki Oja,et al.  Clustering by Low-Rank Doubly Stochastic Matrix Decomposition , 2012, ICML.

[5]  Robert E. Mahony,et al.  Optimization Algorithms on Matrix Manifolds , 2007 .

[6]  Amnon Shashua,et al.  Doubly Stochastic Normalization for Spectral Clustering , 2006, NIPS.

[7]  Yaguang Yang Globally Convergent Optimization Algorithms on Riemannian Manifolds: Uniform Framework for Unconstrained and Constrained Optimization , 2007 .

[8]  J. Csima,et al.  The DAD Theorem for Symmetric Non-negative Matrices , 1972, J. Comb. Theory, Ser. A.

[9]  Babak Hassibi,et al.  Manifold Optimization Over the Set of Doubly Stochastic Matrices: A Second-Order Geometry , 2018, IEEE Transactions on Signal Processing.

[10]  Feiping Nie,et al.  Structured Doubly Stochastic Matrix for Graph Based Clustering: Structured Doubly Stochastic Matrix , 2016, KDD.

[11]  Babak Hassibi,et al.  Similarity clustering in the presence of outliers: Exact recovery via convex program , 2016, 2016 IEEE International Symposium on Information Theory (ISIT).

[12]  Stephen J. Wright,et al.  Numerical Optimization (Springer Series in Operations Research and Financial Engineering) , 2000 .

[13]  G. Hurlbert,et al.  A SHORT PROOF OF THE BIRKHOFF-VON NEUMANN THEOREM , 2012 .

[14]  Richard Sinkhorn A Relationship Between Arbitrary Positive Matrices and Doubly Stochastic Matrices , 1964 .

[15]  Junbin Gao,et al.  Heterogeneous Tensor Decomposition for Clustering via Manifold Optimization , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Shun-ichi Amari,et al.  Methods of information geometry , 2000 .