论文信息 - Convex Perturbations for Scalable Semidefinite Programming

Convex Perturbations for Scalable Semidefinite Programming

Many important machine learning problems are modeled and solved via semidefinite programs; examples include metric learning, nonlinear embedding, and certain clustering problems. Often, off-the-shelf software is invoked for the associated optimization, which can be inappropriate due to excessive computational and storage requirements. In this paper, we introduce the use of convex perturbations for solving semidefinite programs (SDPs), and for a specific perturbation we derive an algorithm that has several advantages over existing techniques: a) it is simple, requiring only a few lines of MATLAB, b) it is a first-order method, and thereby scalable, and c) it can easily exploit the structure of a given SDP (e.g., when the constraint matrices are low-rank, a situation common to several machine learning SDPs). A pleasant byproduct of our method is a fast, kernelized version of the large-margin nearest neighbor metric learning algorithm (Weinberger et al., 2005). We demonstrate that our algorithm is effective in finding fast approximations to large-scale SDPs arising in some machine learning applications.

Inderjit S. Dhillon | Suvrit Sra | Brian Kulis

[1] Pietro Perona,et al. Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[2] Andrzej Stachurski,et al. Parallel Optimization: Theory, Algorithms and Applications , 2000, Scalable Comput. Pract. Exp..

[3] Inderjit S. Dhillon,et al. Information-theoretic metric learning , 2006, ICML '07.

[4] Sanjoy Dasgupta,et al. Robust Euclidean embedding , 2006, ICML.

[5] B. Martinet,et al. R'egularisation d''in'equations variationnelles par approximations successives , 1970 .

[6] Paul Tseng. Convergence and Error Bound for Perturbation of Linear Programs , 1999, Comput. Optim. Appl..

[7] Lorenzo Torresani,et al. Large Margin Component Analysis , 2006, NIPS.

[8] Kilian Q. Weinberger,et al. Fast solvers and efficient implementations for distance metric learning , 2008, ICML '08.

[9] Yurii Nesterov,et al. Interior-point polynomial algorithms in convex programming , 1994, Siam studies in applied mathematics.

[10] Jitendra Malik,et al. SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11] O. Mangasarian. Normal solutions of linear programs , 1984 .