论文信息 - Multiple Kernel Learning with Gaussianity Measures

Multiple Kernel Learning with Gaussianity Measures

Kernel methods are known to be effective for nonlinear multivariate analysis. One of the main issues in the practical use of kernel methods is the selection of kernel. There have been a lot of studies on kernel selection and kernel learning. Multiple kernel learning (MKL) is one of the promising kernel optimization approaches. Kernel methods are applied to various classifiers including Fisher discriminant analysis (FDA). FDA gives the Bayes optimal classification axis if the data distribution of each class in the feature space is a gaussian with a shared covariance structure. Based on this fact, an MKL framework based on the notion of gaussianity is proposed. As a concrete implementation, an empirical characteristic function is adopted to measure gaussianity in the feature space associated with a convex combination of kernel functions, and two MKL algorithms are derived. From experimental results on some data sets, we show that the proposed kernel learning followed by FDA offers strong classification power.

[1] Gunnar Rätsch,et al. Invariant Feature Extraction and Classification in Kernel Spaces , 1999, NIPS.

[2] Yves Grandvalet,et al. Y.: SimpleMKL , 2008 .

[3] Stephen P. Boyd,et al. Optimal kernel selection in Kernel Fisher discriminant analysis , 2006, ICML.

[4] Nima Reyhani. Multiple Spectral Kernel Learning and a Gaussian Complexity Computation , 2013, Neural Computation.

[5] Josef Kittler,et al. Non-sparse Multiple Kernel Learning for Fisher Discriminant Analysis , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[6] Charles A. Micchelli,et al. Learning the Kernel Function via Regularization , 2005, J. Mach. Learn. Res..

[7] B. Scholkopf,et al. Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[8] M. Talagrand,et al. Probability in Banach spaces , 1991 .

[9] Taiji Suzuki,et al. SpicyMKL: a fast algorithm for Multiple Kernel Learning with thousands of kernels , 2011, Machine Learning.

[10] Alan L. Yuille,et al. The Concave-Convex Procedure , 2003, Neural Computation.

[11] Kei Takeuchi,et al. The studentized empirical characteristic function and its application to test for the shape of distribution , 1981 .

[12] Si Wu,et al. Improving support vector machine classifiers by modifying kernel functions , 1999, Neural Networks.

[13] Nello Cristianini,et al. Kernel Methods for Pattern Analysis , 2006 .

[14] R. Fisher. THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[15] Kurt Hornik,et al. kernlab - An S4 Package for Kernel Methods in R , 2004 .

[16] Gunnar Rätsch,et al. Input space versus feature space in kernel-based methods , 1999, IEEE Trans. Neural Networks.

[17] Olivier Bousquet,et al. On the Complexity of Learning the Kernel Matrix , 2002, NIPS.

[18] Jinbo Bi,et al. Column-generation boosting methods for mixture of kernels , 2004, KDD.

[19] Gunnar Rätsch,et al. Large Scale Multiple Kernel Learning , 2006, J. Mach. Learn. Res..