论文信息 - Kernel PCA and De-Noising in Feature Spaces

Kernel PCA and De-Noising in Feature Spaces

Kernel PCA as a nonlinear feature extractor has proven powerful as a preprocessing step for classification algorithms. But it can also be considered as a natural generalization of linear principal component analysis. This gives rise to the question how to use nonlinear features for data compression, reconstruction, and de-noising, applications common in linear PCA. This is a nontrivial task, as the results provided by kernel PCA live in some high dimensional feature space and need not have pre-images in input space. This work presents ideas for finding approximate pre-images, focusing on Gaussian kernels, and shows experimental results using these pre-images in data reconstruction and de-noising on toy examples as well as on real world data.

[1] Saburou Saitoh,et al. Theory of Reproducing Kernels and Its Applications , 1988 .

[2] Bernhard E. Boser,et al. A training algorithm for optimal margin classifiers , 1992, COLT '92.

[3] Stéphane Mallat,et al. Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[4] Christopher J. C. Burges,et al. Simplified Support Vector Decision Rules , 1996, ICML.

[5] Sun-Yuan Kung,et al. Principal Component Neural Networks: Theory and Applications , 1996 .

[6] Bernhard Schölkopf,et al. Support vector learning , 1997 .

[7] Bernhard Schölkopf,et al. Fast Approximation of Support Vector Kernel Expansions, and an Interpretation of Clustering as Approximation in Feature Spaces , 1998, DAGM-Symposium.

[8] Bernhard Schölkopf,et al. Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.