Kernel VA-Files for Nearest Neighbor Search in Large Image Databases

Many data partitioning index methods perform poorly in high dimensional space. The vector approximation file (VA-File) approach overcomes some of the difficulties of high dimensional vector spaces, but cannot be applied when using a kernel distance metric in the data measurement space. This paper introduces a novel KVA-File (kernel VA-File) that extends VAFile to kernel-based retrieval methods. A key observation is that kernel metrics may be non-linear in the input data space but is still linear in an induced feature space. It is this linear invariance in the induced feature space that enables KVA-File to work with kernel metrics. An efficient approach to approximating vectors in an induced feature space is presented with the corresponding upper and lower distance bounds. Thus an effective indexing method is provided for kernel-based image retrieval methods. Experimental results using large image data sets (approximately 100,000 images with 463 dimensions of measurement) validate the efficacy of our method.

[1]  Bikramjit Banerjee,et al.  Kernel Index for Relevance feedback Retrieval , 2002, International Conference on Fuzzy Systems and Knowledge Discovery.

[2]  Richard Bellman,et al.  Adaptive Control Processes - A Guided Tour (Reprint from 1961) , 2015, Princeton Legacy Library.

[3]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[4]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[5]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[6]  Thomas S. Huang,et al.  One-class SVM for learning in image retrieval , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[7]  Pavel Zezula,et al.  M-tree: An Efficient Access Method for Similarity Search in Metric Spaces , 1997, VLDB.

[8]  Nello Cristianini,et al.  Latent Semantic Kernels , 2001, Journal of Intelligent Information Systems.

[9]  Si Wu,et al.  Improving support vector machine classifiers by modifying kernel functions , 1999, Neural Networks.

[10]  Michael E. Tipping Sparse Kernel Principal Component Analysis , 2000, NIPS.

[11]  Jing Peng,et al.  Adaptive quasiconformal kernel metric for image retrieval , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[12]  Michael I. Jordan,et al.  Kernel independent component analysis , 2003 .

[13]  Robert P. W. Duin,et al.  Support vector domain description , 1999, Pattern Recognit. Lett..

[14]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[15]  James V. Candy,et al.  Adaptive and Learning Systems for Signal Processing, Communications, and Control , 2006 .

[16]  A. Atiya,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.