IPSILON: Incremental Parsing for Semantic Indexing of Latent Concepts

A new framework for content-based image retrieval, which takes advantage of the source characterization property of a universal source coding scheme, is investigated. Based upon a new class of multidimensional incremental parsing algorithm, extended from the Lempel-Ziv incremental parsing code, the proposed method captures the occurrence pattern of visual elements from a given image. A linguistic processing technique, namely the latent semantic analysis (LSA) method, is then employed to identify associative ensembles of visual elements, which lay the foundation for intelligent visual information analysis. In 2-D applications, incremental parsing decomposes an image into elementary patches that are different from the conventional fixed square-block type patches. When used in compressive representations, it is amenable in schemes that do not rely on average distortion criteria, a methodology that is a departure from the conventional vector quantization. We call this methodology a parsed representation. In this article, we present our implementations of an image retrieval system, called IPSILON, with parsed representations induced by different perceptual distortion thresholds. We evaluate the effectiveness of the use of the parsed representations by comparing their performance with that of four image retrieval systems, one using the conventional vector quantization for visual information analysis under the same LSA paradigm, another using a method called SIMPLIcity which is based upon an image segmentation and integrated region matching, and the other two based upon query-by-semantic-example and query-by-visual-example. The first two of them were tested with 20 000 images of natural scenes, and the others were tested with a portion of the images. The experimental results show that the proposed parsed representation efficiently captures the salient features in visual images and the IPSILON systems outperform other systems in terms of retrieval precision and distortion robustness.

[1]  Robert M. Gray,et al.  Image retrieval using color histograms generated by Gauss mixture vector quantization , 2004, Comput. Vis. Image Underst..

[2]  Jerome R. Bellegarda,et al.  A novel word clustering algorithm based on latent semantic analysis , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[3]  Thierry Pun,et al.  The Truth about Corel - Evaluation in Image Retrieval , 2002, CIVR.

[4]  Vittorio Castelli,et al.  Digital Imagery: Fundamentals , 2002 .

[5]  William I. Grosky,et al.  From features to semantics: some preliminary results , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[6]  Gaurav Harit,et al.  Word image based latent semantic indexing for conceptual querying in document image databases , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[7]  Nuno Vasconcelos,et al.  Minimum probability of error image retrieval , 2012, IEEE Transactions on Signal Processing.

[8]  Sethuraman Panchanathan,et al.  Image and video indexing using vector quantization , 1997, Machine Vision and Applications.

[9]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Francesco G. B. De Natale,et al.  Content-Based Image Retrieval by Feature Adaptation and Relevance Feedback , 2007, IEEE Transactions on Multimedia.

[11]  Chia-Hung Yeh,et al.  Content-based image retrieval through compressed indices based on vector quantized images , 2006 .

[12]  Daniel Gatica-Perez,et al.  Modeling Semantic Aspects for Cross-Media Image Indexing , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[14]  Zhongfei Zhang,et al.  Effective Image Retrieval Based on Hidden Concept Discovery in Image Database , 2007, IEEE Transactions on Image Processing.

[15]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[16]  Alex Pentland,et al.  Photobook: Content-based manipulation of image databases , 1996, International Journal of Computer Vision.

[17]  Djemel Ziou,et al.  Image Collection Organization and Its Application to Indexing, Browsing, Summarization, and Semantic Retrieval , 2007, IEEE Transactions on Multimedia.

[18]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[19]  Thrasyvoulos N. Pappas,et al.  Perceptually based techniques for image segmentation and semantic classification , 2007, IEEE Communications Magazine.

[20]  Daniel Gatica-Perez,et al.  PLSA-based image auto-annotation: constraining the latent space , 2004, MULTIMEDIA '04.

[21]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[22]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[23]  James A. Storer,et al.  Reduced complexity content-based image retrieval using vector quantization , 2006, Data Compression Conference (DCC'06).

[24]  P. Praks,et al.  Latent Semantic Indexing for Image Retrieval Systems , 2003 .

[25]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Vijay V. Raghavan,et al.  Content-Based Image Retrieval Systems - Guest Editors' Introduction , 1995, Computer.

[27]  Guojun Lu,et al.  Image indexing and retrieval based on vector quantization , 2007, Pattern Recognit..

[28]  Alexei A. Efros,et al.  Discovering objects and their location in images , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[29]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[30]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[31]  P. Praks,et al.  On SVD-free latent semantic indexing for image retrieval for application in a hard industrial environment , 2003, IEEE International Conference on Industrial Technology, 2003.

[32]  George Economou,et al.  Color-based image retrieval using vector quantization and multivariate graph matching , 2005, IEEE International Conference on Image Processing 2005.

[33]  Fabrice Souvannavong,et al.  Using Structure for Video Object Retrieval , 2004, CIVR.

[34]  R. Keys Cubic convolution interpolation for digital image processing , 1981 .

[35]  Tom Minka,et al.  Interactive learning with a "society of models" , 1997, Pattern Recognit..

[36]  Abraham Lempel,et al.  A universal algorithm for sequential data compression , 1977, IEEE Trans. Inf. Theory.

[37]  Luo Si,et al.  Effective automatic image annotation via a coherent language model and active learning , 2004, MULTIMEDIA '04.

[38]  James Ze Wang,et al.  IRM: integrated region matching for image retrieval , 2000, ACM Multimedia.

[39]  Robert H. Halstead,et al.  Matrix Computations , 2011, Encyclopedia of Parallel Computing.

[40]  Andrew Zisserman,et al.  Scene Classification Via pLSA , 2006, ECCV.

[41]  Luc Van Gool,et al.  Modeling scenes with local descriptors and latent aspects , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[42]  M. Vetterli Image Retrieval Using Latent Semantic Indexing , 1997 .

[43]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Electronic Imaging.

[44]  Rosalind W. Picard,et al.  Interactive Learning Using a "Society of Models" , 2017, CVPR 1996.

[45]  Hyeran Byun,et al.  FRIP: a region-based image retrieval tool using automatic image segmentation and stepwise Boolean AND matching , 2005, IEEE Transactions on Multimedia.

[46]  Nuno Vasconcelos,et al.  Bridging the Gap: Query by Semantic Example , 2007, IEEE Transactions on Multimedia.

[47]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[48]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[49]  Kyuseok Shim,et al.  WALRUS: a similarity retrieval algorithm for image databases , 1999, IEEE Transactions on Knowledge and Data Engineering.

[50]  Thomas S. Huang,et al.  Relevance feedback in image retrieval: A comprehensive review , 2003, Multimedia Systems.

[51]  Biing-Hwang Juang,et al.  Multidimensional Incremental Parsing for Universal Source Coding , 2008, IEEE Transactions on Image Processing.

[52]  Biing-Hwang Juang,et al.  Two dimensional incremental parsing for image compression , 2007, 2007 15th European Signal Processing Conference.

[53]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[54]  Susu Yao,et al.  Just noticeable distortion model and its applications in video coding , 2005, Signal Process. Image Commun..

[55]  Guoping Qiu Color image indexing using BTC , 2003, IEEE Trans. Image Process..

[56]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[57]  Anil K. Jain,et al.  Image classification for content-based indexing , 2001, IEEE Trans. Image Process..

[58]  Nuno Vasconcelos,et al.  Scene classification with low-dimensional semantic spaces and weak supervision , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[59]  Abraham Lempel,et al.  Compression of individual sequences via variable-rate coding , 1978, IEEE Trans. Inf. Theory.

[60]  Joo-Hwee Lim,et al.  Medical-Image Retrieval Based on Knowledge-Assisted Text and Image Indexing , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[61]  Milind R. Naphade,et al.  Extracting semantics from audio-visual content: the final frontier in multimedia retrieval , 2002, IEEE Trans. Neural Networks.

[62]  James Ze Wang,et al.  Toward Bridging the Annotation-Retrieval Gap in Image Search , 2007, IEEE MultiMedia.

[63]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[64]  Andrew Zisserman,et al.  Scene Classification Using a Hybrid Generative/Discriminative Approach , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[65]  Hichem Frigui,et al.  Unsupervised Image Segmentation and Annotation for Content-Based Image Retrieval , 2006, 2006 IEEE International Conference on Fuzzy Systems.

[66]  Gustavo Carneiro,et al.  Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.