PicSOM-self-organizing image retrieval with MPEG-7 content descriptors

Development of content-based image retrieval (CBIR) techniques has suffered from the lack of standardized ways for describing visual image content. Luckily, the MPEG-7 international standard is now emerging as both a general framework for content description and a collection of specific agreed-upon content descriptors. We have developed a neural, self-organizing technique for CBIR. Our system is named PicSOM and it is based on pictorial examples and relevance feedback (RF). The name stems from "picture" and the self-organizing map (SOM). The PicSOM system is implemented by using tree structured SOMs. In this paper, we apply the visual content descriptors provided by MPEG-7 in the PicSOM system and compare our own image indexing technique with a reference system based on vector quantization (VQ). The results of our experiments show that the MPEG-7-defined content descriptors can be used as such in the PicSOM system even though Euclidean distance calculation, inherently used in the PicSOM system, is not optimal for all of them. Also, the results indicate that the PicSOM technique is a bit slower than the reference system in starting to find relevant images. However, when the strong RF mechanism of PicSOM begins to function, its retrieval precision exceeds that of the reference system.

[1]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[2]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[3]  Pasi Koikkalainen,et al.  Self-organizing hierarchical feature maps , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[4]  Pasi Koikkalainen,et al.  Progress with the Tree-Structured Self-Organizing Map , 1994, ECAI.

[5]  James H. Burrows,et al.  Secure Hash Standard , 1995 .

[6]  Tom Minka,et al.  Modeling user subjectivity in image libraries , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[7]  Samuel Kaski,et al.  Self organization of a massive text document collection , 1999 .

[8]  Alberto Del Bimbo,et al.  Visual information retrieval , 1999 .

[9]  Erkki Oja,et al.  Self-Organizing Maps for Content-Based Image Database Retrieval , 1999 .

[10]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[11]  Simone Santini,et al.  Similarity Measures , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Erkki Oja,et al.  PicSOM - A Framework for Content-Based Image Database Retrieval using Self-Organizing Maps , 1999 .

[13]  Samuel Kaski,et al.  Self organization of a massive document collection , 2000, IEEE Trans. Neural Networks Learn. Syst..

[14]  Erkki Oja,et al.  PicSOM - content-based image retrieval with self-organizing maps , 2000, Pattern Recognit. Lett..

[15]  Neil J. Gunther,et al.  Benchmark for image retrieval using distributed systems over the Iinternet: BIRDS-I , 2000, IS&T/SPIE Electronic Imaging.

[16]  E. Oja,et al.  COMPARISON OF TECHNIQUES FOR CONTENT-BASED IMAGE RETRIEVAL , 2001 .

[17]  Michael S. Lew,et al.  Principles of Visual Information Retrieval , 2001, Advances in Pattern Recognition.

[18]  Shih-Fu Chang,et al.  Overview of the MPEG-7 standard , 2001, IEEE Trans. Circuits Syst. Video Technol..

[19]  Erkki Oja,et al.  Self-Organizing Maps of Web Link Information , 2001, WSOM.

[20]  Erkki Oja,et al.  Self-Organising Maps as a Relevance Feedback Technique in Content-Based Image Retrieval , 2001, Pattern Analysis & Applications.

[21]  Erkki Oja,et al.  Statistical Shape Features for Content-Based Image Retrieval , 2004, Journal of Mathematical Imaging and Vision.