Image and Video Retrieval

We present a user-centric system for visualization and layout for content-based image retrieval and browsing. Image features (visual and/or semantic) are analyzed to display and group retrievals as thumbnails in a 2-D spatial layout which conveys mutual similarities. Moreover, a novel subspace feature weighting technique is proposed and used to modify 2-D layouts in a variety of context-dependent ways. An efficient computational technique for subspace weighting and re-estimation leads to a simple user-modeling framework whereby the system can learn to display query results based on layout examples (or relevance feedback) provided by the user. The resulting retrieval, browsing and visualization engine can adapt to the user's (time-varying) notions of content, context and preferences in style of interactive navigation. Monte Carlo simulations with synthetic "user-layouts" as well as pilot user studies have demonstrated the ability of this framework to accurately model or "mimic" users by automatically generating layouts according to their preferences.

[1]  Rosalind W. Picard A Society of Models for Video and Image Libraries , 1996, IBM Syst. J..

[2]  Hayit Greenspan,et al.  Finding Pictures of Objects in Large Collections of Images , 1996, Object Representation in Computer Vision.

[3]  Michael S. Lew,et al.  Quality Measures for Interactive Image Retrieval with a Performance Evaluation of Two 3x3 Texel-based Methods , 1997, ICIAP.

[4]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[5]  Dragutin Petkovic,et al.  Challenges and Opportunities for Pattern Recognition and Computer Vision Research in Year 2000 and Beyond , 1997 .

[6]  M. Lew,et al.  Optimal supports for image matching , 1996, 1996 IEEE Digital Signal Processing Workshop Proceedings.

[7]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Michael S. Lew,et al.  Information theory and face detection , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[9]  Tomaso A. Poggio,et al.  Example-Based Learning for View-Based Human Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Anil K. Jain,et al.  On image classification: city vs. landscape , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[11]  F. Dirfaux Key frame selection to represent a video , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[12]  S. Kullback,et al.  Information Theory and Statistics , 1959 .

[13]  Matti Pietikäinen,et al.  A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[14]  Shih-Fu Chang,et al.  Visually Searching the Web for Content , 1997, IEEE Multim..

[15]  A. Murat Tekalp,et al.  Digital Video Processing , 1995 .

[16]  Michael J. Swain,et al.  WebSeer: An Image Search Engine for the World Wide Web , 1996 .

[17]  Michael S. Lew Next-Generation Web Searches for Visual Content , 2000, Computer.