A Joint Optimization Model for Image Summarization Based on Image Content and Tags

As an effective technology for navigating a large number of images, image summarization is becoming a promising task with the rapid development of image sharing sites and social networks. Most existing summarization approaches use the visual-based features for image representation without considering tag information. In this paper, we propose a novel framework, named JOINT, which employs both image content and tag information to summarize images. Our model generates the summary images which can best reconstruct the original collection. Based on the assumption that an image with representative content should also have typical tags, we introduce a similarity-inducing regularizer to our model. Furthermore, we impose the lasso penalty on the objective function to yield a concise summary set. Extensive experiments demonstrate our model out-performs the state-of-the-art approaches.

[1]  Emmanuel J. Candès,et al.  A Singular Value Thresholding Algorithm for Matrix Completion , 2008, SIAM J. Optim..

[2]  S SawhneyHarpreet,et al.  Efficient Color Histogram Indexing for Quadratic Form Distance Functions , 1995 .

[3]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[4]  Yangqiu Song,et al.  ImageHive: Interactive Content-Aware Image Summarization , 2012, IEEE Computer Graphics and Applications.

[5]  Xi Chen,et al.  Smoothing proximal gradient method for general structured sparse regression , 2010, The Annals of Applied Statistics.

[6]  Xian-Sheng Hua,et al.  Interactive browsing via diversified visual summarization for image search results , 2011, Multimedia Systems.

[7]  Jianping Fan,et al.  Image collection summarization via dictionary learning for sparse representation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  James Lee Hafner,et al.  Efficient Color Histogram Indexing for Quadratic Form Distance Functions , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Xiaojin Zhu,et al.  Improving Diversity in Ranking using Absorbing Random Walks , 2007, NAACL.

[10]  Steven M. Seitz,et al.  Scene Summarization for Online Image Collections , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[11]  Changsheng Xu,et al.  Context saliency based image summarization , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[12]  Chun Chen,et al.  Document Summarization Based on Data Reconstruction , 2012, AAAI.

[13]  Mor Naaman,et al.  Generating summaries and visualization for large collections of geo-referenced photographs , 2006, MIR '06.

[14]  Chong-Wah Ngo,et al.  Evaluating bag-of-visual-words representations in scene classification , 2007, MIR '07.

[15]  Youssef Hadi,et al.  Video summarization by k-medoid clustering , 2006, SAC '06.

[16]  Pinaki Sinha Summarization of archived and shared personal photo collections , 2011, WWW.

[17]  Andreas Krause,et al.  Submodular Dictionary Selection for Sparse Representation , 2010, ICML.

[18]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[19]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[20]  Peter J. Rousseeuw,et al.  Clustering by means of medoids , 1987 .

[21]  Jianping Fan,et al.  Effective summarization of large-scale web images , 2011, MM '11.