Automatic Name-Face Alignment to Enable Cross-Media News Retrieval

A new algorithm is developed in this paper to support automatic name-face alignment for achieving more accurate cross-media news retrieval. We focus on extracting valuable information from large amounts of news images and their captions, where multi-level image-caption pairs are constructed for characterizing both significant names with higher salience and their cohesion with human faces extracted from news images. To remedy the issue of lacking enough related information for rare name, Web mining is introduced to acquire the extra multimodal information. We also emphasize on an optimization mechanism by our Improved Self-Adaptive Simulated Annealing Genetic Algorithm to verify the feasibility of alignment combinations. Our experiments have obtained very positive results.

[1]  Alexander C. Berg,et al.  Who's In the Picture , 2004, NIPS 2004.

[2]  Andrew Zisserman,et al.  Hello! My name is... Buffy'' -- Automatic Naming of Characters in TV Video , 2006, BMVC.

[3]  Duy-Dinh Le,et al.  Unsupervised Face Annotation by Mining the Web , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[4]  Thomas Mensink,et al.  Improving People Search Using Query Expansions , 2008, ECCV.

[5]  Yoke San Wong,et al.  Optimization of multi-pass milling using parallel genetic algorithm and parallel genetic simulated annealing , 2005 .

[6]  Cordelia Schmid,et al.  Automatic face naming with caption-based supervision , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Gholam Ali Rezai Rad,et al.  A Genetic Programming-PCA Hybrid Face Recognition Algorithm , 2011, J. Signal Inf. Process..

[8]  Cordelia Schmid,et al.  Face recognition from caption-based supervision , 2010 .

[9]  Andrew Zisserman,et al.  Taking the bite out of automated naming of characters in TV video , 2009, Image Vis. Comput..

[10]  Pinar Duygulu Sahin,et al.  A Graph Based Approach for Naming Faces in News Photos , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11]  Tamara L. Berg,et al.  names and faces. , 1982, The Physician and sportsmedicine.

[12]  Marie-Francine Moens,et al.  Naming People in News Videos with Label Propagation , 2011, IEEE MultiMedia.

[13]  Yi Yang,et al.  Ranking with local regression and global alignment for cross media retrieval , 2009, ACM Multimedia.

[14]  Marie-Francine Moens,et al.  Cross-Media Alignment of Names and Faces , 2010, IEEE Transactions on Multimedia.

[15]  Jun Yang,et al.  Finding Person X: Correlating Names with Visual Appearances , 2004, CIVR.

[16]  Marie-Francine Moens,et al.  Text Analysis for Automatic Image Annotation , 2007, ACL.

[17]  Thomas Mensink,et al.  Improving People Search Using Query Expansions , 2008, ECCV.

[18]  Takeo Kanade,et al.  Name-It: Naming and Detecting Faces in News Videos , 1999, IEEE Multim..

[19]  Yee Whye Teh,et al.  Names and faces in the news , 2004, CVPR 2004.

[20]  Duy-Dinh Le,et al.  Finding Important People in Large News Video Databases Using Multimodal and Clustering Analysis , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.

[21]  Frank Werner,et al.  Simulated annealing and genetic algorithms for minimizing mean flow time in an open shop , 2008, Math. Comput. Model..

[22]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[23]  Takeo Kanade,et al.  Name-It: association of face and name in video , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[24]  Jianping Fan,et al.  Quantitative Characterization of Semantic Gaps for Learning Complexity Estimation and Inference Model Selection , 2012, IEEE Transactions on Multimedia.

[25]  Qingming Huang,et al.  Naming faces in broadcast news video by image google , 2008, ACM Multimedia.