论文信息 - Decision-Level Fusion for Audio-Visual Laughter Detection

Decision-Level Fusion for Audio-Visual Laughter Detection

Laughter is a highly variable signal, which can be caused by a spectrum of emotions. This makes the automatic detection of laughter a challenging, but interesting task. We perform automatic laughter detection using audio-visual data from the AMI Meeting Corpus. Audio-visual laughter detection is performed by fusing the results of separate audio and video classifiers on the decision level. This results in laughter detection with a significantly higher AUC-ROC than single-modality classification.

[1] Maja Pantic,et al. Particle filtering with factorized likelihoods for tracking facial features , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[2] Nebojsa Jojic,et al. A Graphical Model for Audiovisual Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[3] Chih-Jen Lin,et al. A Practical Guide to Support Vector Classication , 2008 .

[4] Maja Pantic,et al. Spontaneous vs. posed facial behavior: automatic analysis of brow actions , 2006, ICMI '06.

[5] Tom Fawcett,et al. An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[6] Min Xu,et al. Affective content analysis in comedy and horror videos by audio emotional event detection , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[7] Massimo Piccardi,et al. Fusing Face and Body Display for Bi-modal Emotion Recognition: Single Frame Analysis and Multi-frame Post Integration , 2005, ACII.

[8] David A. van Leeuwen,et al. Automatic discrimination between laughter and speech , 2007, Speech Commun..

[9] Zhigang Deng,et al. Analysis of emotion recognition using facial expressions, speech and multimodal information , 2004, ICMI '04.

[10] Akinori Ito,et al. Smile and laughter recognition using speech processing and face recognition from conversation video , 2005, 2005 International Conference on Cyberworlds (CW'05).

[11] Johannes D. Krijnders,et al. CASSANDRA: audio-video sensor fusion for aggression detection , 2007, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance.

[12] Maja Pantic,et al. Audiovisual discrimination between laughter and speech , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[13] Maja Pantic,et al. Fusion of audio and visual cues for laughter detection , 2008, CIVR '08.

[14] Nick Campbell,et al. No laughing matter , 2005, INTERSPEECH.

[15] J. Trouvain. Segmenting Phonetic Units in Laughter , 2003 .

[16] Ananth N. Iyer,et al. Emotion Detection From Infant Facial Expressions And Cries , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[17] Gerhard Rigoll,et al. Bimodal fusion of emotional data in an automotive environment , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[18] Daniel P. W. Ellis,et al. Laughter Detection in Meetings , 2004 .

[19] Yuxiao Hu,et al. Audio-Visual Spontaneous Emotion Recognition , 2007, Artifical Intelligence for Human Computing.

[20] Boris Reuderink. Fusion for Audio-Visual Laughter Detection , 2007 .

[21] Zhihong Zeng,et al. Audio-Visual Affect Recognition , 2007, IEEE Transactions on Multimedia.

[22] Dae-Jong Lee,et al. Emotion recognition from the facial image and speech signal , 2003, SICE 2003 Annual Conference (IEEE Cat. No.03TH8734).

[23] Andrea Lockerd Thomaz,et al. LAFCam: Leveraging affective feedback camcorder , 2002, CHI Extended Abstracts.

[24] Timothy F. Cootes,et al. Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[25] Ling Guan,et al. Recognizing human emotion from audiovisual information , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[26] Dariu M. Gavrila,et al. Audio-video sensor fusion for aggression detection. , 2007 .

[27] Juergen Luettin,et al. Audio-Visual Speech Modeling for Continuous Speech Recognition , 2000, IEEE Trans. Multim..