论文信息 - Detecting Emotions in Speech

Detecting Emotions in Speech

Human language carries various kinds of information. In human computer interaction the detection of the emotional state of a speaker as re ected in his or her utterances is crucial. In this investigation we will explore how acoustic and prosodic information can be used to detect the emotional state of a speaker. We will show how prosodic information can be combined and integrated with acoustic information within a hidden Markov model architecture, which allows one to make observations at a rate appropriate for the phenomena to be modeled. Using this architecture, we will demonstrate that prosodic information adds discriminative power to the overall system.

Alex Waibel | Thomas Polzin | A. Waibel | T. Polzin

[1] Rainer Stiefelhagen,et al. Real-time lip-tracking for lipreading , 1997, EUROSPEECH.

[2] John L. Arnott,et al. Synthesizing emotions in speech: is it time to get excited? , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[3] Akikazu Takeuchi,et al. Communicative facial displays as a new conversational modality , 1993, INTERCHI.

[4] Kim E. A. Silverman,et al. Vocal cues to speaker affect: testing two models , 1984 .

[5] Alexander H. Waibel,et al. Recognition of conversational telephone speech using the JANUS speech engine , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6] R. Frick. Communicating emotion: The role of prosodic features. , 1985 .

[7] Klaus R. Scherer,et al. Models of "normal" emotions applied to facial and vocal expression in clinical disorders. , 1998 .

[8] K. Scherer,et al. Vocal cues in emotion encoding and decoding , 1991 .

[9] Yang Li,et al. Recognizing emotions in speech using short-term and long-term features , 1998, ICSLP.

[10] S. Kaiser,et al. Automated coding of facial behavior in human-computer interactions with facs , 1992 .

[11] Alexander H. Waibel,et al. Improving connected letter recognition by lipreading , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.