A probabilistic approach to acoustic echo clustering and suppression

This paper introduces an approach to cluster and suppress acoustic echo signals in hands-free, full-duplex speech communication systems. We employ the instantaneous recursive estimate of the magnitude squared coherence (MSC) of the echo line signal and the microphone signal, and model it with a two-component Beta mixture distribution. Since we consider the case of multiple microphone pickup, we further integrate the normalized recording vector as location feature into the proposed approach to achieve reliable soft decisions on the echo presence. The location information has been widely used for clustering-based blind source separation, and can be modeled using a Watson mixture distribution. Simulation evaluations of the proposed method show that it can achieve significant echo suppression performance.

[1]  Barak A. Pearlmutter,et al.  Soft-LOST: EM on a Mixture of Oriented Lines , 2004, ICA.

[2]  Ted S. Wada,et al.  Enhancement of Residual Echo for Robust Acoustic Echo Cancellation , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  K. Mardia,et al.  The complex Watson distribution and shape analysis , 1999 .

[4]  I. M. Pyshik,et al.  Table of integrals, series, and products , 1965 .

[5]  Yun-Sik Park,et al.  Frequency Domain Acoustic Echo Suppression Based on Soft Decision , 2009, IEEE Signal Processing Letters.

[6]  Ivan Tashev Coherence based double talk detector with soft decision , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7]  Ted S. Wada,et al.  A system approach to acoustic echo cancellation in robust hands-free teleconferencing , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[8]  Tomohiro Nakatani,et al.  An integration of source location cues for speech clustering in distributed microphone arrays , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Christof Faller,et al.  Robust Acoustic ECHO Control using a Simple ECHO Path Model , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[10]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[11]  Joon-Hyuk Chang,et al.  Frequency-Domain Double-Talk Detection Based on the Gaussian Mixture Model , 2010, IEEE Signal Processing Letters.

[12]  Hiroshi Sawada,et al.  A Two-Stage Frequency-Domain Blind Source Separation Method for Underdetermined Convolutive Mixtures , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[13]  Maria Hansson,et al.  A double-talk detector based on coherence , 1996, IEEE Trans. Commun..

[14]  Reinhold Häb-Umbach,et al.  Blind speech separation employing directional statistics in an Expectation Maximization framework , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  C. Avendano Acoustic echo suppression in the STFT domain , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[16]  Suvrit Sra,et al.  The multivariate Watson distribution: Maximum-likelihood estimation and other aspects , 2011, J. Multivar. Anal..