Inter-Channel Decorrelation by Sub-Band Resampling for Multi-Channel Acoustic Echo Cancellation

An inter-channel decorrelation procedure is highly recommended for multi-channel acoustic echo cancellation (AEC) to directly assist adaptive filtering algorithms in overcoming the so-called “non-uniqueness” problem. Although various methods have been proposed in the past to mitigate the problem and lower the misalignment of the adaptive filter, the introduced audible distortion often limits the effectiveness of those algorithms. In this paper, we investigate in detail the decorrelation by resampling technique and a proper design strategy for sub-band resampling (SBR), which permits finely tuned, frequency specific control of the trade-off between decorrelation, measured in terms of the (magnitude-squared) coherence, and audio quality, evaluated in terms of objective speech quality measures and a subjective listening test. Our most recent study provides a deep analysis of the performance bounds of the resampling procedure by analyzing the relationship between resampling, coherence, and achievable steady-state misalignment. We provide a novel, theoretically justifiable, and perceptually motivated strategy for SBR to achieve a fast multi-channel AEC convergence rate while maintaining the highest audio quality when compared to other conventional decorrelation procedures.

[1]  Ted S. Wada,et al.  On Dealing with Sampling Rate Mismatches in Blind Source Separation and Acoustic Echo Cancellation , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[2]  Ted S. Wada,et al.  On the performance of the robust acoustic echo cancellation system with decorrelation by sub-band resampling , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[3]  Ted S. Wada,et al.  MULTI-CHANNEL ACOUSTIC ECHO CANCELLATION BASED ON RESIDUAL ECHO ENHANCEMENT WITH EFFECTIVE CHANNEL , 2010 .

[4]  Ted S. Wada,et al.  Acoustic echo cancellation based on independent component analysis and integrated residual echo enhancement , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[5]  G. Carter Coherence and time delay estimation , 1987, Proceedings of the IEEE.

[6]  Ted S. Wada,et al.  Inter-channel decorrelation by sub-band resampling in frequency domain , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7]  Jacob Benesty,et al.  New insights into the stereophonic acoustic echo cancellation problem and an adaptive nonlinearity solution , 2002, IEEE Trans. Speech Audio Process..

[8]  Ted S. Wada,et al.  Inter-Channel Decorrelation by Resampling via Time-Domain Interpolation Filters Derived from the Time-Shifting Property , 2012, IWAENC.

[9]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[10]  J. L. Hall,et al.  Stereophonic acoustic echo cancellation-an overview of the fundamental problem , 1995, IEEE Signal Processing Letters.

[11]  Jacob Benesty,et al.  A better understanding and an improved solution to the specific problems of stereophonic acoustic echo cancellation , 1998, IEEE Trans. Speech Audio Process..

[12]  Jacob Benesty,et al.  Investigation of several types of nonlinearities for use in stereo acoustic echo cancellation , 2001, IEEE Trans. Speech Audio Process..

[13]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[14]  Akihiko Sugiyama,et al.  A stereo echo canceller with simultaneous input-sliding and sliding-period control , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Karin Ackermann,et al.  Advances In Network And Acoustic Echo Cancellation , 2016 .

[16]  Methods for objective and subjective assessment of quality Perceptual evaluation of speech quality ( PESQ ) : An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs , 2002 .

[17]  Ted S. Wada,et al.  Decorrelation by resampling in frequency domain for multi-channel acoustic echo cancellation based on residual echo enhancement , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[18]  Yann Joncour,et al.  A stereo echo canceler with correct echo-path identification based on an input-sliding technique , 1997, IEEE Trans. Signal Process..

[19]  Carla Teixeira Lopes,et al.  TIMIT Acoustic-Phonetic Continuous Speech Corpus , 2012 .

[20]  Ted S. Wada,et al.  Enhancement of Residual Echo for Robust Acoustic Echo Cancellation , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  Jacob Benesty,et al.  On Regularization in Adaptive Filtering , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[22]  Walter Kellermann,et al.  Acoustic Echo Cancellation for Surround Sound using Perceptually Motivated Convergence Enhancement , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[23]  J. Shynk Frequency-domain and multirate adaptive filtering , 1992, IEEE Signal Processing Magazine.

[24]  Jacob Benesty,et al.  Stereophonic acoustic echo cancellation: analysis of the misalignment in the frequency domain , 2006, IEEE Signal Processing Letters.