Principal angles approach to time-domain filter design for target cancelation

We consider the design of a pair of time-domain filters to achieve target signal cancelation in a multi-source environment. The problem is formulated as a minimization of a sum squared error cost function with respect to the pair of finite impulse response cancelation filters. Direct minimization is achieved through an alternating gradient descent based method, whereas a novel method based on the method of principal angles is proposed which exploits the singular value decomposition. Simulation studies show that the gradient descent method suffers from slow convergence but this is overcome by the method based on principal angles which also achieves a lower cost than the gradient descent approach. The cancelation filters are then combined with an adaptive filtering scheme to address a video-informed audio source separation problem and preliminary results suggest good performance in terms of objective measures.

[1]  Barbara G Shinn-Cunningham,et al.  Localizing nearby sound sources in a classroom: binaural room impulse responses. , 2005, The Journal of the Acoustical Society of America.

[2]  Yi Hu,et al.  Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Christian Jutten,et al.  Visual voice activity detection as a help for speech source separation from convolutive mixtures , 2007, Speech Commun..

[4]  B.D. Van Veen,et al.  Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.

[5]  L. J. Griffiths,et al.  An alternative approach to linearly constrained adaptive beamforming , 1982 .

[6]  Patrick A. Naylor,et al.  Alternating fixed-point algorithm for stereophonic acoustic echo cancellation , 2002 .

[7]  E. C. Cmm,et al.  on the Recognition of Speech, with , 2008 .

[8]  Gene H. Golub,et al.  Matrix computations , 1983 .

[9]  Yonggang Zhang,et al.  Multimodal blind source separation for moving sources , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Pierre Comon,et al.  Handbook of Blind Source Separation: Independent Component Analysis and Applications , 2010 .

[11]  Israel Cohen,et al.  Analysis of two-channel generalized sidelobe canceller (GSC) with post-filtering , 2003, IEEE Trans. Speech Audio Process..

[12]  Muhammad Salman Khan,et al.  Multimodal (audio-visual) source separation exploiting multi-speaker tracking, robust beamforming and time-frequency masking , 2012, IET Signal Process..

[13]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[14]  Miao Yu,et al.  A Multimodal Approach to Blind Source Separation of Moving Sources , 2010, IEEE Journal of Selected Topics in Signal Processing.